froztbyte @ froztbyte @awful.systems Posts 17Comments 1,755Joined 2 yr. ago
many of the proponents of things in this field will propose/argue $x thing to be massively valuable for $x
thing is, that doesn't often work out
yes, there's some value in the tech for translation outcomes. to anyone even mildly online, "so are language teaching apps/sites using this?" is probably a very nearby question. and rightly so!
and then when you go digging into how that's going in practice, wow fuck damn doesn't that Glorious AI Future sheen just fall right off...
on the one hand I feel for other people who'll maybe read this thread somewhen down the line
on the other, it's not exactly like I clipped words in my post
I moderately regret this post
because the counterposter in question went on to have some decidedly "fucking ugggggggh" posts
ah well. so we learn.
maybe train your model better! I know I know, they were already supposed to be taking over the world... alas...
jesus fuck how do you fail to understand any post of this kind this badly
RIP my hopes and dreams :<
like LLM like shithead
fuck, there's potential here, but a bit too specific for a t-shirt?
like like like idiot
perhaps?
oh, I get it, you personally choose not to make these structurally-repeatable-by-foundation errors? you personally choose to be a Unique And Correct Snowflake?
wow shit damn, I sure want to read your eventual uni paper, see what kind of distinctly novel insight you've had to wrangle this domain!
(also I did my disclaimer at the start there, so, y'know (but also igwym))
woo! but still also check out pomsky, it's legit handy!
a'ight, sure bub, let's play
tell me what hw spec I need to deploy some kind of interactive user-facing prompt system backed by whatever favourite LLM/transformer-model you want to pick. idgaf if it's llama or qwen or some shit you've got brewing in your back shed - if it's on huggingface, fair game. here's the baselines:
- expected response latencies: human, or better
- expected topical coherence: mid-support capability or above
- expected correctness: at worst "I misunderstood $x" in the sense of "whoops, sorry, I thought you were asking about ${foo} but I answered about ${bar}"; i.e. actual, contextual, concrete contextual understanding
(so, basically, anything a competent L2 support engineer at some random ISP or whatever could do)
hit it, I'm waiting.
how dare I use someone's own words against them!
You can experiment on your own GPU
you have lost the game
you have been voted off the island
you are the weakest list
etc etc etc
also
I’ve spent 6+ years of my life in compsci academia
eh. look.
I realize you'll probably receive/perceive this post negatively, ranging as anywhere from "criticism"/"extremely harsh" through ... "condemnation"?
but, nonetheless, I have a request for you
please, for the love of ${deity}, go out and meet people. get out of your niche, explore a bit. you are so damned close to stepping in the trap, and you could do not-that.
(just think! you've spent a whole 6+ years on compsci? now imagine what your next 80+ years could be!)
ah yes, my ability to read a pdf immediately confers upon me all the resources required to engage in materially equivalent experimentation of the thing that I just read! no matter whether the publisher spent cents or billions in the execution and development of said publication, oh no! it is so completely a cost paid just once, and thereafter it's totally free!
oh, wait, hang on. no. no it's the other thing. that one where all the criticisms continue to hold! my bad, sorry for mistaking those. guess I was roleplaying a LLM for a moment there!
space alien technology!!~
My most honest goal is to educate people
oh and I suppose you can back that up with verifiable facts, yes?
and that you, yourself, can stand as a sole beacon against the otherwise regularly increasing evidence and studies that both indicate toward and also prove your claims to be full of shit? you are the saviour that can help enlighten us poor unenlightened mortals?
sounds very hard. managing your calendar must be quite a skill
this isn't the place to decide which seed generator you want for your autoplag runtime
Stubsack: weekly thread for sneers not worth an entire post, week ending Sunday 27 October 2024
Stubsack: weekly thread for sneers not worth an entire post, week ending Sunday 18 August 2024
Stubsack: weekly thread for sneers not worth an entire post, week ending Sunday 11 August 2024
Stubsack: weekly thread for sneers not worth an entire post, week ending Sunday 30 June 2024
hot off the presses: automatic wrong information without even going to the wrong-information deliveries store
Stubsack: weekly thread for sneers not worth an entire post, week ending Sunday 2 June 2024
In which folks once again don’t learn the same lesson as the last few times
History of (extremely predictable) failures catching up to you? Quick, write a book!