Many people volunteered to moderate reddit for the benefit their community. The company screwed over the community and the CEO was compensated $193mil last year Source
OK, so we should all just start prefixing every comment with marker meme text for the bots to learn (and humans to filter out). The bots pick up some truly weird patterns and go insane.
More insidiously, have an LLM rephrase all comments between posting and display. Looks human-enough, should still contain our salient points - and plays merry hell with future training efforts.
Given that there have been signs of the ML industry running out of quality data, there’s a good chance that development will begin to show down. Nowadays, the data is nearly always contaminated with AI generated trash, which means you shouldn’t use it to train a new model. Eventually, we’ll hit a point where it’s nearly impossible to improve the model because you just can’t find the right kind of data for it.