"ooh it's more advanced but don't worry- it's not conscious"
is as much a marketing tactic as "how it feels to chew 5 gum" or buzzfeedesque "top 10 celebrity mistakes - number 3 will blow your mind"
it's a tech product that runs a series of complicated loops against a large series of texts and returns the closest comparison, as it stands it's never going to be dangerous in and of itself.
it's a tech product that runs a series of complicated loops against a large series of texts and returns the closest comparison, as it stands it's never going to be dangerous in and of itself.
That's not how it works. I really don't get what's with people these days being so willing to be confidently incorrect. It's like after the pandemic people just decided that if everyone else was spewing BS from their "gut feelings," well gosh darnit they could too!
It uses gradient descent on a large series of texts to build a neural network capable of predicting those texts as accurately as possible.
How that network actually operates ends up a black box, especially for larger models.
But research over the past year and a half in simpler toy models has found that there's a rather extensive degree of abstraction. For example, a small GPT trained only on legal Othello or Chess moves ends up building a virtual representation of the board and tracks "my pieces" and "opponent pieces" on it, despite never being fed anything that directly describes the board or the concept of 'mine' vs 'other'. In fact, in the Chess model, the research found there was even a single vector in the neural network that could be flipped to have the model play well or play like shit regardless of the surrounding moves fed in.
It's fairly different from what you seem to think it is. Though I suspect that's not going to matter to you in the least, as I've come to find that explaining transformers to people spouting misinformation about them online has about the same result as a few years ago explaining vaccine research to people spouting misinformation about that.
AI can be dangerous. The point is not that it's likely but that in the very unlikely event of it going rogue it can at worst have civilication ending consequences.
Imagine how easy it is to trick a child as an adult. The difference in intelligence between a human and superintelligent AGI would be orders of magnitude greater that that.
Exactly. People try to scare into regulatory capture talking about paperclip maximizers when meanwhile it's humans and our corporations that are literally making excess shit to the point of human extinction.
To say nothing for how often theorizing around 'superintelligence' imagines the stupidest tendencies of humanity being passed on to it while denying our smartest tendencies as "uniquely human" despite existing models largely already rejecting the projected features and modeling the 'unique' ones like empathy.
I was reflecting on this myself the other day. For all my criticisms of Zuckerberg/Meta (which are very valid), they really didn't have to release anything concerning LLaMA. They're practically the only reason we have viable open source weights/models and an engine.
It is a model squarely in the "fancy autocomplete" category along with GPT-3 and fails miserably at variations of logic puzzles in ways other contemporary models do not.
It seems that the larger training data set allows for better modeling around the fancy autocomplete parts, but even other similarly sized models like Mistral appear to have developed better underlying critical thinking capacities when you scratch below the surface that are absent here.
I don't think it's a coincidence that Meta's lead AI researcher is one of the loudest voices criticizing the views around emergent capabilities. There seems to be a degree of self-fulfilling prophecy going on. A lot of useful learnings in the creation of Llama 3, but once other models (i.e. Mistral) also start using extended training my guess is that any apparent advantages to Llama 3 right now are going to go out the window.
Meta launched the latest iteration of its AI chatbot on Thursday with Llama 3, and CEO Mark Zuckerberg says it's supposed to be really good.
The new model boasts "state-of-the-art" performance on various industry-standard benchmarks and comes with "improved reasoning," according to a company blog post.
"In terms of all of the concerns around the more existential risks, I don't think that anything at the level of what we or others in the field are working on in the next year is really in the ballpark of those types of risks," he told the publication.
It's one reason Zuckerberg feels that the company can continue making Llama open-source or available for the public or researchers to tinker with.
If Meta's model achieves multimodality — meaning the ability to deliver results in various forms of media, including text, images, and video — then that may be a case when the company won't want to make all aspects of its model open-source, Zuckerberg said.
"For example, image generation is one that we're looking at closely Especially in an election year, is that a net positive thing to do?
The original article contains 314 words, the summary contains 186 words. Saved 41%. I'm a bot and I'm open source!