this post was submitted on 11 Feb 2024
329 points (85.2% liked)
Technology
59377 readers
3673 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
How can we expect a predictive language model trained on our violent history to come up with non-violent solutions in any consistent fashion?
By debating itself (paper) regarding pros and cons of options.
There's too much focus on trying to get models to behave on initial generation right now, which isn't even at all how human brains work.
Humans have intrusive thoughts all the time. If you sat in front of a big red button labeled "nuke everything" it's pretty much a guarantee that you'd generate a thought of pushing the button.
But then your prefrontal cortex would kick in with its impulse control, modeling the outcomes and consequences of the thought and shutting that shit down quick.
The most advanced models are at a stage where we could build something similar in terms of self-guidance. It's just that it would be more expensive than it being an all-in-one generation, so there's a continued focus on safety to the point the loss in capabilities has become a subject of satire.
Make it play Tic-Tac-Toe.
How about a nice game of chess