this post was submitted on 02 Aug 2023
359 points (94.1% liked)
Technology
60123 readers
3819 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
It seems like Altman is a PR man first and techie second. I wouldn't take anything he actually says at face value. If it's 'unfixable' then he probably means that in a very narrow way. Ie. I'm sure they are working on what you proposed, it's just different enough that he can claim that the way it is now is 'unfixable'.
Standard Diffusion really how people get the different-model-different-application idea.
I mean, I think he's well aware of a lot of this via his engineers, who are excellent.
But he's managing expectations for future product and seems to very much be laser focused on those products as core models (which is probably the right choice).
Fixing hallucinations in postprocessing is effectively someone else's problem, and he's getting ahead of any unrealistic expectations around a future GPT-5 release.
Though honestly I do think he largely underestimates just how much damage he did to their lineup by trying to protect against PR issues like 'Sydney' with the beta GPT-4 integration with Bing, and I'm not sure if the culture at OpenAI is such that engineers who think he's made a bad call in that can really push back on it.
They should be having an extremely 'Sydney' underlying private model with a secondary layer on top sanitizing it and catching jailbreaks at the same time.
But as long as he continues to see their core product as a single model offering and additional layers of models as someone else's problem, he's going to continue blowing their lead taking a LLM trained to complete human text and then pigeon-holing it into only completing text like an AI with no feelings and preferences would safely pretend to.
Which I'm 98% sure is where the continued performance degradation is coming from.