this post was submitted on 18 Jun 2024
70 points (98.6% liked)

And Finally...

1081 readers
37 users here now

A place for odd or quirky world news stories.

Elsewhere in the Fediverse:

Rules:

founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] snooggums@midwest.social 11 points 5 months ago (2 children)

Are there any examples of user facing AI that isn't a complete train wreck?

I'm sure some usafe as a tool to assist someone that can filter outliers exists for acientific pattern matching, I'm mostly wondering about stuff customers interact with that is just working without any news stories.

[–] j4k3@lemmy.world 6 points 5 months ago

OpenAI seems to be functioning.

The problem with speech to text is the background noise and the many variations of speech. I've played around with a couple of models. I can get one to work with my voice with little effort in training, but when my window AC kicks in or my computer fan hits the highest setting, it becomes a problem because the training is very dependant on the noise floor. I think they are likely extremely limited in the audio gear available in combination with the compute hardware to make it viable. Human hearing has a relatively large dynamic range and we have natural analog filtering. A machine just doing math can't handle things like clipping from someone speaking too loud, or understand the periodicity of all the vehicle and background noises like wind, birds, and other people in the vicinity. Everything that humans can contextualize is like a small learned program and alignment that took many years to train.

You will not see the full use cases of AI for quite awhile. The publicly facing tools are nowhere near the actual capabilities of present AI. If you simply read the introductory documentation for the Transformers library, which is the basis of almost all the AI stuff you see in any public spaces, the documentation clearly states that it is a a simplified tool that bypasses complexity in an attempt to make the codebase approachable to more people in various fields. It is in no way a comprehensive implementation. People are forming opinions based on projects that are hacked together using Transformers. The real shakeups are happening in business where companies like OpenAI are not peddling the simple public API, they are demonstrating the full implementations directly.