this post was submitted on 22 Apr 2024
30 points (87.5% liked)
Technology
60123 readers
2607 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
That sounds interesting. I was hoping for something that I could use on a mobile app. I'm not sure what "adapting the model size" means so this might be more complicated than I'm looking for.
Record then transcribe later ? But you can try https://whisper.ggerganov.com ( this runs on your browser but nothing is sent. So works even on your Android/IOS phone.) the website owner is a trusted dev that made whisper.cpp and llama.ccp, the latter basically being the backbone of the entire LLM industry.
A bit of complexity is generally the price to pay for freedom from the constant surveillance and data gathering. Plus, It's actually super easy. Bigger model means better transcription quality, but the smaller ones are really good already. The base.en is probably all you need anyway.
On pc, you can generally try any app from github. They basically all use the same backend.
I found a few : https://whishper.net/ https://github.com/chidiwilliams/buzz