this post was submitted on 04 Sep 2024
35 points (97.3% liked)

Linux

48008 readers
882 users here now

From Wikipedia, the free encyclopedia

Linux is a family of open source Unix-like operating systems based on the Linux kernel, an operating system kernel first released on September 17, 1991 by Linus Torvalds. Linux is typically packaged in a Linux distribution (or distro for short).

Distributions include the Linux kernel and supporting system software and libraries, many of which are provided by the GNU Project. Many Linux distributions use the word "Linux" in their name, but the Free Software Foundation uses the name GNU/Linux to emphasize the importance of GNU software, causing some controversy.

Rules

Related Communities

Community icon by Alpár-Etele Méder, licensed under CC BY 3.0

founded 5 years ago
MODERATORS
 

Hi folks, I'm in a bit of a personal crisis currently and need to quickly find a piece of speech transcription software that works on Linux and does not require a significant time investment to set up and can help me transcribe a number of audio clips <15 min. each.

  • Can someone recommend a program that can transcribe some audio recordings for me and is relatively simple to set up and use?
  • Do such programs need a GPU to run effectively? I'm running a Dell XPS 9370 laptop which only has internal graphics.

My backup plan is to listen and transcribe by hand, so recommendations of a program that will allow me to self-transcribe by typing while listening at a reduced rate are also appreciated.

  • If any experienced transcribers are reading this, have you found that your pedals worked well with Linux?

Normally I would try out all the different programs and do more than the small number of searches I've done, but my timeline doesn't allow time for to build a cluster of custom-coded transcription bots running gentoo on hand-soldered hardware.

My environment is EndeavorOS running on a Dell XPS 9370,internet is over Wifi, with no external dongles or anything currently hooked up.

you are viewing a single comment's thread
view the rest of the comments
[–] just_another_person@lemmy.world 2 points 2 months ago (1 children)

Depends on what the audio is. What's the crisis?

Generally, you can use CPU for anything based on pytorch, it will just take substantially longer.

[–] njordomir@lemmy.world 1 points 2 months ago (1 children)

Transcription of numerous voice mails and phone calls for a legal matter. Would like to supply transcripts with the audio files so we don't have to pay as much time for the lawyer's paralegals to review and decide what is actually going to be useful.

[–] just_another_person@lemmy.world 2 points 2 months ago (1 children)

Start with Whisper as someone else mentioned. DeepSpeech by Mozilla is another simple one.

Both are similar in performance and accuracy for normal spoken conversation with no extra auditory noise.

[–] njordomir@lemmy.world 2 points 2 months ago

Whisper worked for me. I'll have to go back through and tag speakers and fox a few spots but you guys have saved me 80-90% of the work. Thank you.