• 1 Post
  • 2 Comments
Joined 2 years ago
cake
Cake day: June 15th, 2023

help-circle

  • Working with an LLM, I’ve been able to use vosk and python to get something that works well, given a small delay. Using pyautogui, sounddevice, and keyboard, it works. I also implemented a toggle for transcription using a keyboard shortcut. Not as nice as having a gui and more options, but it does the job.

    I’d still be interested if anyone knows of a simpler “just works” solution, but this does seem like it’ll suit my needs. Hopefully others can find this helpful.