Hacker Newsnew | past | comments | ask | show | jobs | submit | broland's commentslogin

Website says Speech to text is done by Groq and Whisper.


I use whisper and pyannote (https://github.com/m-bain/whisperX), but it is a pain to run locally - I run it on a 4080. This seems to be actually trying to identify the speakers. Not sure what they are doing for that.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: