Search: [speech-to-text] - Biapy Web Directory

Whisper.c++ https://github.com/ggerganov/whisper.cpp

Tue Apr 16 08:52:16 2024

📧email

Port of OpenAI's Whisper model in C/C++

BetterDictation.com https://betterdictation.com/

Fri Mar 1 08:22:57 2024

📧email

Type so fast, your boss will think there's 3 of you!

BetterDictation is your personal scribe. You speak, and it will quickly and flawless transcribe into any app.

S4E10 - Quel destin pour l’Apple Vision Pro ? @ Underscore_'s Acast :fr:.

Distil-Whisper https://github.com/huggingface/distil-whisper

Thu Dec 7 10:09:15 2023

📧email

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Otter.ai https://otter.ai/

Tue Oct 3 08:39:52 2023

📧email

Voice Meeting Notes & Real-time Transcription

AI Transcriptions by Riverside https://riverside.fm/transcription

Mon Jul 17 08:42:58 2023

📧email

Accurate AI Transcriptions in Minutes.

Web service proposing to transcribe video and/or audio content using AI

Buzz Captions https://buzzcaptions.com/

Tue May 30 11:47:52 2023

📧email

Offline audio transcription and translation.

Transcribe and translate audio offline on your personal computer. Powered by OpenAI's Whisper.

Buzz Captions @ GitHub.

Whisper https://openai.com/research/whisper

Wed Mar 15 08:26:45 2023

📧email

Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification.

Whisper @ GitHub.

writeout.ai – Transcribe and translate any audio file https://writeout.ai/

Mon Mar 13 10:31:28 2023

📧email

Transcribe and translate any audio file.

Free, fast and accurate transcription of audio files. 100% free to use.

writeout.ai @ GitHub

Good Tape https://www.mygoodtape.com/

Mon Mar 6 09:51:05 2023

📧email

Good Tape is a transcription service for your interview tape. (available in french)

Coqui STT https://coqui.ai/

Mon Feb 27 14:35:19 2023

📧email

Coqui STT (frogSTT) is a fast, open-source, multi-platform, deep-learning toolkit for training and deploying speech-to-text models. frogSTT is battle tested in both production and research rocket

Coqui STT @ GitHub.

Whisper https://openai.com/blog/whisper/

Sun Jan 29 12:02:22 2023

📧email

Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.
Whisper is an automatic speech recognition (ASR) system trained on 680,000 hours of multilingual and multitask supervised data collected from the web. We show that the use of such a large and diverse dataset leads to improved robustness to accents, background noise and technical language. Moreover, it enables transcription in multiple languages, as well as translation from those languages into English. We are open-sourcing models and inference code to serve as a foundation for building useful applications and for further research on robust speech processing.

Whisper @ GitHub.

Amberscript https://www.amberscript.com/en/

Mon Jan 16 08:28:16 2023

📧email

Audio & Video Transcription | Speech-to-text.
Smarter subtitling and transcription.
We combine artificial and human intelligence to bring you accurate and fast transcripts, captions, and translated subtitles with ease.

Links per page

Filters