Audience

Anyone looking for a tool to recognize speech automatically and improve text transcription

About Whisper

We’ve trained and are open-sourcing a neural net called Whisper that approaches human-level robustness and accuracy in English speech recognition. Whisper is an automatic speech recognition (ASR) system trained on 680,000 hours of multilingual and multitask supervised data collected from the web. We show that the use of such a large and diverse dataset leads to improved robustness to accents, background noise, and technical language. Moreover, it enables transcription in multiple languages, as well as translation from those languages into English. We are open-sourcing models and inference code to serve as a foundation for building useful applications and for further research on robust speech processing. The Whisper architecture is a simple end-to-end approach, implemented as an encoder-decoder Transformer. Input audio is split into 30-second chunks, converted into a log-Mel spectrogram, and then passed into an encoder.

Integrations

API:
Yes, Whisper offers API access

Ratings/Reviews

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Company Information

OpenAI
United States
openai.com/blog/whisper/

Videos and Screen Captures

Whisper Screenshot 1
Other Useful Business Software
Our Free Plans just got better! | Auth0 Icon
Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
Try free now

Product Details

Platforms Supported
Cloud
Training
Documentation
Webinars
Videos
Support
Online

Whisper Frequently Asked Questions

Q: What kinds of users and organization types does Whisper work with?
Q: What languages does Whisper support in their product?
Q: What other applications or services does Whisper integrate with?
Q: Does Whisper have an API?
Q: What type of training does Whisper provide?

Whisper Product Features

Speech Recognition

Concatenated Speech
Customizable Macros
Variable Frequency
Specialty Vocabularies
Automatic Form Fill
Continuous Speech
Call Analysis
Speech-to-Text Analysis
Automatic Transcription
Voice Recognition
Multi-Languages
Audio Capture

Transcription

Annotations
Automatic Transcription
Audio/Video File Upload
AI / Machine Learning
Collaboration Tools
For Manual Transcription
Multi-Language Support
Playback Controls
Subtitles
Speech Recognition
Timecoding
Full Text Search
Natural Language Processing (NLP)
Text Editor
File Sharing

Whisper Additional Categories