List of speech recognition software


Speech recognition software is available for many computing platforms, operating systems, use models, and software licenses. Here is a listing of such, grouped in various useful ways.

Acoustic models and speech corpus (compilation)

The following list presents notable speech recognition software engines with a brief synopsis of characteristics.
Application nameDescriptionOpen-sourceLicenseOperating systemProgramming languageSupported language, noteOffline or online
CMU SphinxHMMBSD styleCross-platformJavaEnglish, German, French, Mandarin, RussianOffline
HTKHMM neural netHTK specificCross-platformCEnglish; version 3.5 released December 2015
JuliusHMM trigramsBSD style, non-commercialCross-platformCJapanese, English; Offline
KaldiNeural netApacheCross-platformC++English
RWTH ASRRWTH Aachen UniversityRWTH ASR, non-commercial use onlyLinux, macOSC++English
WhisperEncoder/decoder transformerMIT licenseCross-platformPython (programming language)MultilingualOnline and Offline

Cross-platform web apps based on Chrome

The following list presents notable speech recognition software that operate in a Chrome browser as web apps. They make use of HTML5 Web-Speech-API.
Application nameDescriptionOpen-sourceLicensePriceNote
SpeechmaticsCloud based and on-premise automatic speech recognitionFrom £0.06 per minute of audio

Mobile devices and smartphones

Many mobile phone handsets, including feature phones and smartphones such as iPhones and BlackBerrys, have basic dial-by-voice features built in. Many third-party apps have implemented natural-language speech recognition support, including:
Application nameDescriptionOpen-sourceLicensePriceNote
Assistant.aiAssistant for Android, iOS and Windows Phone, freewareFreeDiscontinued
Dragon Dictation, freewareFree
Google NowAndroid voice search, freewareFree
Google Voice Search, freewareFree
Microsoft CortanaMicrosoft voice search, freewareFree
Siri Personal AssistantApple's virtual personal assistant, freewareFree
Alexa – Amazon EchoAmazon's personal assistant
SILVIAAndroid and iOS
Vlingo

Windows

Windows built-in speech recognition

The Windows Speech Recognition version 8.0 by Microsoft comes built into Windows Vista, Windows 7, Windows 8 and Windows 10.
Speech Recognition is available only in English, French, Spanish, German, Japanese, Simplified Chinese, and Traditional Chinese and only in the corresponding version of Windows; meaning you cannot use the speech recognition engine in one language if you use a version of Windows in another language. Windows 7 Ultimate and Windows 8 Pro allow you to change the system language, and therefore change which speech engine is available. Windows Speech Recognition evolved into Cortana (software), a personal assistant included in Windows 10.

Windows 7, 8, 10, 11 third-party speech recognition

Microsoft Speech API

The first version of the Microsoft Speech API was released for Windows NT 3.51 and Windows 95 in 1995, it was then part of Windows up to Windows Vista. This initial version already contained Direct Speech Recognition and Direct Text To Speech APIs which applications could use to directly control engines, as well as simplified 'higher-level' Voice Command and Voice Talk APIs. Speech recognition functionality included as part of Microsoft Office and on Tablet PCs running Microsoft Windows XP Tablet PC Edition. It can also be downloaded as part of the Speech SDK 5.1 for Windows applications, but since that is aimed at developers building speech applications, the pure SDK form lacks any user interface, and thus is unsuitable for end users.

Built-in software

Interactive voice response

The following are interactive voice response systems:

Unix-like x86 and x86-64 speech transcription software

  • Janus Recognition Toolkit
  • Mozilla DeepSpeech was developing an open-source Speech-To-Text engine based on Baidu's deep speech research paper.
  • Weesper Neon Flow – professional voice-dictation software that provides offline speech-to-text processing on macOS and Windows using local AI models. It is not open source and offers a paid subscription after a 15‑day free trial.

Discontinued software