List of speech recognition software explained
Speech recognition software is available for many computing platforms, operating systems, use models, and software licenses. Here is a listing of such, grouped in various useful ways.
Acoustic models and speech corpus (compilation)
The following list presents notable speech recognition software engines with a brief synopsis of characteristics.
Application name | Description | | | | | Supported language, note | Offline or online |
---|
| | | | | | English, German, French, Mandarin, Russian | Offline |
| | | HTK specific | | | English; version 3.5 released December 2015 | |
| HMM trigrams | | BSD style, non-commercial | | C | Japanese, English; https://github.com/julius-speech/julius#english | Offline |
| | | | | | English | |
| RWTH Aachen University | | RWTH ASR, non-commercial use only | | C++ | English | |
| | | | | | Multilingual | Online (through API) and Offline | |
Macintosh
Application name | Description | | | Price | Note |
---|
Dragon for Mac (discontinued 2018) | macOS
- by Nuance
| | | | |
Dragon Dictate (discontinued) | macOS; by Nuance | | | | |
MacSpeech Scribe (discontinued) | Transcription from recorded text; acquired by Nuance | | | | |
iListen (discontinued) | PowerPC Macintosh; discontinued by MacSpeech; acquired by Nuance | | | | |
| Included with macOS | | | | |
ViaVoice (discontinued) | IBM Product; acquired by Nuance | | | | |
| Original GUI voice control; 1989 | | | | | |
Cross-platform web apps based on Chrome
The following list presents notable speech recognition software that operate in a Chrome browser as web apps. They make use of HTML5 Web-Speech-API.[1]
Mobile devices and smartphones
Many mobile phone handsets, including feature phones and smartphones such as iPhones and BlackBerrys, have basic dial-by-voice features built in. Many third-party apps have implemented natural-language speech recognition support, including:
Application name | Description | | | Price | Note |
---|
| Assistant for Android, iOS and Windows Phone | | | Free | Discontinued |
| | | | Free | |
| Android voice search | | | Free | |
| | | | Free | |
| Microsoft voice search | | | Free | |
| Apple's virtual personal assistant | | | Free | |
| Amazon's personal assistant | | | | |
| Android and iOS | | | | |
| | | | | | |
Windows
Windows built-in speech recognition
The Windows Speech Recognition version 8.0 by Microsoft comes built into Windows Vista, Windows 7, Windows 8 and Windows 10.Speech Recognition is available only in English, French, Spanish, German, Japanese, Simplified Chinese, and Traditional Chinese and only in the corresponding version of Windows; meaning you cannot use the speech recognition engine in one language if you use a version of Windows in another language. Windows 7 Ultimate and Windows 8 Pro allow you to change the system language, and therefore change which speech engine is available. Windows Speech Recognition evolved into Cortana (software), a personal assistant included in Windows 10.
Windows 7, 8, 10, 11 third-party speech recognition
- Braina – Dictate into third party software and websites,[3] fill web forms and execute vocal commands.[4]
- Dragon NaturallySpeaking from Nuance Communications – Successor to the older DragonDictate product. Focus on dictation. 64-bit Windows support since version 10.1.
- Tazti – Create speech command profiles to play PC games and control applications – programs. Create speech commands to open files, folders, webpages, applications. Windows 7, Windows 8 and Windows 8.1 versions.[5]
- Voice Finger – software that improves the Windows speech recognition system by adding several extensions to it. The software enables controlling the mouse and the keyboard by only using the voice. It is especially useful for aiding users to overcome disabilities or to heal from computer injuries.
Windows XP or 2000 only
- Microsoft Speech API – Speech recognition functionality included as part of Microsoft Office and on Tablet PCs running Microsoft Windows XP Tablet PC Edition. It can also be downloaded as part of the Speech SDK 5.1 for Windows applications, but since that is aimed at developers building speech applications, the pure SDK form lacks any user interface, and thus is unsuitable for end users.
Built-in software
Interactive voice response
The following are interactive voice response (IVR) systems:
Unix-like x86 and x86-64 speech transcription software
Discontinued software
Notes and References
- Web site: Web Speech API Specification . dvcs.w3.org . live . https://web.archive.org/web/20160621225102/https://dvcs.w3.org/hg/speech-api/raw-file/tip/speechapi.html . 2016-06-21 .
- Web site: Orlowski . Andrew . Total recog: British AI makes universal speech breakthrough . The Register . Situation Publishing . 17 May 2018.
- Web site: Speech Recognition Software for Windows PC – Braina . www.brainasoft.com . live . https://web.archive.org/web/20150407054442/http://www.brainasoft.com/braina/speech-to-text.html . 2015-04-07 .
- Web site: Dynamic Faceting-List of Most 57 Speech Recognition SWs and Web Services . https://web.archive.org/web/20190213161952/https://www.capterra.com/speech-recognition-software/ . en . February 13, 2019 . live . February 23, 2019 . mdy-all .
- Web site: O'Neill . Mark . Control your PC with these 5 speech recognition programs . . 2013-11-06 . 2013-12-30 . live . https://web.archive.org/web/20140101030044/http://www.pcworld.com/article/2055599/control-your-pc-with-these-5-speech-recognition-programs.html . 2014-01-01 .
- Web site: Interactive Voice Response . Genesys . live . https://web.archive.org/web/20161014010400/http://www.genesys.com/platform-services/intelligent-voice-response . 2016-10-14 .
- http://isl.ira.uka.de/downloads/asru_hagen.ps
- Book: Janus-III: speech-to-speech translation in multiple languages . Lavie . A. . Waibel . A. . Levin . L. . Finke . M. . Gates . D. . Gavalda . M. . Zeppenfeld . T. . Zhan . Puming . 1 April 1997 . IEEE Xplore . 1 . 99–102 . 10.1109/ICASSP.1997.599557 . 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing . 978-0-8186-7919-3 . 10.1.1.36.6967 . 1514209 .
- Web site: A TensorFlow implementation of Baidu's DeepSpeech architecture . 2017-12-05 . Mozilla . 2017-12-05.
- Web site: IBM - Embedded ViaVoice - Embedded ViaVoice - Software . 2010-06-29 . live . https://web.archive.org/web/20100808052606/http://www-01.ibm.com/software/pervasive/embedded_viavoice/ . 2010-08-08 .
- Web site: Nuance product support for Microsoft Windows 7 . Nuance Communications, Customer Help . 2019-03-16.
- Web site: ViaVoice for Mac OS X on Intel Chipset . Nuance Communications, Customer Help . 2019-03-16.