List of speech recognition software

From Wikipedia, the free encyclopedia - View original article

 
Jump to: navigation, search

Open source acoustic models and speech corpus[edit]

The following list presents notable speech recognition software with a brief synopsis of characteristics.

Application nameDescriptionWebsiteOpen SourceLicenseOperating SystemNote
Cmu SphinxCMU: SourceforgeYesLinux
JuliusJuilus Sourceforge licenseYes
KaldiKalidi SourceforgeYesApache
simonSimonYes
iATROSiATROSYes
RWTH ASRNo
VoxForgeYesGPL

Macintosh[edit]

Application nameDescriptionWebsiteOpen SourceLicensePriceNote
Dragon DictateMac OSNoProprietary
MacSpeech Dictate MedicalMedical dictation product
Macspeech Dictate LegalLegal-focused dictation
Macspeech ScribeTranscription from recorded text
iListenPowerPC Macintosh
Speakable itemsIncluded with Mac OS
ViaVoiceIBM Product. Supported ended 2007.
Voice NavigatorOriginal GUI voice control (1989)
Power Secretary[1]

Mobile Devices / Smartphones[edit]

Many cell phone handsets have basic dial-by-voice features built in. Smartphones such as iPhone or Blackberry also support this. A number of 3rd party Apps have implemented natural language speech recognition support, including:

Application nameDescriptionWebsiteOpen SourceLicensePriceNote
Sonic Cloud Online Speech
Verbio ASR embeddedEmbedded Speech Recognition
Dragon Dictation
Google NowAndroid voice search
Google Voice Search
BingMicrosoft voice search
GoVivaceCloud based speech recognition
Siri Personal AssistantApple vrtual personal assistant
MeMeMe Mobile
SILVIAAndroid and iOS
Shoutout
DriveSafe.lySpeech Recognition
Vlingo
JeannieAndroid
SpeaktoitAssistant for Android, iOS and Windows Phone
ZiriAndroid
Microsoft TellmeWindows Phone 7
Ask ZiggyWindows Phone 7
fcGlobalTerminal
Vocreios
Zowi[2]
eremAndroid and iOserem

Windows[edit]

Windows 7[edit]

Windows 7 built-in speech recognition[edit]

The Windows Speech Recognition by Microsoft is the speech recognition system that comes built into Windows Vista and Windows 7. Windows Vista and Windows 7 include version 8.0 of the Microsoft speech recognition engine. Speech Recognition is available only in English, French, Spanish, German, Japanese, Simplified Chinese, and Traditional Chinese and only in the corresponding version of Windows. That means that you can not use the French speech recognition engine if you use an English version of Windows.

Add-ons for Windows 7 speech recognition[edit]

Windows 7 third-party speech recognition[edit]

  • Auditory Sciences[2]—transcription software for captioning whatever someone says.
  • Dragon NaturallySpeaking from Nuance Communications – Successor to the older DragonDictate product. Focus on dictation. 64-bit Windows support since version 10.1.
  • Freesr Speech Recognition Software – Create voice interfaces for any application, window in an application, or website/webpage. Works with Windows Speech Recognition or as add-on to NaturallySpeaking.
  • SpeechGear's Interact - combines speech recognition with language translation.
  • Sonic Extractor from Digital Syphon – Supports 22 languages. Focus on broadcasting and telephony.
  • SpeechMagicNuance Communications acquired Philips owned. Medical industry focus according to Frost & Sullivan. Standalone or embedded.[3][4]
  • Tazti - Create speech command profiles to play PC games and control applications - programs. Create speech commands to open files, folders, webpages, applications. Windows 7, Windows 8 and Windows 8.1 versions.[5]
  • VoxCommando – Voice command utility for Windows Vista or later. It interfaces with various programs and devices to allow control of multimedia, communication, and home automation.

Windows XP or 2000 only[edit]

  • e-Speaking – software for Windows XP that facilitates use of the Microsoft Speech API by adding ability to create commands to perform custom actions.
  • Microsoft Speech API – Speech recognition functionality included as part of Microsoft Office and on Tablet PCs running Microsoft Windows XP Tablet PC Edition. It can also be downloaded as part of the Speech SDK 5.1 for Windows applications, but since that is aimed at developers building speech applications, the pure SDK form lacks any user interface, and thus is unsuitable for end users.

Programs for controlling a computer's screens and desktop applications with claps[edit]

Built-in Software[edit]

  • Microsoft Kinect includes built-in software which allow speech-recognition of commands.
  • Older generation of Nokia phones like Nokia N97 series ( before using Windows 7 mobile technology) used speech-recognition with family names from contact list and other few commands.

Interactive voice response[edit]

The following are IVR/Interactive Voice response systems:

Unix-like x86 and x86_64 Speech Transcription Software[edit]

Discontinued software[edit]

  • SpeechWorks from Nuance Communications.
  • Quack.com (acquired by AOL) The name has now been reused for an iPad search app.
  • IBM ViaVoice – Embedded version still maintained by IBM.[9] No longer supported for versions above Windows Vista.[10] Untested above Mac OS X 10.4 or on Macintoshes with an Intel chipset.[11]
  • Game Commander 2 by Mindmaker. Gaming oriented voice recognition. Voice commands can be assigned to issue keystrokes and key combinations.

See also[edit]

References[edit]

  1. ^ "PowerSecretary Announcement". 
  2. ^ http://www.massmatch.org/aboutus/listserv/2010/2010-03-31.html
  3. ^ [1]
  4. ^ Philips SpeechMagic named European Technology Leader by Frost & Sullivan
  5. ^ O'Neill, Mark (2013-11-06). "Control your PC with these 5 speech recognition programs". PC World. Retrieved 2013-12-30. 
  6. ^ http://isl.ira.uka.de/downloads/asru_hagen.ps
  7. ^ http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=599557
  8. ^ http://www.vocapia.com/voxsigma-speech-to-text.html
  9. ^ http://www-01.ibm.com/software/pervasive/embedded_viavoice/
  10. ^ http://nuance.custhelp.com/app/answers/detail/a_id/5775/p/31/c/980/r_id/100023
  11. ^ http://nuance.custhelp.com/app/answers/detail/a_id/4987/related/1/p/31/c/980/r_id/100023