Kaldi (software) explained

Kaldi
Developer:	Daniel Povey and others
Latest Release Version:	Revision 3122
Repo:	https://github.com/kaldi-asr/kaldi
Programming Language:	C++
Operating System:	Unix systems (Linux, BSD, OSX 10. etc.), Windows (via Cygwin)
Genre:	Speech recognition
License:	Apache License v.2.0^[1]

Kaldi is an open-source speech recognition toolkit written in C++ for speech recognition and signal processing, freely available under the Apache License v2.0.

Kaldi aims to provide software that is flexible and extensible,^[2] and is intended for use by automatic speech recognition (ASR) researchers for building a recognition system.

It supports linear transforms, MMI, boosted MMI and MCE discriminative training, feature-space discriminative training, and deep neural networks.^[3]

Kaldi is capable of generating features like mfcc, fbank, fMLLR, etc. Hence in recent deep neural network research, a popular usage of Kaldi is to pre-process raw waveform into acoustic feature for end-to-end neural models.

Kaldi has been incorporated as part of the CHiME Speech Separation and Recognition Challenge over several successive events.^[4] ^[5] ^[6] The software was initially developed as part of a 2009 workshop at Johns Hopkins University.^[7]

Kaldi is named after the legendary Ethiopian goat herder Kaldi who was said to have discovered the coffee plant.^[8]

External links

- Kaldi – The official GitHub project
Kaldi paper - The Kaldi Speech Recognition Toolkit
VOSK – open source and commercial models from Alpha Cephei on Kaldi foundations

Notes and References

Web site: Kaldi: Legal stuff. kaldi-asr.org.
Web site: Kaldi: About the Kaldi project. kaldi-asr.org.
Web site: Kaldi: Deep Neural Networks in Kaldi. kaldi-asr.org.
Web site: The 4th CHiME Speech Separation and Recognition Challenge. . 15 February 2017. 16 February 2017. https://web.archive.org/web/20170216210537/http://spandh.dcs.shef.ac.uk/chime_challenge/software.html. dead.
Web site: The 3rd CHiME Speech Separation and Recognition Challenge. . 15 February 2017.
Emmanuel Vincent, Jon Barker, Shinji Watanabe, Jonathan Le Roux, Francesco Nesta, etal.. The second 'CHiME' Speech Separation and Recognition Challenge: Datasets, tasks andbaselines. ICASSP - 38th International Conference on Acoustics, Speech, and Signal Processing- 2013, May 2013, Vancouver, Canada. pp.126-130, 2013.
Web site: History of the Kaldi project. 26 July 2017.
Web site: Kaldi: About the Kaldi project.

Kaldi (software) explained

See also

External links

Notes and References