Alex Graves (computer scientist) explained

Alex Graves
Thesis Title:Supervised sequence labelling with recurrent neural networks
Thesis Year:2008
Thesis Url:https://www.worldcat.org/oclc/1184353689
Workplaces:DeepMind
University of Toronto
Dalle Molle Institute for Artificial Intelligence Research
Doctoral Advisor:Jürgen Schmidhuber

Alex Graves is a computer scientist.

Education

Graves earned his Bachelor of Science degree in Theoretical Physics from the University of Edinburgh and a PhD in artificial intelligence from the Technical University of Munich supervised by Jürgen Schmidhuber at the Dalle Molle Institute for Artificial Intelligence Research.[1] [2]

Career and research

After his PhD, Graves was postdoc working with Schmidhuber at the Technical University of Munich and Geoffrey Hinton[3] at the University of Toronto.

At the Dalle Molle Institute for Artificial Intelligence Research, Graves trained long short-term memory (LSTM) neural networks by a novel method called connectionist temporal classification (CTC).[4] This method outperformed traditional speech recognition models in certain applications.[5] In 2009, his CTC-trained LSTM was the first recurrent neural network (RNN) to win pattern recognition contests, winning several competitions in connected handwriting recognition.[6] [7] Google uses CTC-trained LSTM for speech recognition on the smartphone.[8] [9]

Graves is also the creator of neural Turing machines[10] and the closely related differentiable neural computer.[11] [12] In 2023, he wrote the paper Bayesian Flow Networks.[13]

Notes and References

  1. Alex. Graves. PhD. 1184353689. Supervised sequence labelling with recurrent neural networks . 2008. Technischen Universitat Munchen.
  2. Web site: Alex Graves . Canadian Institute for Advanced Research . https://web.archive.org/web/20150501222647/http://www.cifar.ca/alex-graves . 1 May 2015.
  3. Web site: Marginally Interesting: What is going on with DeepMind and Google? . Blog.mikiobraun.de . 28 January 2014. May 17, 2016.
  4. Alex Graves, Santiago Fernandez, Faustino Gomez, and Jürgen Schmidhuber (2006). Connectionist temporal classification: Labelling unsegmented sequence data with recurrent neural nets. Proceedings of ICML’06, pp. 369–376.
  5. Book: 10.1007/978-3-540-74695-9_23 . An Application of Recurrent Neural Networks to Discriminative Keyword Spotting . Artificial Neural Networks – ICANN 2007 . Lecture Notes in Computer Science . 2007 . Fernández . Santiago . Graves . Alex . Schmidhuber . Jürgen . 4669 . 220–229 . 978-3-540-74693-5 .
  6. Graves, Alex; and Schmidhuber, Jürgen; Offline Handwriting Recognition with Multidimensional Recurrent Neural Networks, in Bengio, Yoshua; Schuurmans, Dale; Lafferty, John; Williams, Chris K. I.; and Culotta, Aron (eds.), Advances in Neural Information Processing Systems 22 (NIPS'22), December 7th–10th, 2009, Vancouver, BC, Neural Information Processing Systems (NIPS) Foundation, 2009, pp. 545–552 https://dl.acm.org/doi/10.5555/2981780.2981848
  7. 10.1109/TPAMI.2008.137 . A Novel Connectionist System for Unconstrained Handwriting Recognition . 2009 . Graves . A. . Liwicki . M. . Fernandez . S. . Bertolami . R. . Bunke . H. . Schmidhuber . J. . IEEE Transactions on Pattern Analysis and Machine Intelligence . 31 . 5 . 855–868 . 19299860 .
  8. Google Research Blog. The neural networks behind Google Voice transcription. August 11, 2015. By Françoise Beaufays http://googleresearch.blogspot.co.at/2015/08/the-neural-networks-behind-google-voice.html
  9. Google Research Blog. Google voice search: faster and more accurate. September 24, 2015. By Haşim Sak, Andrew Senior, Kanishka Rao, Françoise Beaufays and Johan Schalkwyk – Google Speech Team http://googleresearch.blogspot.co.uk/2015/09/google-voice-search-faster-and-more.html
  10. Web site: Google's Secretive DeepMind Startup Unveils a "Neural Turing Machine" . May 17, 2016.
  11. Graves. Alex. Wayne. Greg. Reynolds. Malcolm. Harley. Tim. Danihelka. Ivo. Grabska-Barwińska. Agnieszka. Colmenarejo. Sergio Gómez. Grefenstette. Edward. Ramalho. Tiago. 2016-10-12. Hybrid computing using a neural network with dynamic external memory. Nature. en. 538. 7626. 10.1038/nature20101. 1476-4687. 471–476. 27732574. 2016Natur.538..471G. 205251479.
  12. Web site: Differentiable neural computers DeepMind. DeepMind. 2016-10-19.
  13. 2308.07037 . Graves . Alex . Rupesh Kumar Srivastava . Atkinson . Timothy . Gomez . Faustino . Bayesian Flow Networks . 2023 . cs.LG .