Ashish Vaswani Explained
Ashish Vaswani |
Birth Date: | 1986 |
Known For: | Transformer (deep learning architecture) |
Website: | https://www.isi.edu/~avaswani/ |
Thesis Title: | Smaller, Faster, and Accurate Models for Statistical Machine Translation |
Thesis Year: | 2014 |
Ashish Vaswani is a computer scientist working in deep learning,[1] who is known for his significant contributions to the field of artificial intelligence (AI) and natural language processing (NLP). He is one of the co-authors of the seminal paper "Attention Is All You Need"[2] which introduced the Transformer model, a novel architecture that uses a self-attention mechanism and has since become foundational to many state-of-the-art models in NLP. Transformer architecture is the core of language models that power applications such as ChatGPT.[3] [4] [5] He was a co-founder of Adept AI Labs[6] [7] and a former staff research scientist at Google Brain.[8] [9]
Career
Vaswani completed his engineering in Computer Science from BIT Mesra in 2002. In 2004, he moved to the US to pursue higher studies at University of Southern California.[10] He did his PhD at the University of Southern California under the supervision of Prof. David Chiang.[11] He has worked as a researcher at Google,[12] where he was part of the Google Brain team. He was a co-founder of Adept AI Labs but has since left the company.[13] [14]
Notable works
Vaswani's most notable work is the paper "Attention Is All You Need", published in 2017.[15] The paper introduced the Transformer model, which eschews the use of recurrence in sequence-to-sequence tasks and relies entirely on self-attention mechanisms. The model has been instrumental in the development of several subsequent state-of-the-art models in NLP, including BERT,[16] GPT-2, and GPT-3.
Notes and References
- Web site: Ashish Vaswani . 2023-07-11 . scholar.google.com.
- Vaswani . Ashish . Ashish Vaswani . Shazeer . Noam . Parmar . Niki . Uszkoreit . Jakob . Jones . Llion . Gomez . Aidan N . Aidan Gomez . Kaiser . Ćukasz . Polosukhin . Illia . Attention is All you Need . Advances in Neural Information Processing Systems . 2017 . 30 . Curran Associates, Inc..
- Web site: Inside the brain of ChatGPT . 2023-07-12 . stackbuilders.com . en.
- Web site: 2023-01-18 . Understanding ChatGPT as explained by ChatGPT . 2023-07-12 . Advancing Analytics . en-US.
- News: Seetharaman . Deepa . Jin . Berber . 2023-05-08 . ChatGPT Fever Has Investors Pouring Billions Into AI Startups, No Business Plan Required . en-US . Wall Street Journal . 2023-07-12 . 0099-9660.
- Web site: Introducing Adept .
- News: Top ex-Google AI researchers raise $8 million in funding from Thrive Capital. The Economic Times . May 4, 2023.
- Attention is All You Need. Ashish. Vaswani. Noam. Shazeer. Niki. Parmar. Jakob. Uszkoreit. Llion. Jones. Aidan N.. Gomez. Lukasz. Kaiser. Illia. Polosukhin. May 21, 2017. cs.CL . 1706.03762 .
- Web site: Shead . Sam . 2022-06-10 . A.I. gurus are leaving Big Tech to work on buzzy new start-ups . 2023-07-12 . CNBC . en.
- Web site: The Indian Researchers Whose Work Led To The Creation Of ChatGPT. OfficeChai. Team. February 4, 2023. OfficeChai.
- Web site: Ashish Vaswani's webpage at ISI. www.isi.edu.
- Web site: Transformer: A Novel Neural Network Architecture for Language Understanding. August 31, 2017. ai.googleblog.com.
- News: AI startup Adept raises $350 mln in fresh funding. Ananya Mariam. Rajesh. Krystal. Hu. Ananya Mariam. Rajesh. Krystal. Hu. Reuters . March 16, 2023. www.reuters.com.
- News: Tong . Anna . Hu . Krystal . Tong . Anna . Hu . Krystal . 2023-05-04 . Top ex-Google AI researchers raise funding from Thrive Capital . en . Reuters . 2023-07-11.
- Web site: USC Alumni Paved Path for ChatGPT. USC Viterbi | School of Engineering.
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Jacob. Devlin. Ming-Wei. Chang. Kenton. Lee. Kristina. Toutanova. May 24, 2019. cs.CL . 1810.04805.