Outline of natural language processing explained

The following outline is provided as an overview of and topical guide to natural-language processing:

natural-language processing  - computer activity in which computers are entailed to analyze, understand, alter, or generate natural language. This includes the automation of any or all linguistic forms, activities, or methods of communication, such as conversation, correspondence, reading, written composition, dictation, publishing, translation, lip reading, and so on. Natural-language processing is also the name of the branch of computer science, artificial intelligence, and linguistics concerned with enabling computers to engage in communication using natural language(s) in all forms, including but not limited to speech, print, writing, and signing.

Natural-language processing

Natural-language processing can be described as all of the following:

Prerequisite technologies

The following technologies make natural-language processing possible:

Subfields of natural-language processing

Related fields

Natural-language processing contributes to, and makes use of (the theories, tools, and methodologies from), the following fields:

Structures used in natural-language processing

Processes of NLP

Applications

Component processes

Component processes of natural-language understanding

Component processes of natural-language generation

Natural-language generation  - task of converting information from computer databases into readable human language.

History of natural-language processing

History of natural-language processing

Timeline of NLP software

Software Year  CreatorDescriptionReference
Georgetown experiment1954Georgetown University and IBMinvolved fully automatic translation of more than sixty Russian sentences into English.
STUDENT1964Daniel Bobrowcould solve high school algebra word problems.[10]
ELIZA1964Joseph Weizenbauma simulation of a Rogerian psychotherapist, rephrasing her (referred to as her not it) response with a few grammar rules.[11]
SHRDLU1970Terry Winograda natural-language system working in restricted "blocks worlds" with restricted vocabularies, worked extremely well
PARRY1972Kenneth ColbyA chatterbot
KL-ONE1974Sondheimer et al.a knowledge representation system in the tradition of semantic networks and frames; it is a frame language.
MARGIE1975Roger Schank
TaleSpin (software)1976Meehan
QUALMLehnert
LIFER/LADDER1978Hendrixa natural-language interface to a database of information about US Navy ships.
SAM (software)1978Cullingford
PAM (software)1978Robert Wilensky
Politics (software)1979Carbonell
Plot Units (software)1981Lehnert
Jabberwacky1982Rollo Carpenterchatterbot with stated aim to "simulate natural human chat in an interesting, entertaining and humorous manner".
MUMBLE (software)1982McDonald
Racter1983William Chamberlain and Thomas Etterchatterbot that generated English language prose at random.
MOPTRANS1984Lytinen
KODIAK (software)1986Wilensky
Absity (software)1987Hirst
AeroText1999Lockheed MartinOriginally developed for the U.S. intelligence community (Department of Defense) for information extraction & relational link analysis
Watson2006IBMA question answering system that won the Jeopardy! contest, defeating the best human players in February 2011.
MeTA2014Sean Massung, Chase Geigle, Chengiang ZhaiMeTA is a modern C++ data sciences toolkit featuringL text tokenization, including deep semantic features like parse trees; inverted and forward indexes with compression and various caching strategies; a collection of ranking functions for searching the indexes; topic models; classification algorithms; graph algorithms; language models; CRF implementation (POS-tagging, shallow parsing); wrappers for liblinear and libsvm (including libsvm dataset parsers); UTF8 support for analysis on various languages; multithreaded algorithms
Tay2016MicrosoftAn artificial intelligence chatterbot that caused controversy on Twitter by releasing inflammatory tweets and was taken offline shortly after.

General natural-language processing concepts

Natural-language processing tools

Corpora

Natural-language processing toolkits

The following natural-language processing toolkits are notable collections of natural-language processing software. They are suites of libraries, frameworks, and applications for symbolic, statistical natural-language and speech processing.

NameLanguageLicenseCreators
(various)
Deeplearning4jJava, ScalaApache 2.0Adam Gibson, Skymind
DELPH-INLISP, C++LGPL, MIT, ... Deep Linguistic Processing with HPSG Initiative
DistinguoC++Commercial Ultralingua Inc.
DKPro CoreJavaApache 2.0 / Varying for individual modules Technische Universität Darmstadt / Online community
General Architecture for Text Engineering (GATE)JavaLGPLGATE open source community
GensimPythonLGPLRadim Řehůřek
LinguaStreamJavaFree for research University of Caen, France
MalletJavaCommon Public LicenseUniversity of Massachusetts Amherst
Modular Audio Recognition FrameworkJavaBSDThe MARF Research and Development Group, Concordia University
MontyLinguaPython, JavaFree for research MIT
Natural Language Toolkit (NLTK) PythonApache 2.0
Apache OpenNLPJavaApache License 2.0Online community
spaCyPython, CythonMITMatthew Honnibal, Explosion AI
UIMAJava / C++Apache 2.0Apache

Named-entity recognizers

Translation software

Other software

Chatterbots

See main article: List of chatbots. Chatterbot  - a text-based conversation agent that can interact with human users through some medium, such as an instant message service. Some chatterbots are designed for specific purposes, while others converse with human users on a wide range of topics.

Classic chatterbots

General chatterbots

Instant messenger chatterbots

Natural-language processing organizations

Natural-language processing-related conferences

Companies involved in natural-language processing

Natural-language processing publications

Books

Book series

Journals

People influential in natural-language processing

Bibliography

Notes and References

  1. "... modern science is a discovery as well as an invention. It was a discovery that nature generally acts regularly enough to be described by laws and even by mathematics; and required invention to devise the techniques, abstractions, apparatus, and organization for exhibiting the regularities and securing their law-like descriptions." —p.vii, J. L. Heilbron, (2003, editor-in-chief) The Oxford Companion to the History of Modern Science New York: Oxford University Press
    • Encyclopedia: Merriam-Webster Online Dictionary . science . 2011-10-16 . Merriam-Webster, Inc . 3 a: knowledge or a system of knowledge covering general truths or the operation of general laws especially as obtained and tested through scientific method b: such knowledge or such a system of knowledge concerned with the physical world and its phenomena .
  2. [Software Engineering Body of Knowledge|SWEBOK]
  3. Web site: ACM . 2006 . Computing Degrees & Careers . ACM . 2010-11-23 . 2011-06-17 . https://web.archive.org/web/20110617053818/http://computingcareers.acm.org/?page_id=12 . dead .
  4. Book: Laplante, Phillip . What Every Engineer Should Know about Software Engineering . CRC . Boca Raton . 2007 . 978-0-8493-7228-5 . 2011-01-21 .
  5. http://www.computerhope.com/jargon/i/inputdev.htm Input device
  6. McQuail, Denis. (2005). Mcquail's Mass Communication Theory. 5th ed. London: SAGE Publications.
  7. Yucong Duan, Christophe Cruz (2011), [http –//www.ijimt.org/abstract/100-E00187.htm Formalizing Semantic of Natural Language through Conceptualization from Existence]. International Journal of Innovation, Management and Technology(2011) 2 (1), pp. 37–42.
  8. Web site: Tool Module: Chomsky's Universal Grammar. thebrain.mcgill.ca.
  9. [Roger Schank]
  10. ,,
  11. ,
  12. Web site: МНОГОЦЕЛЕВОЙ ЛИНГВИСТИЧЕСКИЙ ПРОЦЕССОР ЭТАП-3 . Iitp.ru . 2012-02-14.
  13. News: Aiming to Learn as We Do, a Machine Teaches Itself . Since the start of the year, a team of researchers at Carnegie Mellon University — supported by grants from the Defense Advanced Research Projects Agency and Google, and tapping into a research supercomputing cluster provided by Yahoo — has been fine-tuning a computer system that is trying to master semantics by learning more like a human. . . October 4, 2010 . 2010-10-05 .
  14. http://rtw.ml.cmu.edu/rtw/overview Project Overview
  15. Web site: Loebner Prize Contest 2013 . People.exeter.ac.uk . 2013-09-14 . 2013-12-02.
  16. News: Gibes. Al. Circle of buddies grows ever wider. Las Vegas Review-Journal (Nevada) . 2009-03-30-->. 2002-03-25.
  17. News: ActiveBuddy Introduces Software to Create and Deploy Interactive Agents for Text Messaging; ActiveBuddy Developer Site Now Open: www.BuddyScript.com. Business Wire. 2014-01-16. 2002-07-15.
  18. Lenzo. Kevin. Infobots and Purl. The Perl Journal. 3. 2. Summer 1998. 2010-07-26.
  19. Book: Laorden. Carlos. Galan-Garcia. Patxi. Santos. Igor. Sanz. Borja. Hidalgo. Jose Maria Gomez. Bringas. Pablo G.. Negobot: A conversational agent based on game theory for the detection of paedophile behaviour. 23 August 2012. 978-3-642-33018-6. dead. https://web.archive.org/web/20130917013039/http://paginaspersonales.deusto.es/isantos/publications/2012/Laorden_2012_CISIS_Negobot.pdf. 2013-09-17.
  20. Book: Wermter . Stephan . Connectionist, Statistical and Symbolic Approaches to Learning for Natural Language Processing . 1996. Springer . Ellen Riloff . Gabriele Scheler .
  21. Book: Jurafsky . Dan . Speech and Language Processing. An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition . 2008. Prentice Hall . Upper Saddle River (N.J.) . 2 . James H. Martin . 2nd.
  22. Web site: SEM1A5 - Part 1 - A brief history of NLP. 2010-06-25.
  23. [Roger Schank]
  24. http://hdl.handle.net/2042/14456 Ibrahim, Amr Helmy. 2002. "Maurice Gross (1934-2001). À la mémoire de Maurice Gross". Hermès 34.
  25. http://www.nyu.edu/pages/linguistics/kaliedoscope/mauricegross13.pdf Dougherty, Ray. 2001. Maurice Gross Memorial Letter.
  26. Web site: Programming with Natural Language Is Actually Going to Work—Wolfram Blog. 16 November 2010 .