UIMA explained
UIMA,[1] short for Unstructured Information Management Architecture, is an OASIS standard[2] for content analytics, originally developed at IBM. It provides a component software architecture for the development, discovery, composition, and deployment of multi-modal analytics for the analysis of unstructured information and integration with search technologies.
Structure
The UIMA architecture can be thought of in four dimensions:
- It specifies component interfaces in an analytics pipeline.
- It describes a set of design patterns.
- It suggests two data representations: an in-memory representation of annotations for high-performance analytics and an XML representation of annotations for integration with remote web services.
- It suggests development roles allowing tools to be used by users with diverse skills.
Implementations and uses
Apache UIMA, a reference implementation of UIMA, is maintained by the Apache Software Foundation.
UIMA is used in a number of software projects:
- IBM Research's Watson uses UIMA for analyzing unstructured data.[4]
- The Clinical Text Analysis and Knowledge Extraction System (Apache cTAKES) is a UIMA-based system for information extraction from medical records.
- DKPro Core is a collection of reusable UIMA components for general-purpose natural language processing.
See also
External links
Notes and References
- http://uima.apache.org/d/uimaj-2.4.0/overview_and_setup.html#ugr.faqs UIMA Frequently Asked Questions (FAQ's)
- http://uima.apache.org/uima-specification.html UIMA Specification
- Web site: Apache UIMA - News. uima.apache.org. 11 December 2019.
- Web site: Apache Innovation Bolsters IBM's "Smartest Machine on Earth" in First-ever Man vs. Machine Competition on Jeopardy! Quiz Show : The Apache Software Foundation Blog. blogs.apache.org. 14 February 2011 . 23 April 2018.