资 源 简 介
Overview
UIMA-connectors aims mainly at offering solutions to build the bridge between some markup languages and the UIMA structure data, namely the CAS.
In comparison, the Tika project aims at detecting and extracting metadata and structured text content from various type MIME documents.
UIMA-connectors is more dedicated to perfom mapping from/to text formats to/from CAS, providing solutions for handling language formats such as eXtended Markup Language (XML), Comma Separated Value (CSV), whitespace token and newline sentence... or applications of these formats such as Message Understanding Conferences (MUC), Apache OpenNLP.