资 源 简 介
A rule-based sentence boundary recognizer written in java. Comes with a rule base for Hungarian.
About JSentence
JSentence is a rule-based sentence boundary detector written in java as a part of Mihály Héder"s diploma work in 2009. If you understand Hungarian you can read the relevant parts of Héder"s diploma thesis:
https://jsentence.googlecode.com/svn-history/r5/jsentence/docs/JSentence.pdf
Also if your work with this software results in an academic publication, please cite:
**Héder M, Tikk D
Docuphet - a dialogue assisted content annotation tool
In: Cena F, Farzan R, Lops P (szerk.)
Proc. of Workshop on Web 3.0: Merging Semantic Web and Social Web 2009
CEUR Workshop Proceedings vol 476**
Usage
Download the jar file and the Hungarian configuration (HUConfig.xml) from the downloads section.
Run java -cp jsentence.jar jsentence.JSentenceMain [input file] [output file]
exa