Apache Stanbol Bundlelist for Language Support: Kuromoji Japanese

Provides modules that bring language support for Japanese using the Solr/Lucene kuromoji analyzer. This includes a (1) Bundle providing the Solr Analyzer; (2) an NLP processing Engine that Tokenizes, detects sentences, POS taggs, extracts Named Entities and Lemmatizes Japanese text (3) an LabelTokenizer needed to match tokens of the analyzed text with the labels of Entities in the matched vocabularies.

Homepage POM file JAR file Javadoc
'org.apache.stanbol:org.apache.stanbol.launchers.bundlelists.languageextras.kuromoji:0.12.0'

Dependencies

no dependencies