The Apache OpenNLP library is a free and open source machine learning based toolkit for the processing of natural language text.
Read more
The Linux Portal Site
The Apache OpenNLP library is a free and open source machine learning based toolkit for the processing of natural language text.
Read more
Java is one of the most widely used programming languages. We explore the best free and open source Java based NLP tools.
Read more
Stanford CoreNLP is an extensible annotation-based NLP pipeline that provides core natural language analysis.
Read more
CogComp-NLP provides a suite of state-of-the-art Natural Language Processing (NLP) tools that allows you to annotate plain text inputs.
Read more
ReVerb automatically identifies and extracts binary relationships from English sentences. It’s designed for Web-scale information extraction.
Read more
The Natural Language Processing for JVM languages (NLP4J) project provides NLP tools, frameworks, and an API. Free and open source.
Read more
MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling…
Read more
Apache Lucene is an open source high-performance, full-featured information retrieval software library written entirely in Java.
Read more
The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).
Read more
Apache UIMA is an Apache-licensed open source implementation of the UIMA specification. Frameworks are available for Java and C++.
Read more