CogComp-NLP provides a suite of state-of-the-art Natural Language Processing (NLP) tools that allows you to annotate plain text inputs.
There’s a JVM API and a Python API.
CogComp NLP Pipeline bundles some basic preprocessing steps that a lot of NLP applications need, with the goal of making them run locally.
CogComp Corpusreaders includes NLP Corpus readers that reads into datastructures provided by the cogcomp-core-utilities package.
CogComp’s main NLP libraries provide the following modules – test an NLP annotator.
- nlp-pipeline – provides an end-to-end NLP processing application that runs a variety of NLP tools on input text.
- core-utilities – provides a set of NLP-friendly data structures and a number of NLP-related utilities that support writing NLP applications, running experiments, etc.
- corpusreaders – provides classes to read documents from corpora into core-utilities data structures.
- curator – supports use of CogComp NLP Curator, a tool to run NLP applications as services.
- edison – a library for feature extraction from core-utilities data structures. Edison is a feature extraction framework that uses the data structures of core-utilities to extract features used in NLP applications. The library is using a Feature Extraction Language (FEX) as a declarative way of defining features.
- lemmatizer – an application that uses WordNet and simple rules to find the root forms of words in plain text.
- tokenizer – an application that identifies sentence and word boundaries in plain text.
- transliteration – an application that transliterates names between different scripts.
- pos – an application that identifies the part of speech (e.g. verb + tense, noun + number) of each word in plain text.
- ner – an application that identifies named entities in plain text according to two different sets of categories.
- md – an application that identifies entity mentions in plain text.
- relation-extraction – an application that identifies entity mentions, then identify relation pairs among the mentions detected.
- quantifier – this tool detects mentions of quantities in the text, as well as normalizes it to a standard form.
- inference – a suite of unified wrappers to a set optimization libraries, as well as some basic approximate solvers.
- depparse – an application that identifies the dependency parse tree of a sentence.
- verbsense – this system addresses the verb sense disambiguation (VSD) problem for English.
- prepsrl – an application that identifies semantic relations expressed by prepositions and develops statistical learning models for predicting the relations.
- commasrl – this software extracts relations that commas participate in.
- similarity – this software compare objects –especially Strings– and return a score indicating how similar they are.
- temporal-normalizer – a temporal extractor and normalizer.
- dataless-classifier – classifies text into a user-specified label hierarchy from just the textual label descriptions
- external-annotators – a collection useful external annotators.
Website: cogcomp.seas.upenn.edu
Support: GitHub Code Repository
Developer: Cognitive Computation Group
License: Research and Academic Use License
CogComp-NLP is written in Java. Learn Java with our recommended free books and free tutorials.
Return to Java Natural Language Tools
Popular series | |
---|---|
The largest compilation of the best free and open source software in the universe. Each article is supplied with a legendary ratings chart helping you to make informed decisions. | |
Hundreds of in-depth reviews offering our unbiased and expert opinion on software. We offer helpful and impartial information. | |
The Big List of Active Linux Distros is a large compilation of actively developed Linux distributions. | |
Replace proprietary software with open source alternatives: Google, Microsoft, Apple, Adobe, IBM, Autodesk, Oracle, Atlassian, Corel, Cisco, Intuit, and SAS. | |
Awesome Free Linux Games Tools showcases a series of tools that making gaming on Linux a more pleasurable experience. This is a new series. | |
Machine Learning explores practical applications of machine learning and deep learning from a Linux perspective. We've written reviews of more than 40 self-hosted apps. All are free and open source. | |
New to Linux? Read our Linux for Starters series. We start right at the basics and teach you everything you need to know to get started with Linux. | |
Alternatives to popular CLI tools showcases essential tools that are modern replacements for core Linux utilities. | |
Essential Linux system tools focuses on small, indispensable utilities, useful for system administrators as well as regular users. | |
Linux utilities to maximise your productivity. Small, indispensable tools, useful for anyone running a Linux machine. | |
Surveys popular streaming services from a Linux perspective: Amazon Music Unlimited, Myuzi, Spotify, Deezer, Tidal. | |
Saving Money with Linux looks at how you can reduce your energy bills running Linux. | |
Home computers became commonplace in the 1980s. Emulate home computers including the Commodore 64, Amiga, Atari ST, ZX81, Amstrad CPC, and ZX Spectrum. | |
Now and Then examines how promising open source software fared over the years. It can be a bumpy ride. | |
Linux at Home looks at a range of home activities where Linux can play its part, making the most of our time at home, keeping active and engaged. | |
Linux Candy reveals the lighter side of Linux. Have some fun and escape from the daily drudgery. | |
Getting Started with Docker helps you master Docker, a set of platform as a service products that delivers software in packages called containers. | |
Best Free Android Apps. We showcase free Android apps that are definitely worth downloading. There's a strict eligibility criteria for inclusion in this series. | |
These best free books accelerate your learning of every programming language. Learn a new language today! | |
These free tutorials offer the perfect tonic to our free programming books series. | |
Linux Around The World showcases usergroups that are relevant to Linux enthusiasts. Great ways to meet up with fellow enthusiasts. | |
Stars and Stripes is an occasional series looking at the impact of Linux in the USA. |