Natural language processing (NLP) is a field of computer science, artificial intelligence, and computational linguistics concerned with the interactions between computers and human (natural) languages.
Read moreTag: nlp
tidytext – text mining using dplyr, ggplot2, and other tidy tools
tidytext is text mining for word processing and sentiment analysis using ‘dplyr’, ‘ggplot2’, and other tidy tools.
Read morequanteda – R package for Quantitative Analysis of Textual Data
quanteda (Quantitative Analysis of Textual Data) is an R package for managing and analyzing text.
Read morewordcloud – create attractive word clouds, visualize differences and similarity between documents
wordcloud creates attractive word clouds, visualize differences and similarity between documents, and avoid over-plotting in scatter plots.
Read moretm – Text Mining Infrastructure in R
tm (shorthand for Text Mining Infrastructure in R) provides a framework for text mining applications within R.
Read morestringi – fast and portable character string processing in R
stringi is an R package for fast, portable, correct, consistent, and convenient string/text processing in any locale or character encoding.
Read morestringr – string manipulation in R
The stringr package provides a cohesive set of functions designed to make working with strings as easy as possible.
Read moreUDPipe – R package for Tokenization, Tagging, Lemmatization and Dependency Parsing
UDPipe provides language-agnostic tokenization, tagging, lemmatization and dependency parsing of raw text.
Read moreWord Vectors – R package for building and exploring word embedding models
Word Vectors is an R package for building and exploring word2vec and other word embedding models.
Read morespacyr – R wrapper around the Python spaCy package
spacyr provides a convenient R wrapper around the Python spaCy package.
Read moreMITIE: MIT Information Extraction
MITIE: MIT Information Extraction offers state-of-the-art information extraction tools. MITIE is free and open source software.
Read moretext2vec – R package – framework with API for text analysis and natural language processing
text2vec is an R package which provides an efficient framework with a concise API for text analysis and natural language processing (NLP).
Read moreMoses – statistical machine translation system
Moses is a statistical machine translation system to automatically train translation models for any language pair.
Read moreTiMBL – Tilburg Memory-Based Learner
TiMBL is an open source tool for NLP research, and for many other domains where classification tasks are learned from examples.
Read moreMeTA – modern C++ data sciences toolkit
MeTA is a C++ data sciences toolkit. A suite of natural language processing, classification, information retrieval, data mining, and more.
Read more