Big Data analysis can be performed with data mining software. Here’s the best free tools to perform data analysis on big data.
Read moreTag: big data
Apache Flink – framework and distributed processing engine
Apache Flink is a framework and distributed processing engine for stateful computations over unbounded and bounded data streams.
Read moreApache Spark – unified analytics engine for large-scale data processing
Apache Spark is a multi-language engine for executing data engineering, data science, and machine learning on single-node machines or clusters.
Read more14 Best Free and Open Source Key Value Stores for Big Data
This article recommends the best free and open source software key value stores for Big Data.
Read moreScyllaDB – real-time big data database
ScyllaDB is a real-time big data database that is API-compatible with Apache Cassandra and Amazon DynamoDB.
Read more16 Best Free and Open Source Natural Language Processing Tools
Natural language processing (NLP) is a field of computer science, artificial intelligence, and computational linguistics concerned with the interactions between computers and human (natural) languages.
Read more20 Best Free and Open Source Python Visualization Packages
Python has a fantastic range of packages to produce mesmerizing visualizations. We recommend the best free and open source Python tools.
Read moreApache Drill – schema-free SQL Query Engine for Hadoop, NoSQL and Cloud Storage
Apache Drill is an open source distributed system for interactive analysis of large-scale datasets. Drill is similar to Google’s Dremel.
Read moreHPCC Systems – massively scalable supercomputing platform
HPCC is a data-intensive computing system platform designed for the enterprise to solve Big Data challenges.
Read moreStorm – big-data processing system
Storm is an open source, big-data processing system that is different from other systems. Storm is designed for distributed real-time processing.
Read moreApache Hadoop – reliable, scalable, distributed computing
The Apache Hadoop software library is an open source framework that allows for the distributed processing of large data sets across clusters.
Read moreApache Solr – enterprise search platform
Solr is a popular, stand alone, fast, open source enterprise search platform from the Apache Lucene project.
Read moreElasticSearch – distributed RESTful search engine and analytics engine
ElasticSearch is a flexible and powerful open source, distributed RESTful search engine and analytics engine for the cloud.
Read moreMeiliSearch – fast, open-source, easy to use and deploy search engine
MeiliSearch is a RESTful search API. A ready-to-go solution for everyone who wants a fast and relevant search experience for their end-users.
Read moreXapian – information retrieval library
Xapian is an open source probabilistic information retrieval library. It is a full text search engine library for programmers.
Read more