Optimus is the missing framework to profile, clean, process and do ML in a distributed fashion using Apache Spark (PySpark).
Read more
The Linux Portal Site
Optimus is the missing framework to profile, clean, process and do ML in a distributed fashion using Apache Spark (PySpark).
Read moreyt is an open-source Python package for analyzing and visualizing volumetric data. yt focuses on driving physically-meaningful inquiry.
Read moreAWS Data Wrangler extends the power of Pandas library to AWS connecting DataFrames and AWS data related services.
Read moreThe R Project for Statistical Computing (R) is a free software environment for statistical computing and graphics.
Read moreMOA is a software environment for implementing algorithms and running experiments for online learning from evolving data streams.
Read moreOrange is a component-based framework for machine learning and data mining. It includes a range of data visualization, and exploration.
Read moreEnvironment for DeveLoping KDD-Applications Supported by Index-Structures (ELKI) is a data mining software framework.
Read moreDataMelt is an environment for scientific computation, data analysis and data visualization.
Read moreWeka (Waikato Environment for Knowledge Analysis) is a comprehensive popular suite of machine learning software written in Java.
Read moreRattle provides a Gnome based open source interface to R functionality for binary classification tasks and data mining.
Read more