Optimus is the missing framework to profile, clean, process and do ML in a distributed fashion using Apache Spark (PySpark).
Read moreTag: data analysis
yt – Multi-code Toolkit for Analyzing and Visualizing Volumetric Data
yt is an open-source Python package for analyzing and visualizing volumetric data. yt focuses on driving physically-meaningful inquiry.
Read moreAWS Data Wrangler – extends the power of Pandas library
AWS Data Wrangler extends the power of Pandas library to AWS connecting DataFrames and AWS data related services.
Read moreOrange – data mining software
Orange is a component-based framework for machine learning and data mining. It includes a range of data visualization, and exploration.
Read morePentaho – manage and process data in hybrid and multicloud environments
The Pentaho BI Project is application software for enterprise reporting, analysis, dashboard, data mining, workflow and ETL capabilities.
Read more6 Essential Python Tools for Data Science
Data science is an emerging, multidisciplinary field of scientific methods, processes, algorithm development and technology to extract knowledge or insights in ingenious ways from structured or unstructured data.
Read more