Optimus is the missing framework to profile, clean, process and do ML in a distributed fashion using Apache Spark (PySpark).
Read more
The Linux Portal Site
Optimus is the missing framework to profile, clean, process and do ML in a distributed fashion using Apache Spark (PySpark).
Read moreyt is an open-source Python package for analyzing and visualizing volumetric data. yt focuses on driving physically-meaningful inquiry.
Read moreAWS Data Wrangler extends the power of Pandas library to AWS connecting DataFrames and AWS data related services.
Read moreOrange is a component-based framework for machine learning and data mining. It includes a range of data visualization, and exploration.
Read moreThe Pentaho BI Project is application software for enterprise reporting, analysis, dashboard, data mining, workflow and ETL capabilities.
Read moreData science is an emerging, multidisciplinary field of scientific methods, processes, algorithm development and technology to extract knowledge or insights in ingenious ways from structured or unstructured data.
Read more