OpenRefine is a powerful tool for working with messy data: cleaning it; transforming it from one format into another; and extending it with web services and external data.
It has similarities with spreadsheet applications, and can handle spreadsheet file formats such as CSV, but it’s closer to acting like a database. Unlike spreadsheets, OpenRefine doesn’t store formulas and display the output of those calculations; it only shows the value inside each cell. It doesn’t support cell colors or text formatting.
OpenRefine lets users clean, correct, codify, and extend data. Without ever needing to type inside a single cell, users can automatically fix typos, convert things to the right format, and add structured categories from trusted sources.
This is free and open source software.
Features include:
- Faceting – drill through large datasets using facets and apply operations on filtered views of your dataset.
- Clustering – use a variety of comparison methods to find text entries that are similar but not exact, then shares those results with you so that you can merge the cells that should match.
- Transformation of data – convert values to other formats, normalizing and denormalizing.
- Reconciliation – matches your dataset with that of an external source.
- Infinite undo/redo – go back to any previous state of your dataset and replay your operation history.
- Privacy – data is cleaned locally. It doesn’t require internet access to run its basic functions.
- Export – CSV, Excel, Google spreadsheet, HTML table, and TSV.
- Import – CSV, Google spreadsheet, JSON, RDF triples, TSV, and XML.
- Cross-platform support – runs under Linux, macOS, and Windows.
Website: openrefine.org
Support: GitHub Code Repository
Developer: Community
License: BSD 3-Clause “New” or “Revised” License
OpenRefine is written in Java. Learn Java with our recommended free books and free tutorials.
Popular series | |
---|---|
![]() | The largest compilation of the best free and open source software in the universe. Each article is supplied with a legendary ratings chart helping you to make informed decisions. |
![]() | Hundreds of in-depth reviews offering our unbiased and expert opinion on software. We offer helpful and impartial information. |
![]() | The Big List of Active Linux Distros is a large compilation of actively developed Linux distributions. |
![]() | Replace proprietary software with open source alternatives: Google, Microsoft, Apple, Adobe, IBM, Autodesk, Oracle, Atlassian, Corel, Cisco, Intuit, and SAS. |
![]() | Awesome Free Linux Games Tools showcases a series of tools that making gaming on Linux a more pleasurable experience. This is a new series. |
![]() | Machine Learning explores practical applications of machine learning and deep learning from a Linux perspective. We've written reviews of more than 40 self-hosted apps. All are free and open source. |
![]() | New to Linux? Read our Linux for Starters series. We start right at the basics and teach you everything you need to know to get started with Linux. |
![]() | Alternatives to popular CLI tools showcases essential tools that are modern replacements for core Linux utilities. |
![]() | Essential Linux system tools focuses on small, indispensable utilities, useful for system administrators as well as regular users. |
![]() | Linux utilities to maximise your productivity. Small, indispensable tools, useful for anyone running a Linux machine. |
![]() | Surveys popular streaming services from a Linux perspective: Amazon Music Unlimited, Myuzi, Spotify, Deezer, Tidal. |
![]() | Saving Money with Linux looks at how you can reduce your energy bills running Linux. |
![]() | Home computers became commonplace in the 1980s. Emulate home computers including the Commodore 64, Amiga, Atari ST, ZX81, Amstrad CPC, and ZX Spectrum. |
![]() | Now and Then examines how promising open source software fared over the years. It can be a bumpy ride. |
![]() | Linux at Home looks at a range of home activities where Linux can play its part, making the most of our time at home, keeping active and engaged. |
![]() | Linux Candy reveals the lighter side of Linux. Have some fun and escape from the daily drudgery. |
![]() | Getting Started with Docker helps you master Docker, a set of platform as a service products that delivers software in packages called containers. |
![]() | Best Free Android Apps. We showcase free Android apps that are definitely worth downloading. There's a strict eligibility criteria for inclusion in this series. |
![]() | These best free books accelerate your learning of every programming language. Learn a new language today! |
![]() | These free tutorials offer the perfect tonic to our free programming books series. |
![]() | Linux Around The World showcases usergroups that are relevant to Linux enthusiasts. Great ways to meet up with fellow enthusiasts. |
![]() | Stars and Stripes is an occasional series looking at the impact of Linux in the USA. |