Last Updated on May 22, 2022
Summary
OCRmyPDF is a great utility. It receives our strong recommendation. It’s easy to install, it works really well, and there’s a lot of options available. OCRmyPDF has all you need to work efficiently. It produces valid PDFs, handles accents and multilingual characters, and doesn’t bloat the output file. We particularly like that the program uses unpaper, which offers a great range of image processing filters to improve images.
Out of the whole process, only the OCR phase actually makes use of more than 1 core of your CPU. The other parts of the process would really benefit from multi-core support although you can use the GNU Parallel utility to apply OCRmyPDF simultaneously to multiple files.
There are issues that can affect the quality of the OCR output, such as poor image quality of the scanned document, a mixture of fonts used in the scanned documents, the italicized and underlining of fonts, all of which can blur the quality and shape of the individual characters. Because of this, it is much more difficult to ensure that the character that is “recognized” by the OCR software is the character on the scanned document. Fortunately, Tesseract has very good accuracy although it doesn’t recognize handwriting, and has problems with documents with multiple columns. But there isn’t a better open source tool available.
OCRmyPDF is under active development, seeing regular updates roughly on a fortnightly basis. The project has attracted more than 2,500 GitHub stars. There’s clear and concise documentation available.
OCRmyPDF is written in Python. Learn Python with our recommended free books and free tutorials.
Website: github.com/ocrmypdf/OCRmyPDF
Support: Documentation
Developer: James R. Barlow and contributors
License: GNU General Public License v3.0
Pages in this article:
Page 1 – Introduction / Installation
Page 2 – In Operation
Page 3 – Summary
Complete list of articles in this series:
Excellent Utilities | |
---|---|
AES Crypt | Encrypt files using the Advanced Encryption Standard |
Ananicy | Shell daemon created to manage processes’ IO and CPU priorities |
broot | Next gen tree explorer and customizable launcher |
Cerebro | Fast application launcher |
cheat.sh | Community driven unified cheat sheet |
CopyQ | Advanced clipboard manager |
croc | Securely transfer files and folders from the command-line |
Deskreen | Live streaming your desktop to a web browser |
duf | Disk usage utility with more polished presentation than the classic df |
eza | A turbo-charged alternative to the venerable ls command |
Extension Manager | Browse, install and manage GNOME Shell Extensions |
fd | Wonderful alternative to the venerable find |
fkill | Kill processes quick and easy |
fontpreview | Quickly search and preview fonts |
horcrux | File splitter with encryption and redundancy |
Kooha | Simple screen recorder |
KOReader | Document viewer for a wide variety of file formats |
Imagine | A simple yet effective image optimization tool |
LanguageTool | Style and grammar checker for 30+ languages |
Liquid Prompt | Adaptive prompt for Bash & Zsh |
lnav | Advanced log file viewer for the small-scale; great for troubleshooting |
lsd | Like exa, lsd is a turbo-charged alternative to ls |
Mark Text | Simple and elegant Markdown editor |
McFly | Navigate through your bash shell history |
mdless | Formatted and highlighted view of Markdown files |
navi | Interactive cheatsheet tool |
noti | Monitors a command or process and triggers a notification |
Nushell | Flexible cross-platform shell with a modern feel |
nvitop | GPU process management for NVIDIA graphics cards |
OCRmyPDF | Add OCR text layer to scanned PDFs |
Oh My Zsh | Framework to manage your Zsh configuration |
Paperwork | Designed to simplify the management of your paperwork |
pastel | Generate, analyze, convert and manipulate colors |
PDF Mix Tool | Perform common editing operations on PDF files |
peco | Simple interactive filtering tool that's remarkably useful |
ripgrep | Recursively search directories for a regex pattern |
Rnote | Sketch and take handwritten notes |
scrcpy | Display and control Android devices |
Sticky | Simulates the traditional “sticky note” style stationery on your desktop |
tldr | Simplified and community-driven man pages |
tmux | A terminal multiplexer that offers a massive boost to your workflow |
Tusk | An unofficial Evernote client with bags of potential |
Ulauncher | Sublime application launcher |
Watson | Track the time spent on projects |
Whoogle Search | Self-hosted and privacy-focused metasearch engine |
Zellij | Terminal workspace with batteries included |