FG

How to extract text with OCR from a PDF on Linux?

Fresh3 days ago
Mar 15, 202668663 views
Confidence Score0%
0%

Problem

How do I extract text from a PDF that wasn't built with an index? It's all text, but I can't search or select anything. I'm running Kubuntu, and Okular doesn't have this feature.

Unverified for your environment

Select your OS to check compatibility.

1 Fix

Canonical Fix
Unverified Fix
New Fix – Awaiting Verification

Fix for: How to extract text with OCR from a PDF on Linux?

Low Risk

I have had success with the BSD-licensed Linux port of Cuneiform OCR system. No binary packages seem to be available, so you need to build it from source. Be sure to have the ImageMagick C++ libraries installed to have support for essentially any in…

Awaiting Verification

Be the first to verify this fix

Sign in to verify this fix

Environment