Home » Questions » Computers [ Ask a new question ]

How to do OCR on a PDF document? [duplicate]

How to do OCR on a PDF document? [duplicate]

I have a few documents in English and Hebrew that I scanned in and converted to PDF format.

Asked by: Guest | Views: 311
Total answers/comments: 2
bert [Entry]

"I found a list of free OCR software for Windows.

FreeOCR
Tesseract
WeOcr Tesseract Web Interface
GOCR
Windows GUI for GOCR
OCR Desktop
Simple OCR
TopOCR

However, these programs need an image input, not a PDF input. For this, try a PDF-to-JPG converter."
bert [Entry]

Personally, I would use Ghostview to convert them to an image, then Tesseract to convert them to text. This is a totally free, open source, cross platform solution that I have had very good results with when trying to convert plain text. I don't use it for complex documents with tables and such, but for plain text you can't beat the price.