Search found 4 matches

by geohei
2014-06-04T04:04:58-07:00
Forum: Users
Topic: Best scan options for PDF
Replies: 6
Views: 8793

Re: Best scan options for PDF

I spent quite some time now on experimenting. Your suggested options are not as good as this here: convert -density 600 -blur 1x2 -monochrome in.pdf out.tif ... seems to give the best results, but tesseract still fails while hitting partially overlapping characters. I don't know the meaning and ...
by geohei
2014-06-03T09:37:34-07:00
Forum: Users
Topic: Best scan options for PDF
Replies: 6
Views: 8793

Re: Best scan options for PDF

If I don't use -monochrome, tesseract gives the follwoing error: Error in pixReadFromTiffStream: can't handle bpp > 32 Error in pixReadStreamTiff: pix not read Error in pixReadStream: tiff: no pix returned Error in pixRead: pix not read Error in pixGetInputFormat: pix not defined Reading tmp/tmp.tif ...
by geohei
2014-06-01T09:40:42-07:00
Forum: Users
Topic: Best scan options for PDF
Replies: 6
Views: 8793

Re: Best scan options for PDF

Uuuhhh... this will eat up a lot of resources (not monchrome and desity 2400). I'll give it a try.

What about some fancy image processing features like -adaptive-blur, ... ?
This was more the direction I was thinking about.
by geohei
2014-06-01T02:00:33-07:00
Forum: Users
Topic: Best scan options for PDF
Replies: 6
Views: 8793

Best scan options for PDF

Hi. I need to convert the following PDF into TIF. https://www.dropbox.com/s/6yo378t1mu4j74b/1400769600930.HEI-dmz2-prd-crewlink.2336.idp.pdf Later on, it should be parsed using tesseract (OCR software). tesseract produces quite a lot of errors and false character recognition. Its available options ...