Search found 4 matches
- 2014-06-04T04:04:58-07:00
- Forum: Users
- Topic: Best scan options for PDF
- Replies: 6
- Views: 8793
Re: Best scan options for PDF
I spent quite some time now on experimenting. Your suggested options are not as good as this here: convert -density 600 -blur 1x2 -monochrome in.pdf out.tif ... seems to give the best results, but tesseract still fails while hitting partially overlapping characters. I don't know the meaning and ...
- 2014-06-03T09:37:34-07:00
- Forum: Users
- Topic: Best scan options for PDF
- Replies: 6
- Views: 8793
Re: Best scan options for PDF
If I don't use -monochrome, tesseract gives the follwoing error: Error in pixReadFromTiffStream: can't handle bpp > 32 Error in pixReadStreamTiff: pix not read Error in pixReadStream: tiff: no pix returned Error in pixRead: pix not read Error in pixGetInputFormat: pix not defined Reading tmp/tmp.tif ...
- 2014-06-01T09:40:42-07:00
- Forum: Users
- Topic: Best scan options for PDF
- Replies: 6
- Views: 8793
Re: Best scan options for PDF
Uuuhhh... this will eat up a lot of resources (not monchrome and desity 2400). I'll give it a try.
What about some fancy image processing features like -adaptive-blur, ... ?
This was more the direction I was thinking about.
What about some fancy image processing features like -adaptive-blur, ... ?
This was more the direction I was thinking about.
- 2014-06-01T02:00:33-07:00
- Forum: Users
- Topic: Best scan options for PDF
- Replies: 6
- Views: 8793
Best scan options for PDF
Hi. I need to convert the following PDF into TIF. https://www.dropbox.com/s/6yo378t1mu4j74b/1400769600930.HEI-dmz2-prd-crewlink.2336.idp.pdf Later on, it should be parsed using tesseract (OCR software). tesseract produces quite a lot of errors and false character recognition. Its available options ...