Crop various scanned newspaper-pages
Posted: 2011-07-21T00:47:53-07:00
Good day!
I am working on digitalizing a newspaper archive. Some guy has scanned thousands of pages from microfilm, and I'm taking the job of turning the scans into something useful.
The pages are readable, but many of the images have big black borders surrounding the pages. Normally the borders cover the right side and/or the bottom of the images. I'm wondering if there is any way to crop these black borders, so only the white newspaper-pages are left in the images. The borders are inconsistent and varies from image to image. I need a command that automatically recognizes the black borders, if any, and crops them away. We are talking about ~16,000 pages, so I need to make a batch script that goes through all the images automatically. That part I can handle myself, if I know the appropriate command for the 'convert' binary.
Here's an example of a scanned page:

I am working on digitalizing a newspaper archive. Some guy has scanned thousands of pages from microfilm, and I'm taking the job of turning the scans into something useful.
The pages are readable, but many of the images have big black borders surrounding the pages. Normally the borders cover the right side and/or the bottom of the images. I'm wondering if there is any way to crop these black borders, so only the white newspaper-pages are left in the images. The borders are inconsistent and varies from image to image. I need a command that automatically recognizes the black borders, if any, and crops them away. We are talking about ~16,000 pages, so I need to make a batch script that goes through all the images automatically. That part I can handle myself, if I know the appropriate command for the 'convert' binary.
Here's an example of a scanned page:
