Batch process OCR sections and import

munwin

Joined: 2005-11-21
Posts: 1
Posted: Mon, 2005-11-21 05:27

I am looking for some software to process a batch of images (scanned forms). We want to use Gallery to view the images, and be able to search them by 3 or 4 attributes. We want to get these attributes from the form (date, name, etc). We want it to check a section of the scanned form, read the info from that section(s), and dump the retrieved info into Gallery (using OCR / ICR). Is there any (preferrably) open software that can do this ? OS should be either Windows, Linux or OSX.

Even if the software can only scan the form, and output the scanned sections into a .csv file - I'm pretty sure I could hack together a php script to import this stuff into Gallery...

Windows Inventory - What's on your network ?
http://winventory.sourceforge.net

 
valiant

Joined: 2003-01-04
Posts: 32509
Posted: Mon, 2005-11-21 10:11

G2 has no OCR module (yet). so you'd have to run your images through an OCR program, save the information in EXIF/IPTC/XMP / csv and then add the image to g2.
exif/itpc are already supported by G2, so you could see this detected information in G2 when viewing images.
xmp will soon be supported.
csv isn't supported, you'd have to create a module to parse the csv and read it into a db table.

exif/itpc isn't searchable yet in g2 (unless it's a description / keywords field), but that will soon change.