Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Hardware and software for scanning and OCR old magazines
6 points by scrpn 3 months ago | hide | past | favorite | 5 comments
I have a collection of old history magazines, starting from 1969, that I would like to scan and convert to text. What tools can be used?


Book scanning: https://en.wikipedia.org/wiki/Book_scanning

awesome-scanning lists Devices, Software: https://github.com/ad-si/awesome-scanning

book scanner: https://hn.algolia.com/?q=book+scanner :

- Foot pedal book scanner


Thank you!


NP!

/?awesome-selfhosted "scan" : https://github.com/awesome-selfhosted/awesome-selfhosted#doc... :

- DMS: Document Management System

- paperless-ngx

- papermerge

But then to do search snippets and/or genai with citations of scanned PDFs, images, and hopefully .txt and .md too: https://news.ycombinator.com/item?id=44321180 :

> paperai/paperetl, paperqa2, paperqa-zotero,


One that I see that looks promising is CZUR scanners, they are relatively low-cost and have a lot of ease-of-use features like OCR, page separation (scanning an open book and creating two pages), auto rotation, curl flattening (corrects warped scans due to book spines), foot control, etc.

I haven't used it but maybe someone will pipe-in with a reply.


A process I’ve been following for a genealogy project is to scan with iPhone, or use a webcam on your computer, organize photos into pdf, upload pdf to ai studio to ocr and transcribe, and then I will convert that to a Latex document for a cleanly formatted text based version.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: