Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Do we still need OCR? An implementation of a pure vision-based agent (pageindex.ai)
7 points by mingtianzhang 82 days ago | hide | past | favorite | 1 comment


We discuss the limitations of the classic OCR pipeline and provide a pure vision-based RAG system for document analysis (https://github.com/VectifyAI/PageIndex/blob/main/cookbook/vi...)

Any feedback is welcome!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: