better pre-processing pdf files #1697
cloudrage999
started this conversation in
Ideas
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
i think private GPT needs to better parse and pre-process the pdf files , maybe using unstructured io or OCR tools
now that we have lamaparse , i think we should add such things to privategpt
this will make a big difference in getting accurate answers
currently if you want to run privategpt locally, you wont have a good experience if you got bunch of complex pdf files that have tables,messy format,pictures,diagrams etc
Beta Was this translation helpful? Give feedback.
All reactions