Replies: 2 comments 3 replies
-
PyPDF2 is deprecated and PyPDF3 / PyPDF4 are different projects. I've edited your question to be only about pypdf. |
Beta Was this translation helpful? Give feedback.
0 replies
-
Is the layout mode text extraction what you're looking for? https://pypdf.readthedocs.io/en/stable/user/extract-text.html |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Consider the following article: https://arxiv.org/pdf/2106.13823.pdf
It is an academic paper formatted in two columns.
Can pypdf correctly extract text from PDF documents with a two-column format?
Beta Was this translation helpful? Give feedback.
All reactions