Trying to extract both annotations and text from a PDF. #342
Santoshzzz
started this conversation in
Ask for help with specific PDFs
Replies: 1 comment 2 replies
-
PDFs represent annotations in a distinctly different way than standard text. You can access them via |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
I'm trying to extract both annotations and text from a PDF. I'm using "text = p0.extract_words(x_tolerance=3, y_tolerance=3, keep_blank_chars=False, use_text_flow=False, horizontal_ltr=True, vertical_ttb=True, extra_attrs=[])
print(text)" to extract text but unable to extract the annotations in the same way.
Any help?
Beta Was this translation helpful? Give feedback.
All reactions