-
Do you have the accuracy of identifying pdf table? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
Hi @hhhhjjj, and thanks for your interest in this library. I take your question to mean something like: Has The short answer is: no, it has not. The longer answer is that |
Beta Was this translation helpful? Give feedback.
Hi @hhhhjjj, and thanks for your interest in this library. I take your question to mean something like: Has
pdfplumber
's table-detection algorithm been tested against a benchmark, and evaluated re. whether the tables are extracted correctly? (If that's not your question, please do let me know.)The short answer is: no, it has not. The longer answer is that
pdfplumber
's table-detection algorithm does not take a probabilistic approach, but rather a deterministic one. And although it aims to provide utility with just its default settings, it provides the most utility when you customize the detection settings to the particular PDF you are parsing. So although it's theoretically possible to test