pdf_table_extraction.ipynb notebook (dem) error #241

ashuYB · 2022-11-19T16:53:40Z

Hello team,

I am trying to execute the demo notebook (pdf_data_extraction) and 'am getting an error while importing:

from src.data.s3_communication import S3Communication

ImportError: cannot import name 'PDFTableExtractor' from 'src.components.preprocessing' (/opt/app-root/lib64/python3.8/site-packages/src/components/preprocessing/init.py)

I was speaking with Ryan Day and he suggested that I get your counsel.

Thanks
Ashu

PS: I have been following teh instructions in https://github.com/os-climate/aicoe-osc-demo/blob/master/README.md

HeatherAck · 2022-11-21T19:05:42Z

@Shreyanand are the instructions still accurate and branch accurate

Shreyanand · 2022-11-21T23:35:49Z

@HeatherAck The instructions are accurate but since we are not using the NLP models for table extraction, it is not updated and has errors. The pdf text extraction and other notebooks in the inference.pipeline and training.pipeline should work fine.

ashuYB · 2022-11-23T14:57:18Z

Hello team, @Shreyanand and I made progress but the pdf_text_extraction notebook doesn't create the extracted output in the ~/data/extraction

HeatherAck assigned Shreyanand and JeremyGohBNP Nov 21, 2022

HeatherAck added the bug Something isn't working label Nov 21, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pdf_table_extraction.ipynb notebook (dem) error #241

pdf_table_extraction.ipynb notebook (dem) error #241

ashuYB commented Nov 19, 2022

HeatherAck commented Nov 21, 2022

Shreyanand commented Nov 21, 2022

ashuYB commented Nov 23, 2022

pdf_table_extraction.ipynb notebook (dem) error #241

pdf_table_extraction.ipynb notebook (dem) error #241

Comments

ashuYB commented Nov 19, 2022

HeatherAck commented Nov 21, 2022

Shreyanand commented Nov 21, 2022

ashuYB commented Nov 23, 2022