Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pdf_table_extraction.ipynb notebook (dem) error #241

Open
ashuYB opened this issue Nov 19, 2022 · 3 comments
Open

pdf_table_extraction.ipynb notebook (dem) error #241

ashuYB opened this issue Nov 19, 2022 · 3 comments
Assignees
Labels
bug Something isn't working

Comments

@ashuYB
Copy link

ashuYB commented Nov 19, 2022

Hello team,

I am trying to execute the demo notebook (pdf_data_extraction) and 'am getting an error while importing:

from src.data.s3_communication import S3Communication

ImportError: cannot import name 'PDFTableExtractor' from 'src.components.preprocessing' (/opt/app-root/lib64/python3.8/site-packages/src/components/preprocessing/init.py)

I was speaking with Ryan Day and he suggested that I get your counsel.

Thanks
Ashu

PS: I have been following teh instructions in https://github.com/os-climate/aicoe-osc-demo/blob/master/README.md

@HeatherAck
Copy link
Contributor

@Shreyanand are the instructions still accurate and branch accurate

@HeatherAck HeatherAck added the bug Something isn't working label Nov 21, 2022
@Shreyanand
Copy link
Member

@HeatherAck The instructions are accurate but since we are not using the NLP models for table extraction, it is not updated and has errors. The pdf text extraction and other notebooks in the inference.pipeline and training.pipeline should work fine.

@ashuYB
Copy link
Author

ashuYB commented Nov 23, 2022

Hello team, @Shreyanand and I made progress but the pdf_text_extraction notebook doesn't create the extracted output in the ~/data/extraction

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

4 participants