Skip to content

Issues: adbar/trafilatura

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Question regarding title extraction question Further information is requested
#770 opened Dec 16, 2024 by unsleepy22
Loss format or data when li contains p bug Something isn't working
#769 opened Dec 16, 2024 by ezscode
Documentation: on precision documentation Docs in need of update or extension
#766 opened Dec 10, 2024 by DesBw
Backticks produce extra line breaks bug Something isn't working
#755 opened Nov 30, 2024 by klvbdmh
CLI: better control of output file names enhancement New feature or request
#754 opened Nov 30, 2024 by DesBw
Support for sidemap parsing from text instead of urls feedback Feedback from users requested
#751 opened Nov 27, 2024 by NiClassic
Performance bottleneck in prune_unwanted_nodes causing 200ms per call question Further information is requested
#750 opened Nov 23, 2024 by thsunkid
Review input type for is_probably_readerable() function enhancement New feature or request
#749 opened Nov 22, 2024 by adbar
Documentation about settings could use examples documentation Docs in need of update or extension
#746 opened Nov 15, 2024 by georgedorn
Review HTML element list and conversion enhancement New feature or request
#720 opened Oct 15, 2024 by adbar
2 tasks
Docs: add page explaining how to run tests documentation Docs in need of update or extension
#698 opened Sep 9, 2024 by adbar
Downloads: add support to switch between proxies enhancement New feature or request
#697 opened Sep 9, 2024 by adbar
Empty Results When Using Spider Function with Category URL question Further information is requested
#696 opened Sep 9, 2024 by felipehertzer
Investigate spacing in element tails question Further information is requested
#661 opened Jul 26, 2024 by adbar
Faulty extraction for very short documents enhancement New feature or request
#660 opened Jul 26, 2024 by Psynbiotik
Missing h1 heading if <header> outside of <article> question Further information is requested
#642 opened Jul 11, 2024 by chrisgoddard
some extraction duplicated in xml question Further information is requested
#634 opened Jun 27, 2024 by fortyfourforty
Image/Video caption and credits removal documentation Docs in need of update or extension question Further information is requested
#616 opened Jun 6, 2024 by hamsarajan
It's set include_images=True, but there is no picture bug Something isn't working
#610 opened May 31, 2024 by dark2star
New port of readability.js? question Further information is requested
#604 opened May 23, 2024 by zirkelc
Add option to provide XPaths for content extraction enhancement New feature or request
#596 opened May 16, 2024 by klvbdmh
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.