Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Superparagraph merging #1

Open
keynmol opened this issue Nov 6, 2014 · 0 comments
Open

Superparagraph merging #1

keynmol opened this issue Nov 6, 2014 · 0 comments
Labels

Comments

@keynmol
Copy link

keynmol commented Nov 6, 2014

Sometimes jusText looks at this structure:

paragraph A
paragraph B
paragraph C

And extracts A and C but not B. Example:

A: Catholic priest from Derby invited a seven year old boy to his house promising to give him something he will never forget.
C: The parents were shocked by what the boy told them afterwards, but the priest convinced them that it's God's will.
....
B: The priest decided to sponsor boy's Sunday school and all expenses it might incur.

See how awkward it can get?

We need to add paragraphs with small link concentration that have the same xpath as the nearest extracted ones.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

4 participants