Fix sub-definitions not being listed #68

imlutr · 2020-09-09T16:28:17Z

Previously, a word that had a layered definition, such as:

would return the following definition:

['grapple (countable and uncountable, plural grapples)',
 'A tool with claws or hooks which is used to catch or hold something.',
 'A close hand-to-hand struggle.',
 '(uncountable) The act of grappling.']

So the sub-definitions of A tool with claws or hooks which is used to catch or hold something. weren't listed.

This PR makes it so the returned definition becomes:

['grapple (countable and uncountable, plural grapples)',
 ['A tool with claws or hooks which is used to catch or hold something.',
  '(nautical) A device consisting of iron claws, attached to the end of a rope, used for grasping and holding an enemy ship prior to boarding; a grappling iron.',
  '(nautical) A grapnel (“type of anchor”).'],
 'A close hand-to-hand struggle.',
 '(uncountable) The act of grappling.']

thus fixing #57.

However, this structure (using arrays to depict a layered definition, where the first element represents the top-level definition) may not be the most ideal one. Not sure.

This causes two of the tests to fail (grapple and house), as they both have such sub-definitions that were previously ignored.

They were deleted in the parse_examples() function, which would remove quotations, sub-definitions accidentally being treated as such.

Sub-definitions were previously appended to list as: ["Top definition A", "Sub definition ASub definition B", "Top definition B"] Now, they are appended to the "definitions" list as: [["Top definition A", "Sub definition A", "Sub definition B"], "Top definition B"] However, I am not sure if this is the best structure. You may change it if you find a better way.

suyashb95 · 2020-09-09T17:40:51Z

Thanks for this! I'll take a look. Can you fix the failing tests in the meanwhile?

Previously, if the top definition and its sub-definitions weren't separated by '\n', the code would fail.

imlutr · 2020-09-11T13:21:35Z

Done. Everything should be fine now. However, I'll test some more words on my own, to make sure I didn't miss any edge cases.

reeseovine · 2020-09-21T04:40:43Z

Thanks so much for this! I just tested it out with my script that uses WiktionaryParser and (with slight modification) it works great. Hope it gets merged soon! 😁

suyashb95 · 2020-09-23T04:03:47Z

@Luca1152 @katacarbix while this change works, I'm not sure the definitions list should contain different data types. For your use cases, is it a problem if the sub-definitions are included in the list as top level definitions?

@katacarbix what are the modifications you've had to make? Was wondering if the parser could be enhanced to include them

Thanks so much for this! I just tested it out with my script that uses WiktionaryParser and (with slight modification) it works great. Hope it gets merged soon! 😁

reeseovine · 2020-09-23T05:21:46Z

I added a section to loop through sub-definitions and output them indented. I also put back the colons that I was removing before for formatting.

Old:

word = parser.fetch(query)
entries = []
for section in word:
    for defn in section['definitions']:
        for entry in defn['text']:
            if entry[:len(query)] != query:
                entries.append('('+defn['partOfSpeech']+') ' + entry.rstrip(':'))

New:

word = parser.fetch(query)
entries = []
for section in word:
    for defn in section['definitions']:
        for entry in defn['text']:
            if type(entry) is list:
                entries.append('('+defn['partOfSpeech'])+') ' + entry[0])
                for subentry in entry[1:]:
                    entries.append('    ' + subentry)
            elif entry[:len(query)] != query:
                entries.append('('+defn['partOfSpeech'])+') '  + entry)

pragma- · 2021-07-11T18:51:50Z

@suyash458 This PR fixes a very serious bug. Is this ever going to get merged and is pip ever going to get a new release?

imlutr added 2 commits September 9, 2020 18:43

Fix sub-definitions not being listed

958a278

They were deleted in the parse_examples() function, which would remove quotations, sub-definitions accidentally being treated as such.

suyashb95 assigned imlutr Sep 9, 2020

suyashb95 self-requested a review September 9, 2020 17:41

imlutr added 2 commits September 11, 2020 16:16

Update unit tests to include sub-definitions

ef1ae01

Fix logic for determining the top-definition's text

419eccc

Previously, if the top definition and its sub-definitions weren't separated by '\n', the code would fail.

imlutr closed this by deleting the head repository Dec 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix sub-definitions not being listed #68

Fix sub-definitions not being listed #68

imlutr commented Sep 9, 2020 •

edited

Loading

suyashb95 commented Sep 9, 2020

imlutr commented Sep 11, 2020

reeseovine commented Sep 21, 2020 •

edited

Loading

suyashb95 commented Sep 23, 2020

reeseovine commented Sep 23, 2020 •

edited

Loading

pragma- commented Jul 11, 2021 •

edited

Loading

Fix sub-definitions not being listed #68

Fix sub-definitions not being listed #68

Conversation

imlutr commented Sep 9, 2020 • edited Loading

suyashb95 commented Sep 9, 2020

imlutr commented Sep 11, 2020

reeseovine commented Sep 21, 2020 • edited Loading

suyashb95 commented Sep 23, 2020

reeseovine commented Sep 23, 2020 • edited Loading

pragma- commented Jul 11, 2021 • edited Loading

imlutr commented Sep 9, 2020 •

edited

Loading

reeseovine commented Sep 21, 2020 •

edited

Loading

reeseovine commented Sep 23, 2020 •

edited

Loading

pragma- commented Jul 11, 2021 •

edited

Loading