Skip to content

Releases: explosion/spacy-layout

v0.0.11

24 Dec 07:29
Compare
Choose a tag to compare
v0.0.11 Pre-release
Pre-release
  • Fix regression that would cause incorrect pagination numbers to be represented.

v0.0.10

13 Dec 13:59
Compare
Choose a tag to compare
v0.0.10 Pre-release
Pre-release
  • Allow DoclingDocument as input to spaCyLayout.__call__ to convert already processed documents to spaCy Doc objects.

v0.0.9

09 Dec 11:08
64c6f4a
Compare
Choose a tag to compare
v0.0.9 Pre-release
Pre-release
  • Add Doc._.markdown with Markdown representation of the document.

v0.0.8

07 Dec 11:42
Compare
Choose a tag to compare
v0.0.8 Pre-release
Pre-release
  • Fix serialization of extension attributes and pandas.DataFrame via spaCy's DocBin. (#11, #14)

v0.0.7

24 Nov 12:31
Compare
Choose a tag to compare
v0.0.7 Pre-release
Pre-release
  • Fix bounding boxes for top left origin, refactor and add tests.

v0.0.6

24 Nov 11:15
a6b33bf
Compare
Choose a tag to compare
v0.0.6 Pre-release
Pre-release
  • Add support for tables as layout spans and via shortcut Doc._.tables.
  • Add Span._.data for table data as a pandas.DataFrame.
  • Allow customizing table display text in Doc.text via display_table callback option.

v0.0.5

21 Nov 13:19
Compare
Choose a tag to compare
v0.0.5 Pre-release
Pre-release
  • Improve bounding box calculation for bottom left origin.

v0.0.4

20 Nov 12:32
45ff613
Compare
Choose a tag to compare
v0.0.4 Pre-release
Pre-release
  • Fix bounding boxes for bottom left origin.

v0.0.3

20 Nov 11:00
Compare
Choose a tag to compare
v0.0.3 Pre-release
Pre-release
  • Add spaCyLayout.pipe to process multiple documents.
  • Use nlp.pipe internally for tokenization.
  • Also accept bytes as input.

v0.0.2

19 Nov 09:36
Compare
Choose a tag to compare
v0.0.2 Pre-release
Pre-release
  • Add Span.id with running index of layout span.
  • Add Span._.heading that returns closest heading for given span.