Releases: explosion/spacy-layout
Releases · explosion/spacy-layout
v0.0.11
- Fix regression that would cause incorrect pagination numbers to be represented.
v0.0.10
- Allow
DoclingDocument
as input to spaCyLayout.__call__
to convert already processed documents to spaCy Doc
objects.
v0.0.9
- Add
Doc._.markdown
with Markdown representation of the document.
v0.0.8
- Fix serialization of extension attributes and
pandas.DataFrame
via spaCy's DocBin
. (#11, #14)
v0.0.7
- Fix bounding boxes for top left origin, refactor and add tests.
v0.0.6
- Add support for tables as layout spans and via shortcut
Doc._.tables
.
- Add
Span._.data
for table data as a pandas.DataFrame
.
- Allow customizing table display text in
Doc.text
via display_table
callback option.
v0.0.5
- Improve bounding box calculation for bottom left origin.
v0.0.4
- Fix bounding boxes for bottom left origin.
v0.0.3
- Add
spaCyLayout.pipe
to process multiple documents.
- Use
nlp.pipe
internally for tokenization.
- Also accept
bytes
as input.
v0.0.2
- Add
Span.id
with running index of layout span.
- Add
Span._.heading
that returns closest heading for given span.