[BUG] lmformatenforcer integration seems to be broken on new versions #696

hvico · 2024-12-11T04:30:59Z

OS

Linux

GPU Library

CUDA 12.x

Python version

3.10

Pytorch version

2.5

Model

No response

Describe the bug

When running lmformatenformatenforcer integration for JSON output I get:

Error during execution: 'ExLlamaV2TokenEnforcerFilter' object has no attribute 'background_drop'

and if I comment that call in the generator code it fails with another missing stuff related with logit mask.

Reproduction steps

Try to enforce json output on v0.2.6

Expected behavior

Integration works.

Logs

No response

Additional context

No response

Acknowledgements

I have looked for similar issues before submitting this one.
I understand that the developers have lives and my issue will be answered when possible.
I understand the developers of this program are human, and I will ask my questions politely.

turboderp · 2024-12-14T19:34:28Z

Problem with LMFE is that it implements the filter class that ExLlamaV2 needs, but it doesn't derive from the base class in ExLlamaV2. You can use the filter class from here instead until it is fixed in LMFE.

hvico · 2024-12-15T04:23:19Z

Thanks for the workaround!

hvico added the bug Something isn't working label Dec 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] lmformatenforcer integration seems to be broken on new versions #696

[BUG] lmformatenforcer integration seems to be broken on new versions #696

hvico commented Dec 11, 2024

turboderp commented Dec 14, 2024

hvico commented Dec 15, 2024

[BUG] lmformatenforcer integration seems to be broken on new versions #696

[BUG] lmformatenforcer integration seems to be broken on new versions #696

Comments

hvico commented Dec 11, 2024

OS

GPU Library

Python version

Pytorch version

Model

Describe the bug

Reproduction steps

Expected behavior

Logs

Additional context

Acknowledgements

turboderp commented Dec 14, 2024

hvico commented Dec 15, 2024