-
-
Notifications
You must be signed in to change notification settings - Fork 289
Issues: turboderp/exllamav2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[REQUEST] Offloading a customizable number of experts into RAM for DeepSeek V3 685B?
#706
opened Dec 26, 2024 by
TyraVex
3 tasks done
[REQUEST] Sage Attention? Anyone tried it with exllama?
#702
opened Dec 21, 2024 by
Ph0rk0z
3 tasks done
[BUG] Qwen2.5-72B-2.xxbpw/Llama-70B-2.4bpw (maybe related to KV caching code) garbage output on some specific prompts.
bug
Something isn't working
#697
opened Dec 14, 2024 by
Originalimoc
3 tasks done
[BUG] lmformatenforcer integration seems to be broken on new versions
bug
Something isn't working
#696
opened Dec 11, 2024 by
hvico
3 tasks done
[BUG] ExLlamaV2DynamicGenerator class is not multiple threads supported
bug
Something isn't working
#690
opened Nov 29, 2024 by
UTSAV-44
3 tasks done
[BUG] Something isn't working
generator.iterate()
returns corrupted result objects in some cases
bug
#689
opened Nov 29, 2024 by
p-e-w
3 tasks done
[BUG] Speculative decoding regresses performance on 7900 xtx under ROCM
bug
Something isn't working
#685
opened Nov 25, 2024 by
Mushoz
3 tasks done
qwen coder32b run on colab t4
bug
Something isn't working
#682
opened Nov 23, 2024 by
werruww
3 tasks done
[BUG] [Qwen] Draft model produce garbage output
bug
Something isn't working
#674
opened Nov 14, 2024 by
Nepherpitou
3 tasks done
[REQUEST] Convert.py: Option to skip measurement when setting 8.0/8.0
#673
opened Nov 13, 2024 by
Originalimoc
3 tasks done
[PAPER] New quant method with SOTA quality and speed: QTIP
#668
opened Nov 1, 2024 by
TyraVex
3 tasks done
[REQUEST] Alternative way to the Pytorch environment variables on Windows to set Pytorch memory management parameters
#664
opened Oct 29, 2024 by
Nexesenex
3 tasks done
[BUG] AMD - Out of memory errors despite having plenty of VRAM
bug
Something isn't working
#662
opened Oct 27, 2024 by
RSAStudioGames
3 tasks done
[REQUEST] Llama 3.2 Vision Support (or already exists?)
#658
opened Oct 18, 2024 by
grimulkan
3 tasks done
[BUG] Appending-Runtime-LoRA-weights
bug
Something isn't working
#656
opened Oct 16, 2024 by
royallavanya140
3 tasks done
[BUG] Convert script fails to run on Something isn't working
master
branch as of v0.2.3
bug
#655
opened Oct 15, 2024 by
iamwavecut
3 tasks done
[BUG] RAM UTILISATION IS INCREASING RAPIDLY
bug
Something isn't working
#639
opened Sep 25, 2024 by
UTSAV-44
Previous Next
ProTip!
Updated in the last three days: updated:>2024-12-23.