Skip to content

Issues: LLMServe/DistServe

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[ERROR] CUDA error
#53 opened Dec 18, 2024 by MSMsssss
Adding a new model?
#43 opened Aug 22, 2024 by gursimar
编译SwiftTransformer失败
#37 opened Aug 9, 2024 by FredHuang99
Cmake build fail
#31 opened Jul 31, 2024 by hyuenmin-choi
Model not loaded error
#24 opened Jul 15, 2024 by melissadu-db
Great work!
#20 opened Jul 4, 2024 by irasin
codellama34b ttft延迟问题 question Further information is requested
#19 opened Jul 2, 2024 by sitabulaixizawaluduo
Decode Wrong Token help wanted Extra attention is needed
#16 opened Jun 18, 2024 by sitabulaixizawaluduo
How to profile help wanted Extra attention is needed
#13 opened Jun 14, 2024 by YLSnowy
How difficult will adding Llama 3 support be? enhancement New feature or request
#12 opened Jun 13, 2024 by kalradivyanshu
decoder.embed_tokens.weight.pt not found help wanted Extra attention is needed
#10 opened Jun 11, 2024 by llx-08
ProTip! Follow long discussions with comments:>50.