Skip to content

Releases: xorbitsai/inference

v0.15.2

20 Sep 09:05
5de46e9
Compare
Choose a tag to compare

What's new in 0.15.2 (2024-09-20)

These are the changes in inference v0.15.2.

New features

Bug fixes

Documentation

Full Changelog: v0.15.1...v0.15.2

v0.15.1

14 Sep 07:38
4c5e752
Compare
Choose a tag to compare

What's new in 0.15.1 (2024-09-14)

These are the changes in inference v0.15.1.

New features

Enhancements

Bug fixes

Documentation

New Contributors

Full Changelog: v0.15.0...v0.15.1

v0.15.0

06 Sep 08:45
e2618be
Compare
Choose a tag to compare

What's new in 0.15.0 (2024-09-06)

These are the changes in inference v0.15.0.

New features

  • FEAT: cosyvoice model support streaming reply by @wuminghui-coder in #2192
  • FEAT: support qwen2-vl-instruct by @Minamiyama in #2205

Enhancements

Bug fixes

  • BUG: Fix docker image startup issue due to entrypoint by @ChengjieLi28 in #2207
  • BUG: fix init xinference fail when custom path is fault by @amumu96 in #2208
  • BUG: use default_uid to replace uid of actors which may override the xoscar actor's uid property by @qinxuye in #2214
  • BUG: fix rerank max length by @qinxuye in #2219
  • BUG: logger bug of function using generator decoration by @wxiwnd in #2215
  • BUG: fix rerank calculation of tokens number by @qinxuye in #2228
  • BUG: fix embedding token calculation & optimize memory by @qinxuye in #2221

Documentation

  • DOC: Modify the installation documentation to change single quotes to double quotes for Windows compatibility. by @nikelius in #2211

Others

New Contributors

  • @nikelius made their first contribution in #2211
  • @wuminghui-coder made their first contribution in #2192

Full Changelog: v0.14.4...v0.15.0

v0.14.4.post1

03 Sep 07:50
b1b7c44
Compare
Choose a tag to compare

What's new in 0.14.4.post1 (2024-09-03)

These are the changes in inference v0.14.4.post1.

Bug fixes

  • BUG: Fix docker image startup issue due to entrypoint by @ChengjieLi28 in #2207
  • BUG: fix init xinference fail when custom path is fault by @amumu96 in #2208

Documentation

  • DOC: Modify the installation documentation to change single quotes to double quotes for Windows compatibility. by @nikelius in #2211

Others

  • CHORE: Docker image is only pushed to aliyun when releasing version by @ChengjieLi28 in #2216

New Contributors

Full Changelog: v0.14.4...v0.14.4.post1

v0.14.4

30 Aug 10:59
f3d510e
Compare
Choose a tag to compare

What's new in 0.14.4 (2024-08-30)

These are the changes in inference v0.14.4.

New features

Enhancements

Bug fixes

Others

New Contributors

Full Changelog: v0.14.3...v0.14.4

v0.14.3

25 Aug 04:51
2460978
Compare
Choose a tag to compare

What's new in 0.14.3 (2024-08-25)

These are the changes in inference v0.14.3.

New features

  • FEAT: ChatTTS speech voice support encoded speaker str by @codingl2k1 in #2096
  • FEAT: [UI] Add other parameters to other models besides the LLM model. by @yiboyasss in #2129
  • FEAT: support SD3-medium inpainting by @qinxuye in #2137
  • feat: 🎸 Added the model dtype parameter for embedding (currently only supported for models gte-Qwen2). by @Zzzz1111 in #2120
  • FEAT: Support fish speech model by @codingl2k1 in #2119
  • FEAT: support CogVLM2-video by @Minamiyama in #2110
  • FEAT: Support LMDeploy for internvl2 and fix finish reasion miss at internvl stream by @amumu96 in #2145

Enhancements

  • ENH: make internvl2 support video by @Minamiyama in #2104
  • ENH: support process_image with padding for image_to_image by @qinxuye in #2109
  • REF: use utils._decode_image replacing same codes in individual vl files by @Minamiyama in #2105

Bug fixes

Documentation

Others

New Contributors

Full Changelog: v0.14.2...v0.14.3

v0.14.2

16 Aug 10:29
e4d2257
Compare
Choose a tag to compare

What's new in 0.14.2 (2024-08-16)

These are the changes in inference v0.14.2.

New features

  • FEAT: add gemma-2-it 2b & internlm2.5-chat 1.8b and 20b & update video and sglang docs by @qinxuye in #2080
  • FEAT: support FP8 for vllm & sglang engine by @qinxuye in #2069
  • Feat: Support internvl2 and internvl stream by @amumu96 in #2079

Enhancements

Bug fixes

Documentation

  • DOC: update readme & add tips for large image models by @qinxuye in #2056

New Contributors

Full Changelog: v0.14.1...v0.14.2

v0.14.1.post1

13 Aug 06:56
9afee76
Compare
Choose a tag to compare

What's new in 0.14.1.post1 (2024-08-13)

These are the changes in inference v0.14.1.post1.

Bug fixes

  • BUG: limit AutoAWQ version to fix docker issue by @qinxuye in #2067

Documentation

  • DOC: update readme & add tips for large image models by @qinxuye in #2056

Full Changelog: v0.14.1...v0.14.1.post1

v0.14.1

09 Aug 10:14
3e7ed86
Compare
Choose a tag to compare

What's new in 0.14.1 (2024-08-09)

These are the changes in inference v0.14.1.

New features

Enhancements

Bug fixes

Tests

Documentation

Others

New Contributors

Full Changelog: v0.14.0...v0.14.1

v0.14.0.post1

05 Aug 06:06
1112993
Compare
Choose a tag to compare

What's new in 0.14.0.post1 (2024-08-05)

These are the changes in inference v0.14.0.post1.

Enhancements

Bug fixes

Full Changelog: v0.14.0...v0.14.0.post1