ChatGLM + VITS + SadTalker

base on chatGLM-6B-int4/vits/SadTalker

本项目基于清华大学开源模型chatGLM-6B以及vits框架
- 主要参考https://github.com/ruoqiu6/chat-with-Elysia2.0.git 和 https://github.com/OpenTalker/SadTalker
- chatGLM-6B模型为清华大学开源，使用时请注意查看对应的使用需知，严格遵守使用规定
  - 模型下载链接点击这里请仔细阅读说明根据自己的硬件配置下载对应模型
  - 模型下载后请将模型及响应的文件放置在./chatglm-model路径下
  - 参数下载方式可参考这里的视频
  - 情绪设定和前序prompt设定暂时维持原作者default.json，可自行修改
- vits模型来自up主“saya睡大觉中”，严禁商用
  - 下载后请将模型以及配置文件放在./model-vits路径下
  - 内部含有多种模型，可根据自己的需求进行选择选择参数在soundmaker.py中的self.speaker_choice中进行修改
- Sadtalker模型下载参考https://github.com/OpenTalker/SadTalker
  - 模型下载：bash scripts/download_models.sh
自行部署项目时，使用下面命令以安装模块，注意：pip安装的torch可能为cpu版本，请按照torch官网的安装方式安装对应的cuda版本，如果出现模块兼容性问题，请使用python3.9.6

pip install -r requirements.txt
运行项目时，使用 python main.py 即可运行

在运行main文件后,按顺序，填写问题，提供人物图片，生成对话，生成对话视频

模型全文包括参数等存于链接：https://pan.baidu.com/s/1JPsijA4muq8rGsxUykrfrg?pwd=2zot 提取码：2zot

参考：

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
__pycache__		__pycache__
data		data
docs		docs
examples		examples
model-vits		model-vits
monotonic_align/monotonic_align		monotonic_align/monotonic_align
results/e66701fe-86e4-4c60-bb04-33281dfd1bc6		results/e66701fe-86e4-4c60-bb04-33281dfd1bc6
scripts		scripts
src		src
text		text
0.wav		0.wav
20230613211044.MP4		20230613211044.MP4
LICENSE		LICENSE
README.md		README.md
attentions.py		attentions.py
chat.py		chat.py
commons.py		commons.py
config.json		config.json
inference.py		inference.py
launcher.py		launcher.py
link-of-model.txt		link-of-model.txt
main.py		main.py
mel_processing.py		mel_processing.py
models.py		models.py
modules.py		modules.py
parse.py		parse.py
predict.py		predict.py
requirements.txt		requirements.txt
soundMaker.py		soundMaker.py
transforms.py		transforms.py
utils.py		utils.py