update readme
This commit is contained in:
parent
14b7772475
commit
55adec9f3f
|
@ -1,7 +1,7 @@
|
||||||
A streaming digital human based on the Ernerf model, realize audio video synchronous dialogue. It can basically achieve commercial effects.
|
A streaming digital human based on the Ernerf model, realize audio video synchronous dialogue. It can basically achieve commercial effects.
|
||||||
基于ernerf模型的流式数字人,实现音视频同步对话。基本可以达到商用效果
|
基于ernerf模型的流式数字人,实现音视频同步对话。基本可以达到商用效果
|
||||||
|
|
||||||
[![Watch the video]](/assets/demo.mp4)
|
[效果演示](https://www.bilibili.com/video/BV1PM4m1y7Q2/)
|
||||||
|
|
||||||
## Features
|
## Features
|
||||||
1. 支持声音克隆
|
1. 支持声音克隆
|
||||||
|
@ -19,12 +19,14 @@ Tested on Ubuntu 20.04, Python3.10, Pytorch 1.12 and CUDA 11.3
|
||||||
|
|
||||||
```bash
|
```bash
|
||||||
conda create -n nerfstream python=3.10
|
conda create -n nerfstream python=3.10
|
||||||
conda install pytorch==1.12.1 torchvision==0.13.1 cudatoolkit=11.3 -c pytorch
|
|
||||||
conda activate nerfstream
|
conda activate nerfstream
|
||||||
|
conda install pytorch==1.12.1 torchvision==0.13.1 cudatoolkit=11.3 -c pytorch
|
||||||
pip install -r requirements.txt
|
pip install -r requirements.txt
|
||||||
pip install "git+https://github.com/facebookresearch/pytorch3d.git"
|
pip install "git+https://github.com/facebookresearch/pytorch3d.git"
|
||||||
pip install tensorflow-gpu==2.8.0
|
pip install tensorflow-gpu==2.8.0
|
||||||
|
pip install --upgrade "protobuf<=3.20.1"
|
||||||
```
|
```
|
||||||
|
安装常见问题[FAQ](/assets/faq.md)
|
||||||
linux cuda环境搭建可以参考这篇文章 https://zhuanlan.zhihu.com/p/674972886
|
linux cuda环境搭建可以参考这篇文章 https://zhuanlan.zhihu.com/p/674972886
|
||||||
|
|
||||||
|
|
||||||
|
@ -144,6 +146,7 @@ python app.py --transport rtmp --push_url 'rtmp://localhost/live/livestream'
|
||||||
docker run --gpus all -it --network=host --rm registry.cn-hangzhou.aliyuncs.com/lipku/nerfstream:v1.3
|
docker run --gpus all -it --network=host --rm registry.cn-hangzhou.aliyuncs.com/lipku/nerfstream:v1.3
|
||||||
```
|
```
|
||||||
docker版本已经不是最新代码,可以作为一个空环境,把最新代码拷进去运行。
|
docker版本已经不是最新代码,可以作为一个空环境,把最新代码拷进去运行。
|
||||||
|
另外提供autodl镜像:https://www.codewithgpu.com/i/lipku/metahuman-stream/base
|
||||||
|
|
||||||
## 5. Data flow
|
## 5. Data flow
|
||||||

|

|
||||||
|
@ -174,8 +177,10 @@ docker版本已经不是最新代码,可以作为一个空环境,把最新
|
||||||
- [x] 添加chatgpt实现数字人对话
|
- [x] 添加chatgpt实现数字人对话
|
||||||
- [x] 声音克隆
|
- [x] 声音克隆
|
||||||
- [x] 数字人静音时用一段视频代替
|
- [x] 数字人静音时用一段视频代替
|
||||||
|
- [ ] MuseTalk
|
||||||
|
|
||||||
如果本项目对你有帮助,帮忙点个star。也欢迎感兴趣的朋友一起来完善该项目。
|
如果本项目对你有帮助,帮忙点个star。也欢迎感兴趣的朋友一起来完善该项目。
|
||||||
Email: lipku@foxmail.com
|
Email: lipku@foxmail.com
|
||||||
|
知识星球: https://t.zsxq.com/7NMyO
|
||||||
微信公众号:数字人技术
|
微信公众号:数字人技术
|
||||||

|

|
||||||
|
|
|
@ -0,0 +1,65 @@
|
||||||
|
1. pytorch3d安装不成功\
|
||||||
|
下载源码编译
|
||||||
|
|
||||||
|
```bash
|
||||||
|
git clone https://github.com/facebookresearch/pytorch3d.git
|
||||||
|
python setup.py install
|
||||||
|
```
|
||||||
|
|
||||||
|
2. websocket连接报错\
|
||||||
|
修改python/site-packages/flask\_sockets.py
|
||||||
|
|
||||||
|
```python
|
||||||
|
self.url_map.add(Rule(rule, endpoint=f)) 改成
|
||||||
|
self.url_map.add(Rule(rule, endpoint=f, websocket=True))
|
||||||
|
```
|
||||||
|
|
||||||
|
3. protobuf版本过高
|
||||||
|
|
||||||
|
```bash
|
||||||
|
pip uninstall protobuf
|
||||||
|
pip install protobuf==3.20.1
|
||||||
|
```
|
||||||
|
|
||||||
|
4. 数字人不眨眼\
|
||||||
|
训练模型时添加如下步骤
|
||||||
|
|
||||||
|
> Obtain AU45 for eyes blinking.\
|
||||||
|
> Run FeatureExtraction in OpenFace, rename and move the output CSV file to data/\<ID>/au.csv.
|
||||||
|
|
||||||
|
将au.csv拷到本项目的data目录下
|
||||||
|
|
||||||
|
5. 数字人添加背景图片
|
||||||
|
|
||||||
|
```bash
|
||||||
|
python app.py --bg_img bc.jpg
|
||||||
|
```
|
||||||
|
|
||||||
|
6. 用自己训练的模型报错维度不匹配\
|
||||||
|
训练模型时用wav2vec提取音频特征
|
||||||
|
|
||||||
|
```bash
|
||||||
|
python main.py data/ --workspace workspace/ -O --iters 100000 --asr_model cpierse/wav2vec2-large-xlsr-53-esperanto
|
||||||
|
```
|
||||||
|
|
||||||
|
7. rtmp推流时ffmpeg版本不对
|
||||||
|
网上版友反馈是需要4.2.2版本。我也不确定具体哪些版本不行。原则是运行一下ffmpeg,打印的信息里需要有libx264,如果没有肯定不行
|
||||||
|
```
|
||||||
|
--enable-libx264
|
||||||
|
```
|
||||||
|
8. 替换自己训练的模型
|
||||||
|
```python
|
||||||
|
.
|
||||||
|
├── data
|
||||||
|
│ ├── data_kf.json (对应训练数据中的transforms_train.json)
|
||||||
|
│ ├── au.csv
|
||||||
|
│ ├── pretrained
|
||||||
|
│ └── └── ngp_kf.pth (对应训练后的模型ngp_ep00xx.pth)
|
||||||
|
|
||||||
|
```
|
||||||
|
|
||||||
|
|
||||||
|
其他参考
|
||||||
|
https://github.com/lipku/metahuman-stream/issues/43#issuecomment-2008930101
|
||||||
|
|
||||||
|
|
Loading…
Reference in New Issue