|
1 | | -<div align="center"> |
2 | | - |
3 | | -<h1>Retrieval-based-Voice-Conversion-WebUI</h1> |
4 | | -An easy-to-use SVC framework based on VITS.<br><br> |
5 | | - |
6 | | -[](https://github.com/liujing04/Retrieval-based-Voice-Conversion-WebUI) |
7 | | - |
8 | | -<img src="https://counter.seku.su/cmoe?name=rvc&theme=r34" /><br> |
9 | | - |
10 | | -[](https://colab.research.google.com/github/liujing04/Retrieval-based-Voice-Conversion-WebUI/blob/main/Retrieval_based_Voice_Conversion_WebUI.ipynb) |
11 | | -[](https://github.com/liujing04/Retrieval-based-Voice-Conversion-WebUI/blob/main/%E4%BD%BF%E7%94%A8%E9%9C%80%E9%81%B5%E5%AE%88%E7%9A%84%E5%8D%8F%E8%AE%AE-LICENSE.txt) |
12 | | -[](https://huggingface.co/lj1995/VoiceConversionWebUI/tree/main/) |
13 | | - |
14 | | -</div> |
15 | | - |
16 | | ------- |
17 | | -[**Changelog**](https://github.com/liujing04/Retrieval-based-Voice-Conversion-WebUI/blob/main/Changelog_CN.md) |
18 | | - |
19 | | -[**English**](./README.md) | [**中文简体**](./README_zh_CN.md) |
20 | | - |
21 | | -> Check our [Demo Video](https://www.bilibili.com/video/BV1pm4y1z7Gm/) here! |
22 | | -
|
23 | | -> Realtime Voice Conversion Software using RVC : [w-okada/voice-changer](https://github.com/w-okada/voice-changer) |
24 | | -
|
25 | | -## Summary |
26 | | -This repository has the following features: |
27 | | -+ Reduce tone leakage by replacing source feature to training-set feature using top1 retrieval; |
28 | | -+ Easy and fast training, even on relatively poor graphics cards; |
29 | | -+ Training with a small amount of data also obtains relatively good results (>=10min low noise speech recommended); |
30 | | -+ Supporting model fusion to change timbres (using ckpt processing tab->ckpt merge); |
31 | | -+ Easy-to-use Webui interface; |
32 | | -+ Use the UVR5 model to quickly separate vocals and instruments. |
33 | | -+ The dataset for the pre-training model uses nearly 50 hours of high quality VCTK open source dataset, and high quality licensed song datasets will be added to training-set one after another for your use, without worrying about copyright infringement. |
34 | | -## Preparing the environment |
35 | | -We recommend you install the dependencies through poetry. |
36 | | - |
37 | | -The following commands need to be executed in the environment of Python version 3.8 or higher: |
38 | | -```bash |
39 | | -# Install PyTorch-related core dependencies, skip if installed |
40 | | -# Reference: https://pytorch.org/get-started/locally/ |
41 | | -pip install torch torchvision torchaudio |
42 | | - |
43 | | -#For Windows + 30-series Nvidia cards, you need to specify the cuda version corresponding to pytorch according to the experience of https://github.com/liujing04/Retrieval-based-Voice-Conversion-WebUI/issues/21 |
44 | | - |
45 | | -pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu117 |
46 | | - |
47 | | -# Install the Poetry dependency management tool, skip if installed |
48 | | -# Reference: https://python-poetry.org/docs/#installation |
49 | | -curl -sSL https://install.python-poetry.org | python3 - |
50 | | - |
51 | | -# Install the project dependencies |
52 | | -poetry install |
53 | | -``` |
54 | | -You can also use pip to install the dependencies |
55 | | - |
56 | | -**Notice**: `faiss 1.7.2` will raise Segmentation Fault: 11 under `MacOS`, please change corresponding line in `requirements.txt` to `faiss-cpu==1.7.0` |
57 | | - |
58 | | -```bash |
59 | | -pip install -r requirements.txt |
60 | | -``` |
61 | | - |
62 | | -## Preparation of other Pre-models |
63 | | -RVC requires other pre-models to infer and train. |
64 | | - |
65 | | -You need to download them from our [Huggingface space](https://huggingface.co/lj1995/VoiceConversionWebUI/tree/main/). |
66 | | - |
67 | | -Here's a list of Pre-models and other files that RVC needs: |
68 | | -```bash |
69 | | -hubert_base.pt |
70 | | - |
71 | | -./pretrained |
72 | | - |
73 | | -./uvr5_weights |
74 | | - |
75 | | -#If you are using Windows, you may also need this dictionary, skip if FFmpeg is installed |
76 | | -ffmpeg.exe |
77 | | -``` |
78 | | -Then use this command to start Webui: |
79 | | -```bash |
80 | | -python infer-web.py |
81 | | -``` |
82 | | -If you are using Windows, you can download and extract `RVC-beta.7z` to use RVC directly and use `go-web.bat` to start Webui. |
83 | | - |
84 | | -We will develop an English version of the WebUI in 2 weeks. |
85 | | - |
86 | | -There's also a tutorial on RVC in Chinese and you can check it out if needed. |
87 | | - |
88 | | -## Credits |
89 | | - |
90 | | -## Thanks to all contributors for their efforts |
91 | | - |
92 | | -<a href="https://github.com/liujing04/Retrieval-based-Voice-Conversion-WebUI/graphs/contributors" target="_blank"> |
93 | | - <img src="https://contrib.rocks/image?repo=liujing04/Retrieval-based-Voice-Conversion-WebUI" /> |
94 | | -</a> |
95 | | - |
| 1 | +<div align="center"> |
| 2 | + |
| 3 | +<h1>Retrieval-based-Voice-Conversion-WebUI</h1> |
| 4 | +An easy-to-use SVC framework based on VITS.<br><br> |
| 5 | + |
| 6 | +[](https://github.com/liujing04/Retrieval-based-Voice-Conversion-WebUI) |
| 7 | + |
| 8 | +<img src="https://counter.seku.su/cmoe?name=rvc&theme=r34" /><br> |
| 9 | + |
| 10 | +[](https://colab.research.google.com/github/liujing04/Retrieval-based-Voice-Conversion-WebUI/blob/main/Retrieval_based_Voice_Conversion_WebUI.ipynb) |
| 11 | +[](https://github.com/liujing04/Retrieval-based-Voice-Conversion-WebUI/blob/main/%E4%BD%BF%E7%94%A8%E9%9C%80%E9%81%B5%E5%AE%88%E7%9A%84%E5%8D%8F%E8%AE%AE-LICENSE.txt) |
| 12 | +[](https://huggingface.co/lj1995/VoiceConversionWebUI/tree/main/) |
| 13 | + |
| 14 | +</div> |
| 15 | + |
| 16 | +------ |
| 17 | +[**Changelog**](https://github.com/liujing04/Retrieval-based-Voice-Conversion-WebUI/blob/main/Changelog_CN.md) |
| 18 | + |
| 19 | +[**English**](./README.en.md) | [**中文简体**](./README.md) | [**日本語**](./README.ja.md) |
| 20 | + |
| 21 | +> Check our [Demo Video](https://www.bilibili.com/video/BV1pm4y1z7Gm/) here! |
| 22 | +
|
| 23 | +> Realtime Voice Conversion Software using RVC : [w-okada/voice-changer](https://github.com/w-okada/voice-changer) |
| 24 | +
|
| 25 | +## Summary |
| 26 | +This repository has the following features: |
| 27 | ++ Reduce tone leakage by replacing source feature to training-set feature using top1 retrieval; |
| 28 | ++ Easy and fast training, even on relatively poor graphics cards; |
| 29 | ++ Training with a small amount of data also obtains relatively good results (>=10min low noise speech recommended); |
| 30 | ++ Supporting model fusion to change timbres (using ckpt processing tab->ckpt merge); |
| 31 | ++ Easy-to-use Webui interface; |
| 32 | ++ Use the UVR5 model to quickly separate vocals and instruments. |
| 33 | ++ The dataset for the pre-training model uses nearly 50 hours of high quality VCTK open source dataset, and high quality licensed song datasets will be added to training-set one after another for your use, without worrying about copyright infringement. |
| 34 | +## Preparing the environment |
| 35 | +We recommend you install the dependencies through poetry. |
| 36 | + |
| 37 | +The following commands need to be executed in the environment of Python version 3.8 or higher: |
| 38 | +```bash |
| 39 | +# Install PyTorch-related core dependencies, skip if installed |
| 40 | +# Reference: https://pytorch.org/get-started/locally/ |
| 41 | +pip install torch torchvision torchaudio |
| 42 | + |
| 43 | +#For Windows + Nvidia Ampere Architecture(RTX30xx), you need to specify the cuda version corresponding to pytorch according to the experience of https://github.com/liujing04/Retrieval-based-Voice-Conversion-WebUI/issues/21 |
| 44 | + |
| 45 | +pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu117 |
| 46 | + |
| 47 | +# Install the Poetry dependency management tool, skip if installed |
| 48 | +# Reference: https://python-poetry.org/docs/#installation |
| 49 | +curl -sSL https://install.python-poetry.org | python3 - |
| 50 | + |
| 51 | +# Install the project dependencies |
| 52 | +poetry install |
| 53 | +``` |
| 54 | +You can also use pip to install the dependencies |
| 55 | + |
| 56 | +**Notice**: `faiss 1.7.2` will raise Segmentation Fault: 11 under `MacOS`, please change corresponding line in `requirements.txt` to `faiss-cpu==1.7.0` |
| 57 | + |
| 58 | +```bash |
| 59 | +pip install -r requirements.txt |
| 60 | +``` |
| 61 | + |
| 62 | +## Preparation of other Pre-models |
| 63 | +RVC requires other pre-models to infer and train. |
| 64 | + |
| 65 | +You need to download them from our [Huggingface space](https://huggingface.co/lj1995/VoiceConversionWebUI/tree/main/). |
| 66 | + |
| 67 | +Here's a list of Pre-models and other files that RVC needs: |
| 68 | +```bash |
| 69 | +hubert_base.pt |
| 70 | + |
| 71 | +./pretrained |
| 72 | + |
| 73 | +./uvr5_weights |
| 74 | + |
| 75 | +#If you are using Windows, you may also need this dictionary, skip if FFmpeg is installed |
| 76 | +ffmpeg.exe |
| 77 | +``` |
| 78 | +Then use this command to start Webui: |
| 79 | +```bash |
| 80 | +python infer-web.py |
| 81 | +``` |
| 82 | +If you are using Windows, you can download and extract `RVC-beta.7z` to use RVC directly and use `go-web.bat` to start Webui. |
| 83 | + |
| 84 | +We will develop an English version of the WebUI in 2 weeks. |
| 85 | + |
| 86 | +There's also a tutorial on RVC in Chinese and you can check it out if needed. |
| 87 | + |
| 88 | +## Credits |
| 89 | + |
| 90 | +## Thanks to all contributors for their efforts |
| 91 | + |
| 92 | +<a href="https://github.com/liujing04/Retrieval-based-Voice-Conversion-WebUI/graphs/contributors" target="_blank"> |
| 93 | + <img src="https://contrib.rocks/image?repo=liujing04/Retrieval-based-Voice-Conversion-WebUI" /> |
| 94 | +</a> |
| 95 | + |
0 commit comments