Skip to content

Commit 99996fb

Browse files
authored
优化笔记本、翻译并重新引入自动push (#48)
* optimize: 精简未用到的配置项并在特征提取初步引入mps * add cmd argument: --noautoopen * fix: i18n * fix * fix * add genlocale workflow * add unitest * fix * fix * fix * 优化笔记本 * reintroduce Push changes * disable genlocale on non-main branch * 将笔记本checkout改为stable
1 parent 55135e0 commit 99996fb

6 files changed

Lines changed: 89 additions & 24 deletions

File tree

.github/workflows/genlocale.yml

Lines changed: 11 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,8 @@
11
name: genlocale
2-
on: [ push ]
2+
on:
3+
push:
4+
branches:
5+
- main
36
jobs:
47
golangci:
58
name: genlocale
@@ -22,3 +25,10 @@ jobs:
2225
git config --local user.email '41898282+github-actions[bot]@users.noreply.github.com'
2326
git add --all
2427
git commit -m "🎨 同步 locale"
28+
29+
- name: Push changes
30+
if: ${{ !github.head_ref }}
31+
uses: ad-m/github-push-action@master
32+
with:
33+
github_token: ${{ secrets.GITHUB_TOKEN }}
34+
branch: main

Retrieval_based_Voice_Conversion_WebUI.ipynb

Lines changed: 59 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -58,7 +58,7 @@
5858
"source": [
5959
"#@title 克隆仓库\n",
6060
"\n",
61-
"!git clone --depth=1 https://github.com/fumiama/Retrieval-based-Voice-Conversion-WebUI\n",
61+
"!git clone --depth=1 -b stable https://github.com/fumiama/Retrieval-based-Voice-Conversion-WebUI\n",
6262
"%cd /content/Retrieval-based-Voice-Conversion-WebUI\n",
6363
"!mkdir -p pretrained uvr5_weights"
6464
],
@@ -80,11 +80,22 @@
8080
"execution_count": null,
8181
"outputs": []
8282
},
83+
{
84+
"cell_type": "code",
85+
"source": [
86+
"#@title 安装aria2\n",
87+
"!apt -y install -qq aria2"
88+
],
89+
"metadata": {
90+
"id": "pqE0PrnuRqI2"
91+
},
92+
"execution_count": null,
93+
"outputs": []
94+
},
8395
{
8496
"cell_type": "code",
8597
"source": [
8698
"#@title 下载底模\n",
87-
"!apt -y install -qq aria2\n",
8899
"!aria2c --console-log-level=error -c -x 16 -s 16 -k 1M https://huggingface.co/lj1995/VoiceConversionWebUI/resolve/main/pretrained/D32k.pth -d /content/Retrieval-based-Voice-Conversion-WebUI/pretrained -o D32k.pth\n",
89100
"!aria2c --console-log-level=error -c -x 16 -s 16 -k 1M https://huggingface.co/lj1995/VoiceConversionWebUI/resolve/main/pretrained/D40k.pth -d /content/Retrieval-based-Voice-Conversion-WebUI/pretrained -o D40k.pth\n",
90101
"!aria2c --console-log-level=error -c -x 16 -s 16 -k 1M https://huggingface.co/lj1995/VoiceConversionWebUI/resolve/main/pretrained/D48k.pth -d /content/Retrieval-based-Voice-Conversion-WebUI/pretrained -o D48k.pth\n",
@@ -96,15 +107,35 @@
96107
"!aria2c --console-log-level=error -c -x 16 -s 16 -k 1M https://huggingface.co/lj1995/VoiceConversionWebUI/resolve/main/pretrained/f0D48k.pth -d /content/Retrieval-based-Voice-Conversion-WebUI/pretrained -o f0D48k.pth\n",
97108
"!aria2c --console-log-level=error -c -x 16 -s 16 -k 1M https://huggingface.co/lj1995/VoiceConversionWebUI/resolve/main/pretrained/f0G32k.pth -d /content/Retrieval-based-Voice-Conversion-WebUI/pretrained -o f0G32k.pth\n",
98109
"!aria2c --console-log-level=error -c -x 16 -s 16 -k 1M https://huggingface.co/lj1995/VoiceConversionWebUI/resolve/main/pretrained/f0G40k.pth -d /content/Retrieval-based-Voice-Conversion-WebUI/pretrained -o f0G40k.pth\n",
99-
"!aria2c --console-log-level=error -c -x 16 -s 16 -k 1M https://huggingface.co/lj1995/VoiceConversionWebUI/resolve/main/pretrained/f0G48k.pth -d /content/Retrieval-based-Voice-Conversion-WebUI/pretrained -o f0G48k.pth\n",
100-
"\n",
110+
"!aria2c --console-log-level=error -c -x 16 -s 16 -k 1M https://huggingface.co/lj1995/VoiceConversionWebUI/resolve/main/pretrained/f0G48k.pth -d /content/Retrieval-based-Voice-Conversion-WebUI/pretrained -o f0G48k.pth"
111+
],
112+
"metadata": {
113+
"id": "UG3XpUwEomUz"
114+
},
115+
"execution_count": null,
116+
"outputs": []
117+
},
118+
{
119+
"cell_type": "code",
120+
"source": [
121+
"#@title 下载人声分离模型\n",
101122
"!aria2c --console-log-level=error -c -x 16 -s 16 -k 1M https://huggingface.co/lj1995/VoiceConversionWebUI/resolve/main/uvr5_weights/HP2-人声vocals+非人声instrumentals.pth -d /content/Retrieval-based-Voice-Conversion-WebUI/uvr5_weights -o HP2-人声vocals+非人声instrumentals.pth\n",
102-
"!aria2c --console-log-level=error -c -x 16 -s 16 -k 1M https://huggingface.co/lj1995/VoiceConversionWebUI/resolve/main/uvr5_weights/HP5-主旋律人声vocals+其他instrumentals.pth -d /content/Retrieval-based-Voice-Conversion-WebUI/uvr5_weights -o HP5-主旋律人声vocals+其他instrumentals.pth\n",
103-
"\n",
123+
"!aria2c --console-log-level=error -c -x 16 -s 16 -k 1M https://huggingface.co/lj1995/VoiceConversionWebUI/resolve/main/uvr5_weights/HP5-主旋律人声vocals+其他instrumentals.pth -d /content/Retrieval-based-Voice-Conversion-WebUI/uvr5_weights -o HP5-主旋律人声vocals+其他instrumentals.pth"
124+
],
125+
"metadata": {
126+
"id": "HugjmZqZRuiF"
127+
},
128+
"execution_count": null,
129+
"outputs": []
130+
},
131+
{
132+
"cell_type": "code",
133+
"source": [
134+
"#@title 下载hubert_base\n",
104135
"!aria2c --console-log-level=error -c -x 16 -s 16 -k 1M https://huggingface.co/lj1995/VoiceConversionWebUI/resolve/main/hubert_base.pt -d /content/Retrieval-based-Voice-Conversion-WebUI -o hubert_base.pt"
105136
],
106137
"metadata": {
107-
"id": "UG3XpUwEomUz"
138+
"id": "2RCaT9FTR0ej"
108139
},
109140
"execution_count": null,
110141
"outputs": []
@@ -223,8 +254,12 @@
223254
"#@title 手动预处理(不推荐)\n",
224255
"#@markdown 模型名\n",
225256
"MODELNAME = \"lulu\" #@param {type:\"string\"}\n",
257+
"#@markdown 采样率\n",
258+
"BITRATE = 48000 #@param {type:\"integer\"}\n",
259+
"#@markdown 使用的进程数\n",
260+
"THREADCOUNT = 8 #@param {type:\"integer\"}\n",
226261
"\n",
227-
"!python3 trainset_preprocess_pipeline_print.py /content/dataset 48000 8 logs/{MODELNAME} True\n"
262+
"!python3 trainset_preprocess_pipeline_print.py /content/dataset {BITRATE} {THREADCOUNT} logs/{MODELNAME} True\n"
228263
],
229264
"metadata": {
230265
"id": "ZKAyuKb9J6dz"
@@ -238,8 +273,14 @@
238273
"#@title 手动提取特征(不推荐)\n",
239274
"#@markdown 模型名\n",
240275
"MODELNAME = \"lulu\" #@param {type:\"string\"}\n",
276+
"#@markdown 使用的进程数\n",
277+
"THREADCOUNT = 8 #@param {type:\"integer\"}\n",
278+
"#@markdown 音高提取算法\n",
279+
"ALGO = \"harvest\" #@param {type:\"string\"}\n",
280+
"\n",
281+
"!python3 extract_f0_print.py logs/{MODELNAME} {THREADCOUNT} {ALGO}\n",
241282
"\n",
242-
"!python3 extract_feature_print.py 1 0 0 logs/{MODELNAME}\n"
283+
"!python3 extract_feature_print.py cpu 1 0 0 logs/{MODELNAME}\n"
243284
],
244285
"metadata": {
245286
"id": "CrxJqzAUKmPJ"
@@ -253,14 +294,22 @@
253294
"#@title 手动训练(不推荐)\n",
254295
"#@markdown 模型名\n",
255296
"MODELNAME = \"lulu\" #@param {type:\"string\"}\n",
297+
"#@markdown 使用的GPU\n",
298+
"USEGPU = \"0\" #@param {type:\"string\"}\n",
299+
"#@markdown 批大小\n",
300+
"BATCHSIZE = 32 #@param {type:\"integer\"}\n",
256301
"#@markdown 停止的epoch\n",
257302
"MODELEPOCH = 3200 #@param {type:\"integer\"}\n",
258303
"#@markdown 保存epoch间隔\n",
259304
"EPOCHSAVE = 100 #@param {type:\"integer\"}\n",
260305
"#@markdown 采样率\n",
261306
"MODELSAMPLE = \"48k\" #@param {type:\"string\"}\n",
307+
"#@markdown 是否缓存训练集\n",
308+
"CACHEDATA = 1 #@param {type:\"integer\"}\n",
309+
"#@markdown 是否仅保存最新的ckpt文件\n",
310+
"ONLYLATEST = 0 #@param {type:\"integer\"}\n",
262311
"\n",
263-
"!python3 train_nsf_sim_cache_sid_load_pretrain.py -e lulu -sr {MODELSAMPLE} -f0 1 -bs 32 -g 0 -te {MODELEPOCH} -se {EPOCHSAVE} -pg pretrained/f0G{MODELSAMPLE}.pth -pd pretrained/f0D{MODELSAMPLE}.pth -l 0 -c 1\n"
312+
"!python3 train_nsf_sim_cache_sid_load_pretrain.py -e lulu -sr {MODELSAMPLE} -f0 1 -bs {BATCHSIZE} -g {USEGPU} -te {MODELEPOCH} -se {EPOCHSAVE} -pg pretrained/f0G{MODELSAMPLE}.pth -pd pretrained/f0D{MODELSAMPLE}.pth -l {ONLYLATEST} -c {CACHEDATA}\n"
264313
],
265314
"metadata": {
266315
"id": "IMLPLKOaKj58"

gui.py

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -165,7 +165,7 @@ def launcher(self):
165165
layout=[
166166
[
167167
sg.Frame(title=i18n('加载模型'),layout=[
168-
[sg.Input(default_text='TEMP\\hubert_base.pt',key='hubert_path'),sg.FileBrowse(i18n('Hubert File'))],
168+
[sg.Input(default_text='TEMP\\hubert_base.pt',key='hubert_path'),sg.FileBrowse(i18n('Hubert模型'))],
169169
[sg.Input(default_text='TEMP\\atri.pth',key='pth_path'),sg.FileBrowse(i18n('选择.pth文件'))],
170170
[sg.Input(default_text='TEMP\\added_IVF512_Flat_atri_baseline_src_feat.index',key='index_path'),sg.FileBrowse(i18n('选择.index文件'))],
171171
[sg.Input(default_text='TEMP\\big_src_feature_atri.npy',key='npy_path'),sg.FileBrowse(i18n('选择.npy文件'))]
@@ -187,10 +187,10 @@ def launcher(self):
187187
[sg.Text(i18n("采样长度")),sg.Slider(range=(0.1,3.0),key='block_time',resolution=0.1,orientation='h',default_value=1.0)],
188188
[sg.Text(i18n("淡入淡出长度")),sg.Slider(range=(0.01,0.15),key='crossfade_length',resolution=0.01,orientation='h',default_value=0.08)],
189189
[sg.Text(i18n("额外推理时长")),sg.Slider(range=(0.05,3.00),key='extra_time',resolution=0.01,orientation='h',default_value=0.05)],
190-
[sg.Checkbox(i18n('Input Noisereduce'),key='I_noise_reduce'),sg.Checkbox(i18n('Output Noisereduce'),key='O_noise_reduce')]
190+
[sg.Checkbox(i18n('输入降噪'),key='I_noise_reduce'),sg.Checkbox(i18n('输出降噪'),key='O_noise_reduce')]
191191
],title=i18n("性能设置"))
192192
],
193-
[sg.Button(i18n("开始音频转换"),key='start_vc'),sg.Button(i18n("停止音频转换"),key='stop_vc'),sg.Text(i18n("Infer Time(ms):")),sg.Text("0",key='infer_time')]
193+
[sg.Button(i18n("开始音频转换"),key='start_vc'),sg.Button(i18n("停止音频转换"),key='stop_vc'),sg.Text(i18n("推理时间(ms):")),sg.Text("0",key='infer_time')]
194194
]
195195

196196
self.window=sg.Window("RVC - GUI",layout=layout)

locale/en_US.json

Lines changed: 5 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -76,6 +76,7 @@
7676
"点击查看交流、问题反馈群号": "Click to view the communication and problem feedback group number",
7777
"xxxxx": "xxxxx",
7878
"加载模型": "加载模型",
79+
"Hubert模型": "Hubert File",
7980
"选择.pth文件": "选择.pth文件",
8081
"选择.index文件": "选择.index文件",
8182
"选择.npy文件": "选择.npy文件",
@@ -88,8 +89,10 @@
8889
"采样长度": "采样长度",
8990
"淡入淡出长度": "淡入淡出长度",
9091
"额外推理时长": "额外推理时长",
91-
"输出降噪/Output Noisereduce": "输出降噪/Output Noisereduce",
92+
"输入降噪": "Input Noisereduce",
93+
"输出降噪": "Output Noisereduce",
9294
"性能设置": "性能设置",
9395
"开始音频转换": "开始音频转换",
94-
"停止音频转换": "停止音频转换"
96+
"停止音频转换": "停止音频转换",
97+
"推理时间(ms):": "Infer Time(ms):"
9598
}

locale/ja_JP.json

Lines changed: 7 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -36,7 +36,7 @@
3636
"请指定说话人id": "話者IDを指定してください",
3737
"处理数据": "データ処理",
3838
"step2b: 使用CPU提取音高(如果模型带音高), 使用GPU提取特征(选择卡号)": "ステップ2b: CPUを使用して音高を抽出する(モデルに音高がある場合)、GPUを使用して特徴を抽出する(カード番号を選択する)",
39-
"以-分隔输入使用的卡号, 例如 0-1-2 使用卡0和卡1和卡2": "ハイフンで区切って使用するカード番号を入力します。例えば0-1-2はカード0、カード1、カード2を使用します" ,
39+
"以-分隔输入使用的卡号, 例如 0-1-2 使用卡0和卡1和卡2": "ハイフンで区切って使用するカード番号を入力します。例えば0-1-2はカード0、カード1、カード2を使用します",
4040
"显卡信息": "カード情報",
4141
"提取音高使用的CPU进程数": "抽出に使用するCPUプロセス数",
4242
"选择音高提取算法:输入歌声可用pm提速,高质量语音但CPU差可用dio提速,harvest质量更好但慢": "音高抽出アルゴリズムの選択:歌声を入力する場合は、pmを使用して速度を上げることができます。CPUが低い場合はdioを使用して速度を上げることができます。harvestは品質が高く、精度が高いですが、遅いです。",
@@ -72,10 +72,11 @@
7272
"模型是否带音高指导,1是0否": "モデルに音高ガイドを付けるかどうか、1は付ける、0は付けない",
7373
"提取": "抽出",
7474
"招募音高曲线前端编辑器": "音高曲線フロントエンドエディターを募集",
75-
"加开发群联系我xxxxx": "開発グループに参加して私に連絡してくださいxxxxx" ,
75+
"加开发群联系我xxxxx": "開発グループに参加して私に連絡してくださいxxxxx",
7676
"点击查看交流、问题反馈群号": "クリックして交流、問題フィードバックグループ番号を表示",
7777
"xxxxx": "xxxxx",
7878
"加载模型": "モデルをロードする",
79+
"Hubert模型": "Hubert模型",
7980
"选择.pth文件": ".pthファイルを選択する",
8081
"选择.index文件": ".indexファイルを選択する",
8182
"选择.npy文件": ".npyファイルを選択する",
@@ -88,8 +89,10 @@
8889
"采样长度": "サンプル長",
8990
"淡入淡出长度": "フェードイン/フェードアウト長",
9091
"额外推理时长": "追加推論時間",
91-
"输出降噪/Output Noisereduce": "出力ノイズリダクション",
92+
"输入降噪": "输入降噪",
93+
"输出降噪": "输出降噪",
9294
"性能设置": "パフォーマンス設定",
9395
"开始音频转换": "音声変換を開始する",
94-
"停止音频转换": "音声変換を停止する"
96+
"停止音频转换": "音声変換を停止する",
97+
"推理时间(ms):": "推理时间(ms):"
9598
}

locale/zh_CN.json

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -76,7 +76,7 @@
7676
"点击查看交流、问题反馈群号": "点击查看交流、问题反馈群号",
7777
"xxxxx": "xxxxx",
7878
"加载模型": "加载模型",
79-
"Hubert File":"Hubert模型",
79+
"Hubert模型": "Hubert模型",
8080
"选择.pth文件": "选择.pth文件",
8181
"选择.index文件": "选择.index文件",
8282
"选择.npy文件": "选择.npy文件",
@@ -89,10 +89,10 @@
8989
"采样长度": "采样长度",
9090
"淡入淡出长度": "淡入淡出长度",
9191
"额外推理时长": "额外推理时长",
92-
"Input Noisereduce":"输入降噪",
93-
"Output Noisereduce": "输出降噪",
92+
"输入降噪": "输入降噪",
93+
"输出降噪": "输出降噪",
9494
"性能设置": "性能设置",
9595
"开始音频转换": "开始音频转换",
9696
"停止音频转换": "停止音频转换",
97-
"Infer Time(ms):":"推理时间(ms):"
97+
"推理时间(ms):": "推理时间(ms):"
9898
}

0 commit comments

Comments
 (0)