Voice Datasets

This directory contains the audio datasets for training custom RVC models.

Structure

Each subdirectory corresponds to a specific voice type:

Collect Audio: Gather 10-15 minutes of clean, single-speaker audio for the desired category.
Place Files: Put the raw audio files (mp3, wav, etc.) into a temporary folder or directly here.
Process: Use the provided tool to normalize and split the audio.

# Example: Processing a raw file into the male_low dataset
python tools/audio_preprocessor.py -i raw_audio/my_voice.mp3 -o datasets/male_low

Format: WAV (will be converted automatically)
Sample Rate: 40kHz or 48kHz (will be converted automatically)
Channels: Mono (will be converted automatically)
Quality: No background noise, music, or reverb. Use UVR5 to clean if necessary.