How to Use

If you're also struggling without a GPU, you might want to try the Featurize platform. Here's my invitation link.

Python Version

Python 3.10

pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu116

Python 3.7

pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu110

Clone the Repository

git clone https://github.com/CjangCjengh/vits.git

Choose Cleaners

The repository has been modified for training in Chinese, so you can skip this step if training in Chinese.

Fill in "text_cleaners" in config.json
Edit text/symbols.py
Remove unnecessary imports from text/cleaners.py

Install Dependencies

pip install -r requirements_py310.txt  # or requirements.txt

Create Dataset

Single Speaker

Set "n_speakers" to 0 in config.json.

Format:

path/to/XXX.wav|transcribed text

Example:

dataset/001.wav|こんにちは。

Multiple Speakers

Speaker IDs should start from 0.

Format:

path/to/XXX.wav|speaker ID|transcribed text

Example:

dataset/001.wav|0|こんにちは。

Preprocessing

If you have already completed this step, set "cleaned_text" to true in config.json.

# Single speaker
python preprocess.py --text_index 1 --filelists path/to/filelist_train.txt path/to/filelist_val.txt --text_cleaners chinese_cleaners

# Multiple speakers
python preprocess.py --text_index 2 --filelists path/to/filelist_train.txt path/to/filelist_val.txt --text_cleaners chinese_cleaners

Build Monotonic Alignment Search

cd monotonic_align
mkdir "monotonic_align"
python setup.py build_ext --inplace
cd ..

Training

# Single speaker
python train.py -c <config> -m <folder>

# Multiple speakers
python train_ms.py -c <config> -m <folder>

Inference

Online

See inference.ipynb

Offline

See MoeGoe

Running in Docker

docker run -itd --gpus all --name "container name" -e NVIDIA_DRIVER_CAPABILITIES=compute,utility -e NVIDIA_VISIBLE_DEVICES=all "image name"

Name		Name	Last commit message	Last commit date
Latest commit History 93 Commits
Libtorch C++ Infer		Libtorch C++ Infer
configs		configs
filelists		filelists
monotonic_align		monotonic_align
resources		resources
text		text
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
README_zh.md		README_zh.md
attentions.py		attentions.py
colab.ipynb		colab.ipynb
commons.py		commons.py
config_parameters.md		config_parameters.md
data_utils.py		data_utils.py
inference.ipynb		inference.ipynb
losses.py		losses.py
mel_processing.py		mel_processing.py
models.py		models.py
modules.py		modules.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
requirements_py310.txt		requirements_py310.txt
train.py		train.py
train_ms.py		train_ms.py
transforms.py		transforms.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

How to Use

Python Version

Python 3.10

Python 3.7

Clone the Repository

Choose Cleaners

Install Dependencies

Create Dataset

Single Speaker

Multiple Speakers

Preprocessing

Build Monotonic Alignment Search

Training

Inference

Online

Offline

Running in Docker

About

Uh oh!

Releases

Packages

Languages

License

LIEGU0317/vits

Folders and files

Latest commit

History

Repository files navigation

How to Use

Python Version

Python 3.10

Python 3.7

Clone the Repository

Choose Cleaners

Install Dependencies

Create Dataset

Single Speaker

Multiple Speakers

Preprocessing

Build Monotonic Alignment Search

Training

Inference

Online

Offline

Running in Docker

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages