8000 GitHub - kkorolev1/nv_dla: Implementation of a HiFiGAN
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

kkorolev1/nv_dla

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

NV HW 4

Implementation of a HiFiGAN

WanDB Report

See the results at the end of this README.

Checkpoints

Installation guide

pip install -r ./requirements.txt

To reproduce training download LJSpeech.

sh scripts/download_data.sh

Configs can be found in hw_tts/configs folder. In particular, for testing use config_server.json.

Training

One can redefine parameters which are set within config by passing them in terminal via flags.

python train.py -c CONFIG -r CHECKPOINT -k WANDB_KEY --wandb_run_name WANDB_RUN_NAME --n_gpu NUM_GPU --batch_size BATCH_SIZE --len_epoch ITERS_PER_EPOCH --data_path PATH_TO_WAVS

Testing

python test.py -c hw_tts/configs/config_server.json -r CHECKPOINT -t test_audio -o output_audio
  • test_audio is a directory with 3 wavs for evaluation.
  • output_audio is a directory to save the result.

Results

Generation of these 3 sentences. Filename corresponds to the order of a sentence.

A defibrillator is a device that gives a high energy electric shock to the heart of someone who is in cardiac arrest

Massachusetts Institute of Technology may be best known for its math, science and engineering education

Wasserstein distance or Kantorovich Rubinstein metric is a distance function defined between probability distributions on a given metric space

audio_1.mp4
audio_2.mp4
audio_3.mp4

About

Implementation of a HiFiGAN

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published
0