-
fiftyone Public
Forked from voxel51/fiftyoneThe open-source tool for building high-quality datasets and computer vision models
Python Apache License 2.0 UpdatedFeb 23, 2023 -
clearml-server Public
Forked from clearml/clearml-serverClearML - Auto-Magical Suite of tools to streamline your ML workflow. Experiment Manager, ML-Ops and Data-Management
Python Other UpdatedJan 24, 2023 -
DeepLearning-500-questions Public
Forked from scutan90/DeepLearning-500-questions深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
JavaScript GNU General Public License v3.0 UpdatedJul 12, 2021 -
FaceSwap-1 Public
Forked from MarekKowalski/FaceSwap3D face swapping implemented in Python
Python MIT License UpdatedApr 14, 2021 -
-
espnet Public
Forked from espnet/espnetEnd-to-End Speech Processing Toolkit
Python Apache License 2.0 UpdatedDec 14, 2020 -
TensorFlowASR Public
Forked from TensorSpeech/TensorFlowASR⚡ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Python Apache License 2.0 UpdatedDec 12, 2020 -
TensorFlowTTS Public
Forked from TensorSpeech/TensorFlowTTS😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese, German and Easy to adapt for other languages)
Python Apache License 2.0 UpdatedDec 10, 2020 -
voice_activity_detection Public
Forked from filippogiruzzi/voice_activity_detectionVoice Activity Detection based on Deep Learning & TensorFlow
Python UpdatedDec 3, 2020 -
asteroid Public
Forked from asteroid-team/asteroidThe PyTorch-based audio source separation toolkit for researchers
Python MIT License UpdatedNov 30, 2020 -
kaldi Public
Forked from kaldi-asr/kaldikaldi-asr/kaldi is the official location of the Kaldi project.
Shell Other UpdatedNov 26, 2020 -
fairseq Public
Forked from facebookresearch/fairseqFacebook AI Research Sequence-to-Sequence Toolkit written in Python.
Python MIT License UpdatedNov 24, 2020 -
DeepSpeech Public
Forked from mozilla/DeepSpeechDeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
C++ Mozilla Public License 2.0 UpdatedNov 20, 2020 -
wer_are_we Public
Forked from syhw/wer_are_weAttempt at tracking states of the arts and recent results (bibliography) on speech recognition.
UpdatedNov 17, 2020 -
tacotron2 Public
Forked from NVIDIA/tacotron2Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Jupyter Notebook BSD 3-Clause "New" or "Revised" License UpdatedNov 13, 2020 -
text-detection-ctpn Public
Forked from eragonruan/text-detection-ctpntext detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network
Python MIT License UpdatedNov 13, 2020 -
rnnt-speech-recognition Public
Forked from noahchalifour/rnnt-speech-recognitionEnd-to-end speech recognition using RNN Transducers in Tensorflow 2.0
Python MIT License UpdatedNov 13, 2020 -
Real-Time-Voice-Cloning Public
Forked from CorentinJ/Real-Time-Voice-CloningClone a voice in 5 seconds to generate arbitrary speech in real-time
Python Other UpdatedNov 4, 2020 -
ParallelWaveGAN Public
Forked from kan-bayashi/ParallelWaveGANUnofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
Jupyter Notebook MIT License UpdatedNov 2, 2020 -
tensorpack Public
Forked from tensorpack/tensorpackA Neural Net Training Interface on TensorFlow, with focus on speed + flexibility
Python Apache License 2.0 UpdatedNov 1, 2020 -
AudioSignalProcessingForML Public
Forked from musikalkemist/AudioSignalProcessingForMLCode and slides of my YouTube series called "Audio Signal Proessing for Machine Learning"
Jupyter Notebook MIT License UpdatedOct 31, 2020 -
docs Public
Forked from tensorflow/docsTensorFlow documentation
Jupyter Notebook Apache License 2.0 UpdatedOct 23, 2020 -
OpenSeq2Seq Public
Forked from NVIDIA/OpenSeq2SeqToolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Python Apache License 2.0 UpdatedOct 19, 2020 -
EAST Public
Forked from argman/EASTA tensorflow implementation of EAST text detector
C++ GNU General Public License v3.0 UpdatedOct 8, 2020 -
openpilot Public
Forked from commaai/openpilotopenpilot is an open source driver assistance system. openpilot performs the functions of Automated Lane Centering and Adaptive Cruise Control for over 85 supported car makes and models.
C++ MIT License UpdatedOct 6, 2020 -
transformer-tensorflow Public
Forked from lilianweng/transformer-tensorflowImplementation of Transformer Model in Tensorflow
Python UpdatedSep 30, 2020 -
-
warp-ctc Public
Forked from baidu-research/warp-ctcFast parallel CTC.
Cuda Apache License 2.0 UpdatedSep 18, 2020 -
faceswap-GAN Public
Forked from shaoanlu/faceswap-GANA denoising autoencoder + adversarial losses and attention mechanisms for face swapping.
Jupyter Notebook UpdatedSep 12, 2020 -
rnnoise Public
Forked from xiph/rnnoiseRecurrent neural network for audio noise reduction
C BSD 3-Clause "New" or "Revised" License UpdatedSep 9, 2020