default search action
Takuhiro Kaneko
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j8]Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo, Shogo Seki:
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion With Annealed Langevin Dynamics. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2213-2226 (2024) - [c44]Takuhiro Kaneko:
Improving Physics-Augmented Continuum Neural Radiance Field-Based Geometry-Agnostic System Identification with Lagrangian Particle Optimization. CVPR 2024: 5470-5480 - [c43]Yuto Kondo, Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko, Noboru Harada:
Learning to Assess Subjective Impressions from Speech. EUSIPCO 2024: 381-385 - [c42]Yuto Kondo, Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko:
Selecting N-Lowest Scores for Training MOS Prediction Models. ICASSP 2024: 1451-1455 - [c41]Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka:
Training Generative Adversarial Network-Based Vocoder with Limited Data Using Augmentation-Conditional Discriminator. ICASSP 2024: 12561-12565 - [i33]Shogo Sato, Takuhiro Kaneko, Kazuhiko Murasaki, Taiga Yoshida, Ryuichi Tanida, Akisato Kimura:
Unsupervised Intrinsic Image Decomposition with LiDAR Intensity Enhanced Training. CoRR abs/2403.14089 (2024) - [i32]Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka:
Training Generative Adversarial Network-Based Vocoder with Limited Data Using Augmentation-Conditional Discriminator. CoRR abs/2403.16464 (2024) - [i31]Takuhiro Kaneko:
Improving Physics-Augmented Continuum Neural Radiance Field-Based Geometry-Agnostic System Identification with Lagrangian Particle Optimization. CoRR abs/2406.04155 (2024) - [i30]Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Yuto Kondo:
FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion Distillation. CoRR abs/2409.02245 (2024) - 2023
- [j7]Shogo Seki, Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka:
Non-Parallel Whisper-to-Normal Speaking Style Conversion Using Auxiliary Classifier Variational Autoencoder. IEEE Access 11: 44590-44599 (2023) - [c40]Shogo Sato, Yasuhiro Yao, Taiga Yoshida, Takuhiro Kaneko, Shingo Ando, Jun Shimamura:
Unsupervised Intrinsic Image Decomposition with LiDAR Intensity. CVPR 2023: 13466-13475 - [c39]Shogo Seki, Kanami Imamura, Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Noboru Harada:
W2N-AVSC: Audiovisual Extension For Whisper-To-Normal Speech Conversion. EUSIPCO 2023: 296-300 - [c38]Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Shogo Seki:
Wave-U-Net Discriminator: Fast and Lightweight Discriminator for Generative Adversarial Network-Based Speech Synthesis. ICASSP 2023: 1-5 - [c37]Shogo Seki, Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko:
JSV-VC: Jointly Trained Speaker Verification and Voice Conversion Models. ICASSP 2023: 1-5 - [c36]Takuhiro Kaneko:
MIMO-NeRF: Fast Neural Rendering with Multi-input Multi-output Neural Radiance Fields. ICCV 2023: 3250-3260 - [c35]Ayaka Ideno, Takuhiro Kaneko, Tatsuya Harada:
Frame-Level Event Representation Learning for Semantic-Level Generation and Editing of Avatar Motion. ICMI 2023: 292-300 - [c34]Kou Tanaka, Takuhiro Kaneko, Hirokazu Kameoka, Shogo Seki:
CFVC: Conditional Filtering for Controllable Voice Conversion. INTERSPEECH 2023: 2058-2062 - [c33]Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Shogo Seki:
iSTFTNet2: Faster and More Lightweight iSTFT-Based Neural Vocoder Using 1D-2D CNN. INTERSPEECH 2023: 4369-4373 - [c32]Kou Tanaka, Hirokazu Kameoka, Takuhiro Kaneko:
PRVAE-VC: Non-Parallel Many-to-Many Voice Conversion with Perturbation-Resistant Variational Autoencoder. SSW 2023: 88-93 - [i29]Shogo Sato, Yasuhiro Yao, Taiga Yoshida, Takuhiro Kaneko, Shingo Ando, Jun Shimamura:
Unsupervised Intrinsic Image Decomposition with LiDAR Intensity. CoRR abs/2303.10820 (2023) - [i28]Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Shogo Seki:
Wave-U-Net Discriminator: Fast and Lightweight Discriminator for Generative Adversarial Network-Based Speech Synthesis. CoRR abs/2303.13909 (2023) - [i27]Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Shogo Seki:
iSTFTNet2: Faster and More Lightweight iSTFT-Based Neural Vocoder Using 1D-2D CNN. CoRR abs/2308.07117 (2023) - [i26]Takuhiro Kaneko:
MIMO-NeRF: Fast Neural Rendering with Multi-input Multi-output Neural Radiance Fields. CoRR abs/2310.01821 (2023) - 2022
- [c31]Takuhiro Kaneko:
AR-NeRF: Unsupervised Learning of Depth and Defocus Effects from Natural Images with Aperture Rendering Neural Radiance Fields. CVPR 2022: 18366-18376 - [c30]Takuhiro Kaneko, Kou Tanaka, Hirokazu Kameoka, Shogo Seki:
ISTFTNET: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform. ICASSP 2022: 6207-6211 - [c29]Hirokazu Kameoka, Takuhiro Kaneko, Shogo Seki, Kou Tanaka:
CAUSE: Crossmodal Action Unit Sequence Estimation from Speech. INTERSPEECH 2022: 506-510 - [c28]Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Shogo Seki:
MISRNet: Lightweight Neural Vocoder Using Multi-Input Single Shared Residual Blocks. INTERSPEECH 2022: 1631-1635 - [c27]Kou Tanaka, Hirokazu Kameoka, Takuhiro Kaneko, Shogo Seki:
Distilling Sequence-to-Sequence Voice Conversion Models for Streaming Conversion Applications. SLT 2022: 1022-1028 - [i25]Takuhiro Kaneko, Kou Tanaka, Hirokazu Kameoka, Shogo Seki:
iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform. CoRR abs/2203.02395 (2022) - [i24]Takuhiro Kaneko:
AR-NeRF: Unsupervised Learning of Depth and Defocus Effects from Natural Images with Aperture Rendering Neural Radiance Fields. CoRR abs/2206.06100 (2022) - 2021
- [j6]Hirokazu Kameoka, Wen-Chin Huang, Kou Tanaka, Takuhiro Kaneko, Nobukatsu Hojo, Tomoki Toda:
Many-to-Many Voice Transformer Network. IEEE ACM Trans. Audio Speech Lang. Process. 29: 656-670 (2021) - [c26]Takuhiro Kaneko, Tatsuya Harada:
Blur, Noise, and Compression Robust Generative Adversarial Networks. CVPR 2021: 13579-13589 - [c25]Takuhiro Kaneko:
Unsupervised Learning of Depth and Depth-of-Field Effect From Natural Images With Aperture Rendering Generative Adversarial Networks. CVPR 2021: 15679-15688 - [c24]Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo:
Maskcyclegan-VC: Learning Non-Parallel Voice Conversion with Filling in Frames. ICASSP 2021: 5919-5923 - [i23]Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo:
MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in Frames. CoRR abs/2102.12841 (2021) - [i22]Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko:
FastS2S-VC: Streaming Non-Autoregressive Sequence-to-Sequence Voice Conversion. CoRR abs/2104.06900 (2021) - [i21]Takuhiro Kaneko:
Unsupervised Learning of Depth and Depth-of-Field Effect from Natural Images with Aperture Rendering Generative Adversarial Networks. CoRR abs/2106.13041 (2021) - 2020
- [j5]Hirokazu Kameoka, Kou Tanaka, Damian Kwasny, Takuhiro Kaneko, Nobukatsu Hojo:
ConvS2S-VC: Fully Convolutional Sequence-to-Sequence Voice Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1849-1863 (2020) - [j4]Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo:
Nonparallel Voice Conversion With Augmented Classifier Star Generative Adversarial Networks. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2982-2995 (2020) - [c23]Takuhiro Kaneko, Tatsuya Harada:
Noise Robust Generative Adversarial Networks. CVPR 2020: 8401-8411 - [c22]Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo:
CycleGAN-VC3: Examining and Improving CycleGAN-VCs for Mel-Spectrogram Conversion. INTERSPEECH 2020: 2017-2021 - [i20]Takuhiro Kaneko, Tatsuya Harada:
Blur, Noise, and Compression Robust Generative Adversarial Networks. CoRR abs/2003.07849 (2020) - [i19]Hirokazu Kameoka, Wen-Chin Huang, Kou Tanaka, Takuhiro Kaneko, Nobukatsu Hojo, Tomoki Toda:
Many-to-Many Voice Transformer Network. CoRR abs/2005.08445 (2020) - [i18]Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo, Shogo Seki:
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics. CoRR abs/2010.02977 (2020) - [i17]Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo:
CycleGAN-VC3: Examining and Improving CycleGAN-VCs for Mel-spectrogram Conversion. CoRR abs/2010.11672 (2020)
2010 – 2019
- 2019
- [j3]Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo:
ACVAE-VC: Non-Parallel Voice Conversion With Auxiliary Classifier Variational Autoencoder. IEEE ACM Trans. Audio Speech Lang. Process. 27(9): 1432-1443 (2019) - [c21]Takuhiro Kaneko, Yoshitaka Ushiku, Tatsuya Harada:
Class-Distinct and Class-Mutual Image Generation with GANs. BMVC 2019: 16 - [c20]Takuhiro Kaneko, Yoshitaka Ushiku, Tatsuya Harada:
Label-Noise Robust Generative Adversarial Networks. CVPR 2019: 2467-2476 - [c19]Kou Tanaka, Hirokazu Kameoka, Takuhiro Kaneko, Nobukatsu Hojo:
ATTS2S-VC: Sequence-to-sequence Voice Conversion with Attention and Context Preservation Mechanisms. ICASSP 2019: 6805-6809 - [c18]Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo:
Cyclegan-VC2: Improved Cyclegan-based Non-parallel Voice Conversion. ICASSP 2019: 6820-6824 - [c17]Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo:
StarGAN-VC2: Rethinking Conditional Methods for StarGAN-Based Voice Conversion. INTERSPEECH 2019: 679-683 - [i16]Kou Tanaka, Hirokazu Kameoka, Takuhiro Kaneko, Nobukatsu Hojo:
WaveCycleGAN2: Time-domain Neural Post-filter for Speech Waveform Generation. CoRR abs/1904.02892 (2019) - [i15]Hirokazu Kameoka, Kou Tanaka, Aaron Valero Puche, Yasunori Ohishi, Takuhiro Kaneko:
Crossmodal Voice Conversion. CoRR abs/1904.04540 (2019) - [i14]Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo:
CycleGAN-VC2: Improved CycleGAN-based Non-parallel Voice Conversion. CoRR abs/1904.04631 (2019) - [i13]Takuhiro Kaneko, Tatsuya Harada:
Label-Noise Robust Multi-Domain Image-to-Image Translation. CoRR abs/1905.02185 (2019) - [i12]Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo:
StarGAN-VC2: Rethinking Conditional Methods for StarGAN-Based Voice Conversion. CoRR abs/1907.12279 (2019) - [i11]Takuhiro Kaneko, Tatsuya Harada:
Noise Robust Generative Adversarial Networks. CoRR abs/1911.11776 (2019) - 2018
- [c16]Takuhiro Kaneko, Kaoru Hiramatsu, Kunio Kashino:
Generative Adversarial Image Synthesis With Decision Tree Latent Controller. CVPR 2018: 6606-6615 - [c15]Takuhiro Kaneko, Hirokazu Kameoka:
CycleGAN-VC: Non-parallel Voice Conversion Using Cycle-Consistent Adversarial Networks. EUSIPCO 2018: 2100-2104 - [c14]Nobukatsu Hojo, Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko:
Automatic Speech Pronunciation Correction with Dynamic Frequency Warping-Based Spectral Conversion. EUSIPCO 2018: 2310-2314 - [c13]Keisuke Oyamada, Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo, Hiroyasu Ando:
Generative adversarial network-based approach to signal reconstruction from magnitude spectrogram. EUSIPCO 2018: 2514-2518 - [c12]Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo:
StarGAN-VC: non-parallel many-to-many Voice Conversion Using Star Generative Adversarial Networks. SLT 2018: 266-273 - [c11]Kou Tanaka, Takuhiro Kaneko, Nobukatsu Hojo, Hirokazu Kameoka:
Synthetic-to-Natural Speech Waveform Conversion Using Cycle-Consistent Adversarial Networks. SLT 2018: 632-639 - [i10]Keisuke Oyamada, Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo, Hiroyasu Ando:
Generative adversarial network-based approach to signal reconstruction from magnitude spectrograms. CoRR abs/1804.02181 (2018) - [i9]Takuhiro Kaneko, Kaoru Hiramatsu, Kunio Kashino:
Generative Adversarial Image Synthesis with Decision Tree Latent Controller. CoRR abs/1805.10603 (2018) - [i8]Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo:
StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial networks. CoRR abs/1806.02169 (2018) - [i7]Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo:
ACVAE-VC: Non-parallel many-to-many voice conversion with auxiliary classifier variational autoencoder. CoRR abs/1808.05092 (2018) - [i6]Kou Tanaka, Takuhiro Kaneko, Nobukatsu Hojo, Hirokazu Kameoka:
WaveCycleGAN: Synthetic-to-natural speech waveform conversion using cycle-consistent adversarial networks. CoRR abs/1809.10288 (2018) - [i5]Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko, Nobukatsu Hojo:
ConvS2S-VC: Fully convolutional sequence-to-sequence voice conversion. CoRR abs/1811.01609 (2018) - [i4]Kou Tanaka, Hirokazu Kameoka, Takuhiro Kaneko, Nobukatsu Hojo:
AttS2S-VC: Sequence-to-Sequence Voice Conversion with Attention and Context Preservation Mechanisms. CoRR abs/1811.04076 (2018) - [i3]Takuhiro Kaneko, Yoshitaka Ushiku, Tatsuya Harada:
Class-Distinct and Class-Mutual Image Generation with GANs. CoRR abs/1811.11163 (2018) - [i2]Takuhiro Kaneko, Yoshitaka Ushiku, Tatsuya Harada:
Label-Noise Robust Generative Adversarial Networks. CoRR abs/1811.11165 (2018) - 2017
- [c10]Keisuke Oyamada, Hirokazu Kameoka, Takuhiro Kaneko, Hiroyasu Ando, Kaoru Hiramatsu, Kunio Kashino:
Non-native speech conversion with consistency-aware recursive network and generative adversarial network. APSIPA 2017: 182-188 - [c9]Takuhiro Kaneko, Kaoru Hiramatsu, Kunio Kashino:
Generative Attribute Controller with Conditional Filtered Generative Adversarial Networks. CVPR 2017: 7006-7015 - [c8]Takuhiro Kaneko, Hirokazu Kameoka, Nobukatsu Hojo, Yusuke Ijima, Kaoru Hiramatsu, Kunio Kashino:
Generative adversarial network-based postfilter for statistical parametric speech synthesis. ICASSP 2017: 4910-4914 - [c7]Takuhiro Kaneko, Hirokazu Kameoka, Kaoru Hiramatsu, Kunio Kashino:
Sequence-to-Sequence Voice Conversion with Similarity Metric Learned Using Generative Adversarial Networks. INTERSPEECH 2017: 1283-1287 - [c6]Takuhiro Kaneko, Shinji Takaki, Hirokazu Kameoka, Junichi Yamagishi:
Generative Adversarial Network-Based Postfilter for STFT Spectrograms. INTERSPEECH 2017: 3389-3393 - [i1]Takuhiro Kaneko, Hirokazu Kameoka:
Parallel-Data-Free Voice Conversion Using Cycle-Consistent Adversarial Networks. CoRR abs/1711.11293 (2017) - 2016
- [j2]Shigeyuki Odashima, Masamichi Shimosaka, Takuhiro Kaneko, Rui Fukui, Tomomasa Sato:
Collective activity localization by spatiality preservation search. Adv. Robotics 30(11-12): 784-794 (2016) - [c5]Takuhiro Kaneko, Kaoru Hiramatsu, Kunio Kashino:
Adaptive Visual Feedback Generation for Facial Expression Improvement with Multi-task Deep Neural Networks. ACM Multimedia 2016: 327-331 - 2014
- [j1]Takuhiro Kaneko, Masamichi Shimosaka, Shigeyuki Odashima, Rui Fukui, Tomomasa Sato:
A fully connected model for consistent collective activity recognition in videos. Pattern Recognit. Lett. 43: 109-118 (2014) - [c4]Masamichi Shimosaka, Takuhiro Kaneko, Kentaro Nishi:
Modeling risk anticipation and defensive driving on residential roads with inverse reinforcement learning. ITSC 2014: 1694-1700 - 2012
- [c3]Shigeyuki Odashima, Masamichi Shimosaka, Takuhiro Kaneko, Rui Fukui, Tomomasa Sato:
Collective Activity Localization with Contextual Spatial Pyramid. ECCV Workshops (3) 2012: 243-252 - [c2]Takuhiro Kaneko, Masamichi Shimosaka, Shigeyuki Odashima, Rui Fukui, Tomomasa Sato:
Viewpoint Invariant Collective Activity Recognition with Relative Action Context. ECCV Workshops (3) 2012: 253-262 - [c1]Takuhiro Kaneko, Masamichi Shimosaka, Shigeyuki Odashima, Rui Fukui, Tomomasa Sato:
Consistent collective activity recognition with fully connected CRFs. ICPR 2012: 2792-2795
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-07 21:36 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint