More Web Proxy on the site http://driver.im/

default search action

combined dblp search
author search
venue search
publication search

ask others

Shogo Seki

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/KameokaKTHS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/KameokaKTHS24
Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo, Shogo Seki:
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion With Annealed Langevin Dynamics. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2213-2226 (2024)
[c25]
- view
  - electronic edition @ ieee.org
  - no references & citations available
- export record
  dblp key:
  - conf/eusipco/AyanoLSK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/AyanoLSK24
Shoma Ayano, Li Li, Shogo Seki, Daichi Kitamura:
Audio Spotforming Using Nonnegative Tensor Factorization with Attractor-Based Regularization. EUSIPCO 2024: 121-125
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0063S24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0063S24
Li Li, Shogo Seki:
Remixed2remixed: Domain Adaptation for Speech Enhancement by Noise2noise Learning with Remixing. ICASSP 2024: 806-810
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-13982
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-13982
Li Li, Shogo Seki:
Improved Remixing Process for Domain Adaptation-Based Speech Enhancement by Mitigating Data Imbalance in Signal-to-Noise Ratio. CoRR abs/2406.13982 (2024)
[i7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-08951
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-08951
Shoma Ayano, Li Li, Shogo Seki, Daichi Kitamura:
Audio Spotforming Using Nonnegative Tensor Factorization with Attractor-Based Regularization. CoRR abs/2407.08951 (2024)
2023
[j3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/access/SekiKKT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/SekiKKT23
Shogo Seki, Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka:
Non-Parallel Whisper-to-Normal Speaking Style Conversion Using Auxiliary Classifier Variational Autoencoder. IEEE Access 11: 44590-44599 (2023)
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/SekiIKKTH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/SekiIKKTH23
Shogo Seki, Kanami Imamura, Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Noboru Harada:
W2N-AVSC: Audiovisual Extension For Whisper-To-Normal Speech Conversion. EUSIPCO 2023: 296-300
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KanekoKTS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KanekoKTS23
Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Shogo Seki:
Wave-U-Net Discriminator: Fast and Lightweight Discriminator for Generative Adversarial Network-Based Speech Synthesis. ICASSP 2023: 1-5
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SekiKTK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SekiKTK23
Shogo Seki, Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko:
JSV-VC: Jointly Trained Speaker Verification and Voice Conversion Models. ICASSP 2023: 1-5
[c20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TanakaKKS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TanakaKKS23
Kou Tanaka, Takuhiro Kaneko, Hirokazu Kameoka, Shogo Seki:
CFVC: Conditional Filtering for Controllable Voice Conversion. INTERSPEECH 2023: 2058-2062
[c19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KanekoKTS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KanekoKTS23
Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Shogo Seki:
iSTFTNet2: Faster and More Lightweight iSTFT-Based Neural Vocoder Using 1D-2D CNN. INTERSPEECH 2023: 4369-4373
[i6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-13909
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-13909
Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Shogo Seki:
Wave-U-Net Discriminator: Fast and Lightweight Discriminator for Generative Adversarial Network-Based Speech Synthesis. CoRR abs/2303.13909 (2023)
[i5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-07117
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-07117
Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Shogo Seki:
iSTFTNet2: Faster and More Lightweight iSTFT-Based Neural Vocoder Using 1D-2D CNN. CoRR abs/2308.07117 (2023)
[i4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-16836
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-16836
Li Li, Shogo Seki:
Remixed2Remixed: Domain adaptation for speech enhancement by Noise2Noise learning with Remixing. CoRR abs/2312.16836 (2023)
2022
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SekiKL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SekiKL22
Shogo Seki, Hirokazu Kameoka, Li Li:
Investigation And Comparison of Optimization Methods for Variational Autoencoder-Based Underdetermined Multichannel Source Separation. ICASSP 2022: 511-515
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiKS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiKS22
Li Li, Hirokazu Kameoka, Shogo Seki:
HBP: An Efficient Block Permutation Solver Using Hungarian Algorithm and Spectrogram Inpainting for Multichannel Audio Source Separation. ICASSP 2022: 516-520
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KameokaSLW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KameokaSLW22
Hirokazu Kameoka, Shogo Seki, Li Li, Chihiro Watanabe:
Attentionpit: Soft Permutation Invariant Training for Audio Source Separation with Attention Mechanism. ICASSP 2022: 706-710
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KanekoTKS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KanekoTKS22
Takuhiro Kaneko, Kou Tanaka, Hirokazu Kameoka, Shogo Seki:
ISTFTNET: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform. ICASSP 2022: 6207-6211
[c14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KameokaKST22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KameokaKST22
Hirokazu Kameoka, Takuhiro Kaneko, Shogo Seki, Kou Tanaka:
CAUSE: Crossmodal Action Unit Sequence Estimation from Speech. INTERSPEECH 2022: 506-510
[c13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KanekoKTS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KanekoKTS22
Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Shogo Seki:
MISRNet: Lightweight Neural Vocoder Using Multi-Input Single Shared Residual Blocks. INTERSPEECH 2022: 1631-1635
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/TanakaKKS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/TanakaKKS22
Kou Tanaka, Hirokazu Kameoka, Takuhiro Kaneko, Shogo Seki:
Distilling Sequence-to-Sequence Voice Conversion Models for Streaming Conversion Applications. SLT 2022: 1022-1028
[i3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-02395
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-02395
Takuhiro Kaneko, Kou Tanaka, Hirokazu Kameoka, Shogo Seki:
iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform. CoRR abs/2203.02395 (2022)
2021
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/mlsp/SekiTT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsp/SekiTT21
Shogo Seki, Haruka Taga, Tomoki Toda:
Singing Fundamental Frequency Contour Generation Using Generalized Command-Response Model and Score-Conditional Variational Autoencoder. MLSP 2021: 1-3
2020
[b1]
- view
- export record
  dblp key:
  - phd/jp/Seki20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/jp/Seki20
Shogo Seki:
A Study on Utilization of Prior Knowledge in Underdetermined Source Separation and Its Application. Nagoya University, Japan, 2020
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/TakadaSTT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/TakadaSTT20
Moe Takada, Shogo Seki, Patrick Lumban Tobing, Tomoki Toda:
Semi-Supervised Enhancement and Suppression of Self-Produced Speech Using Correspondence between Air- and Body-Conducted Signals. EUSIPCO 2020: 456-460
[c9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SekiTT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SekiTT20
Shogo Seki, Moe Takada, Tomoki Toda:
Semi-Supervised Self-Produced Speech Enhancement and Suppression Based on Joint Source Modeling of Air- and Body-Conducted Signals Using Variational Autoencoder. INTERSPEECH 2020: 4039-4043
[c8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HikosakaSHKTBT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HikosakaSHKTBT20
Shu Hikosaka, Shogo Seki, Tomoki Hayashi, Kazuhiro Kobayashi, Kazuya Takeda, Hideki Banno, Tomoki Toda:
Intelligibility Enhancement Based on Speech Waveform Modification Using Hearing Impairment. INTERSPEECH 2020: 4059-4063
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-02977
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-02977
Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo, Shogo Seki:
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics. CoRR abs/2010.02977 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/access/SekiKLTT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/SekiKLTT19
Shogo Seki, Hirokazu Kameoka, Li Li, Tomoki Toda, Kazuya Takeda:
Underdetermined Source Separation Based on Generalized Multichannel Variational Autoencoder. IEEE Access 7: 168104-168115 (2019)
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/SekiKLTT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/SekiKLTT19
Shogo Seki, Hirokazu Kameoka, Li Li, Tomoki Toda, Kazuya Takeda:
Generalized Multichannel Variational Autoencoder for Underdetermined Source Separation. EUSIPCO 2019: 1-5
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/InoueKLSM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/InoueKLSM19
Shota Inoue, Hirokazu Kameoka, Li Li, Shogo Seki, Shoji Makino:
Joint Separation and Dereverberation of Reverberant Mixtures with Multichannel Variational Autoencoder. ICASSP 2019: 96-100
2018
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/SekiTT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/SekiTT18
Shogo Seki, Tomoki Toda, Kazuya Takeda:
Stereophonic Music Separation Based on Non-Negative Tensor Factorization with Cepstral Distance Regularization. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 101-A(7): 1057-1064 (2018)
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/TakadaST18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/TakadaST18
Moe Takada, Shogo Seki, Tomoki Toda:
Self-Produced Speech Enhancement and Suppression Method using Air- and Body-Conductive Microphones. APSIPA 2018: 1240-1245
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1810-00223
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-00223
Shogo Seki, Hirokazu Kameoka, Li Li, Tomoki Toda, Kazuya Takeda:
Generalized Multichannel Variational Autoencoder for Underdetermined Source Separation. CoRR abs/1810.00223 (2018)
2017
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/SekiTT17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/SekiTT17
Shogo Seki, Tomoki Toda, Kazuya Takeda:
Stereophonic music separation based on non-negative tensor factorization with cepstrum regularization. EUSIPCO 2017: 981-985
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/mlsp/SekiKTT17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsp/SekiKTT17
Shogo Seki, Hirokazu Kameoka, Tomoki Toda, Kazuya Takeda:
Missing component restoration for masked speech signals based on time-domain spectrogram factorization. MLSP 2017: 1-6
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/sca/SekiI17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sca/SekiI17
Shogo Seki, Takeo Igarashi:
Sketch-based 3D hair posing by contour drawings. Symposium on Computer Animation 2017: 29:1-29:2
2016
[c1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OgawaSKDYNT16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OgawaSKDYNT16
Atsunori Ogawa, Shogo Seki, Keisuke Kinoshita, Marc Delcroix, Takuya Yoshioka, Tomohiro Nakatani, Kazuya Takeda:
Robust Example Search Using Bottleneck Features for Example-Based Speech Enhancement. INTERSPEECH 2016: 3733-3737

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.