default search action
Kyu Jeong Han
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2023
- [c38]Felix Wu, Kwangyoun Kim, Shinji Watanabe, Kyu Jeong Han, Ryan McDonald, Kilian Q. Weinberger, Yoav Artzi:
Wav2Seq: Pre-Training Speech-to-Text Encoder-Decoder Models Using Pseudo Languages. ICASSP 2023: 1-5 - 2022
- [j6]Tae Jin Park, Naoyuki Kanda, Dimitrios Dimitriadis, Kyu Jeong Han, Shinji Watanabe, Shrikanth Narayanan:
A review of speaker diarization: Recent advances with deep learning. Comput. Speech Lang. 72: 101317 (2022) - [c37]Felix Wu, Kwangyoun Kim, Jing Pan, Kyu Jeong Han, Kilian Q. Weinberger, Yoav Artzi:
Performance-Efficiency Trade-Offs in Unsupervised Pre-Training for Speech Recognition. ICASSP 2022: 7667-7671 - [c36]Jing Pan, Tao Lei, Kwangyoun Kim, Kyu Jeong Han, Shinji Watanabe:
SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition. ICASSP 2022: 7872-7876 - [c35]Suwon Shon, Ankita Pasad, Felix Wu, Pablo Brusco, Yoav Artzi, Karen Livescu, Kyu Jeong Han:
SLUE: New Benchmark Tasks For Spoken Language Understanding Evaluation on Natural Speech. ICASSP 2022: 7927-7931 - [c34]Ankita Pasad, Felix Wu, Suwon Shon, Karen Livescu, Kyu Jeong Han:
On the Use of External Data for Spoken Named Entity Recognition. NAACL-HLT 2022: 724-737 - [c33]Kwangyoun Kim, Felix Wu, Yifan Peng, Jing Pan, Prashant Sridhar, Kyu Jeong Han, Shinji Watanabe:
E-Branchformer: Branchformer with Enhanced Merging for Speech Recognition. SLT 2022: 84-91 - [i15]Felix Wu, Kwangyoun Kim, Shinji Watanabe, Kyu Jeong Han, Ryan McDonald, Kilian Q. Weinberger, Yoav Artzi:
Wav2Seq: Pre-training Speech-to-Text Encoder-Decoder Models Using Pseudo Languages. CoRR abs/2205.01086 (2022) - [i14]Kwangyoun Kim, Felix Wu, Yifan Peng, Jing Pan, Prashant Sridhar, Kyu Jeong Han, Shinji Watanabe:
E-Branchformer: Branchformer with Enhanced merging for speech recognition. CoRR abs/2210.00077 (2022) - 2021
- [c32]Kyu Jeong Han, Jing Pan, Venkata Krishna Naveen Tadala, Tao Ma, Dan Povey:
Multistream CNN for Robust Acoustic Modeling. ICASSP 2021: 6873-6877 - [c31]Kwangyoun Kim, Felix Wu, Prashant Sridhar, Kyu Jeong Han, Shinji Watanabe:
Multi-Mode Transformer Transducer with Stochastic Future Context. Interspeech 2021: 1827-1831 - [c30]Suwon Shon, Pablo Brusco, Jing Pan, Kyu Jeong Han, Shinji Watanabe:
Leveraging Pre-Trained Language Model for Speech Sentiment Analysis. Interspeech 2021: 3420-3424 - [i13]Tae Jin Park, Naoyuki Kanda, Dimitrios Dimitriadis, Kyu Jeong Han, Shinji Watanabe, Shrikanth Narayanan:
A Review of Speaker Diarization: Recent Advances with Deep Learning. CoRR abs/2101.09624 (2021) - [i12]Suwon Shon, Pablo Brusco, Jing Pan, Kyu Jeong Han, Shinji Watanabe:
Leveraging Pre-trained Language Model for Speech Sentiment Analysis. CoRR abs/2106.06598 (2021) - [i11]Kwangyoun Kim, Felix Wu, Prashant Sridhar, Kyu Jeong Han, Shinji Watanabe:
Multi-mode Transformer Transducer with Stochastic Future Context. CoRR abs/2106.09760 (2021) - [i10]Felix Wu, Kwangyoun Kim, Jing Pan, Kyu Jeong Han, Kilian Q. Weinberger, Yoav Artzi:
Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition. CoRR abs/2109.06870 (2021) - [i9]Jing Pan, Tao Lei, Kwangyoun Kim, Kyu Jeong Han, Shinji Watanabe:
SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition. CoRR abs/2110.05571 (2021) - [i8]Suwon Shon, Ankita Pasad, Felix Wu, Pablo Brusco, Yoav Artzi, Karen Livescu, Kyu Jeong Han:
SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation on Natural Speech. CoRR abs/2111.10367 (2021) - [i7]Ankita Pasad, Felix Wu, Suwon Shon, Karen Livescu, Kyu Jeong Han:
On the Use of External Data for Spoken Named Entity Recognition. CoRR abs/2112.07648 (2021) - 2020
- [j5]Tae Jin Park, Kyu Jeong Han, Manoj Kumar, Shrikanth Narayanan:
Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap. IEEE Signal Process. Lett. 27: 381-385 (2020) - [c29]Jing Pan, Joshua Shapiro, Jeremy Wohlwend, Kyu Jeong Han, Tao Lei, Tao Ma:
ASAPP-ASR: Multistream CNN and Self-Attentive SRU for SOTA Speech Recognition. INTERSPEECH 2020: 16-20 - [i6]Tae Jin Park, Kyu Jeong Han, Manoj Kumar, Shrikanth Narayanan:
Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap. CoRR abs/2003.02405 (2020) - [i5]Tae Jin Park, Kyu Jeong Han, Jing Huang, Xiaodong He, Bowen Zhou, Panayiotis G. Georgiou, Shrikanth Narayanan:
Speaker Diarization with Lexical Information. CoRR abs/2004.06756 (2020) - [i4]Jing Pan, Joshua Shapiro, Jeremy Wohlwend, Kyu Jeong Han, Tao Lei, Tao Ma:
ASAPP-ASR: Multistream CNN and Self-Attentive SRU for SOTA Speech Recognition. CoRR abs/2005.10469 (2020) - [i3]Kyu Jeong Han, Jing Pan, Venkata Krishna Naveen Tadala, Tao Ma, Dan Povey:
Multistream CNN for Robust Acoustic Modeling. CoRR abs/2005.10470 (2020)
2010 – 2019
- 2019
- [c28]Kyu Jeong Han, Ramon Prieto, Tao Ma:
State-of-the-Art Speech Recognition Using Multi-Stream Self-Attention with Dilated 1D Convolutions. ASRU 2019: 54-61 - [c27]Tae Jin Park, Kyu Jeong Han, Jing Huang, Xiaodong He, Bowen Zhou, Panayiotis G. Georgiou, Shrikanth Narayanan:
Speaker Diarization with Lexical Information. INTERSPEECH 2019: 391-395 - [c26]Kyu Jeong Han, Jing Huang, Yun Tang, Xiaodong He, Bowen Zhou:
Multi-Stride Self-Attention for Speech Recognition. INTERSPEECH 2019: 2788-2792 - [c25]Kyu Jeong Han, Ramon Prieto, Tao Ma:
Survey Talk: When Attention Meets Speech Applications: Speech & Speaker Recognition Perspective. INTERSPEECH 2019 - [i2]Kyu Jeong Han, Ramon Prieto, Kaixing Wu, Tao Ma:
State-of-the-Art Speech Recognition Using Multi-Stream Self-Attention With Dilated 1D Convolutions. CoRR abs/1910.00716 (2019) - 2018
- [c24]Kyu Jeong Han, Akshay Chandrashekaran, Jungsuk Kim, Ian R. Lane:
Densely Connected Networks for Conversational Speech Recognition. INTERSPEECH 2018: 796-800 - [i1]Kyu Jeong Han, Akshay Chandrashekaran, Jungsuk Kim, Ian R. Lane:
The CAPIO 2017 Conversational Speech Recognition System. CoRR abs/1801.00059 (2018) - 2017
- [c23]Kyu Jeong Han, Seongjun Hahm, Byung-Hak Kim, Jungsuk Kim, Ian R. Lane:
Deep Learning-Based Telephony Speech Recognition in the Wild. INTERSPEECH 2017: 1323-1327 - 2016
- [c22]Wonkyum Lee, Kyu Jeong Han, Ian R. Lane:
Semi-Supervised Speaker Adaptation for In-Vehicle Speech Recognition with Deep Neural Networks. INTERSPEECH 2016: 3843-3847 - 2014
- [c21]Sriram Ganapathy, Kyu Jeong Han, Samuel Thomas, Mohamed Kamal Omar, Maarten Van Segbroeck, Shrikanth S. Narayanan:
Robust language identification using convolutional neural network features. INTERSPEECH 2014: 1846-1850 - 2013
- [j4]Ming Li, Kyu Jeong Han, Shrikanth S. Narayanan:
Automatic speaker age and gender recognition using acoustic and prosodic level information fusion. Comput. Speech Lang. 27(1): 151-167 (2013) - [c20]Kyu Jeong Han, Sriram Ganapathy, Ming Li, Mohamed Kamal Omar, Shrikanth S. Narayanan:
TRAP language identification system for RATS phase II evaluation. INTERSPEECH 2013: 1502-1506 - 2012
- [c19]Kyu Jeong Han, Jason W. Pelecanos, Mohamed Kamal Omar:
Keyword-conditioned phone N-gram modeling with contextual information for speaker verification. ICASSP 2012: 4797-4800 - [c18]Kyu Jeong Han, Jason W. Pelecanos:
Frame-based phonotactic Language Identification. SLT 2012: 303-306 - 2011
- [c17]Kyu Jeong Han, Mohamed Kamal Omar, Jason W. Pelecanos, Cezar Pendus, Sibel Yaman, Weizhong Zhu:
Forensically inspired approaches to automatic speaker recognition. ICASSP 2011: 5160-5163 - 2010
- [j3]Dhaval Shah, Kyu Jeong Han, Shrikanth S. Narayanan:
Robust Multimodal Person Recognition Using Low-Complexity Audio-Visual Feature Fusion Approaches. Int. J. Semantic Comput. 4(2): 155-179 (2010) - [j2]Viktor Rozgic, Kyu Jeong Han, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Multimodal Speaker Segmentation and Identification in Presence of Overlapped Speech Segments. J. Multim. 5(4): 322-331 (2010) - [c16]Emily Mower, Kyu Jeong Han, Sungbok Lee, Shrikanth S. Narayanan:
A cluster-profile representation of emotion using agglomerative hierarchical clustering. INTERSPEECH 2010: 797-800 - [c15]Kyu Jeong Han, Shrikanth S. Narayanan:
An improved cluster model selection method for agglomerative hierarchical speaker clustering using incremental Gaussian mixture models. INTERSPEECH 2010: 2658-2661 - [c14]Chi-Sang Jung, Kyu Jeong Han, Hyunson Seo, Shrikanth S. Narayanan, Hong-Goo Kang:
A variable frame length and rate algorithm based on the spectral kurtosis measure for speaker verification. INTERSPEECH 2010: 2754-2757 - [c13]Ming Li, Chi-Sang Jung, Kyu Jeong Han:
Combining five acoustic level modeling methods for automatic speaker age and gender recognition. INTERSPEECH 2010: 2826-2829
2000 – 2009
- 2009
- [c12]Kyu Jeong Han, Shrikanth S. Narayanan:
Improved speaker diarization of meeting speech with recurrent selection of representative speech segments and participant interaction pattern modeling. INTERSPEECH 2009: 1067-1070 - [c11]Kyu Jeong Han, Shrikanth S. Narayanan:
Signature cluster model selection for incremental Gaussian mixture cluster modeling in agglomerative hierarchical speaker clustering. INTERSPEECH 2009: 2547-2550 - [c10]Dhaval Shah, Kyu Jeong Han, Shrikanth S. Narayanan:
A Low-Complexity Dynamic Face-Voice Feature Fusion Approach to Multimodal Person Recognition. ISM 2009: 24-31 - 2008
- [j1]Kyu Jeong Han, Samuel Kim, Shrikanth S. Narayanan:
Strategies to Improve the Robustness of Agglomerative Hierarchical Clustering Under Data Source Variation for Speaker Diarization. IEEE Trans. Speech Audio Process. 16(8): 1590-1601 (2008) - [c9]Kyu Jeong Han, Shrikanth S. Narayanan:
Novel inter-cluster distance measure combining GLR and ICR for improved agglomerative hierarchical speaker clustering. ICASSP 2008: 4373-4376 - [c8]Kyu Jeong Han, Shrikanth S. Narayanan:
Agglomerative hierarchical speaker clustering using incremental Gaussian mixture cluster modeling. INTERSPEECH 2008: 20-23 - [c7]Viktor Rozgic, Kyu Jeong Han, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Multimodal Speaker Segmentation in Presence of Overlapped Speech Segments. ISM 2008: 679-684 - [c6]Kyu Jeong Han, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
The SAIL speaker diarization system for analysis of spontaneous meetings. MMSP 2008: 966-971 - 2007
- [c5]Kyu Jeong Han, Samuel Kim, Shrikanth S. Narayanan:
Robust speaker clustering strategies to data source variation for improved speaker diarization. ASRU 2007: 262-267 - [c4]Kyu Jeong Han, Shrikanth S. Narayanan:
A robust stopping criterion for agglomerative hierarchical clustering in a speaker diarization system. INTERSPEECH 2007: 1853-1856 - 2004
- [c3]Naveen Srinivasamurthy, Kyu Jeong Han, Shrikanth S. Narayanan:
Robust speech recognition over packet networks: an overview. INTERSPEECH 2004: 621-624 - [c2]Kyu Jeong Han, Shrikanth S. Narayanan, Naveen Srinivasamurthy:
A distributed speech recognition system in multi-user environments. INTERSPEECH 2004: 2121-2124 - 2002
- [c1]Kyu Jeong Han, Jae Hong Lee:
Iterative decoding of a differential space-time block code with low complexity. VTC Spring 2002: 1322-1325
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-14 21:04 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint