default search action
Heinrich Dinkel
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c28]Heinrich Dinkel, Yongqing Wang, Zhiyong Yan, Junbo Zhang, Yujun Wang:
CED: Consistent Ensemble Distillation for Audio Tagging. ICASSP 2024: 291-295 - [i24]Heinrich Dinkel, Zhiyong Yan, Yongqing Wang, Junbo Zhang, Yujun Wang, Bin Wang:
Scaling up masked audio encoder learning for general audio classification. CoRR abs/2406.06992 (2024) - [i23]Zhiyong Yan, Heinrich Dinkel, Yongqing Wang, Jizhong Liu, Junbo Zhang, Yujun Wang, Bin Wang:
Bridging Language Gaps in Audio-Text Retrieval. CoRR abs/2406.07012 (2024) - [i22]Jizhong Liu, Gang Li, Junbo Zhang, Heinrich Dinkel, Yongqing Wang, Zhiyong Yan, Yujun Wang, Bin Wang:
Enhancing Automated Audio Captioning via Large Language Models with Optimized Audio Encoding. CoRR abs/2406.13275 (2024) - 2023
- [c27]Heinrich Dinkel, Yongqing Wang, Zhiyong Yan, Junbo Zhang, Yujun Wang:
Unified Keyword Spotting and Audio Tagging on Mobile Devices with Transformers. ICASSP 2023: 1-5 - [c26]Jiuxin Lin, Xinyu Cai, Heinrich Dinkel, Jun Chen, Zhiyong Yan, Yongqing Wang, Junbo Zhang, Zhiyong Wu, Yujun Wang, Helen Meng:
Av-Sepformer: Cross-Attention Sepformer for Audio-Visual Target Speaker Extraction. ICASSP 2023: 1-5 - [c25]Jiuxin Lin, Peng Wang, Heinrich Dinkel, Jun Chen, Zhiyong Wu, Zhiyong Yan, Yongqing Wang, Junbo Zhang, Yujun Wang:
Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information. INTERSPEECH 2023: 2488-2492 - [i21]Heinrich Dinkel, Yongqing Wang, Zhiyong Yan, Junbo Zhang, Yujun Wang:
Unified Keyword Spotting and Audio Tagging on Mobile Devices with Transformers. CoRR abs/2303.01812 (2023) - [i20]Heinrich Dinkel, Zhiyong Yan, Yongqing Wang, Junbo Zhang, Yujun Wang:
Streaming Audio Transformers for Online Audio Tagging. CoRR abs/2305.17834 (2023) - [i19]Heinrich Dinkel, Weiji Zhuang, Zhiyong Yan, Yongqing Wang, Junbo Zhang, Yujun Wang:
Understanding temporally weakly supervised training: A case study for keyword spotting. CoRR abs/2305.18794 (2023) - [i18]Jiuxin Lin, Xinyu Cai, Heinrich Dinkel, Jun Chen, Zhiyong Yan, Yongqing Wang, Junbo Zhang, Zhiyong Wu, Yujun Wang, Helen Meng:
AV-SepFormer: Cross-Attention SepFormer for Audio-Visual Target Speaker Extraction. CoRR abs/2306.14170 (2023) - [i17]Jiuxin Lin, Peng Wang, Heinrich Dinkel, Jun Chen, Zhiyong Wu, Zhiyong Yan, Yongqing Wang, Junbo Zhang, Yujun Wang:
Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information. CoRR abs/2306.16241 (2023) - [i16]Heinrich Dinkel, Yongqing Wang, Zhiyong Yan, Junbo Zhang, Yujun Wang:
CED: Consistent ensemble distillation for audio tagging. CoRR abs/2308.11957 (2023) - 2022
- [c24]Heinrich Dinkel, Zhiyong Yan, Yongqing Wang, Junbo Zhang, Yujun Wang:
Pseudo Strong Labels for Large Scale Weakly Supervised Audio Tagging. ICASSP 2022: 336-340 - [c23]Guangwei Li, Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Kai Yu:
Category-Adapted Sound Event Enhancement with Weakly Labeled Data. ICASSP 2022: 851-855 - [c22]Heinrich Dinkel, Yongqing Wang, Zhiyong Yan, Junbo Zhang, Yujun Wang:
UniKW-AT: Unified Keyword Spotting and Audio Tagging. INTERSPEECH 2022: 3238-3242 - [c21]Heinrich Dinkel, Zhiyong Yan, Yongqing Wang, Junbo Zhang, Yujun Wang:
An Empirical Study of Weakly Supervised Audio Tagging Embeddings for General Audio Representations. Odyssey 2022: 390-395 - [i15]Heinrich Dinkel, Zhiyong Yan, Yongqing Wang, Junbo Zhang, Yujun Wang:
Pseudo strong labels for large scale weakly supervised audio tagging. CoRR abs/2204.13430 (2022) - [i14]Heinrich Dinkel, Yongqing Wang, Zhiyong Yan, Junbo Zhang, Yujun Wang:
UniKW-AT: Unified Keyword Spotting and Audio Tagging. CoRR abs/2209.11377 (2022) - [i13]Heinrich Dinkel, Zhiyong Yan, Yongqing Wang, Junbo Zhang, Yujun Wang:
An empirical study of weakly supervised audio tagging embeddings for general audio representations. CoRR abs/2209.15167 (2022) - 2021
- [j4]Heinrich Dinkel, Mengyue Wu, Kai Yu:
Towards Duration Robust Weakly Supervised Sound Event Detection. IEEE ACM Trans. Audio Speech Lang. Process. 29: 887-900 (2021) - [j3]Heinrich Dinkel, Shuai Wang, Xuenan Xu, Mengyue Wu, Kai Yu:
Voice Activity Detection in the Wild: A Data-Driven Approach Using Teacher-Student Training. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1542-1555 (2021) - [c20]Xinyu Cai, Heinrich Dinkel:
A Contrastive Semi-Supervised Learning Framework For Anomaly Sound Detection. DCASE 2021: 31-34 - [c19]Xinyu Cai, Heinrich Dinkel:
A Lightweight Approach for Semi-Supervised Sound Event Detection with Unsupervised Data Augmentation. DCASE 2021: 35-39 - [c18]Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Kai Yu:
Text-to-Audio Grounding: Building Correspondence Between Captions and Sound Events. ICASSP 2021: 606-610 - [c17]Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Zeyu Xie, Kai Yu:
Investigating Local and Global Information for Automated Audio Captioning with Transfer Learning. ICASSP 2021: 905-909 - [c16]Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Kai Yu:
A Lightweight Framework for Online Voice Activity Detection in the Wild. Interspeech 2021: 371-375 - [c15]Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Kai Yu:
Audio Caption in a Car Setting with a Sentence-Level Loss. ISCSLP 2021: 1-5 - [c14]Pingyue Zhang, Mengyue Wu, Heinrich Dinkel, Kai Yu:
DEPA: Self-Supervised Audio Embedding for Depression Detection. ACM Multimedia 2021: 135-143 - [i12]Heinrich Dinkel, Mengyue Wu, Kai Yu:
Towards duration robust weakly supervised sound event detection. CoRR abs/2101.07687 (2021) - [i11]Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Zeyu Xie, Kai Yu:
Investigating Local and Global Information for Automated Audio Captioning with Transfer Learning. CoRR abs/2102.11457 (2021) - [i10]Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Kai Yu:
Text-to-Audio Grounding: Building Correspondence Between Captions and Sound Events. CoRR abs/2102.11474 (2021) - [i9]Heinrich Dinkel, Shuai Wang, Xuenan Xu, Mengyue Wu, Kai Yu:
Voice activity detection in the wild: A data-driven approach using teacher-student training. CoRR abs/2105.04065 (2021) - 2020
- [c13]Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Kai Yu:
A CRNN-GRU Based Reinforcement Learning Approach to Audio Captioning. DCASE 2020: 225-229 - [c12]Rui Qian, Di Hu, Heinrich Dinkel, Mengyue Wu, Ning Xu, Weiyao Lin:
Multiple Sound Sources Localization from Coarse to Fine. ECCV (20) 2020: 292-308 - [c11]Heinrich Dinkel, Kai Yu:
Duration Robust Weakly Supervised Sound Event Detection. ICASSP 2020: 311-315 - [c10]Hongji Wang, Heinrich Dinkel, Shuai Wang, Yanmin Qian, Kai Yu:
Dual-Adversarial Domain Adaptation for Generalized Replay Attack Detection. INTERSPEECH 2020: 1086-1090 - [c9]Yefei Chen, Heinrich Dinkel, Mengyue Wu, Kai Yu:
Voice Activity Detection in the Wild via Weakly Supervised Sound Event Detection. INTERSPEECH 2020: 3665-3669 - [i8]Heinrich Dinkel, Yefei Chen, Mengyue Wu, Kai Yu:
GPVAD: Towards noise robust voice activity detection via weakly supervised sound event detection. CoRR abs/2003.12222 (2020) - [i7]Rui Qian, Di Hu, Heinrich Dinkel, Mengyue Wu, Ning Xu, Weiyao Lin:
Multiple Sound Sources Localization from Coarse to Fine. CoRR abs/2007.06355 (2020) - [i6]Heinrich Dinkel, Nanxin Chen, Yanmin Qian, Kai Yu:
End-to-end spoofing detection with raw waveform CLDNNs. CoRR abs/2007.13060 (2020)
2010 – 2019
- 2019
- [c8]Mengyue Wu, Heinrich Dinkel, Kai Yu:
Audio Caption: Listen and Tell. ICASSP 2019: 830-834 - [c7]Yexin Yang, Hongji Wang, Heinrich Dinkel, Zhengyang Chen, Shuai Wang, Yanmin Qian, Kai Yu:
The SJTU Robust Anti-Spoofing System for the ASVspoof 2019 Challenge. INTERSPEECH 2019: 1038-1042 - [c6]Hongji Wang, Heinrich Dinkel, Shuai Wang, Yanmin Qian, Kai Yu:
Cross-Domain Replay Spoofing Attack Detection Using Domain Adversarial Training. INTERSPEECH 2019: 2938-2942 - [i5]Mengyue Wu, Heinrich Dinkel, Kai Yu:
Audio Caption: Listen and Tell. CoRR abs/1902.09254 (2019) - [i4]Heinrich Dinkel, Kai Yu:
Duration robust sound event detection. CoRR abs/1904.03841 (2019) - [i3]Heinrich Dinkel, Mengyue Wu, Kai Yu:
Text-based Depression Detection: What Triggers An Alert. CoRR abs/1904.05154 (2019) - [i2]Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Kai Yu:
What does a Car-ssette tape tell? CoRR abs/1905.13448 (2019) - [i1]Heinrich Dinkel, Pingyue Zhang, Mengyue Wu, Kai Yu:
Depa: Self-supervised audio embedding for depression detection. CoRR abs/1910.13028 (2019) - 2018
- [j2]Heinrich Dinkel, Yanmin Qian, Kai Yu:
Investigating Raw Wave Deep Neural Networks for End-to-End Speaker Spoofing Detection. IEEE ACM Trans. Audio Speech Lang. Process. 26(11): 2002-2014 (2018) - [c5]Shuai Wang, Heinrich Dinkel, Yanmin Qian, Kai Yu:
Covariance Based Deep Feature for Text-Dependent Speaker Verification. IScIDE 2018: 231-242 - 2017
- [j1]Yanmin Qian, Nanxin Chen, Heinrich Dinkel, Zhizheng Wu:
Deep Feature Engineering for Noise Robust Spoofing Detection. IEEE ACM Trans. Audio Speech Lang. Process. 25(10): 1942-1955 (2017) - [c4]Heinrich Dinkel, Nanxin Chen, Yanmin Qian, Kai Yu:
End-to-end spoofing detection with raw waveform CLDNNS. ICASSP 2017: 4860-4864 - [c3]Heinrich Dinkel, Yanmin Qian, Kai Yu:
Small-footprint convolutional neural network for spoofing detection. IJCNN 2017: 3086-3091 - 2016
- [c2]Pavel Korshunov, Sébastien Marcel, Hannah Muckenhirn, André R. Gonçalves, A. G. Souza Mello, Ricardo Paranhos Velloso Violato, Flávio Olmos Simões, Mário Uliani Neto, Marcus de Assis Angeloni, José Augusto Stuchi, Heinrich Dinkel, Nanxin Chen, Yanmin Qian, Dipjyoti Paul, Goutam Saha, Md. Sahidullah:
Overview of BTAS 2016 speaker anti-spoofing competition. BTAS 2016: 1-6 - 2015
- [c1]Nanxin Chen, Yanmin Qian, Heinrich Dinkel, Bo Chen, Kai Yu:
Robust deep feature for spoofing detection - the SJTU system for ASVspoof 2015 challenge. INTERSPEECH 2015: 2097-2101
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-08-06 21:07 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint