default search action
Vamsi K. Ithapu
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c37]Wenqi Jia, Miao Liu, Hao Jiang, Ishwarya Ananthabhotla, James M. Rehg, Vamsi Krishna Ithapu, Ruohan Gao:
The Audio-Visual Conversational Graph: From an Egocentric-Exocentric Perspective. CVPR 2024: 26386-26395 - [c36]Heeseung Yun, Ruohan Gao, Ishwarya Ananthabhotla, Anurag Kumar, Jacob Donley, Chao Li, Gunhee Kim, Vamsi Krishna Ithapu, Calvin Murdock:
Spherical World-Locking for Audio-Visual Localization in Egocentric Videos. ECCV (24) 2024: 256-274 - [c35]Yufeng Yin, Ishwarya Ananthabhotla, Vamsi Krishna Ithapu, Stavros Petridis, Yu-Hsiang Wu, Christi Miller:
Hearing Loss Detection From Facial Expressions in One-On-One Conversations. ICASSP 2024: 5460-5464 - [c34]Calvin Murdock, Ishwarya Ananthabhotla, Hao Lu, Vamsi Krishna Ithapu:
Self-Motion As Supervision For Egocentric Audiovisual Localization. ICASSP 2024: 7835-7839 - [i30]Yufeng Yin, Ishwarya Ananthabhotla, Vamsi Krishna Ithapu, Stavros Petridis, Yu-Hsiang Wu, Christi Miller:
Hearing Loss Detection from Facial Expressions in One-on-one Conversations. CoRR abs/2401.08972 (2024) - [i29]Heeseung Yun, Ruohan Gao, Ishwarya Ananthabhotla, Anurag Kumar, Jacob Donley, Chao Li, Gunhee Kim, Vamsi Krishna Ithapu, Calvin Murdock:
Spherical World-Locking for Audio-Visual Localization in Egocentric Videos. CoRR abs/2408.05364 (2024) - 2023
- [c33]Changan Chen, Alexander Richard, Roman Shapovalov, Vamsi Krishna Ithapu, Natalia Neverova, Kristen Grauman, Andrea Vedaldi:
Novel-View Acoustic Synthesis. CVPR 2023: 6409-6419 - [c32]Sagnik Majumder, Hao Jiang, Pierre Moulon, Ethan Henderson, Paul Calamia, Kristen Grauman, Vamsi Krishna Ithapu:
Chat2Map: Efficient Scene Mapping from Multi-Ego Conversations. CVPR 2023: 10554-10564 - [c31]Fiona Ryan, Hao Jiang, Abhinav Shukla, James M. Rehg, Vamsi Krishna Ithapu:
Egocentric Auditory Attention Localization in Conversations. CVPR 2023: 14663-14674 - [c30]Kuan-Lin Chen, Daniel D. E. Wong, Ke Tan, Buye Xu, Anurag Kumar, Vamsi Krishna Ithapu:
Leveraging Heteroscedastic Uncertainty in Learning Complex Spectral Mapping for Single-Channel Speech Enhancement. ICASSP 2023: 1-5 - [c29]Arjun Gupta, Pablo Hoffmann, Sebastian Prepelita, Philip W. Robinson, Vamsi K. Ithapu, David L. Alon:
Learning to Personalize Equalization for High-Fidelity Spatial Audio Reproduction. ICASSP 2023: 1-5 - [c28]Rodrigo Mira, Buye Xu, Jacob Donley, Anurag Kumar, Stavros Petridis, Vamsi Krishna Ithapu, Maja Pantic:
LA-VOCE: LOW-SNR Audio-Visual Speech Enhancement Using Neural Vocoders. ICASSP 2023: 1-5 - [c27]Anton Ratnarajah, Ishwarya Ananthabhotla, Vamsi Krishna Ithapu, Pablo Hoffmann, Dinesh Manocha, Paul Calamia:
Towards Improved Room Impulse Response Estimation for Speech Recognition. ICASSP 2023: 1-5 - [i28]Sagnik Majumder, Hao Jiang, Pierre Moulon, Ethan Henderson, Paul Calamia, Kristen Grauman, Vamsi Krishna Ithapu:
Chat2Map: Efficient Scene Mapping from Multi-Ego Conversations. CoRR abs/2301.02184 (2023) - [i27]Changan Chen, Alexander Richard, Roman Shapovalov, Vamsi Krishna Ithapu, Natalia Neverova, Kristen Grauman, Andrea Vedaldi:
Novel-View Acoustic Synthesis. CoRR abs/2301.08730 (2023) - [i26]Fiona Ryan, Hao Jiang, Abhinav Shukla, James M. Rehg, Vamsi Krishna Ithapu:
Egocentric Auditory Attention Localization in Conversations. CoRR abs/2303.16024 (2023) - [i25]Wenqi Jia, Miao Liu, Hao Jiang, Ishwarya Ananthabhotla, James M. Rehg, Vamsi Krishna Ithapu, Ruohan Gao:
The Audio-Visual Conversational Graph: From an Egocentric-Exocentric Perspective. CoRR abs/2312.12870 (2023) - 2022
- [j2]Efthymios Tzinis, Yossi Adi, Vamsi K. Ithapu, Buye Xu, Paris Smaragdis, Anurag Kumar:
RemixIT: Continual Self-Training of Speech Enhancement Models via Bootstrapped Remixing. IEEE J. Sel. Top. Signal Process. 16(6): 1329-1341 (2022) - [c26]Hao Jiang, Calvin Murdock, Vamsi Krishna Ithapu:
Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization. CVPR 2022: 10534-10542 - [c25]Kristen Grauman, Andrew Westbury, Eugene Byrne, Zachary Chavis, Antonino Furnari, Rohit Girdhar, Jackson Hamburger, Hao Jiang, Miao Liu, Xingyu Liu, Miguel Martin, Tushar Nagarajan, Ilija Radosavovic, Santhosh Kumar Ramakrishnan, Fiona Ryan, Jayant Sharma, Michael Wray, Mengmeng Xu, Eric Zhongcong Xu, Chen Zhao, Siddhant Bansal, Dhruv Batra, Vincent Cartillier, Sean Crane, Tien Do, Morrie Doulaty, Akshay Erapalli, Christoph Feichtenhofer, Adriano Fragomeni, Qichen Fu, Abrham Gebreselasie, Cristina González, James Hillis, Xuhua Huang, Yifei Huang, Wenqi Jia, Weslie Khoo, Jáchym Kolár, Satwik Kottur, Anurag Kumar, Federico Landini, Chao Li, Yanghao Li, Zhenqiang Li, Karttikeya Mangalam, Raghava Modhugu, Jonathan Munro, Tullie Murrell, Takumi Nishiyasu, Will Price, Paola Ruiz Puentes, Merey Ramazanova, Leda Sari, Kiran Somasundaram, Audrey Southerland, Yusuke Sugano, Ruijie Tao, Minh Vo, Yuchen Wang, Xindi Wu, Takuma Yagi, Ziwei Zhao, Yunyi Zhu, Pablo Arbeláez, David Crandall, Dima Damen, Giovanni Maria Farinella, Christian Fuegen, Bernard Ghanem, Vamsi Krishna Ithapu, C. V. Jawahar, Hanbyul Joo, Kris Kitani, Haizhou Li, Richard A. Newcombe, Aude Oliva, Hyun Soo Park, James M. Rehg, Yoichi Sato, Jianbo Shi, Mike Zheng Shou, Antonio Torralba, Lorenzo Torresani, Mingfei Yan, Jitendra Malik:
Ego4D: Around the World in 3, 000 Hours of Egocentric Video. CVPR 2022: 18973-18990 - [c24]Alexander Richard, Peter Sheridan Dodds, Vamsi Krishna Ithapu:
Deep Impulse Responses: Estimating and Parameterizing Filters with Deep Networks. ICASSP 2022: 3209-3213 - [c23]Efthymios Tzinis, Yossi Adi, Vamsi K. Ithapu, Buye Xu, Anurag Kumar:
Continual Self-Training With Bootstrapped Remixing For Speech Enhancement. ICASSP 2022: 6947-6951 - [c22]Pranay Manocha, Anurag Kumar, Buye Xu, Anjali Menon, Israel Dejene Gebru, Vamsi Krishna Ithapu, Paul Calamia:
SAQAM: Spatial Audio Quality Assessment Metric. INTERSPEECH 2022: 649-653 - [i24]Hao Jiang, Calvin Murdock, Vamsi Krishna Ithapu:
Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization. CoRR abs/2201.01928 (2022) - [i23]Alexander Richard, Peter Sheridan Dodds, Vamsi Krishna Ithapu:
Deep Impulse Responses: Estimating and Parameterizing Filters with Deep Networks. CoRR abs/2202.03416 (2022) - [i22]Efthymios Tzinis, Yossi Adi, Vamsi Krishna Ithapu, Buye Xu, Paris Smaragdis, Anurag Kumar:
RemixIT: Continual self-training of speech enhancement models via bootstrapped remixing. CoRR abs/2202.08862 (2022) - [i21]Pranay Manocha, Anurag Kumar, Buye Xu, Anjali Menon, Israel D. Gebru, Vamsi K. Ithapu, Paul Calamia:
SAQAM: Spatial Audio Quality Assessment Metric. CoRR abs/2206.12297 (2022) - [i20]Anton Ratnarajah, Ishwarya Ananthabhotla, Vamsi Krishna Ithapu, Pablo Hoffmann, Dinesh Manocha, Paul Calamia:
Towards Improved Room Impulse Response Estimation for Speech Recognition. CoRR abs/2211.04473 (2022) - [i19]Kuan-Lin Chen, Daniel D. E. Wong, Ke Tan, Buye Xu, Anurag Kumar, Vamsi Krishna Ithapu:
Leveraging Heteroscedastic Uncertainty in Learning Complex Spectral Mapping for Single-channel Speech Enhancement. CoRR abs/2211.08624 (2022) - [i18]Rodrigo Mira, Buye Xu, Jacob Donley, Anurag Kumar, Stavros Petridis, Vamsi Krishna Ithapu, Maja Pantic:
LA-VocE: Low-SNR Audio-visual Speech Enhancement using Neural Vocoders. CoRR abs/2211.10999 (2022) - 2021
- [c21]Yaxuan Zhou, Hao Jiang, Vamsi Krishna Ithapu:
On the Predictability of Hrtfs from Ear Shapes Using Deep Networks. ICASSP 2021: 441-445 - [c20]Senthil Purushwalkam, Sebastia Vicenc Amengual Gari, Vamsi Krishna Ithapu, Carl Schissler, Philip W. Robinson, Abhinav Gupta, Kristen Grauman:
Audio-Visual Floorplan Reconstruction. ICCV 2021: 1163-1172 - [c19]Hao Jiang, Vamsi Krishna Ithapu:
Egocentric Pose Estimation from Human Vision Span. ICCV 2021: 10986-10994 - [c18]Anurag Kumar, Yun Wang, Vamsi Krishna Ithapu, Christian Fuegen:
Do Sound Event Representations Generalize to Other Audio Tasks? A Case Study in Audio Transfer Learning. Interspeech 2021: 1214-1218 - [c17]Pranay Manocha, Anurag Kumar, Buye Xu, Anjali Menon, Israel D. Gebru, Vamsi K. Ithapu, Paul Calamia:
DPLM: A Deep Perceptual Spatial-Audio Localization Metric. WASPAA 2021: 6-10 - [c16]Christian J. Steinmetz, Vamsi Krishna Ithapu, Paul Calamia:
Filtered Noise Shaping for Time Domain Room Impulse Response Estimation from Reverberant Speech. WASPAA 2021: 221-225 - [i17]Hao Jiang, Vamsi Krishna Ithapu:
Egocentric Pose Estimation from Human Vision Span. CoRR abs/2104.05167 (2021) - [i16]Pranay Manocha, Anurag Kumar, Buye Xu, Anjali Menon, Israel D. Gebru, Vamsi K. Ithapu, Paul Calamia:
DPLM: A Deep Perceptual Spatial-Audio Localization Metric. CoRR abs/2105.14180 (2021) - [i15]Anurag Kumar, Yun Wang, Vamsi Krishna Ithapu, Christian Fuegen:
Do sound event representations generalize to other audio tasks? A case study in audio transfer learning. CoRR abs/2106.11335 (2021) - [i14]Jacob Donley, Vladimir Tourbabin, Jung-Suk Lee, Mark Broyles, Hao Jiang, Jie Shen, Maja Pantic, Vamsi Krishna Ithapu, Ravish Mehra:
EasyCom: An Augmented Reality Dataset to Support Algorithms for Easy Communication in Noisy Environments. CoRR abs/2107.04174 (2021) - [i13]Christian J. Steinmetz, Vamsi Krishna Ithapu, Paul Calamia:
Filtered Noise Shaping for Time Domain Room Impulse Response Estimation From Reverberant Speech. CoRR abs/2107.07503 (2021) - [i12]Kristen Grauman, Andrew Westbury, Eugene Byrne, Zachary Chavis, Antonino Furnari, Rohit Girdhar, Jackson Hamburger, Hao Jiang, Miao Liu, Xingyu Liu, Miguel Martin, Tushar Nagarajan, Ilija Radosavovic, Santhosh Kumar Ramakrishnan, Fiona Ryan, Jayant Sharma, Michael Wray, Mengmeng Xu, Eric Zhongcong Xu, Chen Zhao, Siddhant Bansal, Dhruv Batra, Vincent Cartillier, Sean Crane, Tien Do, Morrie Doulaty, Akshay Erapalli, Christoph Feichtenhofer, Adriano Fragomeni, Qichen Fu, Christian Fuegen, Abrham Gebreselasie, Cristina González, James Hillis, Xuhua Huang, Yifei Huang, Wenqi Jia, Weslie Khoo, Jáchym Kolár, Satwik Kottur, Anurag Kumar, Federico Landini, Chao Li, Yanghao Li, Zhenqiang Li, Karttikeya Mangalam, Raghava Modhugu, Jonathan Munro, Tullie Murrell, Takumi Nishiyasu, Will Price, Paola Ruiz Puentes, Merey Ramazanova, Leda Sari, Kiran Somasundaram, Audrey Southerland, Yusuke Sugano, Ruijie Tao, Minh Vo, Yuchen Wang, Xindi Wu, Takuma Yagi, Yunyi Zhu, Pablo Arbeláez, David Crandall, Dima Damen, Giovanni Maria Farinella, Bernard Ghanem, Vamsi Krishna Ithapu, C. V. Jawahar, Hanbyul Joo, Kris Kitani, Haizhou Li, Richard A. Newcombe, Aude Oliva, Hyun Soo Park, James M. Rehg, Yoichi Sato, Jianbo Shi, Mike Zheng Shou, Antonio Torralba, Lorenzo Torresani, Mingfei Yan, Jitendra Malik:
Ego4D: Around the World in 3, 000 Hours of Egocentric Video. CoRR abs/2110.07058 (2021) - [i11]Efthymios Tzinis, Yossi Adi, Vamsi K. Ithapu, Buye Xu, Anurag Kumar:
Continual self-training with bootstrapped remixing for speech enhancement. CoRR abs/2110.10103 (2021) - 2020
- [c15]Changan Chen, Unnat Jain, Carl Schissler, Sebastia Vicenc Amengual Gari, Ziad Al-Halah, Vamsi Krishna Ithapu, Philip W. Robinson, Kristen Grauman:
SoundSpaces: Audio-Visual Navigation in 3D Environments. ECCV (6) 2020: 17-36 - [c14]Anurag Kumar, Vamsi Krishna Ithapu:
SeCoST: : Sequential Co-Supervision for Large Scale Weakly Labeled Audio Event Detection. ICASSP 2020: 666-670 - [c13]Anurag Kumar, Vamsi K. Ithapu:
A Sequential Self Teaching Approach for Improving Generalization in Sound Event Recognition. ICML 2020: 5447-5457 - [i10]Anurag Kumar, Vamsi Krishna Ithapu:
A Sequential Self Teaching Approach for Improving Generalization in Sound Event Recognition. CoRR abs/2007.00144 (2020) - [i9]Senthil Purushwalkam, Sebastia Vicenc Amengual Gari, Vamsi Krishna Ithapu, Carl Schissler, Philip W. Robinson, Abhinav Gupta, Kristen Grauman:
Audio-Visual Floorplan Reconstruction. CoRR abs/2012.15470 (2020)
2010 – 2019
- 2019
- [i8]Anurag Kumar, Vamsi Krishna Ithapu:
SeCoST: Sequential Co-Supervision for Weakly Labeled Audio Event Detection. CoRR abs/1910.11789 (2019) - [i7]Changan Chen, Unnat Jain, Carl Schissler, Sebastia Vicenc Amengual Gari, Ziad Al-Halah, Vamsi Krishna Ithapu, Philip W. Robinson, Kristen Grauman:
Audio-Visual Embodied Navigation. CoRR abs/1912.11474 (2019) - 2017
- [j1]Felipe Gutierrez-Barragan, Vamsi K. Ithapu, Chris Hinrichs, Camille Maumet, Sterling C. Johnson, Thomas E. Nichols, Vikas Singh:
Accelerating permutation testing in voxel-wise analysis through subspace tracking: A new plugin for SnPM. NeuroImage 159: 79-98 (2017) - [c12]Vamsi K. Ithapu, Risi Kondor, Sterling C. Johnson, Vikas Singh:
The Incremental Multiresolution Matrix Factorization Algorithm. CVPR 2017: 692-701 - [c11]Vamsi K. Ithapu:
Decoding the Deep: Exploring Class Hierarchies of Deep Representations Using Multiresolution Matrix Factorization. CVPR Workshops 2017: 1695-1704 - [c10]Hao Henry Zhou, Yilin Zhang, Vamsi K. Ithapu, Sterling C. Johnson, Grace Wahba, Vikas Singh:
When can Multi-Site Datasets be Pooled for Regression? Hypothesis Tests, $\ell_2$-consistency and Neuroscience Applications. ICML 2017: 4170-4179 - [p1]Vamsi K. Ithapu, Vikas Singh, Sterling C. Johnson:
Randomized Deep Learning Methods for Clinical Trial Enrichment and Design in Alzheimer's Disease. Deep Learning for Medical Image Analysis 2017: 341-378 - [i6]Vamsi K. Ithapu, Sathya N. Ravi, Vikas Singh:
On architectural choices in deep learning: From network structure to gradient convergence and parameter estimation. CoRR abs/1702.08670 (2017) - [i5]Felipe Gutierrez-Barragan, Vamsi K. Ithapu, Chris Hinrichs, Camille Maumet, Sterling C. Johnson, Thomas E. Nichols, Vikas Singh:
Accelerating Permutation Testing in Voxel-wise Analysis through Subspace Tracking: A new plugin for SnPM. CoRR abs/1703.01506 (2017) - [i4]Vamsi K. Ithapu, Risi Kondor, Sterling C. Johnson, Vikas Singh:
The Incremental Multiresolution Matrix Factorization Algorithm. CoRR abs/1705.05804 (2017) - 2016
- [c9]Vamsi K. Ithapu, Sathya N. Ravi, Vikas Singh:
On the interplay of network structure and gradient convergence in deep learning. Allerton 2016: 488-495 - [c8]Sathya N. Ravi, Vamsi K. Ithapu, Sterling C. Johnson, Vikas Singh:
Experimental Design on a Budget for Sparse Linear Models and Applications. ICML 2016: 583-592 - [c7]Hao Henry Zhou, Vamsi K. Ithapu, Sathya Narayanan Ravi, Vikas Singh, Grace Wahba, Sterling C. Johnson:
Hypothesis Testing in Unsupervised Domain Adaptation with Applications in Alzheimer's Disease. NIPS 2016: 2496-2504 - 2015
- [c6]Seong Jae Hwang, Maxwell D. Collins, Sathya N. Ravi, Vamsi K. Ithapu, Nagesh Adluru, Sterling C. Johnson, Vikas Singh:
A Projection Free Method for Generalized Eigenvalue Problem with a Nonsmooth Regularizer. ICCV 2015: 1841-1849 - [c5]Lopamudra Mukherjee, Sathya N. Ravi, Vamsi K. Ithapu, Tyler Holmes, Vikas Singh:
An NMF Perspective on Binary Hashing. ICCV 2015: 4184-4192 - [i3]Chris Hinrichs, Vamsi K. Ithapu, Qinyuan Sun, Sterling C. Johnson, Vikas Singh:
Speeding up Permutation Testing in Neuroimaging. CoRR abs/1502.03536 (2015) - [i2]Vamsi K. Ithapu, Sathya N. Ravi, Vikas Singh:
Convergence of gradient based pre-training in Denoising autoencoders. CoRR abs/1502.03537 (2015) - [i1]Vamsi K. Ithapu, Sathya N. Ravi, Vikas Singh:
On the interplay of network structure and gradient convergence in deep learning. CoRR abs/1511.05297 (2015) - 2014
- [c4]Vamsi K. Ithapu, Vikas Singh, Ozioma C. Okonkwo, Sterling C. Johnson:
Randomized Denoising Autoencoders for Smaller and Efficient Imaging Based AD Clinical Trials. MICCAI (2) 2014: 470-478 - 2013
- [c3]Jia Xu, Vamsi K. Ithapu, Lopamudra Mukherjee, James M. Rehg, Vikas Singh:
GOSUS: Grassmannian Online Subspace Updates with Structured-Sparsity. ICCV 2013: 3376-3383 - [c2]Chris Hinrichs, Vamsi K. Ithapu, Qinyuan Sun, Sterling C. Johnson, Vikas Singh:
Speeding up Permutation Testing in Neuroimaging. NIPS 2013: 890-898 - 2010
- [c1]Vamsi K. Ithapu, Armin Fritsche, Ariane Oppelt, Martin Westhofen, Thomas M. Deserno:
Fundus image registration for vestibularis research. Computer-Aided Diagnosis 2010: 76243E
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-28 21:27 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint