Vamsi K. Ithapu

According to our database1, Vamsi K. Ithapu authored at least 46 papers between 2010 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Self-Motion As Supervision For Egocentric Audiovisual Localization.
Proceedings of the IEEE International Conference on Acoustics, 2024

Hearing Loss Detection From Facial Expressions in One-On-One Conversations.
Proceedings of the IEEE International Conference on Acoustics, 2024

Spherical World-Locking for Audio-Visual Localization in Egocentric Videos.
Proceedings of the Computer Vision - ECCV 2024, 2024

The Audio-Visual Conversational Graph: From an Egocentric-Exocentric Perspective.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Towards Improved Room Impulse Response Estimation for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

LA-VOCE: LOW-SNR Audio-Visual Speech Enhancement Using Neural Vocoders.
Proceedings of the IEEE International Conference on Acoustics, 2023

Learning to Personalize Equalization for High-Fidelity Spatial Audio Reproduction.
Proceedings of the IEEE International Conference on Acoustics, 2023

Leveraging Heteroscedastic Uncertainty in Learning Complex Spectral Mapping for Single-Channel Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2023

Egocentric Auditory Attention Localization in Conversations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Chat2Map: Efficient Scene Mapping from Multi-Ego Conversations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Novel-View Acoustic Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
RemixIT: Continual Self-Training of Speech Enhancement Models via Bootstrapped Remixing.
IEEE J. Sel. Top. Signal Process., 2022

SAQAM: Spatial Audio Quality Assessment Metric.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Continual Self-Training With Bootstrapped Remixing For Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2022

Deep Impulse Responses: Estimating and Parameterizing Filters with Deep Networks.
Proceedings of the IEEE International Conference on Acoustics, 2022


Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Ego4D: Around the World in 3, 000 Hours of Egocentric Video.
CoRR, 2021

EasyCom: An Augmented Reality Dataset to Support Algorithms for Easy Communication in Noisy Environments.
CoRR, 2021

Filtered Noise Shaping for Time Domain Room Impulse Response Estimation from Reverberant Speech.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

DPLM: A Deep Perceptual Spatial-Audio Localization Metric.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

Do Sound Event Representations Generalize to Other Audio Tasks? A Case Study in Audio Transfer Learning.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Audio-Visual Floorplan Reconstruction.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Egocentric Pose Estimation from Human Vision Span.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

On the Predictability of Hrtfs from Ear Shapes Using Deep Networks.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
A Sequential Self Teaching Approach for Improving Generalization in Sound Event Recognition.
Proceedings of the 37th International Conference on Machine Learning, 2020

SeCoST: : Sequential Co-Supervision for Large Scale Weakly Labeled Audio Event Detection.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

SoundSpaces: Audio-Visual Navigation in 3D Environments.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Audio-Visual Embodied Navigation.
CoRR, 2019

SeCoST: Sequential Co-Supervision for Weakly Labeled Audio Event Detection.
CoRR, 2019

2017
Accelerating permutation testing in voxel-wise analysis through subspace tracking: A new plugin for SnPM.
NeuroImage, 2017

On architectural choices in deep learning: From network structure to gradient convergence and parameter estimation.
CoRR, 2017

When can Multi-Site Datasets be Pooled for Regression? Hypothesis Tests, $\ell_2$-consistency and Neuroscience Applications.
Proceedings of the 34th International Conference on Machine Learning, 2017

The Incremental Multiresolution Matrix Factorization Algorithm.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Decoding the Deep: Exploring Class Hierarchies of Deep Representations Using Multiresolution Matrix Factorization.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

Randomized Deep Learning Methods for Clinical Trial Enrichment and Design in Alzheimer's Disease.
Proceedings of the Deep Learning for Medical Image Analysis, 1st Edition, 2017

2016
Hypothesis Testing in Unsupervised Domain Adaptation with Applications in Alzheimer's Disease.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Experimental Design on a Budget for Sparse Linear Models and Applications.
Proceedings of the 33nd International Conference on Machine Learning, 2016

On the interplay of network structure and gradient convergence in deep learning.
Proceedings of the 54th Annual Allerton Conference on Communication, 2016

2015
Convergence of gradient based pre-training in Denoising autoencoders.
CoRR, 2015

An NMF Perspective on Binary Hashing.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

A Projection Free Method for Generalized Eigenvalue Problem with a Nonsmooth Regularizer.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

2014
Randomized Denoising Autoencoders for Smaller and Efficient Imaging Based AD Clinical Trials.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention - MICCAI 2014, 2014

2013
Speeding up Permutation Testing in Neuroimaging.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

GOSUS: Grassmannian Online Subspace Updates with Structured-Sparsity.
Proceedings of the IEEE International Conference on Computer Vision, 2013

2010
Fundus image registration for vestibularis research.
Proceedings of the Medical Imaging 2010: Computer-Aided Diagnosis, 2010


  Loading...