Gordon Wichern

Reinhold Haeb-Umbach

IEEE ACM Trans. Audio Speech Lang. Process., 2024

Task-Aware Unified Source Separation.

[BibT_eX]

[DOI]

CoRR, 2024

Leveraging Audio-Only Data for Text-Queried Target Sound Extraction.

[BibT_eX]

[DOI]

CoRR, 2024

Enhanced Reverberation as Supervision for Unsupervised Speech Separation.

[BibT_eX]

[DOI]

CoRR, 2024

Sound Event Bounding Boxes.

[BibT_eX]

[DOI]

CoRR, 2024

SMITIN: Self-Monitored Inference-Time INtervention for Generative Music Transformers.

[BibT_eX]

[DOI]

CoRR, 2024

TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement.

[BibT_eX]

[DOI]

Proceedings of the 18th International Workshop on Acoustic Signal Enhancement, 2024

Deep Neural Room Acoustics Primitive.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Improving Audio Captioning Models with Fine-Grained Audio Features, Text Embedding Supervision, and LLM Mix-Up Augmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Late Audio-Visual Fusion for in-the-Wild Speaker Diarization.

[BibT_eX]

[DOI]

Zexu Pan

François G. Germain

Proceedings of the IEEE International Conference on Acoustics, 2024

NeuroHeed+: Improving Neuro-Steered Speaker Extraction with Joint Auditory Attention Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

NIIRF: Neural IIR Filter Field for HRTF Upsampling and Personalization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Why Does Music Source Separation Benefit from Cacophony?

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Generation or Replication: Auscultating Audio Latent Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

STFT-Domain Neural Speech Enhancement With Very Low Algorithmic Latency.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2023

Tackling the Cocktail Fork Problem for Separation and Transcription of Real-World Soundtracks.

[BibT_eX]

[DOI]

Darius Petermann

Alexander L. Stempkovskiy

IEEE ACM Trans. Audio Speech Lang. Process., 2023

The Sound Demixing Challenge 2023 - Cinematic Demixing Track.

[BibT_eX]

[DOI]

Tatiana Habruseva

Mikhail Sukhovei

Yuki Mitsufuji

CoRR, 2023

Pac-HuBERT: Self-Supervised Music Source Separation via Primitive Auditory Clustering and Hidden-Unit BERT.

[BibT_eX]

[DOI]

CoRR, 2023

Location as Supervision for Weakly Supervised Multi-Channel Source Separation of Machine Sounds.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

Hyperbolic Unsupervised Anomalous Sound Detection.

[BibT_eX]

[DOI]

François G. Germain

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

Cold Diffusion for Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Optimal Condition Training for Target Source Separation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Hyperbolic Audio Source Separation.

[BibT_eX]

[DOI]

Darius Petermann

Proceedings of the IEEE International Conference on Acoustics, 2023

Paᗧ-HuBERT: Self-Supervised Music Source Separation Via Primitive Auditory Clustering And Hidden-Unit Bert.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Latent Iterative Refinement for Modular Source Separation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Reverberation as Supervision For Speech Separation.

[BibT_eX]

[DOI]

Rohith Aralikatti

Christoph Böddeker

Proceedings of the IEEE International Conference on Acoustics, 2023

Synthesizing Building Operation Data with Generative Models: VAEs, GANs, or Something In Between?

[BibT_eX]

[DOI]

Alessandro Salatiello

Christopher R. Laughman

Ankush Chakrabarty

Proceedings of the Companion Proceedings of the 14th ACM International Conference on Future Energy Systems, 2023

Scenario-Aware Audio-Visual TF-Gridnet for Target Speech Extraction.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022

Meta-Learning of Neural State-Space Models Using Data From Similar Systems.

[BibT_eX]

[DOI]

Ankush Chakrabarty

Christopher R. Laughman

CoRR, 2022

Towards End-to-end Speaker Diarization in the Wild.

[BibT_eX]

[DOI]

Zexu Pan

François G. Germain

CoRR, 2022

Heterogeneous Target Speech Separation.

[BibT_eX]

[DOI]

Efthymios Tzinis

Paris Smaragdis

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Locate This, Not that: Class-Conditioned Sound Event DOA Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World Soundtracks.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Improved Domain Generalization via Disentangled Multi-Task Learning in Unsupervised Anomalous Sound Detection.

[BibT_eX]

[DOI]

Satvik Venkatesh

Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

2021

Convolutive Prediction for Monaural Speech Dereverberation and Noisy-Reverberant Speaker Separation.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

On the Compensation Between Magnitude and Phase in Speech Separation.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2021

Leveraging Low-Distortion Target Estimates for Improved Speech Enhancement.

[BibT_eX]

[DOI]

CoRR, 2021

Attentive Neural Processes and Batch Bayesian Optimization for Scalable Calibration of Physics-Informed Digital Twins.

[BibT_eX]

[DOI]

Ankush Chakrabarty

Christopher R. Laughman

CoRR, 2021

Anomalous Sound Detection Using Attentive Neural Processes.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

Convolutive Prediction for Reverberant Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

Transcription Is All You Need: Learning To Separate Musical Mixtures With Score As Supervision.

[BibT_eX]

[DOI]

Yun-Ning Hung

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Finding Strength in Weakness: Learning to Separate Sounds With Weak Supervision.

[BibT_eX]

[DOI]

Fatemeh Pishdadian

IEEE ACM Trans. Audio Speech Lang. Process., 2020

Autoclip: Adaptive Gradient Clipping for Source Separation Networks.

[BibT_eX]

[DOI]

Proceedings of the 30th IEEE International Workshop on Machine Learning for Signal Processing, 2020

Hierarchical Musical Instrument Separation.

[BibT_eX]

[DOI]

Ethan Manilow

Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

All-in-One Transformer: Unifying Speech Recognition, Audio Tagging, and Event Detection.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Learning to Separate Sounds from Weakly Labeled Scenes.

[BibT_eX]

[DOI]

Fatemeh Pishdadian

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

WHAMR!: Noisy and Reverberant Single-Channel Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Phasebook and Friends: Leveraging Discrete Representations for Source Separation.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2019

Bootstrapping deep music separation from primitive auditory grouping principles.

[BibT_eX]

[DOI]

CoRR, 2019

Cutting Music Source Separation Some Slakh: A Dataset to Study the Impact of Training Data Quality and Quantity.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

WHAM!: Extending Speech Separation to Noisy Environments.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Class-conditional Embeddings for Music Source Separation.

[BibT_eX]

[DOI]

Prem Seetharaman

Shrikant Venkataramani

Proceedings of the IEEE International Conference on Acoustics, 2019

Bootstrapping Single-channel Source Separation via Unsupervised Spatial Clustering on Stereo Mixtures.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

The Phasebook: Building Complex Masks via Discrete Representations for Source Separation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

End-to-end Audio Visual Scene-aware Dialog Using Multimodal Attention-based Video Features.

[BibT_eX]

[DOI]

Raphael Gontijo Lopes

Proceedings of the IEEE International Conference on Acoustics, 2019

Teacher-student Deep Clustering for Low-delay Single Channel Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Phase Reconstruction with Learned Time-Frequency Representations for Single-Channel Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Multimodal Attention for Fusion of Audio and Spatiotemporal Features for Video Description.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

2017

Low-Latency approximation of bidirectional recurrent networks for speech denoising.

[BibT_eX]

[DOI]

Alexey Lukin

Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

2013

Noise adaptive optimization of matrix initialization for frequency-domain independent component analysis.

[BibT_eX]

[DOI]

Digit. Signal Process., 2013

2011

Improving the Accuracy of Least-Squares Probabilistic Classifiers.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2011

2010

Segmentation, Indexing, and Retrieval for Environmental and Natural Sounds.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2010

Direct Importance Estimation with a Mixture of Probabilistic Principal Component Analyzers.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2010

An Ontological Framework for Retrieving Environmental Sounds Using Semantics and Acoustic Content.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2010

Acceleration of sequence kernel computation for real-time speaker identification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Direct importance estimation with probabilistic principal component analyzers.

[BibT_eX]

[DOI]

Makoto Yamada

Masashi Sugiyama

Proceedings of the IEEE International Conference on Acoustics, 2010

Automatic audio tagging using covariate shift adaptation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Combining semantic, social, and acoustic similarity for retrieval of environmental sounds.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

Unifying semantic and content-based approaches for retrieval of environmental sounds.

[BibT_eX]

[DOI]

Harvey D. Thornburg

Andreas Spanias

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009

Continuous observation and archival of acoustic scenes using wireless sensor networks.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Digital Signal Processing, 2009

Multi-channel audio segmentation for continuous observation and archival of large spaces.

[BibT_eX]

[DOI]

Harvey D. Thornburg

Andreas Spanias

Proceedings of the IEEE International Conference on Acoustics, 2009

2008

Fast query by example of environmental sounds via robust and efficient cluster-based indexing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

2007

Environmentally adaptive acoustic transmission loss prediction in turbulent and nonturbulent atmospheres.

[BibT_eX]

[DOI]

Mahmood R. Azimi-Sadjadi

Michael Mungiole

Neural Networks, 2007

An Operationally Adaptive System for Rapid Acoustic Transmission Loss Prediction.

[BibT_eX]

[DOI]

Michael McCarron

Mahmood R. Azimi-Sadjadi

Michael Mungiole

Proceedings of the International Joint Conference on Neural Networks, 2007

Robust Multi-Features Segmentation and Indexing for Natural Sound Environments.

[BibT_eX]

[DOI]

Proceedings of the International Workshop on Content-Based Multimedia Indexing, 2007

2006

An Environmentally Adaptive System for Rapid Acoustic Transmission Loss Prediction.

[BibT_eX]

[DOI]