Gordon Wichern

Orcid: 0000-0002-8597-6795

According to our database1, Gordon Wichern authored at least 77 papers between 2006 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
The Sound Demixing Challenge 2023 - Cinematic Demixing Track.
Trans. Int. Soc. Music. Inf. Retr., January, 2024

TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Task-Aware Unified Source Separation.
CoRR, 2024

Leveraging Audio-Only Data for Text-Queried Target Sound Extraction.
CoRR, 2024

Enhanced Reverberation as Supervision for Unsupervised Speech Separation.
CoRR, 2024

Sound Event Bounding Boxes.
CoRR, 2024

SMITIN: Self-Monitored Inference-Time INtervention for Generative Music Transformers.
CoRR, 2024

TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement.
Proceedings of the 18th International Workshop on Acoustic Signal Enhancement, 2024

Deep Neural Room Acoustics Primitive.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Improving Audio Captioning Models with Fine-Grained Audio Features, Text Embedding Supervision, and LLM Mix-Up Augmentation.
Proceedings of the IEEE International Conference on Acoustics, 2024

Late Audio-Visual Fusion for in-the-Wild Speaker Diarization.
Proceedings of the IEEE International Conference on Acoustics, 2024

NeuroHeed+: Improving Neuro-Steered Speaker Extraction with Joint Auditory Attention Detection.
Proceedings of the IEEE International Conference on Acoustics, 2024

NIIRF: Neural IIR Filter Field for HRTF Upsampling and Personalization.
Proceedings of the IEEE International Conference on Acoustics, 2024

Why Does Music Source Separation Benefit from Cacophony?
Proceedings of the IEEE International Conference on Acoustics, 2024

Generation or Replication: Auscultating Audio Latent Diffusion Models.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
STFT-Domain Neural Speech Enhancement With Very Low Algorithmic Latency.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Tackling the Cocktail Fork Problem for Separation and Transcription of Real-World Soundtracks.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

The Sound Demixing Challenge 2023 - Cinematic Demixing Track.
CoRR, 2023

Pac-HuBERT: Self-Supervised Music Source Separation via Primitive Auditory Clustering and Hidden-Unit BERT.
CoRR, 2023

Location as Supervision for Weakly Supervised Multi-Channel Source Separation of Machine Sounds.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

Hyperbolic Unsupervised Anomalous Sound Detection.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

Cold Diffusion for Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2023

Optimal Condition Training for Target Source Separation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Hyperbolic Audio Source Separation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Paᗧ-HuBERT: Self-Supervised Music Source Separation Via Primitive Auditory Clustering And Hidden-Unit Bert.
Proceedings of the IEEE International Conference on Acoustics, 2023

Latent Iterative Refinement for Modular Source Separation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Reverberation as Supervision For Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Synthesizing Building Operation Data with Generative Models: VAEs, GANs, or Something In Between?
Proceedings of the Companion Proceedings of the 14th ACM International Conference on Future Energy Systems, 2023

Scenario-Aware Audio-Visual TF-Gridnet for Target Speech Extraction.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Meta-Learning of Neural State-Space Models Using Data From Similar Systems.
CoRR, 2022

Towards End-to-end Speaker Diarization in the Wild.
CoRR, 2022

Heterogeneous Target Speech Separation.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Locate This, Not that: Class-Conditioned Sound Event DOA Estimation.
Proceedings of the IEEE International Conference on Acoustics, 2022

The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World Soundtracks.
Proceedings of the IEEE International Conference on Acoustics, 2022

Improved Domain Generalization via Disentangled Multi-Task Learning in Unsupervised Anomalous Sound Detection.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

2021
Convolutive Prediction for Monaural Speech Dereverberation and Noisy-Reverberant Speaker Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

On the Compensation Between Magnitude and Phase in Speech Separation.
IEEE Signal Process. Lett., 2021

Leveraging Low-Distortion Target Estimates for Improved Speech Enhancement.
CoRR, 2021

Attentive Neural Processes and Batch Bayesian Optimization for Scalable Calibration of Physics-Informed Digital Twins.
CoRR, 2021

Anomalous Sound Detection Using Attentive Neural Processes.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

Convolutive Prediction for Reverberant Speech Separation.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

Transcription Is All You Need: Learning To Separate Musical Mixtures With Score As Supervision.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Finding Strength in Weakness: Learning to Separate Sounds With Weak Supervision.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Autoclip: Adaptive Gradient Clipping for Source Separation Networks.
Proceedings of the 30th IEEE International Workshop on Machine Learning for Signal Processing, 2020

Hierarchical Musical Instrument Separation.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

All-in-One Transformer: Unifying Speech Recognition, Audio Tagging, and Event Detection.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Learning to Separate Sounds from Weakly Labeled Scenes.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

WHAMR!: Noisy and Reverberant Single-Channel Speech Separation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Phasebook and Friends: Leveraging Discrete Representations for Source Separation.
IEEE J. Sel. Top. Signal Process., 2019

Bootstrapping deep music separation from primitive auditory grouping principles.
CoRR, 2019

Cutting Music Source Separation Some Slakh: A Dataset to Study the Impact of Training Data Quality and Quantity.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

WHAM!: Extending Speech Separation to Noisy Environments.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Class-conditional Embeddings for Music Source Separation.
Proceedings of the IEEE International Conference on Acoustics, 2019

Bootstrapping Single-channel Source Separation via Unsupervised Spatial Clustering on Stereo Mixtures.
Proceedings of the IEEE International Conference on Acoustics, 2019

The Phasebook: Building Complex Masks via Discrete Representations for Source Separation.
Proceedings of the IEEE International Conference on Acoustics, 2019

End-to-end Audio Visual Scene-aware Dialog Using Multimodal Attention-based Video Features.
Proceedings of the IEEE International Conference on Acoustics, 2019

Teacher-student Deep Clustering for Low-delay Single Channel Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Phase Reconstruction with Learned Time-Frequency Representations for Single-Channel Speech Separation.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Multimodal Attention for Fusion of Audio and Spatiotemporal Features for Video Description.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

2017
Low-Latency approximation of bidirectional recurrent networks for speech denoising.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

2013
Noise adaptive optimization of matrix initialization for frequency-domain independent component analysis.
Digit. Signal Process., 2013

2011
Improving the Accuracy of Least-Squares Probabilistic Classifiers.
IEICE Trans. Inf. Syst., 2011

2010
Segmentation, Indexing, and Retrieval for Environmental and Natural Sounds.
IEEE Trans. Speech Audio Process., 2010

Direct Importance Estimation with a Mixture of Probabilistic Principal Component Analyzers.
IEICE Trans. Inf. Syst., 2010

An Ontological Framework for Retrieving Environmental Sounds Using Semantics and Acoustic Content.
EURASIP J. Audio Speech Music. Process., 2010

Acceleration of sequence kernel computation for real-time speaker identification.
Proceedings of the IEEE International Conference on Acoustics, 2010

Direct importance estimation with probabilistic principal component analyzers.
Proceedings of the IEEE International Conference on Acoustics, 2010

Automatic audio tagging using covariate shift adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2010

Combining semantic, social, and acoustic similarity for retrieval of environmental sounds.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Unifying semantic and content-based approaches for retrieval of environmental sounds.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009

Continuous observation and archival of acoustic scenes using wireless sensor networks.
Proceedings of the 16th International Conference on Digital Signal Processing, 2009

Multi-channel audio segmentation for continuous observation and archival of large spaces.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Fast query by example of environmental sounds via robust and efficient cluster-based indexing.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Environmentally adaptive acoustic transmission loss prediction in turbulent and nonturbulent atmospheres.
Neural Networks, 2007

An Operationally Adaptive System for Rapid Acoustic Transmission Loss Prediction.
Proceedings of the International Joint Conference on Neural Networks, 2007

Robust Multi-Features Segmentation and Indexing for Natural Sound Environments.
Proceedings of the International Workshop on Content-Based Multimedia Indexing, 2007

2006
An Environmentally Adaptive System for Rapid Acoustic Transmission Loss Prediction.
Proceedings of the International Joint Conference on Neural Networks, 2006


  Loading...