Paris Smaragdis

Affiliations:
  • MIT, Cambridge, USA


According to our database1, Paris Smaragdis authored at least 161 papers between 1996 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Awards

IEEE Fellow

IEEE Fellow 2015, "For contributions to audio source separation and audio processing".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
On Class Separability Pitfalls In Audio-Text Contrastive Zero-Shot Learning.
CoRR, 2024

Rethinking Non-Negative Matrix Factorization with Implicit Neural Representations.
CoRR, 2024

Scaling Up Adaptive Filter Optimizers.
CoRR, 2024

Noise-Robust DSP-Assisted Neural Pitch Estimation With Very Low Complexity.
Proceedings of the IEEE International Conference on Acoustics, 2024

Sound Source Separation Using Latent Variational Block-Wise Disentanglement.
Proceedings of the IEEE International Conference on Acoustics, 2024

Meta-AF Echo Cancellation for Improved Keyword Spotting.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Audio Signal Processing in the 21st Century: The important outcomes of the past 25 years.
IEEE Signal Process. Mag., July, 2023

Meta-AF: Meta-Learning for Adaptive Filters.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Audio Editing with Non-Rigid Text Prompts.
CoRR, 2023

Unsupervised Improvement of Audio-Text Cross-Modal Representations.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

Complete and Separate: Conditional Separation with Missing Target Source Attribute Completion.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

Low-Complexity Streaming Speech Super-Resolution.
Proceedings of the 33rd IEEE International Workshop on Machine Learning for Signal Processing, 2023

A Framework for Unified Real-Time Personalized and Non-Personalized Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2023

Optimal Condition Training for Target Source Separation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Framewise Wavegan: High Speed Adversarial Vocoder In Time Domain With Very Low Computational Complexity.
Proceedings of the IEEE International Conference on Acoustics, 2023

Generative Modeling Based Manifold Learning for Adaptive Filtering Guidance.
Proceedings of the IEEE International Conference on Acoustics, 2023

Latent Iterative Refinement for Modular Source Separation.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Compute and Memory Efficient Universal Sound Source Separation.
J. Signal Process. Syst., 2022

Learning Representations for New Sound Classes With Continual Self-Supervised Learning.
IEEE Signal Process. Lett., 2022

RemixIT: Continual Self-Training of Speech Enhancement Models via Bootstrapped Remixing.
IEEE J. Sel. Top. Signal Process., 2022

Don't Look Up: Ubiquitous Data Exfiltration Pathways in Commercial Spaces.
CoRR, 2022

Semi-supervised Time Domain Target Speaker Extraction with Attention.
CoRR, 2022

Meta-Learning for Adaptive Filters with higher-order Frequency Dependencies.
Proceedings of the 17th International Workshop on Acoustic Signal Enhancement, 2022

Mechatronic Generation of Datasets for Acoustics Research.
Proceedings of the 17th International Workshop on Acoustic Signal Enhancement, 2022

Real-Time Packet Loss Concealment With Mixed Generative and Predictive Model.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Heterogeneous Target Speech Separation.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

End-to-end LPCNet: A Neural Vocoder With Fully-Differentiable LPC Estimation.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Neural Speech Synthesis on a Shoestring: Improving the Efficiency of Lpcnet.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Sound Event Detection with Adaptive Frequency Selection.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

Separate But Together: Unsupervised Federated Learning for Speech Enhancement from Non-IID Data.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

Point Cloud Audio Processing.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

Auto-DSP: Learning to Optimize Acoustic Echo Cancellers.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

Optimizing Short-Time Fourier Transform Parameters via Gradient Descent.
Proceedings of the IEEE International Conference on Acoustics, 2021

Unified Gradient Reweighting for Model Biasing with Applications to Source Separation.
Proceedings of the IEEE International Conference on Acoustics, 2021

Differentiable Signal Processing With Black-Box Audio Effects.
Proceedings of the IEEE International Conference on Acoustics, 2021

Communication-Cost Aware Microphone Selection for Neural Speech Enhancement with Ad-Hoc Microphone Arrays.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Self-supervised Learning for Speech Enhancement.
CoRR, 2020

Sudo RM -RF: Efficient Networks for Universal Audio Source Separation.
Proceedings of the 30th IEEE International Workshop on Machine Learning for Signal Processing, 2020

End-To-End Non-Negative Autoencoders for Sound Source Separation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Two-Step Sound Source Separation: Training On Learned Latent Targets.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

One-Shot Parametric Audio Production Style Transfer with Application to Frequency Equalization.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Continual Learning of New Sound Classes Using Generative Replay.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

A Style Transfer Approach to Source Separation.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Deep Tensor Factorization for Spatially-Aware Scene Decomposition.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Unsupervised Deep Clustering for Source Separation: Direct Learning from Mixtures Using Spatial Information.
Proceedings of the IEEE International Conference on Acoustics, 2019

Majorization-minimization Algorithms for Convolutive NMF with the Beta-divergence.
Proceedings of the IEEE International Conference on Acoustics, 2019

Multi-view Networks for Multi-channel Audio Classification.
Proceedings of the IEEE International Conference on Acoustics, 2019

VoiceAssist: Guiding Users to High-Quality Voice Recordings.
Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems, 2019

2018
End-to-end Networks for Supervised Single-channel Speech Separation.
CoRR, 2018

Learning the Base Distribution in Implicit Generative Models.
CoRR, 2018

Multi-View Networks for Denoising of Arbitrary Numbers of Channels.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Generative Adversarial Source Separation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Blind Estimation of the Speech Transmission Index for Speech Quality Prediction.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Bitwise Neural Networks for Efficient Single-Channel Source Separation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Performance Based Cost Functions for End-to-End Speech Separation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

End-To-End Source Separation With Adaptive Front-Ends.
Proceedings of the 52nd Asilomar Conference on Signals, Systems, and Computers, 2018

Audition for multimedia computing.
Proceedings of the Frontiers of Multimedia Research, 2018

2017
Neural Network Alternatives to Convolutive Audio Models for Source Separation.
CoRR, 2017

End-to-end Source Separation with Adaptive Front-Ends.
CoRR, 2017

Diagonal rnns in symbolic music modeling.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Towards end-to-end polyphonic music transcription: Transforming music audio directly to a score.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

AutoDub: Automatic Redubbing for Voiceover Editing.
Proceedings of the 30th Annual ACM Symposium on User Interface Software and Technology, 2017

Neural network alternatives toconvolutive audio models for source separation.
Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017

A neural network alternative to non-negative audio models.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Robust Source Localization and Enhancement With a Probabilistic Steered Response Power Model.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Bitwise Neural Networks.
CoRR, 2016

NoiseOut: A Simple Way to Prune Neural Networks.
CoRR, 2016

Efficient neighborhood-based topic modeling for collaborative audio enhancement on massive crowdsourced recordings.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Guest Editorial: Machine Learning for Signal Processing.
J. Signal Process. Syst., 2015

A State-Space Approach to Dynamic Nonnegative Matrix Factorization.
IEEE Trans. Signal Process., 2015

Joint Optimization of Masks and Deep Recurrent Neural Networks for Monaural Source Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Compositional Models for Audio Processing: Uncovering the structure of sound mixtures.
IEEE Signal Process. Mag., 2015

Mixtures of Local Dictionaries for Unsupervised Speech Enhancement.
IEEE Signal Process. Lett., 2015

A Dictionary Learning Approach for Factorial Gaussian Models.
CoRR, 2015

Directional NMF for joint source localization and separation.
Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015

Method of moments learning for left-to-right Hidden Markov models.
Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015

Predicting Skill-Based Task Performance and Learning with fMRI Motor and Subcortical Network Connectivity.
Proceedings of the 2015 International Workshop on Pattern Recognition in NeuroImaging, 2015

Joint acoustic and spectral modeling for speech dereverberation using non-negative representations.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Efficient manifold preserving audio source separation using locality sensitive hashing.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Adaptive Denoising Autoencoders: A Fine-Tuning Scheme to Learn from Test Mixtures.
Proceedings of the Latent Variable Analysis and Signal Separation, 2015

2014
Multichannel source separation and tracking with RANSAC and directional statistics.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

Static and Dynamic Source Separation Using Nonnegative Factorizations: A unified view.
IEEE Signal Process. Mag., 2014

Spectral Learning of Mixture of Hidden Markov Models.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Multiple speaker tracking with the Factorial von Mises-Fisher Filter.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2014

Singing-Voice Separation from Monaural Recordings using Deep Recurrent Neural Networks.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

Experiments on deep learning for speech denoising.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Robust interchannel phase difference modeling with wrapped regression splines.
Proceedings of the IEEE 8th Sensor Array and Multichannel Signal Processing Workshop, 2014

Phase and level difference fusion for robust multichannel source separation.
Proceedings of the IEEE International Conference on Acoustics, 2014

Deep learning for monaural speech separation.
Proceedings of the IEEE International Conference on Acoustics, 2014

Efficient model selection for speech enhancement using a deflation method for Nonnegative Matrix Factorization.
Proceedings of the 2014 IEEE Global Conference on Signal and Information Processing, 2014

2013
Supervised and Unsupervised Speech Enhancement Using Nonnegative Matrix Factorization.
IEEE Trans. Speech Audio Process., 2013

Privacy-Preserving Speech Processing: Cryptographic and String-Matching Frameworks Show Promise.
IEEE Signal Process. Mag., 2013

A Wrapped Kalman Filter for Azimuthal Speaker Tracking.
IEEE Signal Process. Lett., 2013

Keynote addresses: From auditory masking to binary classification: Machine learning for speech separation.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Non-negative matrix factorization for irregularly-spaced transforms.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

About this non-negative business.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Low-artifact source separation using probabilistic latent component analysis.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Simultaneous noise classification and reduction using a priori learned models.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2013

Single channel source separation using smooth Nonnegative Matrix Factorization with Markov Random Fields.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2013

EMERALD: Characterization of emerging applications and algorithms for low-power devices.
Proceedings of the 2012 IEEE International Symposium on Performance Analysis of Systems & Software, 2013

Manifold Preserving Hierarchical Topic Models for Quantization and Approximation.
Proceedings of the 30th International Conference on Machine Learning, 2013

Blind multi-channel source separation by circular-linear statistical modeling of phase differences.
Proceedings of the IEEE International Conference on Acoustics, 2013

Prediction based filtering and smoothing to exploit temporal dependencies in NMF.
Proceedings of the IEEE International Conference on Acoustics, 2013

Collaborative audio enhancement using probabilistic latent component sharing.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
The Markov selection model for concurrent speech recognition.
Neurocomputing, 2012

Optimal cost function and magnitude power for NMF-based speech separation and music interpolation.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2012

Stereophonic spectrogram segmentation using Markov random fields.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2012

Second Fiddle is Important Too: Pitch Tracking Individual Voices in Polyphonic Music.
Proceedings of the 13th International Society for Music Information Retrieval Conference, 2012

Speech Enhancement by Online Non-negative Spectrogram Decomposition in Non-stationary Noise Environments.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Following musical sources by example.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Noise-robust dynamic time warping using PLCA features.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Singing-voice separation from monaural recordings using robust principal component analysis.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Clustering and synchronizing multi-camera video via landmark cross-correlation.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Sound Recognition in Mixtures.
Proceedings of the Latent Variable Analysis and Signal Separation, 2012

A Non-negative Approach to Language Informed Speech Separation.
Proceedings of the Latent Variable Analysis and Signal Separation, 2012

Online PLCA for Real-Time Semi-supervised Source Separation.
Proceedings of the Latent Variable Analysis and Signal Separation, 2012

An amplitude spectral Capon estimator with a variable filter length.
Proceedings of the 20th European Signal Processing Conference, 2012

Extraction of Speech from Mixture Signals.
Proceedings of the Techniques for Noise Robustness in Automatic Speech Recognition, 2012

2011
Missing Data Imputation for Time-Frequency Representations of Audio Signals.
J. Signal Process. Syst., 2011

Preface.
Speech Commun., 2011

Polyphonic pitch tracking by example.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011

A convolutive spectral decomposition approach to the separation of feedback from target speech.
Proceedings of the 2011 IEEE International Workshop on Machine Learning for Signal Processing, 2011

Approximate nearest-subspace representations for sound mixtures.
Proceedings of the IEEE International Conference on Acoustics, 2011

A non-negative approach to semi-supervised separation of speech from noise with the use of temporal dynamics.
Proceedings of the IEEE International Conference on Acoustics, 2011

An adaptive time-frequency resolution approach for Non-negative Matrix Factorization based single channel sound source separation.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Editorial for the Special Issue on Signal Models and Representations of Musical and Environmental Sounds.
IEEE Trans. Speech Audio Process., 2010

Scalable Audio-Content Analysis.
EURASIP J. Audio Speech Music. Process., 2010

Latent-variable decomposition based dereverberation of monaural and multi-channel signals.
Proceedings of the IEEE International Conference on Acoustics, 2010

Non-negative Hidden Markov Modeling of Audio with Application to Source Separation.
Proceedings of the Latent Variable Analysis and Signal Separation, 2010

2009
Dynamic Range Extension Using Interleaved Gains.
IEEE Trans. Speech Audio Process., 2009

Separation by "humming": User-guided sound extraction from monophonic mixtures.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009

User guided audio selection from complex sound mixtures.
Proceedings of the 22nd Annual ACM Symposium on User Interface Software and Technology, 2009

A Sparse Non-Parametric Approach for Single Channel Separation of Known Sounds.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

The UBS Virtual Maestro: an Interactive Conducting System.
Proceedings of the 9th International Conference on New Interfaces for Musical Expression, 2009

Probabilistic Factorization of Non-negative Data with Entropic Co-occurrence Constraints.
Proceedings of the Independent Component Analysis and Signal Separation, 2009

Relative pitch estimation of multiple instruments.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Probabilistic Latent Variable Models as Nonnegative Factorizations.
Comput. Intell. Neurosci., 2008

Advances in Nonnegative Matrix and Tensor Factorization.
Comput. Intell. Neurosci., 2008

Regularized non-negative matrix factorization with temporal dependencies for speech denoising.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Speech denoising using nonnegative matrix factorization with priors.
Proceedings of the IEEE International Conference on Acoustics, 2008

Sparse and shift-invariant feature extraction from non-negative data.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
A Framework for Secure Speech Recognition.
IEEE Trans. Speech Audio Process., 2007

Position and Trajectory Learning for Microphone Arrays.
IEEE Trans. Speech Audio Process., 2007

Convolutive Speech Bases and Their Application to Supervised Speech Separation.
IEEE Trans. Speech Audio Process., 2007

Sparse Overcomplete Latent Variable Decomposition of Counts Data.
Proceedings of the Advances in Neural Information Processing Systems 20, 2007

Sparse Overcomplete Decomposition for Single Channel Speaker Separation.
Proceedings of the IEEE International Conference on Acoustics, 2007

Bandwidth Expansionwith a pólya URN Model.
Proceedings of the IEEE International Conference on Acoustics, 2007

Supervised and Semi-supervised Separation of Sounds from Single-Channel Mixtures.
Proceedings of the Independent Component Analysis and Signal Separation, 2007

Sensor and Data Systems, Audio-Assisted Cameras and Acoustic Doppler Sensors.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Probabilistic Decompositions of Spectra for Sound Separation.
Proceedings of the Blind Speech Separation, 2007

2006
Secure Sound Classification: Gaussian Mixture Models.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Latent Dirichlet Decomposition for Single Channel Speaker Separation.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Recognizing speech from simultaneous speakers.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Bandwidth expansion of narrowband speech using non-negative matrix factorization.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Exploiting Redundancy to Construct Listening Systems.
Proceedings of the Speech Separation by Humans and Machines, 2005

2004
Discovering auditory objects through non-negativity constraints.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, 2004

Non-negative Matrix Factor Deconvolution; Extraction of Multiple Sound Sources from Monophonic Inputs.
Proceedings of the Independent Component Analysis and Blind Signal Separation, 2004

2002
Combining Musical and Cultural Features for Intelligent Style Detection.
Proceedings of the ISMIR 2002, 2002

2000
Scanned Synthesis: An Introduction and Demonstration of a New Synthesis and Signal Processing Technique.
Proceedings of the 2000 International Computer Music Conference, 2000

1998
Blind separation of convolved mixtures in the frequency domain.
Neurocomputing, 1998

1996
NetSound: Realtime Audio from Semantic Descriptions.
Proceedings of the 1996 International Computer Music Conference, 1996


  Loading...