Richard M. Stern
Orcid: 0000-0003-0557-7282Affiliations:
- Carnegie Mellon University, Electrical and Computer Engineering, Pittsburgh, PA, USA
According to our database1,
Richard M. Stern
authored at least 163 papers
between 1983 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
-
on cs.cmu.edu
On csauthors.net:
Bibliography
2024
Modeling Analog Dynamic Range Compressors using Deep Learning and State-space Models.
CoRR, 2024
2023
Sensors, September, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Reducing the Cost of Spoof Detection Labeling using Mixed-Strategy Active Learning and Pretrained Models.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
2022
Investigating the Important Temporal Modulations for Deep-Learning-Based Speech Activity Detection.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
The Application of Learnable STRF Kernels to the 2021 Fearless Steps Phase-03 SAD Challenge.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
2019
On combining features for single-channel robust speech recognition in reverberant environments.
CoRR, 2019
Weighted delay-and-sum beamforming guided by visual tracking for human-robot interaction.
CoRR, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
2018
An improved DNN-based spectral feature mapping that removes noise and reverberation for robust automatic speech recognition.
CoRR, 2018
Exploring the robustness of features and enhancement on speech recognition systems in highly-reverberant real environments.
CoRR, 2018
A Comparative Study of Spatial Speech Separation Techniques to Improve Speech Recognition.
Proceedings of the Advances in Neural Networks - ISNN 2018, 2018
A Priori SNR Estimation Based on a Recurrent Neural Network for Robust Speech Enhancement.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Sound Source Separation Using Phase Difference and Reliable Mask Selection Selection.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
2017
IEEE Signal Process. Lett., 2017
Locally Normalized Filter Banks Applied to Deep Neural-Network-Based Robust Speech Recognition.
IEEE Signal Process. Lett., 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017
2016
IEEE ACM Trans. Audio Speech Lang. Process., 2016
A Subband-Based Stationary-Component Suppression Method Using Harmonics and Power Ratio for Reverberant Speech Recognition.
IEEE Signal Process. Lett., 2016
The Use of Locally Normalized Cepstral Coefficients (LNCC) to Improve Speaker Recognition Accuracy in Highly Reverberant Rooms.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Fusion Strategies for Robust Speech Recognition and Keyword Spotting for Channel- and Noise-Degraded Speech.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
2015
IEEE J. Sel. Top. Signal Process., 2015
A perceptually-motivated low-complexity instantaneous linear channel normalization technique applied to speaker verification.
Comput. Speech Lang., 2015
Proceedings of the 2015 TREC Video Retrieval Evaluation, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Robustness to additive noise of locally-normalized cepstral coefficients in speaker verification.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Towards machines that know when they do not know: Summary of work done at 2014 Frederick Jelinek Memorial Workshop.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
2014
Optimization of the parameters characterizing sigmoidal rate-level functions based on acoustic features.
Speech Commun., 2014
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014
Robust speech recognition in reverberant environments using subband-based steady-state monaural and binaural suppression.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
Proceedings of the 2013 TREC Video Retrieval Evaluation, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
2012
IEEE Trans. Speech Audio Process., 2012
Hearing Is Believing: Biologically Inspired Methods for Robust Automatic Speech Recognition.
IEEE Signal Process. Mag., 2012
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012
Two-microphone source separation algorithm based on statistical modeling of angle distributions.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Histogram-based subband powerwarping and spectral averaging for robust speech recognition under matched and multistyle training.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the Techniques for Noise Robustness in Automatic Speech Recognition, 2012
2011
Mask classification for missing-feature reconstruction for robust speech recognition in unknown background noise.
Speech Commun., 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
2010
Automatic selection of thresholds for signal separation algorithms based on interaural delay.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Maximum-likelihood-based cepstral inverse filtering for blind speech dereverberation.
Proceedings of the IEEE International Conference on Acoustics, 2010
Feature extraction for robust speech recognition based on maximizing the sharpness of the power distribution and on power flooring.
Proceedings of the IEEE International Conference on Acoustics, 2010
A hybrid physical and statistical dynamic articulatory framework incorporating analysis-by-synthesis for improved phone classification.
Proceedings of the IEEE International Conference on Acoustics, 2010
2009
Spatial separation of speech signals using amplitude estimation based on interaural comparisons of zero-crossings.
Speech Commun., 2009
Feature extraction for robust speech recognition using a power-law nonlinearity and power-bias subtraction.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Signal separation for robust speech recognition based on phase difference information obtained in the frequency domain.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Towards fusion of feature extraction and acoustic model training: a top down process for robust speech recognition.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Unsupervised training scheme with non-stereo data for empirical feature vector compensation.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Deriving vocal tract shapes from electromagnetic articulograph data via geometric adaptation and matching.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Power function-based power distribution normalization algorithm for robust speech recognition.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009
2008
Robust signal-to-noise ratio estimation based on waveform amplitude distribution analysis.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Analysis of physiologically-motivated signal processing for robust speech recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Environment-invariant compensation for reverberation using linear post-filtering for minimum distortion.
Proceedings of the IEEE International Conference on Acoustics, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
2007
"polyaural" array processing for automatic speech recognition in degraded environments.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Missing Feature Speech Recognition using Dereverberation and Echo Suppression in Reverberant Environments.
Proceedings of the IEEE International Conference on Acoustics, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
2006
Subband Likelihood-Maximizing Beamforming for Speech Recognition in Reverberant Environments.
IEEE Trans. Speech Audio Process., 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Physiologically-motivated synchrony-based processing for robust automatic speech recognition.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Spatial Separation of Speech Signals Using Continuously-Variable Masks Estimated From Comparisons of Zero Crossings.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Band-Independent Mask Estimation for Missing-Feature Reconstruction in the Presence of Unknown Background Noise.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
2005
IEEE Signal Process. Lett., 2005
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Signal Separation Motivated by Human Auditory Perception: Applications to Automatic Speech Recognition.
Proceedings of the Speech Separation by Humans and Machines, 2005
2004
IEEE Trans. Speech Audio Process., 2004
A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition.
Speech Commun., 2004
Speech Commun., 2004
Normalization of Time-Derivative Parameters for Robust Speech Recognition in Small Devices.
IEICE Trans. Inf. Syst., 2004
Proceedings of the MICAI 2004: Advances in Artificial Intelligence, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Parameter sharing in subband likelihood-maximizing beamforming for speech recognition using microphone arrays.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
Feature generation based on maximum normalized acoustic likelihood for improved speech recognition.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Duration normalization and hypothesis combination for improved spontaneous speech recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Feature generation based on maximum classification probability for improved speech recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Subband parameter optimization of microphone arrays for speech recognition in reverberant environments.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
Training of stream weights for the decoding of speech using parallel feature streams.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
2002
IEEE Trans. Speech Audio Process., 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Speech recognizer-based microphone array processing for robust hands-free speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2002
2001
Speech Commun., 2001
Speech in Noisy Environments: robust automatic segmentation, feature extraction, and hypothesis combination.
Proceedings of the IEEE International Conference on Acoustics, 2001
Duration normalization for improved recognition of spontaneous and read speech via missing feature methods.
Proceedings of the IEEE International Conference on Acoustics, 2001
2000
Structured redefinition of sound units by merging and splitting for improved speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Classifier-based mask estimation for missing feature methods of robust speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Automatic subword unit refinement for spontaneous speech recognition via phone splitting.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Phone transition acoustic modeling: application to speaker independent and spontaneous speech systems.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Instantaneous-distortion based weighted acoustic modeling for robust recognition of coded speech.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the IEEE International Conference on Acoustics, 2000
Proceedings of the IEEE International Conference on Acoustics, 2000
1999
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999
Automatic clustering and generation of contextual questions for tied states in hidden Markov models.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999
1998
Speech Commun., 1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
1997
Compensation for environmental and speaker variability by normalization of pole locations.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997
1996
Cepstral compensation by polynomial approximation for environment-independent speech recognition.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996
1995
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995
Proceedings of the 1995 International Conference on Acoustics, 1995
Proceedings of the 1995 International Conference on Acoustics, 1995
1994
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
Environmental robustness in automatic speech recognition using physiologic ally-motivated signal processing.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994
Environment normalization for robust speech recognition using direct cepstral comparison.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994
1993
Proceedings of the Human Language Technology: Proceedings of a Workshop Held at Plainsboro, 1993
Proceedings of the IEEE International Conference on Acoustics, 1993
1992
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Harriman, 1992
Proceedings of the Second International Conference on Spoken Language Processing, 1992
Efficient joint compensation of speech for the effects of additive noise and linear filtering.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992
1991
Speaker adaptation in continuous speech recognition via estimation of correlated mean vectors.
Proceedings of the 1991 International Conference on Acoustics, 1991
Proceedings of the 1991 International Conference on Acoustics, 1991
1990
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Hidden Valley, 1990
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Hidden Valley, 1990
Proceedings of the First International Conference on Spoken Language Processing, 1990
Proceedings of the 1990 International Conference on Acoustics, 1990
1989
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Cape Cod, 1989
1988
Proceedings of the IEEE International Conference on Acoustics, 1988
1987
IEEE Trans. Acoust. Speech Signal Process., 1987
Proceedings of the IEEE International Conference on Acoustics, 1987
1984
IEEE Trans. Pattern Anal. Mach. Intell., 1984
IEEE Trans. Pattern Anal. Mach. Intell., 1984
Proceedings of the IEEE International Conference on Acoustics, 1984
1983
Proceedings of the IEEE International Conference on Acoustics, 1983
Proceedings of the IEEE International Conference on Acoustics, 1983