Bhiksha Raj
Orcid: 0000-0003-0038-5513Affiliations:
- Carnegie Mellon University, Pittsburgh, USA
According to our database1,
Bhiksha Raj
authored at least 396 papers
between 1995 and 2024.
Collaborative distances:
Collaborative distances:
Awards
IEEE Fellow
IEEE Fellow 2017, "For contributions to speech recognition".
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Comput. Speech Lang., 2024
CoRR, 2024
CoRR, 2024
Did You Hear That? Introducing AADG: A Framework for Generating Benchmark Data in Audio Anomaly Detection.
CoRR, 2024
ESPnet-Codec: Comprehensive Training and Evaluation of Neural Codecs for Audio, Music, and Speech.
CoRR, 2024
CoRR, 2024
Emergent Interpretable Symbols and Content-Style Disentanglement via Variance-Invariance Constraints.
CoRR, 2024
From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking.
CoRR, 2024
ED-SAM: An Efficient Diffusion Sampling Approach to Domain Generalization in Vision-Language Foundation Models.
CoRR, 2024
CoRR, 2024
CoRR, 2024
AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition.
CoRR, 2024
CoRR, 2024
AugSumm: towards generalizable speech summarization using synthetic labels from large language model.
CoRR, 2024
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024
AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Improving Continual Learning of Acoustic Scene Classification via Mutual Information Optimization.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
AugSumm: Towards Generalizable Speech Summarization Using Synthetic Labels from Large Language Models.
Proceedings of the IEEE International Conference on Acoustics, 2024
Dynamic-Superb: Towards a Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark For Speech.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
R<sup>2</sup>-Bench: Benchmarking the Robustness of Referring Perception Models Under Perturbations.
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
QDFormer: Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
AI Mag., September, 2023
IEEE Trans. Pattern Anal. Mach. Intell., 2023
AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation.
Int. J. Comput. Vis., 2023
FALCON: Fairness Learning via Contrastive Attention Approach to Continual Semantic Scene Understanding in Open World.
CoRR, 2023
LoFT: Local Proxy Fine-tuning For Improving Transferability Of Adversarial Attacks Against Large Language Model.
CoRR, 2023
CoRR, 2023
Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech.
CoRR, 2023
UTOPIA: Unconstrained Tracking Objects without Preliminary Examination via Cross-Domain Adaptation.
CoRR, 2023
Imprecise Label Learning: A Unified Framework for Learning with Various Imprecise Label Configurations.
CoRR, 2023
CoRR, 2023
Approach to Learning Generalized Audio Representation Through Batch Embedding Covariance Regularization and Constant-Q Transforms.
CoRR, 2023
Synergy between human and machine approaches to sound/scene recognition and processing: An overview of ICASSP special session.
CoRR, 2023
CoRR, 2023
CoRR, 2023
Fairness Continual Learning Approach to Semantic Scene Understanding in Open-World Environments.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the 31st ACM International Conference on Multimedia, 2023
The Hidden Dance of Phonemes and Visage: Unveiling the Enigmatic Link between Phonemes and Facial Features.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
There is more than one kind of robustness: Fooling Whisper with adversarial examples.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
How Many Perturbations Break This Model? Evaluating Robustness Beyond Adversarial Accuracy.
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Towards Noise-Tolerant Speech-Referring Video Object Segmentation: Bridging Speech and Text.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Espnet-Summ: Introducing a Novel Large Dataset, Toolkit, and a Cross-Corpora Evaluation of Speech Summarization Systems.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning.
CoRR, 2022
CoRR, 2022
CoRR, 2022
CoRR, 2022
Watch What You Pretrain For: Targeted, Transferable Adversarial Examples on Self-Supervised Speech Recognition models.
CoRR, 2022
CoRR, 2022
R^2VOS: Robust Referring Video Object Segmentation via Relational Multimodal Cycle Consistency.
CoRR, 2022
CoRR, 2022
CoRR, 2022
On the pragmatism of using binary classifiers over data intensive neural network classifiers for detection of COVID-19 from voice.
CoRR, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Proceedings of the 56th Asilomar Conference on Signals, Systems, and Computers, ACSSC 2022, Pacific Grove, CA, USA, October 31, 2022
2021
Frontiers Comput. Neurosci., 2021
Constant Random Perturbations Provide Adversarial Robustness with Minimal Effect on Accuracy.
CoRR, 2021
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021
Detection and Evaluation of Human and Machine Generated Speech in Spoofing Attacks on Automatic Speaker Verification Systems.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Proceedings of the NeurIPS 2021 Competitions and Demonstrations Track, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Improving Weakly Supervised Sound Event Detection with Self-Supervised Auxiliary Tasks.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
FoolHD: Fooling Speaker Identification by Highly Imperceptible Adversarial Disturbances.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Proceedings of the 32nd British Machine Vision Conference 2021, 2021
2020
CoRR, 2020
Exploring Optimal DNN Architecture for End-to-End Beamformers Based on Time-frequency References.
CoRR, 2020
Efficient Integration of Multi-channel Information for Speaker-independent Speech Separation.
CoRR, 2020
Exploring the Best Loss Function for DNN-Based Low-latency Speech Enhancement with Temporal Convolutional Networks.
CoRR, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the 17th IEEE International Conference on Mobile Ad Hoc and Sensor Systems, 2020
Automatic In-the-wild Dataset Annotation with Deep Generalized Multiple Instance Learning.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020
Proceedings of the Advances in Visual Computing - 15th International Symposium, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 25th International Conference on Pattern Recognition, 2020
Proceedings of the 25th International Conference on Pattern Recognition, 2020
Proceedings of the 25th International Conference on Pattern Recognition, 2020
Proceedings of the Eleventh International Conference on Computational Creativity, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
2019
IEEE ACM Trans. Audio Speech Lang. Process., 2019
CoRR, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the International Joint Conference on Neural Networks, 2019
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019
Proceedings of the 7th International Conference on Learning Representations, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Cross Modal Audio Search and Retrieval with Joint Embeddings Based on Text and Audio.
Proceedings of the IEEE International Conference on Acoustics, 2019
Optimizing Neural Network Embeddings Using a Pair-Wise Loss for Text-Independent Speaker Verification.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
2018
EURASIP J. Audio Speech Music. Process., 2018
CoRR, 2018
Proceedings of the Text, Speech, and Dialogue - 21st International Conference, 2018
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Proceedings of the Statistical Language and Speech Processing, 2018
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 17th IEEE International Conference on Machine Learning and Applications, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Acoustic Scene Classification Using Discrete Random Hashing for Laplacian Kernel Machines.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the Frontiers of Multimedia Research, 2018
2017
The Incredible Shrinking Neural Network: New Perspectives on Learning Representations Through The Lens of Pruning.
CoRR, 2017
Be Careful What You Backpropagate: A Case For Linear Output Activations & Gradient Boosting.
CoRR, 2017
CoRR, 2017
Proceedings of the 2017 IEEE Workshop on Information Forensics and Security, 2017
Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Audio event and scene recognition: A unified approach using strongly and weakly labeled data.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 25th European Signal Processing Conference, 2017
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017
DCASE 2017 Task 1: Acoustic Scene Classification Using Shift-Invariant Kernels and Random Features.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017
Topic and Prosodic Modeling for Interruption Management in Multi-User Multitasking Communication Interactions.
Proceedings of the 2017 AAAI Fall Symposia, Arlington, Virginia, USA, November 9-11, 2017, 2017
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017
2016
Binary Sparse Coding of Convolutive Mixtures for Sound Localization and Separation via Spatialization.
IEEE Trans. Signal Process., 2016
IEEE Trans. Inf. Theory, 2016
A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research.
EURASIP J. Adv. Signal Process., 2016
AudioSentibank: Large-scale Semantic Ontology of Acoustic Concepts for Audio Content Analysis.
CoRR, 2016
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
Proceedings of the 39th International Convention on Information and Communication Technology, 2016
Proceedings of the 4th International Conference on Biometrics and Forensics, 2016
Proceedings of the 4th International Conference on Biometrics and Forensics, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the Eighth International Conference on Information and Communication Technologies and Development, 2016
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2016
Detecting Psychological Distress in Adults Through Transcriptions of Clinical Interviews.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2016
Experiments on the DCASE Challenge 2016: Acoustic Scene Classification and Sound Event Detection in Real Life Recording.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016
The Best of BothWorlds: Combining Data-Independent and Data-Driven Approaches for Action Recognition.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2016
2015
Compositional Models for Audio Processing: Uncovering the structure of sound mixtures.
IEEE Signal Process. Mag., 2015
A Survey: Time Travel in Deep Learning Space: An Introduction to Deep Learning Models and How Deep Learning Models Evolved from the Initial Ideas.
CoRR, 2015
Proceedings of the 2015 IEEE International Workshop on Information Forensics and Security, 2015
Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015
Proceedings of the 2015 TREC Video Retrieval Evaluation, 2015
Rapid development of public health education systems in low-literacy multilingual environments: combating ebola through voice messaging.
Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Reducing communication overhead in distributed learning by an order of magnitude (almost).
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015
Efficient autism spectrum disorder prediction with eye movement: A machine learning framework.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015
2014
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014
Proceedings of the Proceeding of the 1st International Workshop on Privacy-Preserving IR: When Information Retrieval Meets Privacy and Security co-located with 37th Annual International ACM SIGIR conference, 2014
Proceedings of the 37th International Convention on Information and Communication Technology, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Iterative Bayesian word segmentation for unsupervised vocabulary discovery from phoneme lattices.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the 22nd European Signal Processing Conference, 2014
Proceedings of the 22nd European Signal Processing Conference, 2014
2013
IEEE Trans. Speech Audio Process., 2013
Privacy-Preserving Speaker Verification and Identification Using Gaussian Mixture Models.
IEEE Trans. Speech Audio Process., 2013
Privacy-Preserving Speech Processing: Cryptographic and String-Matching Frameworks Show Promise.
IEEE Signal Process. Mag., 2013
Measuring prevalence of other-oriented transactive contributions using an automated measure of speech style accommodation.
Int. J. Comput. Support. Collab. Learn., 2013
Proceedings of the 2013 TREC Video Retrieval Evaluation, 2013
Swara Histogram Based Structural Analysis And Identification Of Indian Classical Ragas.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013
Secure binary embeddings of front-end factor analysis for privacy preserving speaker verification.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Discriminatively trained dependency language modeling for conversational speech recognition.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Scale independent raga identification using chromagram patterns and swara based features.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the 21st European Signal Processing Conference, 2013
Event detection in short duration audio using Gaussian Mixture Model and Random Forest Classifier.
Proceedings of the 21st European Signal Processing Conference, 2013
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013
2012
IEEE Trans. Dependable Secur. Comput., 2012
IEEE Trans. Speech Audio Process., 2012
Microphone Array Processing for Distant Speech Recognition: From Close-Talking Microphones to Far-Field Sensors.
IEEE Signal Process. Mag., 2012
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012
Proceedings of the Information Security - 15th International Conference, 2012
Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2012
Microphone Array Post-filter based on Spatially-Correlated Noise Measurements for Distant Speech Recognition.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2012
Proceedings of the Future of Learning: Proceedings of the 10th International Conference of the Learning Sciences, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
An Unsupervised Dynamic Bayesian Network Approach to Measuring Speech Style Accommodation.
Proceedings of the EACL 2012, 2012
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
Microphone array processing for distant speech recognition: Towards real-world deployment.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
Proceedings of the Techniques for Noise Robustness in Automatic Speech Recognition, 2012
Proceedings of the Techniques for Noise Robustness in Automatic Speech Recognition, 2012
Proceedings of the Techniques for Noise Robustness in Automatic Speech Recognition, 2012
2011
J. Signal Process. Syst., 2011
Trans. Data Priv., 2011
A Unifying Analysis of Projected Gradient Descent for $ell_p$-constrained Least Squares
CoRR, 2011
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011
Learning contextual relevance of audio segments using discriminative models over AUD sequences.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011
Proceedings of the SIGDIAL 2011 Conference, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
A Paradigm for Limited Vocabulary Speech Recognition Based on Redundant Spectro-Temporal Feature Sets.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Unsupervised Learning of Acoustic Unit Descriptors for Audio Content Representation and Classification.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the 19th European Signal Processing Conference, 2011
Proceedings of the 9th International Conference on Computer Supported Collaborative Learning, 2011
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011
Proceedings of the Conference Record of the Forty Fifth Asilomar Conference on Signals, 2011
Reconstructing Noise-Corrupted Spectrographic Components for Robust Speech Recognition.
Proceedings of the Robust Speech Recognition of Uncertain or Missing Data, 2011
2010
Proceedings of the Privacy and Security Issues in Data Mining and Machine Learning, 2010
Proceedings of the Privacy and Security Issues in Data Mining and Machine Learning, 2010
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Non-negative matrix factorization based compensation of music for automatic speech recognition.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Latent-variable decomposition based dereverberation of monaural and multi-channel signals.
Proceedings of the IEEE International Conference on Acoustics, 2010
A hybrid physical and statistical dynamic articulatory framework incorporating analysis-by-synthesis for improved phone classification.
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the Latent Variable Analysis and Signal Separation, 2010
2009
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009
Signal separation for robust speech recognition based on phase difference information obtained in the frequency domain.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Towards fusion of feature extraction and acoustic model training: a top down process for robust speech recognition.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Deriving vocal tract shapes from electromagnetic articulograph data via geometric adaptation and matching.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Probabilistic Factorization of Non-negative Data with Entropic Co-occurrence Constraints.
Proceedings of the Independent Component Analysis and Signal Separation, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
A joint decoding algorithm for multiple-example-based addition of words to a pronunciation lexicon.
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the Advances in Information Retrieval, 2009
2008
Comput. Intell. Neurosci., 2008
Regularized non-negative matrix factorization with temporal dependencies for speech denoising.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
Proceedings of the 8th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2008), 2008
2007
IEEE Trans. Speech Audio Process., 2007
IEEE Signal Process. Lett., 2007
EURASIP J. Audio Speech Music. Process., 2007
Proceedings of the Advances in Neural Information Processing Systems 20, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
Proceedings of the Independent Component Analysis and Signal Separation, 2007
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007
Proceedings of the Fourth IEEE International Conference on Advanced Video and Signal Based Surveillance, 2007
2006
Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
2005
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
A Comparison Between Spoken Queries and Menu-Based Interfaces for In-car Digital Music Selection.
Proceedings of the Human-Computer Interaction, 2005
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Feature compensation with secondary sensor measurements for robust speech recognition.
Proceedings of the 13th European Signal Processing Conference, 2005
Proceedings of the Speech Separation by Humans and Machines, 2005
2004
IEEE Trans. Speech Audio Process., 2004
A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition.
Speech Commun., 2004
Speech Commun., 2004
Proceedings of HLT-NAACL 2004: Short Papers, Boston, Massachusetts, USA, May 2-7, 2004, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
2003
IEEE Signal Process. Lett., 2003
Classifier-based non-linear projection for adaptive endpointing of continuous speech.
Comput. Speech Lang., 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
2002
IEEE Trans. Speech Audio Process., 2002
The MERL SpokenQuery information retrieval system a system for retrieving pertinent documents from a spoken query.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002
Speech recognizer-based microphone array processing for robust hands-free speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2002
2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Speech in Noisy Environments: robust automatic segmentation, feature extraction, and hypothesis combination.
Proceedings of the IEEE International Conference on Acoustics, 2001
2000
Structured redefinition of sound units by merging and splitting for improved speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Classifier-based mask estimation for missing feature methods of robust speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the IEEE International Conference on Acoustics, 2000
1999
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999
Automatic clustering and generation of contextual questions for tied states in hidden Markov models.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999
1998
Speech Commun., 1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
1997
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997
1996
Cepstral compensation by polynomial approximation for environment-independent speech recognition.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996
1995
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995
Proceedings of the 1995 International Conference on Acoustics, 1995