Jean-Marc Valin

Jan Büthe

IEEE Signal Process. Lett., 2024

Real-Time Stereo Speech Enhancement with Spatial-Cue Preservation Based on Dual-Path Structure.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Noise-Robust DSP-Assisted Neural Pitch Estimation With Very Low Complexity.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

NOLACE: Improving Low-Complexity Speech Codec Enhancement Through Adaptive Temporal Shaping.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Lace: A Light-Weight, Causal Model for Enhancing Coded Speech Through Adaptive Convolutions.

[BibT_eX]

[DOI]

Jan Büthe

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

A Framework for Unified Real-Time Personalized and Non-Personalized Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Low-Bitrate Redundancy Coding of Speech Using A Rate-Distortion-Optimized Variational Autoencoder.

[BibT_eX]

[DOI]

Jan Büthe

Proceedings of the IEEE International Conference on Acoustics, 2023

Framewise Wavegan: High Speed Adversarial Vocoder In Time Domain With Very Low Computational Complexity.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Semi-supervised Time Domain Target Speaker Extraction with Attention.

[BibT_eX]

[DOI]

Zhepei Wang

Ritwik Giri

Shrikant Venkataramani

CoRR, 2022

To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets.

[BibT_eX]

[DOI]

Ritwik Giri

Shrikant Venkataramani

Umut Isik

Arvindh Krishnaswamy

CoRR, 2022

Real-Time Packet Loss Concealment With Mixed Generative and Predictive Model.

[BibT_eX]

[DOI]

Michael Klingbeil

Paris Smaragdis

Arvindh Krishnaswamy

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

End-to-end LPCNet: A Neural Vocoder With Fully-Differentiable LPC Estimation.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Improved Singing Voice Separation with Chromagram-Based Pitch-Aware Remixing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Neural Speech Synthesis on a Shoestring: Improving the Efficiency of Lpcnet.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Personalized PercepNet: Real-Time, Low-Complexity Target Voice Separation and Enhancement.

[BibT_eX]

[DOI]

Ritwik Giri

Shrikant Venkataramani

Umut Isik

Arvindh Krishnaswamy

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Multi-Channel Opus Compression for Far-Field Automatic Speech Recognition with a Fixed Bitrate Budget.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Semi-Supervised Singing Voice Separation With Noisy Self-Training.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Low-Complexity, Real-Time Joint Neural Echo Control and Speech Enhancement Based On Percepnet.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Enhancing into the Codec: Noise Robust Speech Coding with Vector-Quantized Autoencoders.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Improving Opus Low Bit Rate Quality with Neural Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019

A Real-Time Wideband Neural Vocoder at 1.6 kb/s Using LPCNet.

[BibT_eX]

[DOI]

CoRR, 2019

A Real-Time Wideband Neural Vocoder at 1.6kb/s Using LPCNet.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

LPCNET: Improving Neural Speech Synthesis through Linear Prediction.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

An Overview of Core Coding Tools in the AV1 Video Codec.

[BibT_eX]

[DOI]

Proceedings of the 2018 Picture Coding Symposium, 2018

A Hybrid DSP/Deep Learning Approach to Real-Time Full-Band Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the 20th IEEE International Workshop on Multimedia Signal Processing, 2018

The Av1 Constrained Directional Enhancement Filter (Cdef).

[BibT_eX]

[DOI]

Steinar Midtskogen

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Updates to the Opus Audio Codec.

[BibT_eX]

[DOI]

RFC, October, 2017

2016

WebRTC Audio Codec and Processing Requirements.

[BibT_eX]

[DOI]

Cary Bran

RFC, May, 2016

Robust Recognition of Simultaneous Speech By a Mobile Robot.

[BibT_eX]

[DOI]

CoRR, 2016

Low-Complexity Iterative Sinusoidal Parameter Estimation.

[BibT_eX]

[DOI]

Daniel V. Smith

CoRR, 2016

High-Quality, Low-Delay Music Coding in the Opus Codec.

[BibT_eX]

[DOI]

Gregory Maxwell

CoRR, 2016

Improved Noise Weighting in CELP Coding of Speech - Applying the Vorbis Psychoacoustic Model To Speex.

[BibT_eX]

[DOI]

CoRR, 2016

Bandwidth Extension of Narrowband Speech for Low Bit-Rate Wideband Coding.

[BibT_eX]

[DOI]

Roch Lefebvre

CoRR, 2016

Interference-Normalised Least Mean Square Algorithm.

[BibT_eX]

[DOI]

CoRR, 2016

Channel Decorrelation For Stereo Acoustic Echo Cancellation In High-Quality Audio Communication.

[BibT_eX]

[DOI]

CoRR, 2016

Speex: A Free Codec For Free Speech.

[BibT_eX]

[DOI]

CoRR, 2016

Perceptually-Motivated Nonlinear Channel Decorrelation For Stereo Acoustic Echo Cancellation.

[BibT_eX]

[DOI]

CoRR, 2016

Extension spectrale d'un signal de parole de la bande téléphonique à la bande AM.

[BibT_eX]

[DOI]

CoRR, 2016

Auditory System for a Mobile Robot.

[BibT_eX]

[DOI]

CoRR, 2016

The Daala Directional Deringing Filter.

[BibT_eX]

[DOI]

CoRR, 2016

Daala: A Perceptually-Driven Next Generation Video Codec.

[BibT_eX]

[DOI]

CoRR, 2016

Perceptually-Driven Video Coding with the Daala Video Codec.

[BibT_eX]

[DOI]

CoRR, 2016

Daala: Building a next-generation video codec from unconventional technology.

[BibT_eX]

[DOI]

Nathan E. Egge

Thomas J. Daede

Yushin Cho

Michael Bebenita

Proceedings of the 18th IEEE International Workshop on Multimedia Signal Processing, 2016

Daala: A perceptually-driven still picture codec.

[BibT_eX]

[DOI]

Nathan E. Egge

Thomas J. Daede

Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

2015

RTP Payload Format for the Opus Speech and Audio Codec.

[BibT_eX]

[DOI]

Julian Spittka

RFC, June, 2015

Perceptual vector quantization for video coding.

[BibT_eX]

[DOI]

Proceedings of the Visual Information Processing and Communication VI, 2015

Predicting chroma from luma with frequency domain intra prediction.

[BibT_eX]

[DOI]

Nathan E. Egge

Proceedings of the Visual Information Processing and Communication VI, 2015

2012

Definition of the Opus Audio Codec.

[BibT_eX]

[DOI]

RFC, September, 2012

Guidelines for Development of an Audio Codec within the IETF.

[BibT_eX]

[DOI]

Slava Borilin

Raymond (Juin-Hwey) Chen

RFC, March, 2012

Guidelines for the Use of Variable Bit Rate Audio with Secure RTP.

[BibT_eX]

[DOI]

Colin Perkins

RFC, March, 2012

Integration of sound source localization and separation to improve Dialogue Management on a robot.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012

2011

Requirements for an Internet Audio Codec.

[BibT_eX]

[DOI]

RFC, August, 2011

2010

A High-Quality Speech and Audio Codec With Less Than 10-ms Delay.

[BibT_eX]

[DOI]

Gregory Maxwell

IEEE Trans. Speech Audio Process., 2010

An iterative linearised solution to the sinusoidal parameter estimation problem.

[BibT_eX]

[DOI]

Daniel V. Smith

Comput. Electr. Eng., 2010

2009

RTP Payload Format for the Speex Codec.

[BibT_eX]

[DOI]

RFC, June, 2009

Evaluating real-time audio localization algorithms for artificial audition in robotics.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009

Reflected Simplex Codebooks for Limited Feedback MIMO Beamforming.

[BibT_eX]

[DOI]

Daniel J. Ryan

Proceedings of IEEE International Conference on Communications, 2009

Priority Based Dynamic Rate Control for VoIP Traffic.

[BibT_eX]

[DOI]

Fariza Sabrina

Proceedings of the Global Communications Conference, 2009. GLOBECOM 2009, Honolulu, Hawaii, USA, 30 November, 2009

A full-bandwidth audio codec with low complexity and very low delay.

[BibT_eX]

[DOI]

Gregory Maxwell

Proceedings of the 17th European Signal Processing Conference, 2009

2008

Embedded auditory system for small mobile robots.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE International Conference on Robotics and Automation, 2008

Adaptive Rate Control for Aggregated VoIP Traffic.

[BibT_eX]

[DOI]

Fariza Sabrina

Proceedings of the Global Communications Conference, 2008. GLOBECOM 2008, New Orleans, LA, USA, 30 November, 2008

2007

Robust Recognition of Simultaneous Speech by a Mobile Robot.

[BibT_eX]

[DOI]

IEEE Trans. Robotics, 2007

On Adjusting the Learning Rate in Frequency Domain Echo Cancellation With Double-Talk.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2007

Interference-Normalized Least Mean Square Algorithm.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2007

Robust localization and tracking of simultaneous moving sound sources using beamforming and particle filtering.

[BibT_eX]

[DOI]

Robotics Auton. Syst., 2007

Spartacus attending the 2005 AAAI conference.

[BibT_eX]

[DOI]

Auton. Robots, 2007

A New Robust Frequency Domain Echo Canceller with Closed-Loop Learning Rate Adaptation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

Design and implementation of a robot audition system for automatic speech recognition of simultaneous speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006

Recognition of Simultaneous Speech by Estimating Reliability of Separated Signals for Robot Audition.

[BibT_eX]

[DOI]

Proceedings of the PRICAI 2006: Trends in Artificial Intelligence, 2006

Real-Time Robot Audition System That Recognizes Simultaneous Speech in The Real World.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2006

Leak energy based missing feature mask generation for ICA and GSS and its evaluation with simultaneous speech recognition.

[BibT_eX]

[DOI]

Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2006

Genetic Algorithm-Based Improvement of Robot Hearing Capabilities in Separating and Recognizing Simultaneous Speech Signals.

[BibT_eX]

[DOI]

Proceedings of the Advances in Applied Artificial Intelligence, 2006

Robust 3D Localization and Tracking of Sound Sources Using Beamforming and Particle Filtering.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Embedded and Integrated Audition for a Mobile Robot.

[BibT_eX]

[DOI]

Proceedings of the Aurally Informed Performance: Integrating Machine Listening and Auditory Presentation in Robotic Systems, 2006

2005

Modularity and integration in the design of a socially interactive robot.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Workshop on Robot and Human Interactive Communication, 2005

Making a robot recognize three simultaneous sentences in real-time.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2005

Multiple moving speaker tracking by microphone array on mobile robot.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Enhanced Robot Speech Recognition Based on Microphone Array Source Separation and Missing Feature Theory.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Robotics and Automation, 2005

A Brochette of Socially Interactive Robots.

[BibT_eX]

[DOI]

Proceedings of the Proceedings, 2005

2004

Autonomous Mobile Robot That Can Read.

[BibT_eX]

[DOI]

Dominic Létourneau

EURASIP J. Adv. Signal Process., 2004

Enhanced robot audition based on microphone array source separation with post-filter.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems, Sendai, Japan, September 28, 2004

Code reusability tools for programming mobile robots.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems, Sendai, Japan, September 28, 2004

Localization of Simultaneous Moving Sound Sources for Mobile Robot Using a Frequency- Domain Steered Beamformer Approach.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Robotics and Automation, 2004

Autonomous Initialization of Robot Formations.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Robotics and Automation, 2004

Microphone array post-filter for separation of simultaneous non-stationary sources.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003

Making a mobile robot read textual messages.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Systems, 2003

Robust sound source localization using a microphone array on a mobile robot.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems, Las Vegas, Nevada, USA, October 27, 2003

Textual message read by a mobile robot.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems, Las Vegas, Nevada, USA, October 27, 2003

2002

Dynamic robot formations using directional visual perception.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, Lausanne, Switzerland, September 30, 2002

1999

On the limits of speech recognition in noise.

[BibT_eX]

[DOI]

Stephen D. Peters

Peter Stubley