Jean-Marc Valin

Orcid: 0000-0002-9883-6927

According to our database1, Jean-Marc Valin authored at least 92 papers between 1999 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Very Low Complexity Speech Synthesis Using Framewise Autoregressive GAN (FARGAN) With Pitch Prediction.
IEEE Signal Process. Lett., 2024

Real-Time Stereo Speech Enhancement with Spatial-Cue Preservation Based on Dual-Path Structure.
Proceedings of the IEEE International Conference on Acoustics, 2024

Noise-Robust DSP-Assisted Neural Pitch Estimation With Very Low Complexity.
Proceedings of the IEEE International Conference on Acoustics, 2024

NOLACE: Improving Low-Complexity Speech Codec Enhancement Through Adaptive Temporal Shaping.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Lace: A Light-Weight, Causal Model for Enhancing Coded Speech Through Adaptive Convolutions.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

A Framework for Unified Real-Time Personalized and Non-Personalized Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2023

Low-Bitrate Redundancy Coding of Speech Using A Rate-Distortion-Optimized Variational Autoencoder.
Proceedings of the IEEE International Conference on Acoustics, 2023

Framewise Wavegan: High Speed Adversarial Vocoder In Time Domain With Very Low Computational Complexity.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Semi-supervised Time Domain Target Speaker Extraction with Attention.
CoRR, 2022

To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets.
CoRR, 2022

Real-Time Packet Loss Concealment With Mixed Generative and Predictive Model.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

End-to-end LPCNet: A Neural Vocoder With Fully-Differentiable LPC Estimation.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Improved Singing Voice Separation with Chromagram-Based Pitch-Aware Remixing.
Proceedings of the IEEE International Conference on Acoustics, 2022

Neural Speech Synthesis on a Shoestring: Improving the Efficiency of Lpcnet.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Personalized PercepNet: Real-Time, Low-Complexity Target Voice Separation and Enhancement.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Multi-Channel Opus Compression for Far-Field Automatic Speech Recognition with a Fixed Bitrate Budget.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Semi-Supervised Singing Voice Separation With Noisy Self-Training.
Proceedings of the IEEE International Conference on Acoustics, 2021

Low-Complexity, Real-Time Joint Neural Echo Control and Speech Enhancement Based On Percepnet.
Proceedings of the IEEE International Conference on Acoustics, 2021

Enhancing into the Codec: Noise Robust Speech Coding with Vector-Quantized Autoencoders.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Improving Opus Low Bit Rate Quality with Neural Speech Synthesis.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019
A Real-Time Wideband Neural Vocoder at 1.6 kb/s Using LPCNet.
CoRR, 2019

A Real-Time Wideband Neural Vocoder at 1.6kb/s Using LPCNet.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

LPCNET: Improving Neural Speech Synthesis through Linear Prediction.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018

A Hybrid DSP/Deep Learning Approach to Real-Time Full-Band Speech Enhancement.
Proceedings of the 20th IEEE International Workshop on Multimedia Signal Processing, 2018

The Av1 Constrained Directional Enhancement Filter (Cdef).
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Updates to the Opus Audio Codec.
RFC, October, 2017

2016
WebRTC Audio Codec and Processing Requirements.
RFC, May, 2016

Robust Recognition of Simultaneous Speech By a Mobile Robot.
CoRR, 2016

Low-Complexity Iterative Sinusoidal Parameter Estimation.
CoRR, 2016

High-Quality, Low-Delay Music Coding in the Opus Codec.
CoRR, 2016

Improved Noise Weighting in CELP Coding of Speech - Applying the Vorbis Psychoacoustic Model To Speex.
CoRR, 2016

Bandwidth Extension of Narrowband Speech for Low Bit-Rate Wideband Coding.
CoRR, 2016

Interference-Normalised Least Mean Square Algorithm.
CoRR, 2016

Channel Decorrelation For Stereo Acoustic Echo Cancellation In High-Quality Audio Communication.
CoRR, 2016

Speex: A Free Codec For Free Speech.
CoRR, 2016

Perceptually-Motivated Nonlinear Channel Decorrelation For Stereo Acoustic Echo Cancellation.
CoRR, 2016

Extension spectrale d'un signal de parole de la bande téléphonique à la bande AM.
CoRR, 2016

Auditory System for a Mobile Robot.
CoRR, 2016

The Daala Directional Deringing Filter.
CoRR, 2016

Daala: A Perceptually-Driven Next Generation Video Codec.
CoRR, 2016

Perceptually-Driven Video Coding with the Daala Video Codec.
CoRR, 2016

Daala: Building a next-generation video codec from unconventional technology.
Proceedings of the 18th IEEE International Workshop on Multimedia Signal Processing, 2016

Daala: A perceptually-driven still picture codec.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

2015
RTP Payload Format for the Opus Speech and Audio Codec.
RFC, June, 2015

Perceptual vector quantization for video coding.
Proceedings of the Visual Information Processing and Communication VI, 2015

Predicting chroma from luma with frequency domain intra prediction.
Proceedings of the Visual Information Processing and Communication VI, 2015

2012
Definition of the Opus Audio Codec.
RFC, September, 2012

Guidelines for Development of an Audio Codec within the IETF.
RFC, March, 2012

Guidelines for the Use of Variable Bit Rate Audio with Secure RTP.
RFC, March, 2012

Integration of sound source localization and separation to improve Dialogue Management on a robot.
Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012

2011
Requirements for an Internet Audio Codec.
RFC, August, 2011

2010
A High-Quality Speech and Audio Codec With Less Than 10-ms Delay.
IEEE Trans. Speech Audio Process., 2010

An iterative linearised solution to the sinusoidal parameter estimation problem.
Comput. Electr. Eng., 2010

2009
RTP Payload Format for the Speex Codec.
RFC, June, 2009

Evaluating real-time audio localization algorithms for artificial audition in robotics.
Proceedings of the 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009

Reflected Simplex Codebooks for Limited Feedback MIMO Beamforming.
Proceedings of IEEE International Conference on Communications, 2009

Priority Based Dynamic Rate Control for VoIP Traffic.
Proceedings of the Global Communications Conference, 2009. GLOBECOM 2009, Honolulu, Hawaii, USA, 30 November, 2009

A full-bandwidth audio codec with low complexity and very low delay.
Proceedings of the 17th European Signal Processing Conference, 2009

2008
Embedded auditory system for small mobile robots.
Proceedings of the 2008 IEEE International Conference on Robotics and Automation, 2008

Adaptive Rate Control for Aggregated VoIP Traffic.
Proceedings of the Global Communications Conference, 2008. GLOBECOM 2008, New Orleans, LA, USA, 30 November, 2008

2007
Robust Recognition of Simultaneous Speech by a Mobile Robot.
IEEE Trans. Robotics, 2007

On Adjusting the Learning Rate in Frequency Domain Echo Cancellation With Double-Talk.
IEEE Trans. Speech Audio Process., 2007

Interference-Normalized Least Mean Square Algorithm.
IEEE Signal Process. Lett., 2007

Robust localization and tracking of simultaneous moving sound sources using beamforming and particle filtering.
Robotics Auton. Syst., 2007

Spartacus attending the 2005 AAAI conference.
Auton. Robots, 2007

A New Robust Frequency Domain Echo Canceller with Closed-Loop Learning Rate Adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2007

Design and implementation of a robot audition system for automatic speech recognition of simultaneous speech.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
Recognition of Simultaneous Speech by Estimating Reliability of Separated Signals for Robot Audition.
Proceedings of the PRICAI 2006: Trends in Artificial Intelligence, 2006

Real-Time Robot Audition System That Recognizes Simultaneous Speech in The Real World.
Proceedings of the 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2006

Leak energy based missing feature mask generation for ICA and GSS and its evaluation with simultaneous speech recognition.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2006

Genetic Algorithm-Based Improvement of Robot Hearing Capabilities in Separating and Recognizing Simultaneous Speech Signals.
Proceedings of the Advances in Applied Artificial Intelligence, 2006

Robust 3D Localization and Tracking of Sound Sources Using Beamforming and Particle Filtering.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Embedded and Integrated Audition for a Mobile Robot.
Proceedings of the Aurally Informed Performance: Integrating Machine Listening and Auditory Presentation in Robotic Systems, 2006

2005
Modularity and integration in the design of a socially interactive robot.
Proceedings of the IEEE International Workshop on Robot and Human Interactive Communication, 2005

Making a robot recognize three simultaneous sentences in real-time.
Proceedings of the 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2005

Multiple moving speaker tracking by microphone array on mobile robot.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Enhanced Robot Speech Recognition Based on Microphone Array Source Separation and Missing Feature Theory.
Proceedings of the 2005 IEEE International Conference on Robotics and Automation, 2005


2004
Autonomous Mobile Robot That Can Read.
EURASIP J. Adv. Signal Process., 2004

Enhanced robot audition based on microphone array source separation with post-filter.
Proceedings of the 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems, Sendai, Japan, September 28, 2004

Code reusability tools for programming mobile robots.
Proceedings of the 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems, Sendai, Japan, September 28, 2004

Localization of Simultaneous Moving Sound Sources for Mobile Robot Using a Frequency- Domain Steered Beamformer Approach.
Proceedings of the 2004 IEEE International Conference on Robotics and Automation, 2004

Autonomous Initialization of Robot Formations.
Proceedings of the 2004 IEEE International Conference on Robotics and Automation, 2004

Microphone array post-filter for separation of simultaneous non-stationary sources.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Making a mobile robot read textual messages.
Proceedings of the IEEE International Conference on Systems, 2003

Robust sound source localization using a microphone array on a mobile robot.
Proceedings of the 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems, Las Vegas, Nevada, USA, October 27, 2003

Textual message read by a mobile robot.
Proceedings of the 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems, Las Vegas, Nevada, USA, October 27, 2003

2002
Dynamic robot formations using directional visual perception.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, Lausanne, Switzerland, September 30, 2002

1999
On the limits of speech recognition in noise.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999


  Loading...