W. Bastiaan Kleijn

Orcid: 0000-0002-1973-3920

According to our database1, W. Bastiaan Kleijn authored at least 295 papers between 1987 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.


IEEE Fellow

IEEE Fellow 1999, "For contributions to modeling the spectral fine structure of speech and its application to low-rate speech coding.".



In proceedings 
PhD thesis 


Online presence:

On csauthors.net:


On Exact Bit-level Reversible Transformers Without Changing Architectures.
CoRR, 2024

TrailBlazer: Trajectory Control for Diffusion-Based Video Generation.
Proceedings of the SIGGRAPH Asia 2024 Conference Papers, 2024

An Effective MVDR Post-Processing Method for Low-Latency Convolutive Blind Source Separation.
Proceedings of the 18th International Workshop on Acoustic Signal Enhancement, 2024

On Accelerating Diffusion-Based Sampling Processes via Improved Integration Approximation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

A Practical Online Multichannel Dereverberation Approach with Data-Reuse Technique.
Proceedings of the IEEE International Conference on Acoustics, 2024

Exact Diffusion Inversion via Bidirectional Integration Approximation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Directed Diffusion: Direct Control of Object Placement through Attention Guidance.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

CoordiNet: Constrained Dynamics Learning for State Coordination Over Graph.
IEEE Trans. Signal Inf. Process. over Networks, 2023

A DNN Optimizer that Improves over AdaBelief by Suppression of the Adaptive Stepsize Range.
Trans. Mach. Learn. Res., 2023

Exact Diffusion Inversion via Bi-directional Integration Approximation.
CoRR, 2023

On Accelerating Diffusion-Based Sampling Process via Improved Integration Approximation.
CoRR, 2023

Directed Diffusion: Direct Control of Object Placement through Attention Guidance.
CoRR, 2023

Estimation of Source and Receiver Positions, Room Geometry and Reflection Coefficients From a Single Room Impulse Response.
CoRR, 2023

A High-Rate Extension to Soundstream.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

A Deep Learning Based Fault Diagnosis Method Combining Domain Knowledge and Transfer Learning.
Proceedings of the 29th International Conference on Mechatronics and Machine Vision in Practice, 2023

Neural Optimization Of Geometry And Fixed Beamformer For Linear Microphone Arrays.
Proceedings of the IEEE International Conference on Acoustics, 2023

Multi-Channel Audio Signal Generation.
Proceedings of the IEEE International Conference on Acoustics, 2023

LMCodec: A Low Bitrate Speech Codec with Causal Transformer Models.
Proceedings of the IEEE International Conference on Acoustics, 2023

Lookahead Diffusion Probabilistic Models for Refining Mean Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Revisiting the Primal-Dual Method of Multipliers for Optimisation Over Centralised Networks.
IEEE Trans. Signal Inf. Process. over Networks, 2022

Dirichlet Process Mixture of Generalized Inverted Dirichlet Distributions for Positive Vector Data With Extended Variational Inference.
IEEE Trans. Neural Networks Learn. Syst., 2022

On Exploiting Layerwise Gradient Statistics for Effective Training of Deep Neural Networks.
CoRR, 2022

Phase Error Analysis for First-Order Linear Differential Microphone Arrays.
Proceedings of the 17th International Workshop on Acoustic Signal Enhancement, 2022

Ultra-Low-Bitrate Speech Coding with Pretrained Transformers.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Wave-Domain Approach for Cancelling Noise Entering Open Windows.
Proceedings of the IEEE International Conference on Acoustics, 2022

Distributed Task Privacy for Aggregation Using Linear Codes.
IEEE Trans. Signal Inf. Process. over Networks, 2021

Room Acoustical Parameter Estimation From Room Impulse Responses Using Deep Neural Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Extending AdamW by Leveraging Its Second Moment and Magnitude.
CoRR, 2021

Handling Background Noise in Neural Speech Generation.
CoRR, 2021

Asynchronous Decentralized Optimization With Implicit Stochastic Variance Reduction.
Proceedings of the 38th International Conference on Machine Learning, 2021

Generative Speech Coding with Predictive Variance Regularization.
Proceedings of the IEEE International Conference on Acoustics, 2021

Microphone Array Wiener Post Filtering Using Monotone Operator Splitting.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Variance Constrained Autoencoding.
CoRR, 2020

Edge-consensus Learning: Deep Learning on P2P Networks with Nonhomogeneous Data.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Distributed Summation Privacy for Speech Enhancement.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Robust Low Rate Speech Coding Based on Cloned Networks and Wavenet.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Projected Weight Regularization to Improve Neural Network Generalization.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

A Linear-time Independence Criterion Based on a Finite Basis Approximation.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

Handling Background Noise in Neural Speech Generation.
Proceedings of the 54th Asilomar Conference on Signals, Systems, and Computers, 2020

The HSIC Bottleneck: Deep Learning without Back-Propagation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

On the Distributed Method of Multipliers for Separable Convex Optimization Problems.
IEEE Trans. Signal Inf. Process. over Networks, 2019

Derivation and Analysis of the Primal-Dual Method of Multipliers Based on Monotone Operator Theory.
IEEE Trans. Signal Inf. Process. over Networks, 2019

Variational Bayesian Learning for Dirichlet Process Mixture of Inverted Dirichlet Distributions in Non-Gaussian Image Feature Modeling.
IEEE Trans. Neural Networks Learn. Syst., 2019

A Low Latency Approach for Blind Source Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Approximated Orthonormal Normalisation in Training Neural Networks.
CoRR, 2019

Rapidly Adapting Moment Estimation.
CoRR, 2019

Generative Speech Enhancement Based on Cloned Networks.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Finite Approximate Consensus for Privacy in Distributed Sensor Networks.
Proceedings of the 20th International Conference on Parallel and Distributed Computing, 2019

Maximum a posteriori Speech Enhancement Based on Double Spectrum.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Salient Speech Representations Based on Cloned Networks.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Speech Enhancement with Variance Constrained Autoencoders.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Fast Edge-consensus Computing Based on Bregman Monotone Operator Splitting.
Proceedings of the IEEE International Conference on Acoustics, 2019

Function Splitting and Quadratic Approximation of the Primal-Dual Method of Multipliers for Distributed Optimization Over Graphs.
IEEE Trans. Signal Inf. Process. over Networks, 2018

An Evaluation of Intrusive Instrumental Intelligibility Metrics.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Multizone Soundfield Reproduction With Privacy- and Quality-Based Speech Masking Filters.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

An Instrumental Intelligibility Metric Based on Information Theory.
IEEE Signal Process. Lett., 2018

Directional Emphasis in Ambisonics.
IEEE Signal Process. Lett., 2018

Cross-modal subspace learning for fine-grained sketch-based image retrieval.
Neurocomputing, 2018

Kernel Density Estimation-Based Markov Models with Hidden State.
CoRR, 2018

Bounded Information Rate Variational Autoencoders.
CoRR, 2018

Edge Consensus Computing for Heterogeneous Data Sets.
Proceedings of the 2018 IEEE Statistical Signal Processing Workshop, 2018

Beamforming with Partial Knowledge of the Acoustic Scenario.
Proceedings of the 20th IEEE International Workshop on Multimedia Signal Processing, 2018

Necessary Attributes for Integrating a Virtual Source in an Acoustic Scenario.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Blind Facial Basis Discovery Using the Hilbert-Schmidt Independence Criterion.
Proceedings of the 2018 International Conference on Image and Vision Computing New Zealand, 2018

Wavenet Based Low Rate Speech Coding.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

On the Comparison of Two Room Compensation / Dereverberation Methods Employing Active Acoustic Boundary Absorption.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Training Deep Neural Networks via Optimization Over Graphs.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

New Results in Modulation-Domain Single-Channel Speech Enhancement.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Intelligibility Enhancement Based on Mutual Information.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Scatter Component Analysis: A Unified Framework for Domain Adaptation and Domain Generalization.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

On Relationship between Primal-Dual Method of Multipliers and Kalman Filter.
CoRR, 2017

Joint wideband source localization and acquisition based on a grid-shift approach.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Incoherent idempotent ambisonics rendering.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Distributed TV-L1 image fusion using PDMM.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

On the information rate of speech communication.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Machine learning based non-intrusive quality estimation with an augmented feature set.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Active speech control using wave-domain processing with a linear wall of dipole secondary sources.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Non-iterative impulse response shortening method for system latency reduction.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Robust and low-complexity blind source separation for meeting rooms.
Proceedings of the Hands-free Speech Communications and Microphone Arrays, 2017

Minimum Entropy Rate Simplification of Stochastic Processes.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Fast simulation method for room impulse responses based on the mirror image source assumption.
Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016

An intelligibility metric based on a simple model of speech communication.
Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016

Convolutive blind source separation with low latency.
Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016

Single-Channel Speech Enhancement Using Double Spectrum.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

A distributed algorithm for robust LCMV beamforming.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Distributed sparse MVDR beamforming using the bi-alternating direction method of multipliers.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Jointly optimal near-end and far-end multi-microphone speech intelligibility enhancement based on mutual information.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Globally optimized least-squares post-filtering for microphone array speech enhancement.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Improving speech privacy in personal sound zones.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Distributed linear blind source separation over wireless sensor networks with arbitrary connectivity patterns.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

On the duality of globally constrained separable problems and its application to distributed signal processing.
Proceedings of the 24th European Signal Processing Conference, 2016

Deep Reconstruction-Classification Networks for Unsupervised Domain Adaptation.
Proceedings of the Computer Vision - ECCV 2016, 2016

Reproducing personal sound zones using a hybrid synthesis of dynamic and parametric loudspeakers.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

Spectral Dynamics Recovery for Enhanced Speech Intelligibility in Noise.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Theory and Design of Multizone Soundfield Reproduction Using Sparse Methods.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Sparse Hidden Markov Models for Speech Enhancement in Non-Stationary Noise Environments.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Optimizing Speech Intelligibility in a Noisy Environment: A unified view.
IEEE Signal Process. Mag., 2015

A Simple Model of Speech Communication and its Application to Intelligibility Enhancement.
IEEE Signal Process. Lett., 2015

A method of speech periodicity enhancement using transform-domain signal decomposition.
Speech Commun., 2015

Line spectral frequencies modeling by a mixture of von Mises-Fisher distributions.
Signal Process., 2015

Domain Generalization for Object Recognition with Multi-task Autoencoders.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

A robust region-based near-field beamformer.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Sparse HMM-based speech enhancement method for stationary and non-stationary noise environments.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Fast sound field reproduction in box-shaped rooms: Rigid walls case.
Proceedings of the 23rd European Signal Processing Conference, 2015

Dirichlet mixture modeling to estimate an empirical lower bound for LSF quantization.
Signal Process., 2014

Introduction to the Special Issue on The listening talker: context-dependent speech production and perception.
Comput. Speech Lang., 2014

Domain Adaptive Neural Networks for Object Recognition.
Proceedings of the PRICAI 2014: Trends in Artificial Intelligence, 2014

Sinusoidal interpolation across missing data.
Proceedings of the 14th International Workshop on Acoustic Signal Enhancement, 2014

On near-field beamforming with smartphone-based ad-hoc microphone arrays.
Proceedings of the 14th International Workshop on Acoustic Signal Enhancement, 2014

On the convergence rate of the bi-alternating direction method of multipliers.
Proceedings of the IEEE International Conference on Acoustics, 2014

Diffusion-based distributed MVDR beamformer.
Proceedings of the IEEE International Conference on Acoustics, 2014

Pitch enhancement motivated by rate-distortion theory.
Proceedings of the IEEE International Conference on Acoustics, 2014

On the EM algorithm for the estimation of speech AR parameters in noise.
Proceedings of the IEEE International Conference on Acoustics, 2014

Multizone soundfield reproduction in reverberant rooms using compressed sensing techniques.
Proceedings of the IEEE International Conference on Acoustics, 2014

Deep hybrid networks with good out-of-sample object recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

Calibration of distributed sound acquisition systems using TOA measurements from a moving acoustic source.
Proceedings of the IEEE International Conference on Acoustics, 2014

Multiple Description Distribution Preserving Quantization.
IEEE Trans. Signal Process., 2013

Maximizing Phoneme Recognition Accuracy for Enhanced Speech Intelligibility in Noise.
IEEE Trans. Speech Audio Process., 2013

Vector quantization of LSF parameters with a mixture of dirichlet distributions.
IEEE Trans. Speech Audio Process., 2013

Picking up the pieces: Causal states in noisy data, and how to recover them.
Pattern Recognit. Lett., 2013

Perceptual Coding of High-Quality Digital Audio.
Proc. IEEE, 2013

Large Scale LP Decoding with Low Complexity.
IEEE Commun. Lett., 2013

Rate-distortion optimization for multichannel audio compression.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Sparse representations in deep learning for noise-robust digit classification.
Proceedings of the 28th International Conference on Image and Vision Computing New Zealand, 2013

Rephrasing-based speech intelligibility enhancement.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Preservation of speech spectral dynamics enhances intelligibility.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Reliable estimation of quality scores by a small calibrated listening panel.
Proceedings of the IEEE International Conference on Acoustics, 2013

Speech coding based on pitch synchrony and two-stage transformation.
Proceedings of the IEEE International Conference on Acoustics, 2013

Multizone soundfield reproduction using orthogonal basis expansion.
Proceedings of the IEEE International Conference on Acoustics, 2013

Auto-localization in ad-hoc microphone arrays.
Proceedings of the IEEE International Conference on Acoustics, 2013

Gamma hidden Markov model as a probabilistic Nonnegative Matrix Factorization.
Proceedings of the 21st European Signal Processing Conference, 2013

Optimal Index Assignment for Multiple Description Scalar Quantization With Translated Lattice Codebooks.
IEEE Trans. Signal Process., 2012

A Quantization Theoretic Perspective on Simulcast and Layered Multicast Optimization.
IEEE/ACM Trans. Netw., 2012

A Hierarchical Bayesian Approach to Modeling Heterogeneity in Speech Quality Assessment.
IEEE Trans. Speech Audio Process., 2012

Feature set augmentation for enhancing the performance of a non-intrusive quality predictor.
Proceedings of the Fourth International Workshop on Quality of Multimedia Experience, 2012

Robust and Low Complexity Delay Estimation.
Proceedings of the IWAENC 2012 - International Workshop on Acoustic Signal Enhancement, Proceedings, RWTH Aachen University, Germany, September 4th, 2012

Improved Prediction of Nearly-Periodic Signals.
Proceedings of the IWAENC 2012 - International Workshop on Acoustic Signal Enhancement, Proceedings, RWTH Aachen University, Germany, September 4th, 2012

Distributed MVDR Beamforming for (Wireless) Microphone Networks Using Message Passing.
Proceedings of the IWAENC 2012 - International Workshop on Acoustic Signal Enhancement, Proceedings, RWTH Aachen University, Germany, September 4th, 2012

Enhancing Subjective Speech Intelligibility Using a Statistical Model of Speech.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Audio codingwith power spectral density preserving quantization.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Transform-domain Wiener filter for speech periodicity enhancement.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Gaussian process dynamical models for nonparametric speech representation and synthesis.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Graph-Preserving Sparse Nonnegative Matrix Factorization With Application to Facial Expression Recognition.
IEEE Trans. Syst. Man Cybern. Part B, 2011

High-Rate Analysis of Symmetric L-Channel Multiple Description Coding.
IEEE Trans. Commun., 2011

Asymptotically Optimal Model Estimation for Quantization.
IEEE Trans. Commun., 2011

Double-Ended Quality Assessment System for Super-Wideband Speech.
IEEE Trans. Speech Audio Process., 2011

Auditory Model-Based Design and Optimization of Feature Vectors for Automatic Speech Recognition.
IEEE Trans. Speech Audio Process., 2011

Optimal Index Assignment for Multiple Description Scalar Quantization
CoRR, 2011

On Distribution Preserving Quantization
CoRR, 2011

Predictive audio coding using rate-distortion-optimal pre- and post-filtering.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011

Transform-domain speech periodicity enhancement with adaptive coefficient weighting.
Proceedings of the International Symposium on Intelligent Signal Processing and Communications Systems, 2011

Discrete Choice Models for Non-Intrusive Quality Assessment.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Intermediate-State HMMs to Capture Continuously-Changing Signal Features.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Quantization with an adjustable codeword length penalty.
Proceedings of the IEEE International Conference on Acoustics, 2011

Distributed blind source separation with an application to audio signals.
Proceedings of the IEEE International Conference on Acoustics, 2011

Bounding the Rate Region of the Two-Terminal Vector Gaussian CEO Problem.
Proceedings of the 2011 Data Compression Conference (DCC 2011), 2011

Reduction of the Impact of Distortion Outliers and Source Mismatch in Resolution-Constrained Quantization.
IEEE Trans. Speech Audio Process., 2010

Distribution Preserving Quantization With Dithering and Transformation.
IEEE Signal Process. Lett., 2010

The synergy between bounded-distance HMM and spectral subtraction for robust speech recognition.
Speech Commun., 2010

Learning from images and speech with Non-negative Matrix Factorization enhanced by input space scaling.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Complexity-outsourced low-latency video encoding through feedback under a sum-rate constraint.
Proceedings of the International Conference on Image Processing, 2010

Objective quality estimation of wide-band speech using a narrow-band prior.
Proceedings of the IEEE International Conference on Acoustics, 2010

Selecting static and dynamic features using an advanced auditory model for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

Flexcode - flexible audio coding.
Proceedings of the IEEE International Conference on Acoustics, 2010

Auditory model based modified MFCC features.
Proceedings of the IEEE International Conference on Acoustics, 2010

Adaptive playout scheduling for VoIP using the k-Erlang distribution.
Proceedings of the 18th European Signal Processing Conference, 2010

A method of speech periodicity enhancement based on transform-domain signal decomposition.
Proceedings of the 18th European Signal Processing Conference, 2010

Simplified probability models for generative tasks: A rate-distortion approach.
Proceedings of the 18th European Signal Processing Conference, 2010

Bounding the Rate Region of Vector Gaussian Multiple Descriptions with Individual and Central Receivers.
Proceedings of the 2010 Data Compression Conference (DCC 2010), 2010

Feature Selection Under a Complexity Constraint.
IEEE Trans. Multim., 2009

Scalable coding with side information for packet loss recovery.
IEEE Trans. Commun., 2009

Speech Watermarking for Analog Flat-Fading Bandpass Channels.
IEEE Trans. Speech Audio Process., 2009

A Bayesian approach to non-intrusive quality assessment of speech.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Auditory model based optimization of MFCCs improves automatic speech recognition performance.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Facial expression recognition based on graph-preserving sparse non-negative matrix factorization.
Proceedings of the International Conference on Image Processing, 2009

Compressive sensing for sparsely excited speech signals.
Proceedings of the IEEE International Conference on Acoustics, 2009

Optimal parameter estimation for model-based quantization.
Proceedings of the IEEE International Conference on Acoustics, 2009

Joint optimization of the redundancy of multiple-description coders for multicast.
Proceedings of the IEEE International Conference on Acoustics, 2009

Rate distribution between model and signal for multiple descriptions.
Proceedings of the IEEE International Conference on Acoustics, 2009

Analysis of K-Channel Multiple Description Quantization.
Proceedings of the 2009 Data Compression Conference (DCC 2009), 2009

Online Noise Estimation Using Stochastic-Gain HMM for Speech Enhancement.
IEEE Trans. Speech Audio Process., 2008

Generalized Postfilter for Speech Quality Enhancement.
IEEE Trans. Speech Audio Process., 2008

Regularized Linear Prediction of Speech.
IEEE Trans. Speech Audio Process., 2008

Autoregressive model-based speech packet-loss concealment.
Proceedings of the IEEE International Conference on Acoustics, 2008

Adaptive resolution-constrained scalar multiple-description coding.
Proceedings of the IEEE International Conference on Acoustics, 2008

Band control policy of playout scheduling for voice over IP.
Proceedings of the 2008 16th European Signal Processing Conference, 2008

Analytical rate optimization for multicast.
Proceedings of the 2008 16th European Signal Processing Conference, 2008

Constructive camera pose control for optimizing multiview distributed video coding.
Proceedings of the 47th IEEE Conference on Decision and Control, 2008

On the Estimation of Differential Entropy From Data Located on Embedded Manifolds.
IEEE Trans. Inf. Theory, 2007

HMM-Based Gain Modeling for Enhancement of Speech in Noise.
IEEE Trans. Speech Audio Process., 2007

Codebook-Based Bayesian Speech Enhancement for Nonstationary Environments.
IEEE Trans. Speech Audio Process., 2007

Estimation of the Instantaneous Pitch of Speech.
IEEE Trans. Speech Audio Process., 2007

The Sensitivity Matrix: Using Advanced Auditory Models in Speech and Audio Processing.
IEEE Trans. Speech Audio Process., 2007

Improving the phase vocoder approach to pitch-shifting.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Mutual information and the speech signal.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Noise suppression based on extending a speech-dominated modulation band.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Interlacing Intraframes in Multiple-Description Video Coding.
Proceedings of the International Conference on Image Processing, 2007

A Canonical Representation of Speech.
Proceedings of the IEEE International Conference on Acoustics, 2007

An Adaptive, Scalable Packet Loss Recovery Method.
Proceedings of the IEEE International Conference on Acoustics, 2007

Comparative rate-distortion performance of multiple description coding for real-time audiovisual communication over the Internet.
IEEE Trans. Commun., 2006

Rate-distortion optimized quantization in multistage audio coding.
IEEE Trans. Speech Audio Process., 2006

Codebook driven short-term predictor parameter estimation for speech enhancement.
IEEE Trans. Speech Audio Process., 2006

Estimation of the short-term predictor parameters of speech under noisy conditions.
IEEE Trans. Speech Audio Process., 2006

Low-Complexity, Nonintrusive Speech Quality Assessment.
IEEE Trans. Speech Audio Process., 2006

On causal algorithms for speech enhancement.
IEEE Trans. Speech Audio Process., 2006

Multichannel parametric speech enhancement.
IEEE Signal Process. Lett., 2006

Resolution-Constrained Quantization With JND-Based Perceptual-Distortion Measures.
IEEE Signal Process. Lett., 2006

Individual on-line variance adaptation of frequency filtered parameters for robust ASR.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Non-intrusive speech quality assessment with low computational complexity.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

HMM-Based Speech Enhancement using Explicit Gain Modeling.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Sub-Pixel Registration of Noisy Images.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Spectral Envelope Estimation and Regularization.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Multivariate block polar quantization.
IEEE Trans. Commun., 2005

Entropy-constrained polar quantization and its application to audio coding.
IEEE Trans. Speech Audio Process., 2005

On frequency quantization in sinusoidal audio coding.
IEEE Signal Process. Lett., 2005

Anthropomorphic Coding of Speech and Audio: A Model Inversion Approach.
EURASIP J. Adv. Signal Process., 2005

The COST-277 European Action: An Overview.
Proceedings of the Nonlinear Analyses and Algorithms for Speech Processing, 2005

The COST-277 Speech Database.
Proceedings of the Nonlinear Analyses and Algorithms for Speech Processing, 2005

On noise gain estimation for HMM-based speech enhancement.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Denoising through source separation and minimum tracking.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Distortion measures for vector quantization of noisy spectrum.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Codebook-Based Bayesian Speech Enhancement.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Stochastic Integration and Long Term Predictor Estimation under Noisy Conditions for Speech Enhancement.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Improved Kalman Filtering for Speech Enhancement.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Internet Low Bit Rate Codec (iLBC).
RFC, December, 2004

KLT-based adaptive classified VQ of the speech signal.
IEEE Trans. Speech Audio Process., 2004

A time-domain interpretation for the LSP decomposition.
IEEE Trans. Speech Audio Process., 2004

Speech enhancement using adaptive time-domain segmentation.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Comparison of transmitter - based packet-loss recovery techniques for voice transmission.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Towards optimal quantization in multistage audio coding.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Estimation of short-term predictor parameters for coding and enhancement of noisy speech.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Multi-variate block polar quantization and an application to audio.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Noise-dependent postfiltering.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Multiple-description vector quantization using translated lattices with local optimization [joint source/channel coding].
Proceedings of the Global Telecommunications Conference, 2004. GLOBECOM '04, Dallas, Texas, USA, 29 November, 2004

Speech enhancement using a-priori information with classified noise codebooks.
Proceedings of the 2004 12th European Signal Processing Conference, 2004

The sensitivity matrix for a spectro-temporal auditory model.
Proceedings of the 2004 12th European Signal Processing Conference, 2004

Shannon entropy estimation based on high-rate quantization theory.
Proceedings of the 2004 12th European Signal Processing Conference, 2004

Robust spectrum quantization for LP parameter enhancement.
Proceedings of the 2004 12th European Signal Processing Conference, 2004

On line spectral frequencies.
IEEE Signal Process. Lett., 2003

Time synchronization of speech.
Proceedings of the Third International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, 2003

Speech enhancement using a-priori information.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Polar quantization of sinusoids from speech signal blocks.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Rate-distortion comparisons between FEC and MDC based on Gilbert channel model.
Proceedings of the 11th IEEE International Conference on Networks, 2003

Minimum mean square error estimation of speech short-term predictor parameters under noisy conditions.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Sinusoidal modeling using psychoacoustic-adaptive matching pursuits.
IEEE Signal Process. Lett., 2002

On the relevance of bandwidth extension for speaker verification.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Entropy-constrained polar quantization: theory and an application to audio coding.
Proceedings of the IEEE International Conference on Acoustics, 2002

Gaussian mixture model based mutual information estimation between frequency bands in speech.
Proceedings of the IEEE International Conference on Acoustics, 2002

KLT-based classified VQ for the speech signal.
Proceedings of the IEEE International Conference on Acoustics, 2002

Spline-based continuous-time pitch estimation.
Proceedings of the IEEE International Conference on Acoustics, 2002

A time domain reformulation of linear prediction equivalent to the LSP decomposition.
Proceedings of the IEEE International Conference on Acoustics, 2002

On the relevance of bandwidth extension for speaker identification.
Proceedings of the 11th European Signal Processing Conference, 2002

Future work in COST action 277 on nonlinear speech processing.
Proceedings of the 11th European Signal Processing Conference, 2002

Squared error as a measure of phase distortion.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Modifying transients for efficient coding of audio.
Proceedings of the IEEE International Conference on Acoustics, 2001

Avoiding over-estimation in bandwidth extension of telephony speech.
Proceedings of the IEEE International Conference on Acoustics, 2001

Estimation of the excitation variances of speech and noise AR-models for enhanced speech coding.
Proceedings of the IEEE International Conference on Acoustics, 2001

All-pole modelling of mixed excitation signals.
Proceedings of the IEEE International Conference on Acoustics, 2001

Sinusoidal modeling of audio and speech using psychoacoustic-adaptive matching pursuits.
Proceedings of the IEEE International Conference on Acoustics, 2001

On time-frequency masking in voiced speech.
IEEE Trans. Speech Audio Process., 2000

Exploiting time and frequency masking in consistent sinusoidal analysis-synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2000

On the mutual information between frequency bands in speech.
Proceedings of the IEEE International Conference on Acoustics, 2000

A frame interpretation of sinusoidal coding and waveform interpolation.
Proceedings of the IEEE International Conference on Acoustics, 2000

Representing speech.
Proceedings of the 10th European Signal Processing Conference, 2000

On phase perception in speech.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

On speech coding in a perceptual domain.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

On the significance of temporal masking in speech coding.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Waveform interpolation coding with pitch-spaced subbands.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Removal of sparse-excitation artifacts in CELP.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

Quantization using wavelet based temporal decomposition of the LSF.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Using a perception-based frequency scale in waveform interpolation.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Perceptual entropy rate estimates for the phonemes of American English.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

On optimal and minimum-entropy decoding.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

On memoryless quantization in speech coding.
IEEE Signal Process. Lett., 1996

A low-complexity waveform interpolation coder.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

Spectral dynamics is more important than spectral distortion.
Proceedings of the 1995 International Conference on Acoustics, 1995

A speech coder based on decomposition of characteristic waveforms.
Proceedings of the 1995 International Conference on Acoustics, 1995

Interpolation of the pitch-predictor parameters in analysis-by-synthesis speech coders.
IEEE Trans. Speech Audio Process., 1994

On the periodicity of speech coded with linear-prediction based analysis by synthesis coders.
IEEE Trans. Speech Audio Process., 1994

Transformation and decomposition of the speech signal for coding.
IEEE Signal Process. Lett., 1994

The RCELP speech-coding algorithm.
Eur. Trans. Telecommun., 1994

Time-scale modification of speech based on a nonlinear oscillator model.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

Encoding speech using prototype waveforms.
IEEE Trans. Speech Audio Process., 1993

A 5.85 kbits CELP algorithm for cellular applications.
Proceedings of the IEEE International Conference on Acoustics, 1993

Efficient channel coding for CELP using source information.
Speech Commun., 1992

Generalized analysis-by-synthesis coding and its application to pitch prediction.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

Methods for waveform interpolation in speech coding.
Digit. Signal Process., 1991

Acoustic to articulatory parameter mapping using an assembly of neural networks.
Proceedings of the 1991 International Conference on Acoustics, 1991

Continuous representations in linear predictive coding.
Proceedings of the 1991 International Conference on Acoustics, 1991

Fast methods for the CELP speech coding algorithm.
IEEE Trans. Acoust. Speech Signal Process., 1990

Source-dependent channel coding for CELP.
Proceedings of the 1990 International Conference on Acoustics, 1990

Robust CELP coders for noisy backgrounds and noisy channels.
Proceedings of the IEEE International Conference on Acoustics, 1989

An efficient stochastically excited linear predictive coding algorithm for high quality low bit rate transmission of speech.
Speech Commun., 1988

Improved speech quality and efficient vector quantization in SELP.
Proceedings of the IEEE International Conference on Acoustics, 1988

Harmonic coding of speech at 4.8 kb/s.
Proceedings of the IEEE International Conference on Acoustics, 1987
