Ben P. Milner

Orcid: 0000-0001-6315-3475

Affiliations:
  • School of Computer Science, University of East Anglia, Norwich, UK


According to our database1, Ben P. Milner authored at least 104 papers between 1992 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
The UEA Digital Humans entry to the GENEA Challenge 2023.
Proceedings of the 25th International Conference on Multimodal Interaction, 2023

Investigating Imaginary Mask Estimation in Complex Masking for Speech Enhancement.
Proceedings of the 31st European Signal Processing Conference, 2023

2022
Speaker-Independent Speech Animation Using Perceptual Loss Functions and Synthetic Data.
IEEE Trans. Multim., 2022

2021
Improving The Robustness Of Right Whale Detection In Noisy Conditions Using Denoising Autoencoders And Augmented Training.
Proceedings of the IEEE International Conference on Acoustics, 2021

2019
Synthesising visual speech using dynamic visemes and deep learning architectures.
Comput. Speech Lang., 2019

2018
Using Visual Speech Information in Masking Methods for Audio Speaker Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

The Effect of Real-Time Constraints on Automatic Speech Animation.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017
Generating Intelligible Audio Speech From Visual Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Estimating acoustic speech features in low signal-to-noise ratios using a statistical framework.
Comput. Speech Lang., 2017

A Comparison of Perceptually Motivated Loss Functions for Binary Mask Estimation in Speech Separation.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Using visual speech information and perceptually motivated loss functions for binary mask estimation.
Proceedings of the 14th International Conference on Auditory-Visual Speech Processing, 2017

2016
Visual Speech Synthesis Using Dynamic Visemes, Contextual Features and DNNs.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Audio-to-Visual Speech Conversion Using Deep Neural Networks.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

HMM-Based Speech Enhancement Using Sub-Word Models and Noise Adaptation.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015
Reconstruction-based speech enhancement from robust acoustic features.
Speech Commun., 2015

Objective measures for predicting the intelligibility of spectrally smoothed speech with artificial excitation.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Using audio and visual information for single channel speaker separation.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Reconstructing intelligible audio speech from visual speech features.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Analysing the importance of different visual feature coefficients.
Proceedings of the Auditory-Visual Speech Processing, 2015

Voicing classification of visual speech using convolutional neural networks.
Proceedings of the Auditory-Visual Speech Processing, 2015

2014
Using hidden Markov models for speech enhancement.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

2013
Enhancing speech at very low signal-to-noise ratios using non-acoustic reference signals.
Speech Commun., 2013

Modelling and estimation of the fundamental frequency of speech using a hidden Markov model.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Speaker separation using visual speech features and single-channel audio.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Speaker separation using visually-derived binary masks.
Proceedings of the Auditory-Visual Speech Processing, 2013

2012
On the use of Machine Learning Methods for Speech and Voicing Classification.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Enhancing Speech by Reconstruction from Robust Acoustic Features.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

2011
Robust Acoustic Speech Feature Prediction From Noisy Mel-Frequency Cepstral Coefficients.
IEEE Trans. Speech Audio Process., 2011

Visually Derived Wiener Filters for Speech Enhancement.
IEEE Trans. Speech Audio Process., 2011

Fundamental Frequency Estimation Using Modified Higher Order Moments and Multiple Windows.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Maximum a posteriori Estimation of Noise from Non-Acoustic Reference Signals in Very Low Signal-to-Noise Ratio Environments.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Speech Enhancement by Reconstruction from Cleaned Acoustic Features.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

2010
Pitch extraction using modified higher order moments.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Reconstructing clean speech from noisy MFCC vectors.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Enhancing audio speech using visual speech features.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Effective visually-derived Wiener filtering for audio-visual speech processing.
Proceedings of the Auditory-Visual Speech Processing, 2009

2008
Kalman tracking of linear predictor and harmonic noise models for noisy speech enhancement.
Comput. Speech Lang., 2008

Applying noise compensation methods to robustly predict acoustic speech features from MFCC vectors in noise.
Proceedings of the IEEE International Conference on Acoustics, 2008

Comparing noise compensation methods for robust prediction of acoustic speech features from MFCC vectors in noise.
Proceedings of the 2008 16th European Signal Processing Conference, 2008

Using audio-visual features for robust voice activity detection in clean and noisy speech.
Proceedings of the 2008 16th European Signal Processing Conference, 2008

2007
Prediction of Fundamental Frequency and Voicing From Mel-Frequency Cepstral Coefficients for Unconstrained Speech Reconstruction.
IEEE Trans. Speech Audio Process., 2007

Formant tracking linear prediction model using HMMs and Kalman filters for noisy speech processing.
Comput. Speech Lang., 2007

A comparison of estimated and MAP-predicted formants and fundamental frequencies with a speech reconstruction application.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

An Investigation into the Correlation and Prediction of Acoustic Speech Features from MFCC Vectors.
Proceedings of the IEEE International Conference on Acoustics, 2007

Visually-Derived Wiener Filters for Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2007

Restoration of noisy and band limited archived speech records with linear predictor and harmonic noise models.
Proceedings of the 15th European Signal Processing Conference, 2007

Noisy audio speech enhancement using Wiener filters derived from visual speech.
Proceedings of the Auditory-Visual Speech Processing 2007, 2007

Maximising audio-visual speech correlation.
Proceedings of the Auditory-Visual Speech Processing 2007, 2007

2006
Acoustic environment classification.
ACM Trans. Speech Lang. Process., 2006

Robust speech recognition over mobile and IP networks in burst-like packet loss.
IEEE Trans. Speech Audio Process., 2006

Special Issue on Robustness Issues for Conversational Interaction.
Speech Commun., 2006

Clean speech reconstruction from MFCC vectors and fundamental frequency using an integrated front-end.
Speech Commun., 2006

Towards improving the robustness of distributed speech recognition in packet loss.
Speech Commun., 2006

MAP prediction of formant frequencies and voicing class from MFCC vectors in noise.
Speech Commun., 2006

Email classification for automated service handling.
Proceedings of the 2006 ACM Symposium on Applied Computing (SAC), 2006

HMM-based MAP prediction of voiced and unvoiced formant frequencies from noisy MFCC vectors.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Analysis of correlation between audio and visual speech features for clean audio feature prediction in noise.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Noise Reduction for Driver-To-Pit-Crew Communication in Motor Racing.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Kalman filter with linear predictor and harmonic noise models for noisy speech enhancement.
Proceedings of the 14th European Signal Processing Conference, 2006

2005
Formant-tracking linear prediction models for speech processing in noisy environments.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Fundamental frequency and voicing prediction from MFCCs for speech reconstruction from unconstrained speech.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Combining packet loss compensation methods for robust distributed speech recognition.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Formant frequency prediction from MFCC vectors in noisy environments.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Soft Decoding of Temporal Derivatives for Robust Distributed Speech Recognition in Packet Loss.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Predicting Formant Frequencies from MFCC Vectors.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
MAP prediction of pitch from MFCC vectors for speech reconstruction.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

An analysis of packet loss models for distributed speech recognition.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

A comparison of packet loss compensation methods and interleaving for speech recognition in burst-like packet loss.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Pitch prediction from MFCC vectors for speech reconstruction.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

An analysis of interleavers for robust speech recognition in burst-like packet loss.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Interleaving and estimation of lost vectors for robust speech recognition in burst-like packet loss.
Proceedings of the 2004 12th European Signal Processing Conference, 2004

2003
Integrated pitch and MFCC extraction for speech reconstruction and speech recognition applications.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Analysis and compensation of packet loss in distributed speech recognition using interleaving.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Non-linear compression of feature vectors using transform coding and non-uniform bit allocation.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Context awareness using environmental noise classification.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Unit selection in concatenative TTS synthesis systems based on mel filter bank amplitudes and phonetic context.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Clean speech reconstruction from noisy mel-frequency cepstral coefficients using a sinusoidal model.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Low bit-rate feature vector compression using transform coding and non-uniform bit allocation.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Environmental Noise Classification for Context-Aware Applications.
Proceedings of the Database and Expert Systems Applications, 14th International Conference, 2003

2002
Transform-based feature vector compression for distributed speech recognition.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Speech reconstruction from mel-frequency cepstral coefficients using a source-filter model.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

A comparison of front-end configurations for robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2002

On the derivation of the optimal payload size for packet based transmission over a binary symmetrical communication channel.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
Robust speech recognition in burst-like packet loss.
Proceedings of the IEEE International Conference on Acoustics, 2001

2000
Robust voice recognition over IP and mobile networks.
Proceedings of the 11th IEEE International Symposium on Personal, 2000

Robust speech recognition over IP networks.
Proceedings of the IEEE International Conference on Acoustics, 2000

1999
A comparison of techniques for tone compensation in payphone-based speech recognition.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1998
Joint recognition and segmentation using phonetically derived features and a hybrid phoneme model.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Improving accuracy of telephony-based, speaker-independent speech recognition.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

1997
Noise compensation methods for hidden Markov model speech recognition in adverse environments.
IEEE Trans. Speech Audio Process., 1997

Evaluating feature set performance using the f-ratio and j-measures.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Cepstral-time matrices and LDA for improved connected digit and sub-word recognition accuracy.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Multi-resolution phonetic/segmental features and models for HMM-based speech recognition.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996
A comparitive analysis of channel-robust features and channel equalization methods for speech recognition.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Inclusion of temporal information into features for speech recognition.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Dynamic features for segmental speech recognition.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

1995
An analysis of cepstral-time matrices for noise and channel robust speech recognition.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Speech recognition in impulsive noise.
Proceedings of the 1995 International Conference on Acoustics, 1995

1994
Noisy speech recognition using cepstral-time features and spectral-time filters.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

Speech modelling using cepstral-time feature matrices and hidden Markov models.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1993
Noise-adaptive hidden Markov models based on wiener filters.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Speech modelling using cepstral-time feature matrices.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Noisy speech recognition based on HMMs, Wiener filters and re-evaluation of most likely candidates.
Proceedings of the IEEE International Conference on Acoustics, 1993

1992
Speech recognition in noisy environments.
Proceedings of the Second International Conference on Spoken Language Processing, 1992


  Loading...