Stephen J. Cox

Orcid: 0000-0002-4443-1000

According to our database1, Stephen J. Cox authored at least 58 papers between 1989 and 2021.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2021
1D Convolutional Neural Networks for Detecting Nystagmus.
IEEE J. Biomed. Health Informatics, 2021

A predictor analysis framework for surface radiation budget reprocessing using satellite data.
Int. J. Crit. Infrastructures, 2021

Detecting positional vertigo using an ensemble of 2D convolutional neural networks.
Biomed. Signal Process. Control., 2021

2019
Automatic nystagmus detection and quantification in long-term continuous eye-movement data.
Comput. Biol. Medicine, 2019

2016
Visual units and confusion modelling for automatic lip-reading.
Image Vis. Comput., 2016

Improved speaker independent lip reading using speaker adaptive training and deep neural networks.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Tennis Ball Tracking Using a Two-Layered Data Association Approach.
IEEE Trans. Multim., 2015

Improving lip-reading performance for robust audiovisual speech recognition using DNNs.
Proceedings of the Auditory-Visual Speech Processing, 2015

Speaker-independent machine lip-reading with speaker-dependent viseme classifiers.
Proceedings of the Auditory-Visual Speech Processing, 2015

Detection of anomalous events in a tennis game using multimodal information.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014
Multimodal joint information processing in human machine interaction: recent advances.
Multim. Tools Appl., 2014

Automatic annotation of tennis games: An integration of audio, vision, and learning.
Image Vis. Comput., 2014

Unsupervised model selection for recognition of regional accented speech.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

2013
Native accent classification via i-vectors and speaker compensation fusion.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

A two layered data association approach for ball tracking.
Proceedings of the IEEE International Conference on Acoustics, 2013

Confusion modelling for automated lip-reading usingweighted finite-state transducers.
Proceedings of the Auditory-Visual Speech Processing, 2013

2012
Language Identification Using Visual Features.
IEEE Trans. Speech Audio Process., 2012

Iterative classification of regional British accents in i-vector space.
Proceedings of the 2012 Symposium on Machine Learning in Speech and Language Processing, 2012

Improved audio event detection by use of contextual noise.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Detection of ball hits in a tennis game using audio and visual information.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011
Inferring the Structure of a Tennis Game Using Audio Information.
IEEE Trans. Speech Audio Process., 2011

Learning Score Structure from Spoken Language for a Tennis Game.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Iterative Improvement of Speaker Segmentation in a Noisy Environment Using High-Level Knowledge.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

An Accurate and Robust Gender Identification Algorithm.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Improved detection of ball hit events in a tennis game using multimodal information.
Proceedings of the Auditory-Visual Speech Processing, 2011

2010
Using high-level information to detect key audio events in a tennis game.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Speaker independent visual-only language identification.
Proceedings of the IEEE International Conference on Acoustics, 2010

Hierarchical language modeling for audio events detection in a sports game.
Proceedings of the IEEE International Conference on Acoustics, 2010

Limitations of visual speech recognition.
Proceedings of the Auditory-Visual Speech Processing, 2010

2009
Modelling Errors in Automatic Speech Recognition for Dysarthric Speakers.
EURASIP J. Adv. Signal Process., 2009

Example-based speech recognition using formulaic phrases.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

On the estimation and the use of confusion-matrices for improving ASR accuracy.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Automatic visual-only language identification: A preliminary study.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Application of weighted finite-state transducers to improve recognition accuracy for dysarthric speech.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

The challenge of multispeaker lip-reading.
Proceedings of the International Conference on Auditory-Visual Speech Processing 2008, 2008

2007
Modelling confusion matrices to improve speech recognition accuracy, with an application to dysarthric speech.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Analysis of User Interaction with Service Oriented Chatbot Systems.
Proceedings of the Human-Computer Interaction. HCI Intelligent Multimodal Interaction Environments, 2007

2006
Task-independent call-routing.
Speech Commun., 2006

2004
Mixture language models for call routing.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Improving phoneme recognition of telephone quality speech.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Integrated pitch and MFCC extraction for speech reconstruction and speech recognition applications.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Unit selection in concatenative TTS synthesis systems based on mel filter bank amplitudes and phonetic context.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Automatic call-routing without transcriptions.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

The use of confidence measures in vector based call-routing.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Discriminative techniques in call routing.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Extraction of Visual Features for Lipreading.
IEEE Trans. Pattern Anal. Mach. Intell., 2002

1998
Towards speech recognizer assessment using a human reference standard.
Comput. Speech Lang., 1998

Nonlinear scale decomposition based features for visual speech recognition.
Proceedings of the 9th European Signal Processing Conference, 1998

A Comparison of Active Shape Model and Scale Decomposition Based Features for Visual Speech Recognition.
Proceedings of the Computer Vision, 1998

Lipreading Using Shape, Shading and Scale.
Proceedings of the Auditory-Visual Speech Processing, 1998

1997
Evaluating feature set performance using the f-ratio and j-measures.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Lip reading from scale-space measurements.
Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97), 1997

Combining noise compensation with visual information in speech recognition.
Proceedings of the ESCA Workshop on Audio-Visual Speech Processing, 1997

1996
Audiovisual speech recognition using multiscale nonlinear image decomposition.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Confidence measures for the SWITCHBOARD database.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1990
RecNorm: Simultaneous Normalisation and Classification Applied to Speech Recognition.
Proceedings of the Advances in Neural Information Processing Systems 3, 1990

1989
Some statistical issues in the comparison of speech recognition algorithms.
Proceedings of the IEEE International Conference on Acoustics, 1989

Unsupervised speaker adaptation by probabilistic spectrum fitting.
Proceedings of the IEEE International Conference on Acoustics, 1989


  Loading...