Andrew W. Senior

Orcid: 0000-0002-2401-5691

  • Google Research
  • IBM T. J. Watson Research Center

According to our database1, Andrew W. Senior authored at least 91 papers between 1992 and 2023.

Collaborative distances:



In proceedings 
PhD thesis 


Online presence:



Machine Learning for Ancient Languages: A Survey.
Comput. Linguistics, September, 2023

Learning to Decode the Surface Code with a Recurrent, Transformer-Based Neural Network.
CoRR, 2023

Deep Audio-Visual Speech Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Inferring a Continuous Distribution of Atom Coordinates from Cryo-EM Images using VAEs.
CoRR, 2021

Improved protein structure prediction using potentials from deep learning.
Nat., 2020

Large-Scale Visual Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Large-Scale Visual Speech Recognition.
CoRR, 2018

Multichannel Signal Processing With Deep Neural Networks for Automatic Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Lip Reading Sentences in the Wild.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Raw Multichannel Processing Using Deep Neural Networks.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

WaveNet: A Generative Model for Raw Audio.
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

Scaling Memory-Augmented Neural Networks with Sparse Reads and Writes.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Flat start training of CD-CTC-SMBR LSTM RNN acoustic models.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Latent Predictor Networks for Code Generation.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Deep Learning for Acoustic Modeling in Parametric Speech Generation: A systematic review of existing techniques and future trends.
IEEE Signal Process. Mag., 2015

A Real-Time End-to-End Multilingual Speech Recognition Architecture.
IEEE J. Sel. Top. Signal Process., 2015

Fast and accurate recurrent neural network acoustic models for speech recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Learning the speech front-end with raw waveform CLDNNs.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Large vocabulary automatic speech recognition for children.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Context dependent phone models for LSTM RNN acoustic modelling.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Learning acoustic frame labeling for speech recognition with recurrent neural networks.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Convolutional, Long Short-Term Memory, fully connected Deep Neural Networks.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Acoustic modelling with CD-CTC-SMBR LSTM RNNS.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Speaker location and microphone spacing invariant acoustic modeling from raw multichannel waveforms.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Long Short-Term Memory Based Recurrent Neural Network Architectures for Large Vocabulary Speech Recognition.
CoRR, 2014

Sequence discriminative distributed training of long short-term memory recurrent neural networks.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Long short-term memory recurrent neural network architectures for large scale acoustic modeling.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Asynchronous stochastic optimization for sequence training of deep neural networks: towards big data.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Asynchronous, online, GMM-free training of a context dependent acoustic model for speech recognition.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Deep mixture density networks for acoustic modeling in statistical parametric speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2014

Fine context, low-rank, softplus deep neural networks for mobile speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

Improving DNN speaker independence with I-vector inputs.
Proceedings of the IEEE International Conference on Acoustics, 2014

GMM-free DNN acoustic model training.
Proceedings of the IEEE International Conference on Acoustics, 2014

Asynchronous stochastic optimization for sequence training of deep neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2014

Accurate and compact large vocabulary speech recognition on mobile devices.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

On rectified linear units for speech processing.
Proceedings of the IEEE International Conference on Acoustics, 2013

Statistical parametric speech synthesis using deep neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2013

An empirical study of learning rates in deep neural networks for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

Multilingual acoustic models using distributed deep neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2013

Large scale deep neural network acoustic modeling with semi-supervised training data for YouTube video transcription.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

Robust and efficient foreground analysis in complex surveillance videos.
Mach. Vis. Appl., 2012

Large Scale Distributed Deep Networks.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Application of Pretrained Deep Neural Networks to Large Vocabulary Speech Recognition.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Learning improved linear transforms for speech recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Translation-Inspired OCR.
Proceedings of the 2011 International Conference on Document Analysis and Recognition, 2011

Privacy Protection and Face Recognition.
Proceedings of the Handbook of Face Recognition, 2nd Edition., 2011

Interactive Motion Analysis for Video Surveillance and Long Term Scene Monitoring.
Proceedings of the Computer Vision - ACCV 2010 Workshops, 2010

Enhancing Privacy Protection in Multimedia Systems.
EURASIP J. Inf. Secur., 2009

Privacy Protection in a Video Surveillance System.
Proceedings of the Protecting Privacy in Video Surveillance, 2009

An Introduction to Automatic Video Surveillance.
Proceedings of the Protecting Privacy in Video Surveillance, 2009

IBM smart surveillance system (S3): event based video surveillance system with an open and extensible framework.
Mach. Vis. Appl., 2008

Privacy enablement in a surveillance system.
Proceedings of the International Conference on Image Processing, 2008

Joint face and head tracking inside multi-camera smart rooms.
Signal Image Video Process., 2007

S3: The IBM Smart Surveillance System: From Transactional Systems to Observational Systems.
Proceedings of the IEEE International Conference on Acoustics, 2007

Video analytics for retail.
Proceedings of the Fourth IEEE International Conference on Advanced Video and Signal Based Surveillance, 2007

Searching surveillance video.
Proceedings of the Fourth IEEE International Conference on Advanced Video and Signal Based Surveillance, 2007

Appearance models for occlusion handling.
Image Vis. Comput., 2006

Presence/Absence: The 2005 ACM Multimedia Interactive Art Exhibition.
IEEE Multim., 2006

Enabling Video Privacy through Computer Vision.
IEEE Secur. Priv., 2005

Acquiring Multi-Scale Images by Pan-Tilt-Zoom Control and Automatic Multi-Camera Calibration.
Proceedings of the 7th IEEE Workshop on Applications of Computer Vision / IEEE Workshop on Motion and Video Computing (WACV/MOTION 2005), 2005

ACM multimedia interactive art program: an introduction to the presence/absence exhibition.
Proceedings of the 13th ACM International Conference on Multimedia, 2005

A Joint System for Person Tracking and Face Detection.
Proceedings of the Computer Vision in Human-Computer Interaction, 2005

IBM smart surveillance system (S3): a open and extensible framework for event based surveillance.
Proceedings of the Advanced Video and Signal Based Surveillance, 2005

The Relation between the ROC Curve and the CMC.
Proceedings of the Fourth IEEE Workshop on Automatic Identification Advanced Technologies (AutoID 2005), 2005

Shibboleth: exploring cultural boundaries in speech.
Proceedings of the 12th ACM International Conference on Multimedia, 2004

S3-R1: the IBM smart surveillance system-release 1.
Proceedings of the 2004 ACM SIGMM Workshop on Effective Telepresence, 2004

Detection and tracking in the IBM PeopleVision system.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Recent advances in the automatic recognition of audiovisual speech.
Proc. IEEE, 2003

Security, Privacy, and Health.
IEEE Pervasive Comput., 2003

Face Cataloger: Multi-Scale Imaging for Relating Identity to Location.
Proceedings of the 2003 IEEE Conference on Advanced Video and Signal Based Surveillance (AVSS 2003), 2003

Absolute Head Pose Estimation From Overhead Wide-Angle Cameras.
Proceedings of the 2003 IEEE International Workshop on Analysis and Modeling of Faces and Gestures (AMFG 2003), 2003

Fingerprint Minutiae: A Constructive Definition.
Proceedings of the Biometric Authentication, 2002

Audio-Visual Speaker Recognition for Video Broadcast News.
J. VLSI Signal Process., 2001

A Combination Fingerprint Classifier.
IEEE Trans. Pattern Anal. Mach. Intell., 2001

A Cascade Visual Front End for Speaker Independent Automatic Speechreading.
Int. J. Speech Technol., 2001

Automated Biometrics.
Proceedings of the Advances in Pattern Recognition, 2001

Joint processing of audio and visual information for multimedia indexing and human-computer interaction.
Proceedings of the Computer-Assisted Information Retrieval (Recherche d'Information et ses Applications), 2000

Virtual Garden: a vision-based multimedia installation.
Proceedings of the ACM Multimedia 2000 Workshops, Los Angeles, CA, USA, October 30, 2000

Perceptual interfaces for information interaction: joint processing of audio and visual information for human-computer interaction.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Recovering Frontal-Pose Image from a Single Profile Image.
Proceedings of the 2000 International Conference on Image Processing, 2000

Audio-visual intent-to-speak detection for human-computer interaction.
Proceedings of the IEEE International Conference on Acoustics, 2000

Audio-visual speaker recognition for video broadcast news: some fusion techniques.
Proceedings of the Third IEEE Workshop on Multimedia Signal Processing, 1999

Audio-visual large vocabulary continuous speech recognition in the broadcast domain.
Proceedings of the Third IEEE Workshop on Multimedia Signal Processing, 1999

On the use of visual information for improving audio-based speaker recognition.
Proceedings of the Auditory-Visual Speech Processing, 1999

An Off-Line Cursive Handwriting Recognition System.
IEEE Trans. Pattern Anal. Mach. Intell., 1998

Writer adaptation of a HMM handwriting recognition system.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Duration modeling results for an on-line handwriting recognizer.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

Initialization of hidden Markov models for unconstrained on-line handwriting recognition.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

Forward-backward retraining of recurrent neural networks.
Proceedings of the Advances in Neural Information Processing Systems 8, 1995

Using constrained snakes for feature spotting in off-line cursive script.
Proceedings of the 2nd International Conference Document Analysis and Recognition, 1993

Off-line Handwriting Recognition by Recurrent Error Propagation Networks.
Proceedings of the British Machine Vision Conference, 1992
