Ivan Himawan

Fernando Villavicencio

Clinton Fookes

Comput. Speech Lang., 2019

Speaker Adaptation of Acoustic Model using a Few Utterances in DNN-based Speech Synthesis Systems.

[BibT_eX]

[DOI]

Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

Investigating Domain Sensitivity of DNN Embeddings for Speaker Recognition Systems.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Improving PLDA speaker verification performance using domain mismatch compensation techniques.

[BibT_eX]

[DOI]

Md. Hafizur Rahman

David Dean

Comput. Speech Lang., 2018

Investigating Deep Neural Networks for Speaker Diarization in the DIHARD Challenge.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Domain-invariant I-vector Feature Extraction for PLDA Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Employing Phonetic Information in DNN Speaker Embeddings to Improve Speaker Recognition Performance.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Deep Learning Techniques for Koala Activity Detection.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

3D convolutional recurrent neural networks for bird sound detection.

[BibT_eX]

[DOI]

Michael Towsey

Paul Roe

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

2017

Impact of automatic region-of-interest coding on perceived quality in mobile video.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2017

Semi-Supervised Learning with Semantic Knowledge Extraction for Improved Speech Recognition in Air Traffic Control.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Domain Mismatch Modeling of Out-Domain i-Vectors for PLDA Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016

Feature mapping using far-field microphones for distant speech recognition.

[BibT_eX]

[DOI]

Speech Commun., 2016

Investigating Cross-lingual Multi-level Adaptive Networks: The Importance of the Correlation of Source and Target Languages.

[BibT_eX]

[DOI]

Proceedings of the 13th International Conference on Spoken Language Translation, 2016

Short Utterance Variance Modelling and Utterance Partitioning for PLDA Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Speakers In The Wild (SITW): The QUT Speaker Recognition System.

[BibT_eX]

[DOI]

Clinton Fookes

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015

Exploiting foreign resources for DNN-based ASR.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2015

Integrating online i-vector extractor with information bottleneck based speaker diarization system.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Channel selection in the short-time modulation domain for distant speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Learning feature mapping using deep neural network bottleneck features for distant large vocabulary speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Towards utterance-based neural network adaptation in acoustic modeling.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014

Acceptability-based QoE Management for User-centric Mobile Video Delivery: A Field Study Evaluation.

[BibT_eX]

[DOI]

Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Adaptive bilateral filtering using saliency map for deblocking low bit rate videos.

[BibT_eX]

[DOI]

Andrew J. Zele

Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

2013

Heteroscedastic probabilistic linear discriminant analysis for manifold learning in video-based face recognition.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Workshop on Applications of Computer Vision, 2013

Automatic region-of-interest detection and prioritisation for visually optimised coding of low bit rate videos.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Workshop on Applications of Computer Vision, 2013

2012

Impact of Region-of-Interest Video Coding on Perceived Quality in Mobile Video.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

2011

Clustered Blind Beamforming From Ad-Hoc Microphone Arrays.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2011

User-driven saliency maps for evaluating Region-of-Interest detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Computer Vision (WACV 2011), 2011

2010

Clustering of ad-hoc microphone arrays for robust blind beamforming.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

2008

Microphone Array Shape Calibration in Diffuse Noise Fields.

[BibT_eX]

[DOI]

Mike Lincoln

IEEE Trans. Speech Audio Process., 2008

A posterior approach for microphone array based speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Dealing with uncertainty in microphone placement in a microphone array speech recognition system.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

2007

Microphone Array Beamforming Approach to Blind Speech Separation.

[BibT_eX]

[DOI]