Ivan Himawan

Orcid: 0000-0003-3848-244X

According to our database1, Ivan Himawan authored at least 36 papers between 2007 and 2022.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2022
Jointly Trained Conversion Model With LPCNet for Any-to-One Voice Conversion Using Speaker-Independent Linguistic Features.
IEEE Access, 2022

2020
Speaker Adaptation of a Multilingual Acoustic Model for Cross-Language Synthesis.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Voice Presentation Attack Detection Using Convolutional Neural Networks.
Proceedings of the Handbook of Biometric Anti-Spoofing, 2019

Deep domain adaptation for anti-spoofing in speaker verification systems.
Comput. Speech Lang., 2019

Speaker Adaptation of Acoustic Model using a Few Utterances in DNN-based Speech Synthesis Systems.
Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

Investigating Domain Sensitivity of DNN Embeddings for Speaker Recognition Systems.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Improving PLDA speaker verification performance using domain mismatch compensation techniques.
Comput. Speech Lang., 2018

Investigating Deep Neural Networks for Speaker Diarization in the DIHARD Challenge.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Domain-invariant I-vector Feature Extraction for PLDA Speaker Verification.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Employing Phonetic Information in DNN Speaker Embeddings to Improve Speaker Recognition Performance.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Deep Learning Techniques for Koala Activity Detection.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

3D convolutional recurrent neural networks for bird sound detection.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

2017
Impact of automatic region-of-interest coding on perceived quality in mobile video.
Multim. Tools Appl., 2017

Semi-Supervised Learning with Semantic Knowledge Extraction for Improved Speech Recognition in Air Traffic Control.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Domain Mismatch Modeling of Out-Domain i-Vectors for PLDA Speaker Verification.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016
Feature mapping using far-field microphones for distant speech recognition.
Speech Commun., 2016

Investigating Cross-lingual Multi-level Adaptive Networks: The Importance of the Correlation of Source and Target Languages.
Proceedings of the 13th International Conference on Spoken Language Translation, 2016

Short Utterance Variance Modelling and Utterance Partitioning for PLDA Speaker Verification.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Speakers In The Wild (SITW): The QUT Speaker Recognition System.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015
Exploiting foreign resources for DNN-based ASR.
EURASIP J. Audio Speech Music. Process., 2015

Integrating online i-vector extractor with information bottleneck based speaker diarization system.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Channel selection in the short-time modulation domain for distant speech recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Learning feature mapping using deep neural network bottleneck features for distant large vocabulary speech recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Towards utterance-based neural network adaptation in acoustic modeling.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Acceptability-based QoE Management for User-centric Mobile Video Delivery: A Field Study Evaluation.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Adaptive bilateral filtering using saliency map for deblocking low bit rate videos.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

2013
Heteroscedastic probabilistic linear discriminant analysis for manifold learning in video-based face recognition.
Proceedings of the 2013 IEEE Workshop on Applications of Computer Vision, 2013

Automatic region-of-interest detection and prioritisation for visually optimised coding of low bit rate videos.
Proceedings of the 2013 IEEE Workshop on Applications of Computer Vision, 2013

2012
Impact of Region-of-Interest Video Coding on Perceived Quality in Mobile Video.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

2011
Clustered Blind Beamforming From Ad-Hoc Microphone Arrays.
IEEE Trans. Speech Audio Process., 2011

User-driven saliency maps for evaluating Region-of-Interest detection.
Proceedings of the IEEE Workshop on Applications of Computer Vision (WACV 2011), 2011

2010
Clustering of ad-hoc microphone arrays for robust blind beamforming.
Proceedings of the IEEE International Conference on Acoustics, 2010

2008
Microphone Array Shape Calibration in Diffuse Noise Fields.
IEEE Trans. Speech Audio Process., 2008

A posterior approach for microphone array based speech recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Dealing with uncertainty in microphone placement in a microphone array speech recognition system.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Microphone Array Beamforming Approach to Blind Speech Separation.
Proceedings of the Machine Learning for Multimodal Interaction , 2007


  Loading...