Andros Tjandra

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Incremental Machine Speech Chain Towards Enabling Listening While Speaking in Real-Time.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Augmenting Images for ASR and TTS Through Single-Loop and Dual-Loop Multimodal Chain Framework.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Transformer-Based Acoustic Modeling for Hybrid Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

DEJA-VU: Double Feature Presentation and Iterated Loss in Deep Transformer Networks.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Deja-vu: Double Feature Presentation in Deep Transformer Networks.

[BibT_eX]

[DOI]

CoRR, 2019

From Speech Chain to Multimodal Chain: Leveraging Cross-modal Data Augmentation for Semi-supervised Learning.

[BibT_eX]

[DOI]

CoRR, 2019

End-to-End Speech Recognition Sequence Training With Reinforcement Learning.

[BibT_eX]

[DOI]

IEEE Access, 2019

Recognition and translation of code-switching speech utterances.

[BibT_eX]

[DOI]

Proceedings of the 22nd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2019

VQVAE Unsupervised Unit Discovery and Multi-Scale Code2Spec Inverter for Zerospeech Challenge 2019.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

End-to-end Feedback Loss in Speech Chain Framework via Straight-through Estimator.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Speech-to-Speech Translation Between Untranscribed Unknown Languages.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Zero-Shot Code-Switching ASR and TTS with Multilingual Machine Speech Chain.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Listening While Speaking and Visualizing: Improving ASR Through Multimodal Chain.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018

Multi-Scale Alignment and Contextual History for Attention Mechanism in Sequence-to-Sequence Model.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Speech Chain for Semi-Supervised Learning of Japanese-English Code-Switching ASR and TTS.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Machine Speech Chain with One-shot Speaker Adaptation.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Compressing End-to-end ASR Networks by Tensor-Train Decomposition.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Tensor Decomposition for Compressing Recurrent Neural Network.

[BibT_eX]

[DOI]

Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

Sequence-to-Sequence Asr Optimization Via Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Local Monotonic Attention Mechanism for End-to-End Speech Recognition.

[BibT_eX]

[DOI]

CoRR, 2017

Speech recognition features based on deep latent Gaussian models.

[BibT_eX]

[DOI]

Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017

Compressing recurrent neural network with tensor train.

[BibT_eX]

[DOI]

Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

Local Monotonic Attention Mechanism for End-to-End Speech And Language Processing.

[BibT_eX]

[DOI]

Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

Attention-based Wav2Text with feature transfer learning.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Listening while speaking: Speech chain by deep learning.

[BibT_eX]

[DOI]