Hasim Sak
Affiliations:- Google, Inc., USA
- Bogazici University, Department of Computer Engineering, Istanbul, Turkey (PhD 2011)
According to our database1,
Hasim Sak
authored at least 54 papers
between 2005 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on linkedin.com
On csauthors.net:
Bibliography
2024
CoRR, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
Proceedings of the IEEE International Conference on Acoustics, 2023
2022
Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
2021
Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection.
CoRR, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
2020
Transformer Transducer: One Model Unifying Streaming and Non-streaming Speech Recognition.
CoRR, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
A Density Ratio Approach to Language Model Fusion in End-to-End Automatic Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
2017
Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Recurrent Neural Aligner: An Encoder-Decoder Neural Network Model for Sequence to Sequence Mapping.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
Exploring architectures, data and units for streaming end-to-end speech recognition with RNN-transducer.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Unidirectional long short-term memory recurrent neural network with recurrent output layer for low-latency speech synthesis.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Learning acoustic frame labeling for speech recognition with recurrent neural networks.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Grapheme-to-phoneme conversion using Long Short-Term Memory recurrent neural networks.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
2014
Long Short-Term Memory Based Recurrent Neural Network Architectures for Large Vocabulary Speech Recognition.
CoRR, 2014
Sequence discriminative distributed training of long short-term memory recurrent neural networks.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Long short-term memory recurrent neural network architectures for large scale acoustic modeling.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Automatic language identification using long short-term memory recurrent neural networks.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013
2012
Morpholexical and Discriminative Language Models for Turkish Automatic Speech Recognition.
IEEE Trans. Speech Audio Process., 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
2011
Integrating morphology into automatic speech recognition: Morpholexical and discriminative language models for Turkish (Biçimbilimin otomatik konuşma tanımaya bütünleştirilmesi: Türkçe için biçimsözlüksel ve ayırıcı dil modelleri)
PhD thesis, 2011
J. Multimodal User Interfaces, 2011
Discriminative reranking of ASR hypotheses with morpholexical and N-best-list features.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011
2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
2009
IEEE Trans. Speech Audio Process., 2009
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009
Proceedings of the ACL 2009, 2009
2008
Turkish Language Resources: Morphological Parser, Morphological Disambiguator and Web Corpus.
Proceedings of the Advances in Natural Language Processing, 2008
2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2007
2005
Proceedings of the 13th European Signal Processing Conference, 2005