Jindrich Zdánský
Orcid: 0000-0001-5591-7228
According to our database1,
Jindrich Zdánský
authored at least 54 papers
between 2004 and 2023.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2023
Proceedings of the Text, Speech, and Dialogue - 26th International Conference, 2023
Proceedings of the Text, Speech, and Dialogue - 26th International Conference, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Combining Multilingual Resources and Models to Develop State-of-the-Art E2E ASR for Swedish.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
2022
Target Speech Extraction: Independent Vector Extraction Guided by Supervised Speaker Identification.
IEEE ACM Trans. Audio Speech Lang. Process., 2022
Proceedings of the Text, Speech, and Dialogue - 25th International Conference, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
2021
Identification of related languages from spoken data: Moving from off-line to on-line scenario.
Comput. Speech Lang., 2021
Blind Extraction of Target Speech Source Guided by Supervised Speaker Identification via X-vectors.
CoRR, 2021
Identification of Scandinavian Languages from Speech Using Bottleneck Features and X-Vectors.
Proceedings of the Text, Speech, and Dialogue - 24th International Conference, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Blind Extraction of Moving Audio Source in a Challenging Environment Supported by Speaker Identification Via X-Vectors.
Proceedings of the IEEE International Conference on Acoustics, 2021
2020
Proceedings of the Text, Speech, and Dialogue, 2020
Proceedings of the Text, Speech, and Dialogue, 2020
Adaptive Blind Audio Source Extraction Supervised By Dominant Speaker Identification Using X-Vectors.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 11th IEEE International Conference on Cognitive Infocommunications, 2020
2019
On Practical Aspects of Multi-condition Training Based on Augmentation for Reverberation-/Noise-Robust Speech Recognition.
Proceedings of the Text, Speech, and Dialogue - 22nd International Conference, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
2018
Robust Recognition of Conversational Telephone Speech via Multi-condition Training and Data Augmentation.
Proceedings of the Text, Speech, and Dialogue - 21st International Conference, 2018
Using Deep Neural Networks for Identification of Slavic Languages from Acoustic Signal.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Robust Recognition of Speech with Background Music in Acoustically Under-Resourced Scenarios.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
2017
Speech Activity Detection in online broadcast transcription using Deep Neural Networks and Weighted Finite State Transducers.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
2016
Study on the Use of Deep Neural Networks for Speech Activity Detection in Broadcast Recordings.
Proceedings of the 13th International Joint Conference on e-Business and Telecommunications (ICETE 2016), 2016
Investigation into the Use of WFSTs and DNNs for Speech Activity Detection in Broadcast Data Transcription.
Proceedings of the E-Business and Telecommunications - 13th International Joint Conference, 2016
2015
Proceedings of the 38th International Conference on Telecommunications and Signal Processing, 2015
2014
Speech-to-text technology to transcribe and disclose 100, 000+ hours of bilingual documents from historical Czech and Czechoslovak radio archive.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
2013
Speaker-adaptive speech recognition using speaker diarization for improved transcription of large spoken archives.
Speech Commun., 2013
2012
J. Multim., 2012
Incorporation of the ASR output in speaker segmentation and clustering within the task of speaker diarization of broadcast streams.
Proceedings of the 14th IEEE International Workshop on Multimedia Signal Processing, 2012
Large-scale processing, indexing and search system for Czech audio-visual cultural heritage archives.
Proceedings of the 14th IEEE International Workshop on Multimedia Signal Processing, 2012
Proceedings of the 14th IEEE International Workshop on Multimedia Signal Processing, 2012
Study on Integration of Speaker Diarization with Speaker Adaptive Speech Recognition for Broadcast Transcription.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Real-Time Lecture Transcription using ASR for Czech Hearing Impaired or Deaf Students.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
2011
Voice Technology to Enable Sophisticated Access to Historical Audio Archive of the Czech Radio.
Proceedings of the Multimedia for Cultural Heritage - First International Workshop, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Challenges in Speech Processing of Slavic Languages (Case Studies in Speech Recognition of Czech and Slovak).
Proceedings of the Development of Multimodal Interfaces: Active Listening and Synchrony, 2009
2008
Proceedings of the Text, Speech and Dialogue, 11th International Conference, 2008
Proceedings of the International Workshop on Multimedia Signal Processing, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the Cross-Modal Analysis of Speech, Gestures, Gaze and Facial Expressions, 2008
Proceedings of the Multimodal Signals: Cognitive and Algorithmic Issues, 2008
Proceedings of the International Conference on Auditory-Visual Speech Processing 2008, 2008
2006
Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
2005
The COST278 broadcast news segmentation and speaker clustering evaluation - overview, methodology, systems, results.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Detection of acoustic change-points in audio records via global BIC maximization and dynamic programming.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Fully automated system for Czech spoken broadcast transcription with very large (300k+) lexicon.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
2004
Proceedings of the Text, Speech and Dialogue, 7th International Conference, 2004
An improved preprocessor for the automatic transcription of broadcast news audio stream.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Very large vocabulary speech recognition system for automatic transcription of czech broadcast programs.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004