Petr Cerva
Orcid: 0000-0003-0767-0106
According to our database1,
Petr Cerva
authored at least 62 papers
between 2005 and 2024.
Collaborative distances:
Collaborative distances:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
A lightweight approach to real-time speaker diarization: from audio toward audio-visual data streams.
EURASIP J. Audio Speech Music. Process., December, 2024
Proceedings of the Text, Speech, and Dialogue - 26th International Conference, 2023
Proceedings of the Text, Speech, and Dialogue - 26th International Conference, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Combining Multilingual Resources and Models to Develop State-of-the-Art E2E ASR for Swedish.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the Text, Speech, and Dialogue - 25th International Conference, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Identification of related languages from spoken data: Moving from off-line to on-line scenario.
Comput. Speech Lang., 2021
Identification of Scandinavian Languages from Speech Using Bottleneck Features and X-Vectors.
Proceedings of the Text, Speech, and Dialogue - 24th International Conference, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the Text, Speech, and Dialogue, 2020
Dealing with Newly Emerging OOVs in Broadcast Programs by Daily Updates of the Lexicon and Language Model.
Proceedings of the Speech and Computer - 22nd International Conference, 2020
Proceedings of the 11th IEEE International Conference on Cognitive Infocommunications, 2020
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Robust Recognition of Conversational Telephone Speech via Multi-condition Training and Data Augmentation.
Proceedings of the Text, Speech, and Dialogue - 21st International Conference, 2018
Using Deep Neural Networks for Identification of Slavic Languages from Acoustic Signal.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Robust Recognition of Speech with Background Music in Acoustically Under-Resourced Scenarios.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Speech Activity Detection in online broadcast transcription using Deep Neural Networks and Weighted Finite State Transducers.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the Text, Speech, and Dialogue - 19th International Conference, 2016
Study on the Use of Deep Neural Networks for Speech Activity Detection in Broadcast Recordings.
Proceedings of the 13th International Joint Conference on e-Business and Telecommunications (ICETE 2016), 2016
Study on the Use and Adaptation of Bottleneck Features for Robust Speech Recognition of Nonlinearly Distorted Speech.
Proceedings of the 13th International Joint Conference on e-Business and Telecommunications (ICETE 2016), 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Investigation into the Use of WFSTs and DNNs for Speech Activity Detection in Broadcast Data Transcription.
Proceedings of the E-Business and Telecommunications - 13th International Joint Conference, 2016
Proceedings of the 38th International Conference on Telecommunications and Signal Processing, 2015
Proceedings of the 38th International Conference on Telecommunications and Signal Processing, 2015
Cross-Lingual Adaptation of Broadcast Transcription System to Polish Language Using Public Data Sources.
Proceedings of the Human Language Technology. Challenges for Computer Science and Linguistics, 2015
A cross-lingual adaptation approach for rapid development of speech recognizers for learning disabled users.
EURASIP J. Audio Speech Music. Process., 2014
Investigation of deep neural networks for robust recognition of nonlinearly distorted speech.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Speech-to-text technology to transcribe and disclose 100, 000+ hours of bilingual documents from historical Czech and Czechoslovak radio archive.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 25th International Workshop on Database and Expert Systems Applications, 2014
Speaker-adaptive speech recognition using speaker diarization for improved transcription of large spoken archives.
Speech Commun., 2013
Proceedings of the 36th International Conference on Telecommunications and Signal Processing, 2013
Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013
Downdating Lexicon and Language Model for Automatic Transcription of Czech Historical Spoken Documents.
Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013
Dealing with Bilingualism in Automatic Transcription of Historical Archive of Czech Radio.
Proceedings of the New Trends in Image Analysis and Processing - ICIAP 2013, 2013
Adding controlled amount of noise to improve recognition of compressed and spectrally distorted speech.
Proceedings of the IEEE International Conference on Acoustics, 2013
J. Multim., 2012
Incorporation of the ASR output in speaker segmentation and clustering within the task of speaker diarization of broadcast streams.
Proceedings of the 14th IEEE International Workshop on Multimedia Signal Processing, 2012
Large-scale processing, indexing and search system for Czech audio-visual cultural heritage archives.
Proceedings of the 14th IEEE International Workshop on Multimedia Signal Processing, 2012
Proceedings of the 14th IEEE International Workshop on Multimedia Signal Processing, 2012
Study on Integration of Speaker Diarization with Speaker Adaptive Speech Recognition for Broadcast Transcription.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Real-Time Lecture Transcription using ASR for Czech Hearing Impaired or Deaf Students.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Voice Technology to Enable Sophisticated Access to Historical Audio Archive of the Czech Radio.
Proceedings of the Multimedia for Cultural Heritage - First International Workshop, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Using Unsupervised Feature-Based Speaker Adaptation for Improved Transcription of Spoken Archives.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Rainbow Bridge - Training Center based on Voice Technology for People with Physical Disabilities.
Proceedings of the HEALTHINF 2011, 2011
Proceedings of the Analysis of Verbal and Nonverbal Communication and Enactment. The Processing Issues, 2010
Proceedings of the Computer Recognition Systems 3, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Challenges in Speech Processing of Slavic Languages (Case Studies in Speech Recognition of Czech and Slovak).
Proceedings of the Development of Multimodal Interfaces: Active Listening and Synchrony, 2009
Proceedings of the Text, Speech and Dialogue, 11th International Conference, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the Cross-Modal Analysis of Speech, Gestures, Gaze and Facial Expressions, 2008
Proceedings of the Multimodal Signals: Cognitive and Algorithmic Issues, 2008
MyVoice goes Spanish. Cross-lingual Adaptation of a Voice Controlled PC Tool for Handicapped People.
Proces. del Leng. Natural, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Two-step unsupervised speaker adaptation based on speaker and gender recognition and HMM combination.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Supervised and Unsupervised Speaker Adaptation in Large Vocabulary Continuous Speech Recognition of Czech.
Proceedings of the Text, Speech and Dialogue, 8th International Conference, 2005
Fully automated system for Czech spoken broadcast transcription with very large (300k+) lexicon.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005