György Szaszák

Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing, 2023

2021

Deep Learning Methods in Speaker Recognition: A Review.

[BibT_eX]

[DOI]

Dávid Sztahó

Period. Polytech. Electr. Eng. Comput. Sci., 2021

2020

A low latency sequential model and its user-focused evaluation for automatic punctuation of ASR closed captions.

[BibT_eX]

[DOI]

Balázs Tarján

Comput. Speech Lang., 2020

Deep Transformer based Data Augmentation with Subword Units for Morphologically Rich Online ASR.

[BibT_eX]

[DOI]

CoRR, 2020

On the Effectiveness of Neural Text Generation Based Data Augmentation for Recognition of Morphologically Rich Speech.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech, and Dialogue, 2020

Using ASR Posterior Probability and Acoustic Features for Voice Disorder Classification.

[BibT_eX]

[DOI]

Miklós Gábriel Tulics

Krisztina Mészáros

Proceedings of the 11th IEEE International Conference on Cognitive Infocommunications, 2020

Improving Real-time Recognition of Morphologically Rich Speech with Transformer Language Model.

[BibT_eX]

[DOI]

Proceedings of the 11th IEEE International Conference on Cognitive Infocommunications, 2020

2019

On the Effects of Automatic Transcription and Segmentation Errors in Hungarian Spoken Language Processing.

[BibT_eX]

[DOI]

Valér Kaszás

Period. Polytech. Electr. Eng. Comput. Sci., 2019

Investigation on N-Gram Approximated RNNLMs for Recognition of Morphologically Rich Speech.

[BibT_eX]

[DOI]

Proceedings of the Statistical Language and Speech Processing, 2019

Investigating Sub-Word Embedding Strategies for the Morphologically Rich and Free Phrase-Order Hungarian.

[BibT_eX]

[DOI]

Proceedings of the 4th Workshop on Representation Learning for NLP, 2019

Assessing the Semantic Space Bias Caused by ASR Error Propagation and its Effect on Spoken Document Summarization.

[BibT_eX]

[DOI]

Valér Kaszás

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Leveraging a Character, Word and Prosody Triplet for an ASR Error Robust and Agglutination Friendly Punctuation Approach.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Artificial Neural Network and SVM based Voice Disorder Classification.

[BibT_eX]

[DOI]

Miklós Gábriel Tulics

Krisztina Mészáros

Proceedings of the 10th IEEE International Conference on Cognitive Infocommunications, 2019

N-gram Approximation of LSTM Recurrent Language Models for Single-pass Recognition of Hungarian Call Center Conversations.

[BibT_eX]

[DOI]

Proceedings of the 10th IEEE International Conference on Cognitive Infocommunications, 2019

2018

Prosodic stress detection for fixed stress languages using formal atom decomposition and a statistical hidden Markov hybrid.

[BibT_eX]

[DOI]

Branislav Gerazov

Speech Commun., 2018

User-centric Evaluation of Automatic Punctuation in ASR Closed Captioning.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Joint Word- and Character-level Embedding CNN-RNN Models for Punctuation Restoration.

[BibT_eX]

[DOI]

Proceedings of the 9th IEEE International Conference on Cognitive Infocommunications, 2018

A semantic space approach for automatic summarization of documents.

[BibT_eX]

[DOI]

Valér Kaszás

Proceedings of the 9th IEEE International Conference on Cognitive Infocommunications, 2018

2017

Low Latency MaxEnt- and RNN-Based Word Sequence Models for Punctuation Restoration of Closed Caption Data.

[BibT_eX]

[DOI]

Balázs Tarján

Proceedings of the Statistical Language and Speech Processing, 2017

Semi-Supervised Learning with Semantic Knowledge Extraction for Improved Speech Recognition in Air Traffic Control.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

A Phonological Phrase Sequence Modelling Approach for Resource Efficient and Robust Real-Time Punctuation Recovery.

[BibT_eX]

[DOI]

Anna Moró

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Á bilingual comparison of MaxEnt-and RNN-based punctuation restoration in speech transcripts.

[BibT_eX]

[DOI]

Balázs Tarján

Proceedings of the 8th IEEE International Conference on Cognitive Infocommunications, 2017

Assessment of pathological speech prosody based on automatic stress detection and phrasing approaches.

[BibT_eX]

[DOI]

Proceedings of the 8th IEEE International Conference on Cognitive Infocommunications, 2017

A prosody inspired RNN approach for punctuation of machine produced speech transcripts to improve human readability.

[BibT_eX]

[DOI]

Anna Moró

Proceedings of the 8th IEEE International Conference on Cognitive Infocommunications, 2017

A context-aware speech recognition and understanding system for air traffic control domain.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016

Ensemble Deep Neural Network Based Waveform-Driven Stress Model for Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 18th International Conference, 2016

Combining Atom Decomposition of the F0 Track and HMM-based Phonological Phrase Modelling for Robust Stress Detection in Speech.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 18th International Conference, 2016

Design of a Speech Corpus for Research on Cross-Lingual Prosody Transfer.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 18th International Conference, 2016

Automatic Summarization of Highly Spontaneous Speech.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 18th International Conference, 2016

Estimating the Sincerity of Apologies in Speech by DNN Rank Learning and Prosodic Analysis.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Summarization of Spontaneous Speech using Automatic Speech Recognition and a Speech Prosody based Tokenizer.

[BibT_eX]

[DOI]

Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2016) - Volume 1: KDIR, Porto - Portugal, November 9, 2016

Atom decomposition based stress detection and automatic phrasing of speech.

[BibT_eX]

[DOI]

Proceedings of the 7th IEEE International Conference on Cognitive Infocommunications, 2016

2015

Toward Exploring the Role of Disfluencies from an Acoustic Point of View: A New Aspect of (Dis)continuous Speech Prosody Modelling.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech, and Dialogue - 18th International Conference, 2015

Automatic Close Captioning for Live Hungarian Television Broadcast Speech: A Fast and Resource-Efficient Approach.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 17th International Conference, 2015

Using automatic stress extraction from audio for improved prosody modelling in speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2014

Combining NLP techniques and acoustic analysis for semantic focus detection in speech.

[BibT_eX]

[DOI]

Proceedings of the 5th IEEE Conference on Cognitive Infocommunications, 2014

2013

Using phonological phrase segmentation to improve automatic keyword spotting for the highly agglutinating Hungarian language.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Evaluating intra- and crosslingual adaptation for non-native speech recognition in a bilingual environment.

[BibT_eX]

[DOI]

Philip N. Garner

Proceedings of the IEEE 4th International Conference on Cognitive Infocommunications, 2013

Automatic phrase segmentation and clustering in spontaneous speech.

[BibT_eX]

[DOI]

Viola Varadi

Proceedings of the IEEE 4th International Conference on Cognitive Infocommunications, 2013

2012

Exploiting Prosody for Syntactic Analysis in Automatic Speech Understanding.

[BibT_eX]

[DOI]

J. Lang. Model., 2012

Unsupervised Clustering of Prosodic Patterns in Spontaneous Speech.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue - 15th International Conference, 2012

Automatic prosodic and syntactic analysis from speech in cognitive infocommunication.

[BibT_eX]

[DOI]

Proceedings of the IEEE 3rd International Conference on Cognitive Infocommunications, 2012

2011

Automatic Intonation Recognition for the Prosodic Assessment of Language-Impaired Children.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2011

Analysing the Correspondence Between Automatic Prosodic Segmentation and Syntactic Structure.

[BibT_eX]

[DOI]

Katalin Nagy

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

2010

Using prosody to improve automatic speech recognition.

[BibT_eX]

[DOI]

Speech Commun., 2010

2009

A szupraszegmentális jellemzők szerepe és felhasználása a gépi beszédfelismerésben

[BibT_eX]

[DOI]

PhD thesis, 2009

Automatic intonation classification for speech training systems.

[BibT_eX]

[DOI]

David Sztahó

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

2008

Using prosody for the improvement of ASR - sentence modality recognition.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

2007

Speech Recognition Supported by Prosodic Information for Fixed Stress Languages.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue, 10th International Conference, 2007

Using Prosody in Fixed Stress Languages for Improvement of Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Verbal and Nonverbal Communication Behaviours, 2007

2006

Prosodic Cues for Automatic Phrase Boundary Detection in ASR.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006

2005

Automatic Segmentation of Continuous Speech on Word Level Based on Supra-segmental Features.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2005

2004

Examination of Pronunciation Variation from Hand-Labelled Corpora.

[BibT_eX]

[DOI]