Géza Németh

Orcid: 0000-0002-2311-4858

According to our database1, Géza Németh authored at least 78 papers between 1990 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
ChildTinyTalks (CTT): A Benchmark Dataset and Baseline for Expressive Child Speech Synthesis.
Proceedings of the Speech and Computer - 26th International Conference, 2024

2023
Universal Approach to Multilingual Multispeaker Child Speech SynthesisUniversal Approach to Multilingual Multispeaker Child Speech Synthesis.
Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023

Automated Child Voice Generation: Methodology and Implementation.
Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2023

Advancing Limited Data Text-to-Speech Synthesis: Non-Autoregressive Transformer for High-Quality Parallel Synthesis.
Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2023

Nonparallel Expressive TTS for Unseen Target Speaker using Style-Controlled Adaptive Layer and Optimized Pitch Embedding.
Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2023

Concept and Pictogram-Based User-Interface Design of a Helper Tool for People with Aphasia.
Proceedings of the dHealth 2023, 2023

2022

Towards Parametric Speech Synthesis Using Gaussian-Markov Model of Spectral Envelope and Wavelet-Based Decomposition of F0.
Proceedings of the 30th European Signal Processing Conference, 2022


2021
Noise and acoustic modeling with waveform generator in text-to-speech and neutral speech conversion.
Multim. Tools Appl., 2021

Advances in Speech Vocoding for Text-to-Speech with Continuous Parameters.
CoRR, 2021

Adaptation of Tacotron2-based Text-To-Speech for Articulatory-to-Acoustic Mapping using Ultrasound Tongue Imaging.
Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021

Continuous Wavelet Vocoder-Based Decomposition of Parametric Speech Waveform Synthesis.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020
Continuous Noise Masking Based Vocoder for Statistical Parametric Speech Synthesis.
IEICE Trans. Inf. Syst., 2020

A continuous vocoder for statistical parametric speech synthesis and its evaluation using an audio-visual phonetically annotated Arabic corpus.
Comput. Speech Lang., 2020

2019
Continuous vocoder applied in deep neural network based voice conversion.
Multim. Tools Appl., 2019

WaveTract: A hybrid generative model for speech synthesis.
Proceedings of the 2019 International Conference on Speech Technology and Human-Computer Dialogue, 2019

Parallel Voice Conversion Based on a Continuous Sinusoidal Model.
Proceedings of the 2019 International Conference on Speech Technology and Human-Computer Dialogue, 2019

Self-Attention Networks for Intent Detection.
Proceedings of the International Conference on Recent Advances in Natural Language Processing, 2019

Transformer Based Grapheme-to-Phoneme Conversion.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Ultrasound-Based Silent Speech Interface Built on a Continuous Vocoder.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

RNN-based speech synthesis using a continuous sinusoidal model.
Proceedings of the International Joint Conference on Neural Networks, 2019

2018
Text normalization with convolutional neural networks.
Int. J. Speech Technol., 2018

A Continuous Vocoder Using Sinusoidal Model for Statistical Parametric Speech Synthesis.
Proceedings of the Speech and Computer - 20th International Conference, 2018

2017
Deep Recurrent Neural Networks in Speech Synthesis Using a Continuous Vocoder.
Proceedings of the Speech and Computer - 19th International Conference, 2017

Time-Domain Envelope Modulating the Noise Component of Excitation in a Continuous Residual-Based Vocoder for Statistical Parametric Speech Synthesis.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016
Improving HMM speech synthesis of interrogative sentences by pitch track transformations.
Speech Commun., 2016

Improvements to Prosodic Variation in Long Short-Term Memory Based Intonation Models Using Random Forest.
Proceedings of the Speech and Computer - 18th International Conference, 2016

Ensemble Deep Neural Network Based Waveform-Driven Stress Model for Speech Synthesis.
Proceedings of the Speech and Computer - 18th International Conference, 2016

Design of a Speech Corpus for Research on Cross-Lingual Prosody Transfer.
Proceedings of the Speech and Computer - 18th International Conference, 2016

DNN-Based Duration Modeling for Synthesizing Short Sentences.
Proceedings of the Speech and Computer - 18th International Conference, 2016

Modeling unvoiced sounds in statistical parametric speech synthesis with a continuous vocoder.
Proceedings of the 24th European Signal Processing Conference, 2016

2015
Residual-Based Excitation with Continuous F0 Modeling in HMM-Based Speech Synthesis.
Proceedings of the Statistical Language and Speech Processing, 2015

A polyglot domain optimised text-to-speech system for railway station announcements.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Automatic transformation of irregular to regular voice by residual analysis and synthesis.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Synthesis of speaking styles with corpus- and HMM-based approaches.
Proceedings of the 6th IEEE International Conference on Cognitive Infocommunications, 2015

2014
Modeling Irregular Voice in Statistical Parametric Speech Synthesis With Residual Codebook Based Excitation.
IEEE J. Sel. Top. Signal Process., 2014

Statistical parametric speech synthesis with a novel codebook-based excitation model.
Intell. Decis. Technol., 2014

Gaps to Bridge in Speech Technology.
Proceedings of the Speech and Computer - 16th International Conference, 2014

The EASR Corpora of European Portuguese, French, Hungarian and Polish Elderly Speech.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

2013
A novel irregular voice model for HMM-based speech synthesis.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Some aspects of synthetic elderly voices in ambient assisted living systems.
Proceedings of the 7th Conference on Speech Technology and Human-Computer Dialogue, 2013

Application of the NAO humanoid robot in the treatment of bone marrow-transplanted children (demo).
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Speech-centric Multimodal Interaction for Easy-to-access Online Services - A Personal Life Assistant for the Elderly.
Proceedings of the 5th International Conference on Software Development for Enhancing Accessibility and Fighting Info-exclusion, 2013

2012
Optimizing HMM Speech Synthesis for Low-Resource Devices.
J. Adv. Comput. Intell. Intell. Informatics, 2012

New Features in the VoxAid Communication Aid for Speech Impaired People.
Proceedings of the Computers Helping People with Special Needs, 2012

Cognitive infocommunications preferences of active senior citizens.
Proceedings of the IEEE 3rd International Conference on Cognitive Infocommunications, 2012

A novel codebook-based excitation model for use in speech synthesis.
Proceedings of the IEEE 3rd International Conference on Cognitive Infocommunications, 2012

Application of the NAO humanoid robot in the treatment of marrow-transplanted children.
Proceedings of the IEEE 3rd International Conference on Cognitive Infocommunications, 2012

2011
The Effects of Phoneme Errors in Speaker Adaptation for HMM Speech Synthesis.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

2010
Improvements of Hungarian Hidden Markov Model-based Text-to-Speech Synthesis.
Acta Cybern., 2010

Special Speech Synthesis for Social Network Websites.
Proceedings of the Text, Speech and Dialogue, 13th International Conference, 2010

Some Aspects of ASR Transcription Based Unsupervised Speaker Adaptation for HMM Speech Synthesis.
Proceedings of the Text, Speech and Dialogue, 13th International Conference, 2010

2009
Automatic Classification of Regular vs. Irregular Phonation Types.
Proceedings of the Advances in Nonlinear Speech Processing, 2009

Human voice or prompt generation? can they co-exist in an application?
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

2008
Multimodal Spontaneous Expressive Speech Corpus for Hungarian.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Automated Drug Information System for Aged and Visually Impaired Persons.
Proceedings of the Computers Helping People with Special Needs, 2008

2007
Speech based drug information system for aged and visually impaired persons.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Increasing prosodic variability of text-to-speech synthesizers.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Expressive Speech Synthesis Using Emotion-Specific Speech Inventories.
Proceedings of the Verbal and Nonverbal Features of Human-Human and Human-Machine Interaction, 2007

2006
Corpus-Based Unit Selection TTS for Hungarian.
Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006

VoxAid 2006: Telephone Communication for Hearing and/or Vocally Impaired People.
Proceedings of the Computers Helping People with Special Needs, 2006

2004
Mobile Devices Converted into a Speaking Communication Aid.
Proceedings of the Computers Helping People with Special Needs, 2004

Design of a Hungarian Emotional Database for Speech Analysis and Synthesis.
Proceedings of the Affective Dialogue Systems, Tutorial and Research Workshop, 2004

2001
Automatic prosody generation - a model for hungarian.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Word unit based multilingual comparative analysis of text corpora.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

A flexible multilingual TTS development and speech research tool.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
Profivox - A Hungarian Text-to-Speech System for Telecommunications Applications.
Int. J. Speech Technol., 2000

The Design, Implementation, and Operation of a Hungarian E-Mail Reader.
Int. J. Speech Technol., 2000

1999
Interactive, TTS supported speech message composer for large, limited vocabulary, but open information systems.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Problems of creating a flexible e-mail reader for hungarian.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1997
Prosody generation for German CTS/TTS systems (from theoretical intonation patterns to practical realisation).
Speech Commun., 1997

A flexible client-server model for multilingual CTS/TTS development.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

1995
Improvement, evaluation and testing of a low cost multilingual portable speaking aid for the speech impaired.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

1993
Voxaid: an interactive speaking communication aid software for the speech impaired.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Improvements of the Spanish version of the multivox text-to-speech system.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

1990
Phonetic aspects of the MULTIVOX text-to-speech system.
Proceedings of the ESCA Workshop on Speech Synthesis, 1990

Implementations aspects and the development system of the multivox text-to-speech converter.
Proceedings of the ESCA Workshop on Speech Synthesis, 1990


  Loading...