Horia Cucu

Orcid: 0000-0002-8711-3854

According to our database1, Horia Cucu authored at least 83 papers between 2011 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Easy, Interpretable, Effective: openSMILE for voice deepfake detection.
CoRR, 2024

Exploring Native and Non-Native English Child Speech Recognition With Whisper.
IEEE Access, 2024

A novel simulations scheduler for automated circuit sizing algorithms.
Proceedings of the 20th International Conference on Synthesis, 2024

Exploring Fusion Techniques for Multimodal Emotion Recognition.
Proceedings of the 15th International Conference on Communications, 2024

2023
Towards generalisable and calibrated synthetic speech detection with self-supervised representations.
CoRR, 2023

Adaptive Planning Search Algorithm for Analog Circuit Verification.
CoRR, 2023

Augmentation Techniques for Adult-Speech to Generate Child-Like Speech Data Samples at Scale.
IEEE Access, 2023

A WAV2VEC2-Based Experimental Study on Self-Supervised Learning Methods to Improve Child Speech Recognition.
IEEE Access, 2023

A Study on Initial Population Sampling for Multi-Objective Optimization based on Differential Evolution and Bayesian Inference.
Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2023

Applying Multi-objective Acquisition Function Ensemble for a candidate proposal algorithm.
Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2023

Efficient Multi-Objective Optimization for PVT Variation-Aware Circuit Sizing Using Surrogate Models and Smart Corner Sampling.
Proceedings of the IEEE/ACM International Symposium on Low Power Electronics and Design, 2023

The SpeeD-ZevoTech submission at DISPLACE 2023.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Adaptation of Whisper models to child speech recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022
FlexLip: A Controllable Text-to-Lip System.
Sensors, 2022

Automated circuit sizing with multi-objective optimization based on differential evolution and Bayesian inference.
Knowl. Based Syst., 2022

A Text-to-Speech Pipeline, Evaluation Methodology, and Initial Fine-Tuning Results for Child Speech Synthesis.
IEEE Access, 2022

Automatic Pollen Classification and Segmentation Using U-Nets and Synthetic Data.
IEEE Access, 2022

Manufacturing Variation Estimation of On Resistance in Power Semiconductors.
Proceedings of the 18th International Conference on Synthesis, 2022

Advanced Operating Conditions Search applied in Analog Circuit Verification.
Proceedings of the 18th International Conference on Synthesis, 2022

Unsupervised deep learning models for aerosol layers segmentation.
Proceedings of the 14th International Conference on Communications, 2022

Improving Multimodal Speech Recognition by Data Augmentation and Speech Representations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2021
Performance vs. hardware requirements in state-of-the-art automatic speech recognition.
EURASIP J. Audio Speech Music. Process., 2021

Multimodal speech recognition for unmanned aerial vehicles.
Comput. Electr. Eng., 2021

Automatic Pollen Classification Using Convolutional Neural Networks.
Proceedings of the 44th International Conference on Telecommunications and Signal Processing, 2021

Versatility and Population Diversity of Evolutionary Algorithms in Automated Circuit Sizing Applications.
Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2021

Improvements of SpeeD's Romanian ASR system during ReTeRom project.
Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2021

MARS: the First Romanian Pollen Dataset using a Rapid-E Particle Analyzer.
Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2021

An Evaluation of Word-Level Confidence Estimation for End-to-End Automatic Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Data-Filtering Methods for Self-Training of Automatic Speech Recognition Systems.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Speaker disentanglement in video-to-speech conversion.
Proceedings of the 29th European Signal Processing Conference, 2021

2020
Automatic Annotation of Speech Corpora using Approximate Transcripts.
Proceedings of the 43rd International Conference on Telecommunications and Signal Processing, 2020

RSC: A Romanian Read Speech Corpus for Automatic Speech Recognition.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Revisiting SincNet: An Evaluation of Feature and Network Hyperparameters for Speaker Recognition.
Proceedings of the 28th European Signal Processing Conference, 2020

2019
Overlapped Speech Detection and Competing Speaker Counting--Humans Versus Deep Learning.
IEEE J. Sel. Top. Signal Process., 2019

The Quo Vadis submission at Traffic4cast 2019.
CoRR, 2019

Automated Baby Cry Classification on a Hospital-acquired Baby Cry Database.
Proceedings of the 42nd International Conference on Telecommunications and Signal Processing, 2019

Progress on automatic annotation of speech corpora using complementary ASR systems.
Proceedings of the 42nd International Conference on Telecommunications and Signal Processing, 2019

Lemma-based Dynamic Time Warping Search for Keyword Spotting Applications in Romanian.
Proceedings of the 2019 International Conference on Speech Technology and Human-Computer Dialogue, 2019

Kaldi-based DNN Architectures for Speech Recognition in Romanian.
Proceedings of the 2019 International Conference on Speech Technology and Human-Computer Dialogue, 2019

Kite: Automatic Speech Recognition for Unmanned Aerial Vehicles.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018
Multilingual Low-Resourced Prototype System for Voice-Controlled Intelligent Building Applications.
Proceedings of the Trends and Advances in Information Systems and Technologies, 2018

Why is My Baby Crying? An In-Depth Analysis of Paralinguistic Features and Classical Machine Learning Algorithms for Baby Cry Classification.
Proceedings of the 41st International Conference on Telecommunications and Signal Processing, 2018

Automatic Annotation of Speech Corpora Using Complementary GMM and DNN Acoustic Models.
Proceedings of the 41st International Conference on Telecommunications and Signal Processing, 2018

Methodology for determining the influencing factors of lifetime variation for power devices.
Proceedings of the 23rd IEEE European Test Symposium, 2018

2017
Fast method for ENF database build and search.
Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2017

SpeeD's DNN approach to Romanian speech recognition.
Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2017

Building a representative audio base of syllables for Romanian language.
Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2017

Speech recognition results for voice-controlled assistive applications.
Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2017

Observability coefficient for 2D dynamical systems.
Proceedings of the Signal Processing: Algorithms, 2017

Detecting Overlapped Speech on Short Timeframes Using Deep Learning.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Recent Experiments and Findings in Baby Cry Classification.
Proceedings of the Future Access Enablers for Ubiquitous and Intelligent Infrastructures, 2017

Autonomous System for Performing Dexterous, Human-Level Manipulation Tasks as Response to External Stimuli in Real Time.
Proceedings of the Future Access Enablers for Ubiquitous and Intelligent Infrastructures, 2017

2016
Baby cry recognition in real-world conditions.
Proceedings of the 39th International Conference on Telecommunications and Signal Processing, 2016

Automatic methods for infant cry classification.
Proceedings of the International Conference on Communications, 2016

Exploring an unsupervised, language independent, spoken document retrieval system.
Proceedings of the 14th International Workshop on Content-Based Multimedia Indexing, 2016

2015
On transcribing informally-pronounced numbers in Romanian speech.
Proceedings of the 38th International Conference on Telecommunications and Signal Processing, 2015

Sound event recognition in smart environments.
Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2015

Exploring multi-language resources for unsupervised spoken term discovery.
Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2015

Speech database acquisition for assisted living environment applications.
Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2015

Speech applications in the eWALL project.
Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2015

Estimating competing speaker count for blind speech source separation.
Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2015

SpeeD @ MediaEval 2015: Multilingual Phone Recognition Approach to Query by Example STD.
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015

Counting competing speakers in a timeframe - human versus computer.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2014
SMT-based ASR domain adaptation methods for under-resourced languages: Application to Romanian.
Speech Commun., 2014

A robust diacritics restoration system using unreliable raw text data.
Proceedings of the 4th Workshop on Spoken Language Technologies for Under-resourced Languages, 2014

Unsupervised acoustic model training using multiple seed ASR systems.
Proceedings of the 4th Workshop on Spoken Language Technologies for Under-resourced Languages, 2014

SpeeD @ MediaEval 2014: Spoken Term Detection with Robust Multilingual Phone Recognition.
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

Detecting the number of competing speakers - human selective hearing versus spectrogram distance based estimator.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Recent improvements of the SpeeD Romanian LVCSR system.
Proceedings of the 10th International Conference on Communications, 2014

An Automatic Speech Recognition solution with speaker identification support.
Proceedings of the 10th International Conference on Communications, 2014

2013
Extensive evaluation experiments for the accumulated cross-power spectrum methods for time delay estimation.
Proceedings of the 7th Conference on Speech Technology and Human-Computer Dialogue, 2013

Multilingual query by example spoken term detection for under-resourced languages.
Proceedings of the 7th Conference on Speech Technology and Human-Computer Dialogue, 2013

Text spotting in large speech databases for under-resourced languages.
Proceedings of the 7th Conference on Speech Technology and Human-Computer Dialogue, 2013

Statistical Error Correction Methods for Domain-Specific ASR Systems.
Proceedings of the Statistical Language and Speech Processing, 2013

SpeeD @ MediaEval 2013: A Phone Recognition Approach to Spoken Term Detection.
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013

Fast accurate time delay estimation based on enhanced accumulated Cross-power Spectrum Phase.
Proceedings of the 21st European Signal Processing Conference, 2013

2012
ARF @ MediaEval 2012: Multimodal Video Classification.
Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012

ARF @ MediaEval 2012: A Romanian ASR-based Approach to Spoken Term Detection.
Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012

ASR for low-resourced languages: Building a phonetically balanced Romanian speech corpus.
Proceedings of the 20th European Signal Processing Conference, 2012

ASR domain adaptation methods for low-resourced languages: Application to Romanian language.
Proceedings of the 20th European Signal Processing Conference, 2012

2011
Word error rate improvement and complexity reduction in Automatic Speech Recognition by analyzing acoustic model uncertainty and confusion.
Proceedings of the 6th International Conference Speech Technology and Human-Computer Dialogue, 2011

Improving automatic speech recognition robustness for the Romanian language.
Proceedings of the 19th European Signal Processing Conference, 2011

Investigating the role of machine translated text in ASR domain adaptation: Unsupervised and semi-supervised methods.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011


  Loading...