Dan Oneata

Orcid: 0000-0003-4354-4393

According to our database1, Dan Oneata authored at least 29 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Visually Grounded Few-Shot Word Learning in Low-Resource Settings.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Visually Grounded Speech Models Have a Mutual Exclusivity Bias.
Trans. Assoc. Comput. Linguistics, 2024

DeCLIP: Decoding CLIP representations for deepfake localization.
CoRR, 2024

Improved Visually Prompted Keyword Localisation in Real Low-Resource Settings.
CoRR, 2024

Easy, Interpretable, Effective: openSMILE for voice deepfake detection.
CoRR, 2024

Translating speech with just images.
CoRR, 2024

Weakly-supervised deepfake localization in diffusion-generated images.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

2023
Towards generalisable and calibrated synthetic speech detection with self-supervised representations.
CoRR, 2023

The SpeeD-ZevoTech submission at DISPLACE 2023.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022
FlexLip: A Controllable Text-to-Lip System.
Sensors, 2022

Keyword Localisation in Untranscribed Speech Using Visually Grounded Speech Models.
IEEE J. Sel. Top. Signal Process., 2022

YFACC: A Yorùbá Speech-Image Dataset for Cross-Lingual Keyword Localisation Through Visual Grounding.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Multilingual Multimodal Learning with Machine Translated Text.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Improving Multimodal Speech Recognition by Data Augmentation and Speech Representations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2021
Multimodal speech recognition for unmanned aerial vehicles.
Comput. Electr. Eng., 2021

An Evaluation of Word-Level Confidence Estimation for End-to-End Automatic Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Data-Filtering Methods for Self-Training of Automatic Speech Recognition Systems.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Speaker disentanglement in video-to-speech conversion.
Proceedings of the 29th European Signal Processing Conference, 2021

2020
Revisiting SincNet: An Evaluation of Feature and Network Hyperparameters for Speaker Recognition.
Proceedings of the 28th European Signal Processing Conference, 2020

2019
The Quo Vadis submission at Traffic4cast 2019.
CoRR, 2019

Kite: Automatic Speech Recognition for Unmanned Aerial Vehicles.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2016
A Robust and Efficient Video Representation for Action Recognition.
Int. J. Comput. Vis., 2016

2015
Robust and efficient models for action recognition and localization. (Modèles robustes et efficaces pour la reconnaissance d'action et leur localisation).
PhD thesis, 2015

2014
The INRIA-LIM-VocR and AXES submissions to TrecVid 2014 Multimedia Event Detection.
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014

Spatio-temporal Object Detection Proposals.
Proceedings of the Computer Vision - ECCV 2014, 2014

Efficient Action Localization with Approximately Normalized Fisher Vectors.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013

Action and Event Recognition with Fisher Vectors on a Compact Feature Set.
Proceedings of the IEEE International Conference on Computer Vision, 2013

2012


  Loading...