Stéphane Dupont

Towards Human Performance on Sketch-Based Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the CBMI 2022: International Conference on Content-based Multimedia Indexing, Graz, Austria, September 14, 2022

2021

Multi-level Attention Fusion Network for Audio-visual Event Recognition.

[BibT_eX]

[DOI]

CoRR, 2021

Gesture of Interest: Gesture Search for Multi-Person, Multi-Perspective TV Footage.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Content-Based Multimedia Indexing, 2021

2020

Hybrid-task learning for robust automatic speech recognition.

[BibT_eX]

[DOI]

Sean U. N. Wood

Comput. Speech Lang., 2020

AVECL-UMONS database for audio-visual event classification and localization.

[BibT_eX]

[DOI]

CoRR, 2020

Modulated Fusion using Transformer for Linguistic-Acoustic Emotion Recognition.

[BibT_eX]

[DOI]

Noé Tits

CoRR, 2020

A Transformer-based joint-encoding for Emotion Recognition and Sentiment Analysis.

[BibT_eX]

[DOI]

Noé Tits

CoRR, 2020

Intra and Inter-modality Interactions for Audio-visual Event Detection.

[BibT_eX]

[DOI]

Proceedings of the HuMA'20: Proceedings of the 1st International Workshop on Human-centric Multimedia Analysis, 2020

Are You Watching Closely? Content-based Retrieval of Hand Gestures.

[BibT_eX]

[DOI]

Proceedings of the 2020 on International Conference on Multimedia Retrieval, 2020

SECL-UMons Database for Sound Event Classification and Localization.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Improved Soccer Action Spotting using both Audio and Video Streams.

[BibT_eX]

[DOI]

Bastien Vanderplaetse

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Can adversarial training learn image captioning ?

[BibT_eX]

[DOI]

Bastien Vanderplaetse

CoRR, 2019

Modulated Self-attention Convolutional Network for VQA.

[BibT_eX]

[DOI]

Antoine Maiorca

Nathan Hubens

CoRR, 2019

Adversarial reconstruction for Multi-modal Machine Translation.

[BibT_eX]

[DOI]

CoRR, 2019

Audio-Visual Fusion And Conditioning With Neural Networks For Event Recognition.

[BibT_eX]

[DOI]

Leontios J. Hadjileontiadis

Proceedings of the 29th IEEE International Workshop on Machine Learning for Signal Processing, 2019

2018

A Multimodal Approach for the Safeguarding and Transmission of Intangible Cultural Heritage: The Case of i-Treasures.

[BibT_eX]

[DOI]

Kosmas Dimitropoulos

Filareti Tsalakanidou

Vasileios S. Charisis

Athanasios Manitsaris

IEEE Intell. Syst., 2018

Object-oriented Targets for Visual Navigation using Rich Semantic Representations.

[BibT_eX]

[DOI]

CoRR, 2018

Bringing back simplicity and lightliness into neural image captioning.

[BibT_eX]

[DOI]

CoRR, 2018

UMONS Submission for WMT18 Multimodal Translation Task.

[BibT_eX]

[DOI]

Guillaume Dubuisson Duplessis

CoRR, 2018

Investigating a Hybrid Learning Approach for Robust Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Statistical Language and Speech Processing, 2018

A Dyadic Conversation Dataset on Moral Emotions.

[BibT_eX]

[DOI]

Proceedings of the 13th IEEE International Conference on Automatic Face & Gesture Recognition, 2018

Multifaceted Engagement in Social Interaction with a Machine: The JOKER Project.

[BibT_eX]

[DOI]

Laurence Devillers

Sophie Rosset

Proceedings of the 13th IEEE International Conference on Automatic Face & Gesture Recognition, 2018

2017

Blind Speech Separation and Enhancement With GCC-NMF.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2017

DeepSketch 3 - Analyzing deep neural networks features for better sketch recognition and sketch-based image retrieval.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2017

Modulating and attending the source image during encoding improves Multimodal Translation.

[BibT_eX]

[DOI]

CoRR, 2017

Visually Grounded Word Embeddings and Richer Visual Features for Improving Multimodal Neural Machine Translation.

[BibT_eX]

[DOI]

CoRR, 2017

Multimodal Compact Bilinear Pooling for Multimodal Neural Machine Translation.

[BibT_eX]

[DOI]

CoRR, 2017

Amused speech components analysis and classification: Towards an amusement arousal level assessment system.

[BibT_eX]

[DOI]

Comput. Electr. Eng., 2017

Noise and Speech Estimation as Auxiliary Tasks for Robust Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Statistical Language and Speech Processing, 2017

Introducing AmuS: The Amused Speech Database.

[BibT_eX]

[DOI]

Proceedings of the Statistical Language and Speech Processing, 2017

Enhanced Retrieval and Browsing in the IMOTION System.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

Quadruplet Networks for Sketch-Based Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

UMONS @ MediaEval 2017: Diverse Social Images Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Working Notes Proceedings of the MediaEval 2017 Workshop co-located with the Conference and Labs of the Evaluation Forum (CLEF 2017), 2017

Investigating the impact of the training data volume for robust speech recognition using multi-task learning.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Symposium on Signal Processing and Information Technology, 2017

A corpus for experimental study of affect bursts in human-robot interaction.

[BibT_eX]

[DOI]

Proceedings of the 1st ACM SIGCHI International Workshop on Investigating Social Interactions with Artificial Agents, 2017

Triplet Networks Feature Masking for Sketch-Based Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Image Analysis and Recognition - 14th International Conference, 2017

Towards Good Practices for Image Retrieval Based on CNN Features.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

An empirical study on the effectiveness of images in Multimodal Neural Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Intangible Cultural Heritage and New Technologies: Challenges and Opportunities for Cultural Preservation and Development.

[BibT_eX]

[DOI]

Proceedings of the Mixed Reality and Gamification for Cultural Heritage, 2017

2016

Laughter Research: A Review of the ILHAIRE Project.

[BibT_eX]

[DOI]

Proceedings of the Toward Robotic Socially Believable Behaving Systems - Volume I, 2016

The IMOTION System at TRECVID 2016: The Ad-Hoc Video Search Task.

[BibT_eX]

[DOI]

Proceedings of the 2016 TREC Video Retrieval Evaluation, 2016

I-Vector estimation as auxiliary task for Multi-Task Learning based acoustic modeling for automatic speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

The i-Treasures Intangible Cultural Heritage dataset.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Symposium on Movement and Computing, 2016

iAutoMotion - an Autonomous Content-Based Video Retrieval Engine.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

IMOTION - Searching for Video Sequences Using Multi-Shot Sketch Queries.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

DeepSketch2Image: Deep Convolutional Neural Networks for Partial Sketch Recognition and Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

AVAB-DBS: an Audio-Visual Affect Bursts Database for Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Semantic Sketch-Based Video Retrieval with Autocompletion.

[BibT_eX]

[DOI]

Proceedings of the Companion Publication of the 21st International Conference on Intelligent User Interfaces, 2016

Speaker-aware Multi-Task Learning for automatic speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Towards a listening agent: a system generating audiovisual laughs and smiles to show interest.

[BibT_eX]

[DOI]

Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016

Speaker-aware long short-term memory multi-task learning for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 24th European Signal Processing Conference, 2016

Audio affect burst synthesis: A multilevel synthesis system for emotional expressions.

[BibT_eX]

[DOI]

Proceedings of the 24th European Signal Processing Conference, 2016

Multi-task learning for speech recognition: an overview.

[BibT_eX]

[DOI]

Proceedings of the 24th European Symposium on Artificial Neural Networks, 2016

DeepSketch 2: Deep convolutional neural networks for partial sketch recognition.

[BibT_eX]

[DOI]

Proceedings of the 14th International Workshop on Content-Based Multimedia Indexing, 2016

2015

Novel 3D Game-like Applications Driven by Body Interactions for Learning Specific Forms of Intangible Cultural Heritage.

[BibT_eX]

[DOI]

Proceedings of the VISAPP 2015, 2015

A Novel Human Interaction Game-like Application to Learn, Perform and Evaluate Modern Contemporary Singing - "Human Beat Box".

[BibT_eX]

[DOI]

Filareti Tsalakanidou

Alexandros Kitsikidis

Francesca Maria Dagnino

Proceedings of the VISAPP 2015, 2015

IMOTION - A Content-Based Video Retrieval Engine.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 21st International Conference, 2015

UMons at MediaEval 2015 Affective Impact of Movies Task including Violent Scenes Detection.

[BibT_eX]

[DOI]

Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015

An HMM approach for synthesizing amused speech with a controllable intensity of smile.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Signal Processing and Information Technology, 2015

Towards a level assessment system of amusement in speech signals: Amused speech components classification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Signal Processing and Information Technology, 2015

Analysis and automatic recognition of Human BeatBox sounds: A comparative study.

[BibT_eX]

[DOI]

Benjamin Picart

Sandrine Brognaux

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Speech-laughs: An HMM-based approach for amused speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Shaking and speech-smile vowels classification: An attempt at amusement arousal estimation from speech signals.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Global Conference on Signal and Information Processing, 2015

An HMM-based speech-smile synthesis system: An approach for amusement synthesis.

[BibT_eX]

[DOI]

Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015

Breath and repeat: An attempt at enhancing speech-laugh synthesis quality.

[BibT_eX]

[DOI]

Proceedings of the 23rd European Signal Processing Conference, 2015

DeepSketch: Deep convolutional neural networks for sketch recognition and similarity search.

[BibT_eX]

[DOI]

Proceedings of the 13th International Workshop on Content-Based Multimedia Indexing, 2015

Investigating sparse deep neural networks for speech recognition.

[BibT_eX]

[DOI]

Guillaume Dubuisson Duplessis

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Multimodal data collection of human-robot humorous interactions in the Joker project.

[BibT_eX]

[DOI]

Laurence Devillers

Sophie Rosset

Mohamed El Amine Sehili

Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

2014

Arousal-Driven Synthesis of Laughter.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2014

Capturing the Intangible - An Introduction to the I-Treasures Project.

[BibT_eX]

[DOI]

Proceedings of the VISAPP 2014, 2014

Tangible needle, digital haystack: tangible interfaces for reusing media content organized by similarity.

[BibT_eX]

[DOI]

Proceedings of the Eighth International Conference on Tangible, 2014

Scenarizing CADastre Exquisse: A Crossover between Snoezeling in Hospitals/Domes, and Authoring/Experiencing Soundful Comic Strips.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 20th Anniversary International Conference, 2014

A Proximity Grid Optimization Method to Improve Audio Search for Sound Design.

[BibT_eX]

[DOI]

Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

AudioMetro: directing search for sound designers through content-based cues.

[BibT_eX]

[DOI]

Proceedings of the Audio Mostly 2014, AM '14, 2014

2013

VideoCycle: User-Friendly Navigation by Similarity in Video Databases.

[BibT_eX]

[DOI]

Christian Frisson

Alexis Moinet

Cécile Picard-Limpens

Thierry Ravet

Xavier Siebert

Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

Improved Audio Classification Using a Novel Non-Linear Dimensionality Reduction Ensemble Approach.

[BibT_eX]

[DOI]

Thierry Ravet

Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

EGT: Enriched Guitar Transcription.

[BibT_eX]

[DOI]

Loïc Reboursière

Proceedings of the Intelligent Technologies for Interactive Entertainment, 2013

MashtaCycle: On-Stage Improvised Audio Collage by Content-Based Similarity and Gesture Recognition.

[BibT_eX]

[DOI]

Laura Colmenares Guerra

Todor Todoroff

Proceedings of the Intelligent Technologies for Interactive Entertainment, 2013

Laugh When You're Winning.

[BibT_eX]

[DOI]

Radoslaw Niewiadomski

Proceedings of the Innovative and Creative Developments in Multimodal Interaction Systems, 2013

Nonlinear dimensionality reduction approaches applied to music and textural sounds.

[BibT_eX]

[DOI]

Thierry Ravet

Cécile Picard-Limpens

Christian Frisson

Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Laugh-aware virtual agent and its impact on user amusement.

[BibT_eX]

[DOI]

Radoslaw Niewiadomski

Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

2012

Left and right-hand guitar playing techniques detection.

[BibT_eX]

[DOI]

Cécile Picard-Limpens

Nicolas Riche

Proceedings of the 12th International Conference on New Interfaces for Musical Expression, 2012

LoopJam: turning the dance floor into a collaborative instrumental map.

[BibT_eX]

[DOI]

Proceedings of the 12th International Conference on New Interfaces for Musical Expression, 2012

2010

Browsing a dance video collection: dance analysis and interface design.

[BibT_eX]

[DOI]

J. Multimodal User Interfaces, 2010

DeviceCycle: Rapid and Reusable Prototyping of Gestural Interfaces, Applied to Audio Browsing by Similarity.

[BibT_eX]

[DOI]

Proceedings of the 10th International Conference on New Interfaces for Musical Expression, 2010

An interactive installation for browsing a dance video database.

[BibT_eX]

[DOI]

Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

2009

AudioCycle: A similarity-based visualization of musical libraries.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

AudioCycle: Browsing Musical Loop Libraries.

[BibT_eX]

[DOI]

Proceedings of the Seventh International Workshop on Content-Based Multimedia Indexing, 2009

2007

Introduction to the Special Issue on Intrinsic Speech Variations.

[BibT_eX]

[DOI]

Speech Commun., 2007

Automatic speech recognition and speech variability: A review.

[BibT_eX]

[DOI]

Speech Commun., 2007

2006

Automatic Speech Recognition and Intrinsic Speech Variation.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005

A study of implicit and explicit modeling of coarticulation and pronunciation variation.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Bimodal combination of speech and handwriting for improved word recognition.

[BibT_eX]

[DOI]

Pascale Woodruff

Proceedings of the 13th European Signal Processing Conference, 2005

2003

Robust feature extraction and acoustic modeling at multitel: experiments on the Aurora databases.

[BibT_eX]

[DOI]

Christophe Ris

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002

Qualcomm-ICSI-OGI features for ASR.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

VTS residual noise compensation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2002

2001

Assessing local noise level estimation methods: Application to noise robust ASR.

[BibT_eX]

[DOI]

Christophe Ris

Speech Commun., 2001

Robust ASR front-end using spectral-based and discriminant features: experiments on the Aurora tasks.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000

Audio-Visual Speech Modeling for Continuous Speech Recognition.

[BibT_eX]

[DOI]

Juergen Luettin

IEEE Trans. Multim., 2000

Fast speaker adaptation of artificial neural networks for automatic speech recognition.

[BibT_eX]

[DOI]

Leila Cheboub

Proceedings of the IEEE International Conference on Acoustics, 2000

1999

Context dependent hybrid HMM/ANN systems for large vocabulary continuous speech recognition system.

[BibT_eX]

[DOI]

Olivier Deroo

Christophe Ris

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1998

Using the multi-stream approach for continuous audio-visual speech recognition: experiments on the M2VTS database.

[BibT_eX]

[DOI]

Juergen Luettin

Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Missing data reconstruction for robust automatic speech recognition in the framework of hybrid HMM/ANN systems.

[BibT_eX]

[DOI]

Continuous Audio-Visual Speech Recognition.

[BibT_eX]

[DOI]

Juergen Luettin