Ricard Marxer

Orcid: 0000-0001-5099-5059

According to our database1, Ricard Marxer authored at least 50 papers between 2009 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Transfer Learning from Whisper for Microscopic Intelligibility Prediction.
CoRR, 2024

PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings.
Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024

Speech Foundation Models on Intelligibility Prediction for Hearing-Impaired Listeners.
Proceedings of the IEEE International Conference on Acoustics, 2024

Scaling Properties of Speech Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

SUCRe: Leveraging Scene Structure for Underwater Color Restoration.
Proceedings of the International Conference on 3D Vision, 2024

2023
Eiffel Tower: A deep-sea underwater dataset for long-term visual localization.
Int. J. Robotics Res., August, 2023

Progress and Prospects for Spoken Language Technology: Results from Five Sexennial Surveys.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

On the Benefits of Self-supervised Learned Speech Representations for Predicting Human Phonetic Misperceptions.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022
Homography-Based Loss Function for Camera Pose Regression.
IEEE Robotics Autom. Lett., 2022

Variable-rate hierarchical CPC leads to acoustic unit discovery in speech.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Blind Speech Separation Through Direction of Arrival Estimation Using Deep Neural Networks with a Flexibility on the Number of Speakers.
Proceedings of the 24th IEEE International Workshop on Multimedia Signal Processing, 2022

Contrastive Prediction Strategies for Unsupervised Segmentation and Categorization of Phonemes and Words.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Aligned Contrastive Predictive Coding.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Information Retrieval for ZeroSpeech 2021: The Submission by University of Wroclaw.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021


2020
A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Robust Training of Vector Quantized Bottleneck Models.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

DOCC10: Open access dataset of marine mammal transient studies and end-to-end CNN classification.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

Deep Learning and Domain Transfer for Orca Vocalization Detection.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

Deep Learning Classification with Noisy Labels.
Proceedings of the 2020 IEEE International Conference on Multimedia & Expo Workshops, 2020

The "ScribbleLens" Dutch Historical Handwriting Corpus.
Proceedings of the 17th International Conference on Frontiers in Handwriting Recognition, 2020

2019
Real-time Passive Acoustic 3D Tracking of Deep Diving Cetacean by Small Non-uniform Mobile Surface Antenna.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
The impact of the Lombard effect on audio and visual speech recognition systems.
Speech Commun., 2018

DNN Driven Speaker Independent Audio-Visual Mask Estimation for Speech Separation.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017
An analysis of environment, microphone and data simulation mismatches in robust speech recognition.
Comput. Speech Lang., 2017

The third 'CHiME' speech separation and recognition challenge: Analysis and outcomes.
Comput. Speech Lang., 2017

Multi-microphone speech recognition in everyday environments.
Comput. Speech Lang., 2017

Binary Mask Estimation Strategies for Constrained Imputation-Based Speech Enhancement.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

An Innovative Speech-Based User Interface for Smarthomes and IoT Solutions to Help People with Speech and Motor Disabilities.
Proceedings of the Harnessing the Power of Technology to Improve Lives, 2017

The CHiME Challenges: Robust Speech Recognition in Everyday Environments.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016
Unsupervised Incremental Online Learning and Prediction of Musical Audio Signals.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Vocal Interactivity in-and-between Humans, Animals, and Robots.
Frontiers Robotics AI, 2016

Vocal Interactivity in-and-between Humans, Animals and Robots (VIHAR) (Dagstuhl Seminar 16442).
Dagstuhl Reports, 2016

Progress and Prospects for Spoken Language Technology: Results from Four Sexennial Surveys.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Language Effects in Noise-Induced Word Misperceptions.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

CloudCAST - Remote Speech Technology for Speech Professionals.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

An Innovative Speech-Based Interface to Control AAL and IoT Solutions to Help People with Speech and Motor Disability.
Proceedings of the Ambient Assisted Living, 2016

A Data Driven Approach to Audiovisual Speech Mapping.
Proceedings of the Advances in Brain Inspired Cognitive Systems, 2016

2015
Unsupervised Incremental Learning and Prediction of Audio Signals.
CoRR, 2015

Automatic dysfluency detection in dysarthric speech using deep belief networks.
Proceedings of the 6th Workshop on Speech and Language Processing for Assistive Technologies, 2015

Remote Speech Technology for Speech Professionals - the CloudCAST initiative.
Proceedings of the 6th Workshop on Speech and Language Processing for Assistive Technologies, 2015

Knowledge transfer between speakers for personalised dialogue management.
Proceedings of the SIGDIAL 2015 Conference, 2015

A framework for the evaluation of microscopic intelligibility models.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Exploiting synchrony spectra and deep neural networks for noise-robust automatic speech recognition.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

The third 'CHiME' speech separation and recognition challenge: Dataset, task and baselines.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2012
A Tikhonov regularization method for spectrum decomposition in low latency audio source separation.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Combining a harmonic-based NMF decomposition with transient analysis for instantaneous percussion separation.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Low-Latency Instrument Separation in Polyphonic Audio Using Timbre Models.
Proceedings of the Latent Variable Analysis and Signal Separation, 2012

Score-informed and timbre independent lead instrument separation in real-world scenarios.
Proceedings of the 20th European Signal Processing Conference, 2012

2009
What/when causal expectation modelling applied to audio signals.
Connect. Sci., 2009


  Loading...