Oriol Nieto

According to our database1, Oriol Nieto authored at least 39 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
SILA: Signal-to-Language Augmentation for Enhanced Control in Text-to-Audio Generation.
CoRR, 2024

Sketch2Sound: Controllable Audio Generation via Time-Varying Signals and Sonic Imitations.
CoRR, 2024

Video-Guided Foley Sound Generation with Multimodal Controls.
CoRR, 2024

MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark.
CoRR, 2024

ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds.
CoRR, 2024

GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities.
CoRR, 2024

VDGD: Mitigating LVLM Hallucinations in Cognitive Prompts by Bridging the Visual Perception Gap.
CoRR, 2024

Augment, Drop & Swap: Improving Diversity in LLM Captions for Efficient Music-Text Representation Learning.
Proceedings of the 25th International Society for Music Information Retrieval Conference, 2024

CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
Bridging High-Quality Audio and Video Via Language for Sound Effects Retrieval from Visual Queries.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

Efficient Spoken Language Recognition via Multilabel Classification.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Audio-Text Models Do Not Yet Leverage Natural Language.
Proceedings of the IEEE International Conference on Acoustics, 2023

Language-Guided Audio-Visual Source Separation via Trimodal Consistency.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Music Enhancement via Image Translation and Vocoding.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Deep Embeddings and Section Fusion Improve Music Segmentation.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

Multimodal Metric Learning for Tag-Based Music Retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Audio-Based Music Structure Analysis: Current Trends, Open Challenges, and Applications.
Trans. Int. Soc. Music. Inf. Retr., 2020

Mood Classification Using Listening Data.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Data-Driven Harmonic Filters for Audio Representation Learning.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
The Harmonix Set: Beats, Downbeats, and Functional Segment Annotations of Western Popular Music.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

2018
Multimodal Deep Learning for Music Genre Classification.
Trans. Int. Soc. Music. Inf. Retr., 2018

Predicting Audio Advertisement Quality.
Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining, 2018

End-to-end Learning for Music Audio Tagging at Scale.
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018

2017
A Deep Multimodal Approach for Cold-start Music Recommendation.
Proceedings of the 2nd Workshop on Deep Learning for Recommender Systems, 2017

Multi-Label Music Genre Classification from Audio, Text and Images Using Deep Features.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017

2016
Systematic Exploration of Computational Music Structure Research.
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016

2015
librosa: Audio and Music Signal Analysis in Python.
Proceedings of the 14th Python in Science Conference 2015 (SciPy 2015), Austin, Texas, July 6, 2015

Hierarchical Evaluation of Segment Boundary Detection.
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015

2014
MIR_EVAL: A Transparent Implementation of Common MIR Metrics.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

Perceptual Analysis of the F-Measure to Evaluate Section Boundaries in Music.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

Identifying Polyphonic Musical Patterns From Audio Recordings Using Music Segmentation Techniques.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

JAMS: A JSON Annotated Music Specification for Reproducible MIR Research.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

Music segment similarity using 2D-Fourier Magnitude Coefficients.
Proceedings of the IEEE International Conference on Acoustics, 2014

Embodying Theoretical Research in Music Cognition: Four Proposals for Theory-Driven Experimentation.
Proceedings of the 36th Annual Meeting of the Cognitive Science Society, 2014

2013
Fortissimo: Force-Feedback for Mobile Devices.
Proceedings of the 13th International Conference on New Interfaces for Musical Expression, 2013

Data Driven and Discriminative Projections for Large-Scale Cover Song Identification.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

Convex non-negative matrix factorization for automatic music structure identification.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Compressing Music Recordings into Audio Summaries.
Proceedings of the 13th International Society for Music Information Retrieval Conference, 2012


  Loading...