Xavier Serra
Orcid: 0000-0003-1395-2345
According to our database1,
Xavier Serra
authored at least 285 papers
between 1986 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
1990
1995
2000
2005
2010
2015
2020
2025
0
5
10
15
20
25
30
35
40
45
5
4
4
4
9
7
3
2
4
3
2
3
2
9
11
12
14
12
11
9
16
16
10
15
6
7
5
1
1
1
2
5
2
3
3
3
1
1
3
2
28
2
1
6
1
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
On csauthors.net:
Bibliography
2025
Improving Singing Voice Transcription Generalization with AI Generated Accompaniments.
Proceedings of the MultiMedia Modeling, 2025
2024
Expert Syst. Appl., March, 2024
CoRR, 2024
CoRR, 2024
Can Audio Reveal Music Performance Difficulty? Insights from the Piano Syllabus Dataset.
CoRR, 2024
Proceedings of the MultiMedia Modeling - 30th International Conference, 2024
Saraga Audiovisual: A Large Multimodal Open Data Collection for the Analysis of Carnatic Music.
Proceedings of the 25th International Society for Music Information Retrieval Conference, 2024
Towards Explainable and Interpretable Musical Difficulty Estimation: A Parameter-Efficient Approach.
Proceedings of the 25th International Society for Music Information Retrieval Conference, 2024
Proceedings of the 25th International Society for Music Information Retrieval Conference, 2024
A Method for MIDI Velocity Estimation for Piano Performance by a U-Net With Attention and FiLM.
Proceedings of the 25th International Society for Music Information Retrieval Conference, 2024
Towards Assessing Data Replication in Music Generation With Music Similarity Metrics on Raw Audio.
Proceedings of the 25th International Society for Music Information Retrieval Conference, 2024
Discogs-VI: A Musical Version Identification Dataset Based on Public Editorial Metadata.
Proceedings of the 25th International Society for Music Information Retrieval Conference, 2024
Leveraging Pre-Trained Autoencoders for Interpretable Prototype Learning of Music Audio.
Proceedings of the IEEE International Conference on Acoustics, 2024
Svara-forms and coarticulation in Carnatic music: an investigation using deep clustering.
Proceedings of the 11th International Conference on Digital Libraries for Musicology, 2024
2023
Dataset, December, 2023
Repertoire-Specific Vocal Pitch Data Generation for Improved Melodic Analysis of Carnatic Music.
Trans. Int. Soc. Music. Inf. Retr., January, 2023
Multilabel Prototype Generation for data reduction in K-Nearest Neighbour classification.
Pattern Recognit., 2023
An objective evaluation of Hearing Aids and DNN-based speech enhancement in complex acoustic scenes.
CoRR, 2023
An Objective Evaluation of Hearing AIDS and DNN-Based Binaural Speech Enhancement in Complex Acoustic Scenes.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023
Carnatic Singing Voice Separation Using Cold Diffusion on Training Data With Bleeding.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023
Sounds Out of Pläce? Score-Independent Detection of Conspicuous Mistakes in Piano Performances.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023
Efficient Supervised Training of Audio Transformers for Music Representation Learning.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Pre-Training Strategies Using Contrastive Learning and Playlist Information for Music Classification and Similarity.
Proceedings of the IEEE International Conference on Acoustics, 2023
2022
IEEE ACM Trans. Audio Speech Lang. Process., 2022
Automatic Piano Fingering from Partially Annotated Scores using Autoregressive Neural Networks.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022
A diffusion-inspired training strategy for singing voice extraction in the waveform domain.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022
In Search of Sañc?ras: Tradition-informed Repeated Melodic Pattern Recognition in Carnatic Music.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022
MusAV: A dataset of relative arousal-valence annotations for validation of audio models.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Urban Sound & Sight: Dataset And Benchmark For Audio-Visual Urban Scene Understanding.
Proceedings of the IEEE International Conference on Acoustics, 2022
Matching Text and Audio Embeddings: Exploring Transfer-Learning Strategies for Language-Based Audio Retrieval.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022
An Overview of Automatic Piano Performance Assessment within the Music Education Context.
Proceedings of the 14th International Conference on Computer Supported Education, 2022
2021
Trans. Int. Soc. Music. Inf. Retr., 2021
Improving Sound Event Classification by Increasing Shift Invariance in Convolutional Neural Networks.
CoRR, 2021
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021
What Is Fair? Exploring the Artists' Perspective on the Fairness of Music Streaming Platforms.
Proceedings of the Human-Computer Interaction - INTERACT 2021 - 18th IFIP TC 13 International Conference, Bari, Italy, August 30, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Melon Playlist Dataset: A Public Dataset for Audio-Based Playlist Generation and Music Tagging.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Toward Interpretable Polyphonic Sound Event Detection with Attention Maps Based on Local Prototypes.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021
Evaluating Off-the-Shelf Machine Listening and Natural Language Models for Automated Audio Captioning.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021
The Matrix Profile for Motif Discovery in Audio - An Example Application in Carnatic Music.
Proceedings of the Music in the AI Era - 15th International Symposium, 2021
Proceedings of the Music in the AI Era - 15th International Symposium, 2021
Proceedings of the CHIIR '21: ACM SIGIR Conference on Human Information Interaction and Retrieval, 2021
2020
Dataset used in COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations.
Dataset, June, 2020
CoRR, 2020
COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations.
CoRR, 2020
Addressing Missing Labels in Large-scale Sound Event Recognition using a Teacher-student Framework with Loss Masking.
CoRR, 2020
Proceedings of the Twenty-Ninth Text REtrieval Conference, 2020
Maximizing the Engagement: Exploring New Signals of Implicit Feedback in Music Recommendations.
Proceedings of the Workshops on Recommendation in Complex Scenarios and the Impact of Recommender Systems co-located with 14th ACM Conference on Recommender Systems (RecSys 2020), 2020
Proceedings of the RecSys 2020: Fourteenth ACM Conference on Recommender Systems, 2020
Proceedings of the 2020 on International Conference on Multimedia Retrieval, 2020
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
How Low Can You Go? Reducing Frequency and Time Resolution in Current CNN Architectures for Music Auto-tagging.
Proceedings of the 28th European Signal Processing Conference, 2020
DCASE-Models: A Python Library for Computational Environmental Sound Analysis using Deep-Learning Models.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020
Proceedings of the 12th International Conference on Computer Supported Education, 2020
2019
Artist and style exposure bias in collaborative filtering based music recommendations.
CoRR, 2019
Music Auto-tagging Using CNNs and Mel-spectrograms With Reduced Frequency and Time Resolution.
CoRR, 2019
CoRR, 2019
Skip prediction using boosting trees based on acoustic features of tracks in sessions.
CoRR, 2019
Using offline metrics and user behavior analysis to combine multiple systems for music recommendation.
CoRR, 2019
Model-Agnostic Approaches To Handling Noisy Labels When Training Sound Event Classifiers.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019
Contributing to New Musicological Theories with Computational Methods: The Case of Centonization in Arab-Andalusian Music.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019
A Dataset of Rhythmic Pattern Reproductions and Baseline Automatic Assessment System.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
A Hybrid Parametric-Deep Learning Approach for Sound Event Localization and Detection.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019
Proceedings of the Perception, Representations, Image, Sound, Music, 2019
2018
Jingju a cappella singing voice test dataset for "An efficient deep learning model for musical onset detection".
Dataset, August, 2018
Dataset, June, 2018
Dataset, June, 2018
Dataset, June, 2018
Dataset, June, 2018
Experiment dataset supplementary materials for DLfM 2018 submission: file list and phoneme number information.
Dataset, June, 2018
Dataset for Interspeech 2018 submission: Singing voice phoneme segmentation by hierarchically inferring syllable and phoneme onset positions.
Dataset, February, 2018
Trans. Int. Soc. Music. Inf. Retr., 2018
Multi criteria biased randomized method for resource allocation in distributed systems: Application in a volunteer computing system.
Future Gener. Comput. Syst., 2018
CoRR, 2018
Assessing the impact of machine intelligence on human behaviour: an interdisciplinary endeavour.
CoRR, 2018
Proceedings of the Companion of the The Web Conference 2018 on The Web Conference 2018, 2018
Proceedings of the 9th Workshop on Hyperspectral Image and Signal Processing: Evolution in Remote Sensing, 2018
Automatic playlist continuation using a hybrid recommender system combining features from text and audio.
Proceedings of the ACM Recommender Systems Challenge, 2018
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018
Audio-Aligned Jazz Harmony Dataset for Automatic Chord Transcription and Corpus-based Research.
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018
Singing Voice Phoneme Segmentation by Hierarchically Inferring Syllable and Phoneme Onset Positions.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 5th International Conference on Digital Libraries for Musicology, 2018
General-purpose tagging of Freesound audio with AudioSet labels: task description, dataset, and baseline.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018
2017
ACM Trans. Intell. Syst. Technol., 2017
Frontiers Digit. Humanit., 2017
Identification of potential Music Information Retrieval technologies for computer-aided jingju singing training.
CoRR, 2017
Proceedings of the 2nd Workshop on Deep Learning for Recommender Systems, 2017
Understanding the Expressive Functions of Jingju Metrical Patterns Through Lyrics Text Mining.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017
Quantitative analysis of the relationship between linguistic tones and melody in jingju using music scores.
Proceedings of the 4th International Workshop on Digital Libraries for Musicology, 2017
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017
Score-Informed Syllable Segmentation for A Cappella Singing Voice with Convolutional Neural Networks.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017
Multi-Label Music Genre Classification from Audio, Text and Images Using Deep Features.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017
Creating an A Cappella Singing Audio Dataset for Automatic Jingju Singing Evaluation Research.
Proceedings of the 4th International Workshop on Digital Libraries for Musicology, 2017
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017
Quantifying Music Trends and Facts Using Editorial Metadata from the Discogs Database.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017
Designing efficient architectures for modeling temporal features with convolutional neural networks.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 25th European Signal Processing Conference, 2017
Proceedings of the Semantic Web: ESWC 2017 Satellite Events - ESWC 2017 Satellite Events, Portorož, Slovenia, May 28, 2017
Acoustic Scene Classification by Ensembling Gradient Boosting Machine and Convolutional Neural Networks.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017
2016
Data Knowl. Eng., 2016
Proceedings of the Winter Simulation Conference, 2016
ELMD: An Automatically Generated Entity Linking Gold Standard Dataset in the Music Domain.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016
On the Use of Note Onsets for Improved Lyrics-To-Audio Alignment in Turkish Makam Music.
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016
Proceedings of the IEEE International Symposium on Multimedia, 2016
A generalized Bayesian model for tracking long metrical cycles in acoustic music signals.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Discovering rāga motifs by characterizing communities in networks of melodic patterns.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 3rd International workshop on Digital Libraries for Musicology, 2016
Proceedings of the 3rd International workshop on Digital Libraries for Musicology, 2016
Proceedings of the 14th International Workshop on Content-Based Multimedia Indexing, 2016
2015
ACM Trans. Intell. Syst. Technol., 2015
Proceedings of the 24th International Conference on World Wide Web Companion, 2015
Predicting Pairwise Pitch Contour Relations Based on Linguistic Tone Information in Beijing Opera Singing.
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015
AcousticBrainz: A Community Platform for Gathering Music Information Obtained from Audio.
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015
Improving Melodic Similarity in Indian Art Music Using Culture-Specific Melodic Characteristics.
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015
An evaluation of methodologies for melodic similarity in audio recordings of Indian art music.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
2014
Class-based tag recommendation and user-based evaluation in online audio clip sharing.
Knowl. Based Syst., 2014
Proceedings of the Tenth International Conference on Signal-Image Technology and Internet-Based Systems, 2014
Proceedings of the ISWC 2014 Posters & Demonstrations Track a track within the 13th International Semantic Web Conference, 2014
Creating Research Corpora for the Computational Study of Music: the case of the CompMusic Project.
Proceedings of the AES International Conference on Semantic Audio 2014, 2014
Proceedings of the AES International Conference on Semantic Audio 2014, 2014
Proceedings of the 1st International Workshop on Digital Libraries for Musicology, 2014
Proceedings of the 1st International Workshop on Digital Libraries for Musicology, 2014
Proceedings of the 1st International Workshop on Digital Libraries for Musicology, 2014
Study of the Similarity between Linguistic Tones and Melodic Pitch Contours in Beijing Opera Singing.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014
Transcription and Recognition of Syllable based Percussion Patterns: The Case of Beijing Opera.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014
Creating a Corpus of Jingju (Beijing Opera) Music and Possibilities for Melodic Analysis.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014
Proceedings of the Music Technology meets Philosophy, 2014
Proceedings of the Music Technology meets Philosophy, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
A supervised approach to hierarchical metrical cycle tracking from audio music recordings.
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
Int. J. Semantic Web Inf. Syst., 2013
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013
Proceedings of the ACM Multimedia Conference, 2013
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013
2012
IEEE Trans. Evol. Comput., 2012
Predictability of Music Descriptor Time Series and its Application to Cover Song Detection.
IEEE Trans. Speech Audio Process., 2012
Pattern Recognit. Lett., 2012
Int. J. Soc. Netw. Min., 2012
Proceedings of the 21st World Wide Web Conference, 2012
Proceedings of the 13th International Society for Music Information Retrieval Conference, 2012
Proceedings of the 13th International Society for Music Information Retrieval Conference, 2012
Proceedings of the 13th International Society for Music Information Retrieval Conference, 2012
Proceedings of the 13th International Society for Music Information Retrieval Conference, 2012
Proceedings of the 13th International Society for Music Information Retrieval Conference, 2012
Proceedings of the 3rd International Workshop on Cognitive Information Processing, 2012
2011
IEEE Trans. Multim., 2011
Proceedings of the User Modeling, Adaption and Personalization, 2011
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011
Proceedings of the 4th International Symposium on Applied Sciences in Biomedical and Communication Technologies, 2011
2010
Pattern Recognit. Lett., 2010
Indexing music by mood: design and integration of an automatic content-based annotator.
Multim. Tools Appl., 2010
Ecological Acoustics Perspective for Content-Based Retrieval of Environmental Sounds.
EURASIP J. Audio Speech Music. Process., 2010
2009
Expressive Concatenative Synthesis by Reusing Samples from Real Performance Recordings.
Comput. Music. J., 2009
Proceedings of the Artificial Intelligence Research and Development, 2009
2008
J. Web Semant., 2008
IEEE Trans. Speech Audio Process., 2008
Comput. Music. J., 2008
Proceedings of the Art of Artificial Evolution: A Handbook on Evolutionary Art and Music, 2008
2007
IEEE Trans. Circuits Syst. Video Technol., 2007
IEEE Signal Process. Mag., 2007
Proceedings of the IEEE 9th Workshop on Multimedia Signal Processing, 2007
2006
Proceedings of the Poster and Demo Proceedings of the 1st International Conference on Semantic and Digital Media Technologies, 2006
2005
Proceedings of the 2005 International Computer Music Conference, 2005
2004
Proceedings of the 2004 International Computer Music Conference, 2004
Current Research in Music Technology at the Audiovisual Institute of the Pompeu Fabra University.
Proceedings of the 2004 International Computer Music Conference, 2004
2001
Singing Voice Synthesis Combining Excitation plus Resonance and Sinusoidal plus Residual Models.
Proceedings of the 2001 International Computer Music Conference, 2001
2000
Towards Instrument Segmentation for Music Content Description: a Critical Review of Instrument Classification Techniques.
Proceedings of the ISMIR 2000, 2000
Proceedings of the 2000 International Computer Music Conference, 2000
Proceedings of the 2000 International Computer Music Conference, 2000
Proceedings of the 2000 International Computer Music Conference, 2000
Proceedings of the 2000 International Computer Music Conference, 2000
1999
The Musicians' Sofware Mall: A Set of Composition and Performance Oriented Applications for Sound Synthesis.
Proceedings of the 1999 International Computer Music Conference, 1999
Proceedings of the 1999 International Computer Music Conference, 1999
1998
Proceedings of the 1998 International Computer Music Conference, 1998
1997
Proceedings of the 1997 International Computer Music Conference, 1997
Proceedings of the 1997 International Computer Music Conference, 1997
Proceedings of the 1997 International Computer Music Conference, 1997
1994
Proceedings of the 1994 International Computer Music Conference, 1994
1989
Proceedings of the 1989 International Computer Music Conference, 1989
1987
PARSHL: An Analysis/Synthesis Program for Non-Harmonic Sounds Based on a Sinusoidal Representation.
Proceedings of the 1987 International Computer Music Conference, 1987
1986
Proceedings of the 1986 International Computer Music Conference, 1986