Gerard Roma

Orcid: 0009-0004-1287-0713

According to our database1, Gerard Roma authored at least 32 papers between 2008 and 2022.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.



In proceedings 
PhD thesis 


Online presence:



Architecture about Dancing: Creating a Cross Environment, Cross Domain Framework for Creative Coding Musicians.
Proceedings of the 33rd Annual Workshop of the Psychology of Programming Interest Group, 2022

Enabling Programmatic Data Mining as Musicking: The Fluid Corpus Manipulation Toolkit.
Comput. Music. J., 2021

Live Coding with the Cloud and a Virtual Agent.
Proceedings of the 21th International Conference on New Interfaces for Musical Expression, 2021

Graph-Based Audio Looping And Granulation.
Proceedings of the 24th International Conference on Digital Audio Effects, 2021

Performing Audiences: Composition Strategies for Network Music using Mobile Phones.
Proceedings of the 20th International Conference on New Interfaces for Musical Expression, 2020

Adaptive Mapping of Sound Collections for Data-driven Musical Interfaces.
Proceedings of the 19th International Conference on New Interfaces for Musical Expression, 2019

Environmental sound recognition using short-time feature aggregation.
J. Intell. Inf. Syst., 2018

Live Repurposing of Sounds: MIR Explorations with Personal and Crowdsourced Databases.
Proceedings of the 18th International Conference on New Interfaces for Musical Expression, 2018

Improving Single-Network Single-Channel Separation of Musical Audio with Convolutional Layers.
Proceedings of the Latent Variable Analysis and Signal Separation, 2018

Jam with Jamendo: Querying a Large Music Collection by Chords from a Learner's Perspective.
Proceedings of the Audio Mostly 2018 on Sound in Immersion and Emotion, 2018

Two-Stage Single-Channel Audio Source Separation Using Deep Neural Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Psychophysical Evaluation of Audio Source Separation Methods.
Proceedings of the Latent Variable Analysis and Signal Separation, 2017

Discriminative Enhancement for Single Channel Audio Source Separation Using Deep Neural Networks.
Proceedings of the Latent Variable Analysis and Signal Separation, 2017

Turn-Taking and Chatting in Collaborative Music Live Coding.
Proceedings of the 12th International Audio Mostly Conference on Augmented and Participatory Sound and Music Experiences, 2017

Handwaving: Gesture Recognition for Participatory Mobile Music.
Proceedings of the 12th International Audio Mostly Conference on Augmented and Participatory Sound and Music Experiences, 2017

Combining Mask Estimates for Single Channel Audio Source Separation Using Deep Neural Networks.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Evaluation of audio source separation models using hypothesis-driven non-parametric statistical methods.
Proceedings of the 24th European Signal Processing Conference, 2016

Algorithms and representations for supporting online music creation with large-scale audio databases.
PhD thesis, 2015

Deep Remix: Remixing Musical Mixtures Using a Convolutional Deep Neural Network.
CoRR, 2015

Deep Karaoke: Extracting Vocals from Musical Mixtures Using a Convolutional Deep Neural Network.
Proceedings of the Latent Variable Analysis and Signal Separation, 2015

SoundXY4: Supporting Tabletop Collaboration and Awareness with Ambisonics Spatialisation.
Proceedings of the 14th International Conference on New Interfaces for Musical Expression, 2014

Recurrence quantification analysis features for environmental sound recognition.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Freesound technical demo.
Proceedings of the ACM Multimedia Conference, 2013

ESSENTIA: an open-source library for sound and music analysis.
Proceedings of the ACM Multimedia Conference, 2013

Essentia: An Audio Analysis Library for Music Information Retrieval.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

Small world networks and creativity in audio clip sharing.
Int. J. Soc. Netw. Min., 2012

Active learning of custom sound taxonomies in unstructured audio data.
Proceedings of the International Conference on Multimedia Retrieval, 2012

Characterization of the Freesound online community.
Proceedings of the 3rd International Workshop on Cognitive Information Processing, 2012

Ecological Acoustics Perspective for Content-Based Retrieval of Environmental Sounds.
EURASIP J. Audio Speech Music. Process., 2010

Community Structure in Audio Clip Sharing.
Proceedings of the 2nd International Conference on Intelligent Networking and Collaborative Systems, 2010

Graph grammar representation for collaborative sample-based music creation.
Proceedings of the AM '10, 2010

A Tabletop Waveform Editor for Live Performance.
Proceedings of the 8th International Conference on New Interfaces for Musical Expression, 2008
