Dorien Herremans

Orcid: 0000-0001-8607-1640

Affiliations:
  • Singapore University of Technology and Design


According to our database1, Dorien Herremans authored at least 90 papers between 2013 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Video2Music: Suitable music generation from videos using an Affective Multimodal Transformer model.
Expert Syst. Appl., 2024

MIRFLEX: Music Information Retrieval Feature Library for Extraction.
CoRR, 2024

DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech.
CoRR, 2024

Leveraging LLM Embeddings for Cross Dataset Label Alignment and Zero Shot Music Emotion Prediction.
CoRR, 2024

Prevailing Research Areas for Music AI in the Era of Foundation Models.
CoRR, 2024

PRESENT: Zero-Shot Text-to-Prosody Control.
CoRR, 2024

BandControlNet: Parallel Transformers-based Steerable Popular Music Generation with Fine-Grained Spatiotemporal Features.
CoRR, 2024

DisfluencySpeech - Single-Speaker Conversational Speech Dataset with Paralanguage.
CoRR, 2024

Are we there yet? A brief survey of Music Emotion Prediction Datasets, Models and Outstanding Challenges.
CoRR, 2024

MidiCaps - A large-scale MIDI dataset with text captions.
CoRR, 2024

Accent Conversion in Text-To-Speech Using Multi-Level VAE and Adversarial Training.
CoRR, 2024

Natural Language Processing Methods for Symbolic Music Generation and Information Retrieval: a Survey.
CoRR, 2024

Mustango: Toward Controllable Text-to-Music Generation.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

2023
PreBit - A multimodal model with Twitter FinBERT embeddings for extreme price movement prediction of Bitcoin.
Expert Syst. Appl., December, 2023

Constructing time-series momentum portfolios with deep multi-task learning.
Expert Syst. Appl., November, 2023

MERP: A Music Dataset with Emotion Ratings and Raters' Profile Information.
Sensors, 2023

EmoMV: Affective music-video correspondence learning datasets for classification and retrieval.
Inf. Fusion, 2023

Jointist: Simultaneous Improvement of Multi-instrument Transcription and Music Source Separation via Joint Training.
CoRR, 2023

Diffroll: Diffusion-Based Generative Music Transcription with Unsupervised Pretraining Capability.
Proceedings of the IEEE International Conference on Acoustics, 2023

A Domain-Knowledge-Inspired Music Embedding Space and a Novel Attention Mechanism for Symbolic Music Modeling.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Single Image Video Prediction with Auto-Regressive GANs.
Sensors, 2022

Forecasting Bitcoin volatility spikes from whale transactions and CryptoQuant data using Synthesizer Transformer models.
CoRR, 2022

SNIPER Training: Variable Sparsity Rate Training For Text-To-Speech.
CoRR, 2022

Accented Text-to-Speech Synthesis with a Conditional Variational Autoencoder.
CoRR, 2022

Jointist: Joint Learning for Multi-instrument Transcription and Its Applications.
CoRR, 2022

A multimodal model with Twitter FinBERT embeddings for extreme price movement prediction of Bitcoin.
CoRR, 2022

Understanding Audio Features via Trainable Basis Functions.
CoRR, 2022

HEAR 2021: Holistic Evaluation of Audio Representations.
CoRR, 2022

Predicting emotion from music videos: exploring the relative contribution of visual and auditory information to affective responses.
CoRR, 2022

Learning Accent Representation with Multi-Level VAE Towards Controllable Speech Synthesis.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Computationally efficient physics approximating neural networks for highly nonlinear maps.
Proceedings of the Conference on Research in Adaptive and Convergent Systems, 2022

Conditional Drums Generation Using Compound Word Representations.
Proceedings of the Artificial Intelligence in Music, Sound, Art and Design, 2022

MusIAC: An Extensible Generative Framework for Music Infilling Applications with Multi-level Control.
Proceedings of the Artificial Intelligence in Music, Sound, Art and Design, 2022

2021
Underwater Acoustic Communication Receiver Using Deep Belief Network.
IEEE Trans. Commun., 2021

AttendAffectNet-Emotion Prediction of Movie Viewers Using Multimodal Fusion with Self-Attention.
Sensors, 2021

Deep Neural Network-Based Respiratory Pathology Classification Using Cough Sounds.
Sensors, 2021

aiSTROM-A Roadmap for Developing a Successful AI Strategy.
IEEE Access, 2021


ReconVAT: A Semi-Supervised Automatic Music Transcription Framework for Low-Resource Real-World Data.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Generating Lead Sheets with Affect: A Novel Conditional seq2seq Framework.
Proceedings of the International Joint Conference on Neural Networks, 2021

Hierarchical Recurrent Neural Networks for Conditional Melody Generation with Long-term Structure.
Proceedings of the International Joint Conference on Neural Networks, 2021

Revisiting the Onsets and Frames Model with Additive Attention.
Proceedings of the International Joint Conference on Neural Networks, 2021

2020
Singing voice separation using a deep convolutional neural network trained by ideal binary mask and cross entropy.
Neural Comput. Appl., 2020

The emergence of deep learning: new opportunities for music and audio technologies.
Neural Comput. Appl., 2020

From context to concept: exploring semantic relationships in music with word2vec.
Neural Comput. Appl., 2020

Hit Song Prediction Based on Early Adopter Data and Audio Features.
CoRR, 2020

A variational autoencoder for music generation controlled by tonal tension.
CoRR, 2020

A dataset and classification model for Malay, Hindi, Tamil and Chinese music.
CoRR, 2020

PerceptionGAN: Real-world Image Construction from Provided Text through Perceptual Understanding.
CoRR, 2020

Generative Modelling for Controllable Audio Synthesis of Expressive Piano Performance.
CoRR, 2020

nnAudio: An on-the-Fly GPU Audio to Spectrogram Conversion Toolbox Using 1D Convolutional Neural Networks.
IEEE Access, 2020

Acoustic prediction of flowrate: varying liquid jet stream onto a free surface.
Proceedings of the International Conference on Signal Processing and Communications, 2020

Music FaderNets: Controllable Music Generation Based On High-Level Features via Low-Level Feature Modelling.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Unsupervised Disentanglement of Pitch and Timbre for Isolated Musical Instrument Sounds.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Regression-based Music Emotion Prediction using Triplet Neural Networks.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

The Impact of Audio Input Representations on Neural Network based Music Transcription.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

AttendAffectNet: Self-Attention based Networks for Predicting Affective Responses from Movies.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

The Effect of Spectrogram Reconstruction on Automatic Music Transcription: An Alternative Approach to Improve Transcription Accuracy.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Singing Voice Conversion with Disentangled Representations of Singer and Vocal Technique Using Variational Autoencoders.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
MorpheuS: Generating Structured Music with Constrained Patterns and Tension.
IEEE Trans. Affect. Comput., 2019

nnAudio: An on-the-fly GPU Audio to Spectrogram Conversion Toolbox Using 1D Convolution Neural Networks.
CoRR, 2019

Singing Voice Conversion with Disentangled Representations of Singer and Vocal Technique Using Variational Autoencoders.
CoRR, 2019

Midi Miner - A Python library for tonal tension and track classification.
CoRR, 2019

Towards robust audio spoofing detection: a detailed comparison of traditional and learned features.
CoRR, 2019

Dance Hit Song Prediction.
CoRR, 2019

Toward Robust Audio Spoofing Detection: A Detailed Comparison of Traditional and Learned Features.
IEEE Access, 2019

A Hybrid Fuzzy Logic-Neural Network Approach for Multi-Path Separation of Underwater Acoustic Signals.
Proceedings of the 89th IEEE Vehicular Technology Conference, 2019

Learning Disentangled Representations of Timbre and Pitch for Musical Instrument Sounds Using Gaussian Mixture Variational Autoencoders.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

Multimodal Deep Models for Predicting Affective Responses Evoked by Movies.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Untangling indices of emotion in music using neural networks.
Proceedings of the 41th Annual Meeting of the Cognitive Science Society, 2019

Towards emotion based music generation: A tonal tension model based on the spiral array.
Proceedings of the 41th Annual Meeting of the Cognitive Science Society, 2019

A novel music-based game with motion capture to support cognitive and motor function in the elderly.
Proceedings of the IEEE Conference on Games, 2019

Latent Space Representation for Multi-Target Speaker Detection and Identification with a Sparse Dataset Using Triplet Neural Networks.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Doppler Invariant Demodulation for Shallow Water Acoustic Communications Using Deep Belief Networks.
Proceedings of the IEEE VTS Asia Pacific Wireless Communications Symposium, 2019

2018
Generating guitar solos by integer programming.
J. Oper. Res. Soc., 2018

Modeling Temporal Tonal Relations in Polyphonic Music Through Deep Networks With a Novel Image-Based Representation.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
A variable neighborhood search algorithm to generate piano fingerings for polyphonic sheet music.
Int. Trans. Oper. Res., 2017

A Functional Taxonomy of Music Generation Systems.
ACM Comput. Surv., 2017

Modeling Musical Context with Word2vec.
CoRR, 2017

Proceedings of the First International Workshop on Deep Learning and Music.
CoRR, 2017

A Multi-modal Platform for Semantic Music Analysis: Visualizing Audio-and Score-Based Tension.
Proceedings of the 11th IEEE International Conference on Semantic Computing, 2017

IMMA-Emo: A Multimodal Interface for Visualising Score- and Audio-synchronised Emotion Annotations.
Proceedings of the 12th International Audio Mostly Conference on Augmented and Participatory Sound and Music Experiences, 2017

2016
Composer Classification Models for Music-Theory Building.
Proceedings of the Computational Music Analysis, 2016

2015
Generating structured music for bagana using quality metrics based on Markov models.
Expert Syst. Appl., 2015

Classification and Generation of Composer-Specific Music Using Global Feature Models and Variable Neighborhood Search.
Comput. Music. J., 2015

Compose ≡ compute - Computer generation and classification of music through operations research methods.
4OR, 2015

Generating Fingerings for Polyphonic Piano Music with a Tabu Search Algorithm.
Proceedings of the Mathematics and Computation in Music - 5th International Conference, 2015

2014
Sampling the extrema from statistical models of music with variable neighbourhood search.
Proceedings of the Music Technology meets Philosophy, 2014

2013
Composing fifth species counterpoint music with a variable neighborhood search algorithm.
Expert Syst. Appl., 2013

FuX, an Android app that generates counterpoint.
Proceedings of the IEEE Symposium on Computational Intelligence for Creativity and Affective Computing, 2013


  Loading...