Dorien Herremans
Orcid: 0000-0001-8607-1640Affiliations:
- Singapore University of Technology and Design
According to our database1,
Dorien Herremans
authored at least 90 papers
between 2013 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
On csauthors.net:
Bibliography
2024
Video2Music: Suitable music generation from videos using an Affective Multimodal Transformer model.
Expert Syst. Appl., 2024
DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech.
CoRR, 2024
Leveraging LLM Embeddings for Cross Dataset Label Alignment and Zero Shot Music Emotion Prediction.
CoRR, 2024
BandControlNet: Parallel Transformers-based Steerable Popular Music Generation with Fine-Grained Spatiotemporal Features.
CoRR, 2024
CoRR, 2024
Are we there yet? A brief survey of Music Emotion Prediction Datasets, Models and Outstanding Challenges.
CoRR, 2024
CoRR, 2024
Natural Language Processing Methods for Symbolic Music Generation and Information Retrieval: a Survey.
CoRR, 2024
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
2023
PreBit - A multimodal model with Twitter FinBERT embeddings for extreme price movement prediction of Bitcoin.
Expert Syst. Appl., December, 2023
Expert Syst. Appl., November, 2023
Sensors, 2023
EmoMV: Affective music-video correspondence learning datasets for classification and retrieval.
Inf. Fusion, 2023
Jointist: Simultaneous Improvement of Multi-instrument Transcription and Music Source Separation via Joint Training.
CoRR, 2023
Diffroll: Diffusion-Based Generative Music Transcription with Unsupervised Pretraining Capability.
Proceedings of the IEEE International Conference on Acoustics, 2023
A Domain-Knowledge-Inspired Music Embedding Space and a Novel Attention Mechanism for Symbolic Music Modeling.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
Forecasting Bitcoin volatility spikes from whale transactions and CryptoQuant data using Synthesizer Transformer models.
CoRR, 2022
CoRR, 2022
CoRR, 2022
A multimodal model with Twitter FinBERT embeddings for extreme price movement prediction of Bitcoin.
CoRR, 2022
Predicting emotion from music videos: exploring the relative contribution of visual and auditory information to affective responses.
CoRR, 2022
Learning Accent Representation with Multi-Level VAE Towards Controllable Speech Synthesis.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Computationally efficient physics approximating neural networks for highly nonlinear maps.
Proceedings of the Conference on Research in Adaptive and Convergent Systems, 2022
Proceedings of the Artificial Intelligence in Music, Sound, Art and Design, 2022
MusIAC: An Extensible Generative Framework for Music Infilling Applications with Multi-level Control.
Proceedings of the Artificial Intelligence in Music, Sound, Art and Design, 2022
2021
IEEE Trans. Commun., 2021
AttendAffectNet-Emotion Prediction of Movie Viewers Using Multimodal Fusion with Self-Attention.
Sensors, 2021
Sensors, 2021
Proceedings of the NeurIPS 2021 Competitions and Demonstrations Track, 2021
ReconVAT: A Semi-Supervised Automatic Music Transcription Framework for Low-Resource Real-World Data.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
Proceedings of the International Joint Conference on Neural Networks, 2021
Hierarchical Recurrent Neural Networks for Conditional Melody Generation with Long-term Structure.
Proceedings of the International Joint Conference on Neural Networks, 2021
Proceedings of the International Joint Conference on Neural Networks, 2021
2020
Singing voice separation using a deep convolutional neural network trained by ideal binary mask and cross entropy.
Neural Comput. Appl., 2020
Neural Comput. Appl., 2020
Neural Comput. Appl., 2020
CoRR, 2020
CoRR, 2020
PerceptionGAN: Real-world Image Construction from Provided Text through Perceptual Understanding.
CoRR, 2020
Generative Modelling for Controllable Audio Synthesis of Expressive Piano Performance.
CoRR, 2020
nnAudio: An on-the-Fly GPU Audio to Spectrogram Conversion Toolbox Using 1D Convolutional Neural Networks.
IEEE Access, 2020
Proceedings of the International Conference on Signal Processing and Communications, 2020
Music FaderNets: Controllable Music Generation Based On High-Level Features via Low-Level Feature Modelling.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020
Unsupervised Disentanglement of Pitch and Timbre for Isolated Musical Instrument Sounds.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020
The Impact of Audio Input Representations on Neural Network based Music Transcription.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020
AttendAffectNet: Self-Attention based Networks for Predicting Affective Responses from Movies.
Proceedings of the 25th International Conference on Pattern Recognition, 2020
The Effect of Spectrogram Reconstruction on Automatic Music Transcription: An Alternative Approach to Improve Transcription Accuracy.
Proceedings of the 25th International Conference on Pattern Recognition, 2020
Singing Voice Conversion with Disentangled Representations of Singer and Vocal Technique Using Variational Autoencoders.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
2019
IEEE Trans. Affect. Comput., 2019
nnAudio: An on-the-fly GPU Audio to Spectrogram Conversion Toolbox Using 1D Convolution Neural Networks.
CoRR, 2019
Singing Voice Conversion with Disentangled Representations of Singer and Vocal Technique Using Variational Autoencoders.
CoRR, 2019
Towards robust audio spoofing detection: a detailed comparison of traditional and learned features.
CoRR, 2019
Toward Robust Audio Spoofing Detection: A Detailed Comparison of Traditional and Learned Features.
IEEE Access, 2019
A Hybrid Fuzzy Logic-Neural Network Approach for Multi-Path Separation of Underwater Acoustic Signals.
Proceedings of the 89th IEEE Vehicular Technology Conference, 2019
Learning Disentangled Representations of Timbre and Pitch for Musical Instrument Sounds Using Gaussian Mixture Variational Autoencoders.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019
Proceedings of the 41th Annual Meeting of the Cognitive Science Society, 2019
Towards emotion based music generation: A tonal tension model based on the spiral array.
Proceedings of the 41th Annual Meeting of the Cognitive Science Society, 2019
A novel music-based game with motion capture to support cognitive and motor function in the elderly.
Proceedings of the IEEE Conference on Games, 2019
Latent Space Representation for Multi-Target Speaker Detection and Identification with a Sparse Dataset Using Triplet Neural Networks.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Doppler Invariant Demodulation for Shallow Water Acoustic Communications Using Deep Belief Networks.
Proceedings of the IEEE VTS Asia Pacific Wireless Communications Symposium, 2019
2018
Modeling Temporal Tonal Relations in Polyphonic Music Through Deep Networks With a Novel Image-Based Representation.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2017
A variable neighborhood search algorithm to generate piano fingerings for polyphonic sheet music.
Int. Trans. Oper. Res., 2017
CoRR, 2017
A Multi-modal Platform for Semantic Music Analysis: Visualizing Audio-and Score-Based Tension.
Proceedings of the 11th IEEE International Conference on Semantic Computing, 2017
IMMA-Emo: A Multimodal Interface for Visualising Score- and Audio-synchronised Emotion Annotations.
Proceedings of the 12th International Audio Mostly Conference on Augmented and Participatory Sound and Music Experiences, 2017
2016
Proceedings of the Computational Music Analysis, 2016
2015
Expert Syst. Appl., 2015
Classification and Generation of Composer-Specific Music Using Global Feature Models and Variable Neighborhood Search.
Comput. Music. J., 2015
Compose ≡ compute - Computer generation and classification of music through operations research methods.
4OR, 2015
Proceedings of the Mathematics and Computation in Music - 5th International Conference, 2015
2014
Sampling the extrema from statistical models of music with variable neighbourhood search.
Proceedings of the Music Technology meets Philosophy, 2014
2013
Composing fifth species counterpoint music with a variable neighborhood search algorithm.
Expert Syst. Appl., 2013
Proceedings of the IEEE Symposium on Computational Intelligence for Creativity and Affective Computing, 2013