Juhan Nam

CoRR, January, 2025

D3RM: A Discrete Denoising Diffusion Refinement Model for Piano Transcription.

[BibT_eX]

[DOI]

Hounsu Kim

CoRR, January, 2025

2024

Towards Efficient and Real-Time Piano Transcription Using Neural Autoregressive Models.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2024

Predicting User Intents and Musical Attributes from Music Discovery Conversations.

[BibT_eX]

[DOI]

Daeyong Kwon

Seungheon Doh

CoRR, 2024

Music Discovery Dialogue Generation Using Human Intent Analysis and Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

PIAST: A Multimodal Piano Dataset with Audio, Symbolic and Text.

[BibT_eX]

[DOI]

CoRR, 2024

Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound.

[BibT_eX]

[DOI]

CoRR, 2024

CONMOD: Controllable Neural Frame-based Modulation Effects.

[BibT_eX]

[DOI]

CoRR, 2024

Musical Word Embedding for Music Tagging and Retrieval.

[BibT_eX]

[DOI]

CoRR, 2024

Quantitative Analysis of Melodic Similarity in Music Copyright Infringement Cases.

[BibT_eX]

[DOI]

Proceedings of the 25th International Society for Music Information Retrieval Conference, 2024

Music Discovery Dialogue Generation Using Human Intent Analysis and Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 25th International Society for Music Information Retrieval Conference, 2024

A Real-Time Lyrics Alignment System Using Chroma and Phonetic Features for Classical Vocal Performance.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Enhancing Spatial Audio Generation with Source Separation and Channel Panning Loss.

[BibT_eX]

[DOI]

Wootaek Lim

Proceedings of the IEEE International Conference on Acoustics, 2024

VoiceLDM: Text-to-Speech with Environmental Context.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Expressive Acoustic Guitar Sound Synthesis with an Instrument-Specific Input Representation and Diffusion Outpainting.

[BibT_eX]

[DOI]

Hounsu Kim

Soonbeom Choi

Proceedings of the IEEE International Conference on Acoustics, 2024

DiffRENT: A Diffusion Model for Recording Environment Transfer of Speech.

[BibT_eX]

[DOI]

Jaekwon Im

Proceedings of the IEEE International Conference on Acoustics, 2024

Enriching Music Descriptions with A Finetuned-LLM and Metadata for Text-to-Music Retrieval.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

T-Foley: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis.

[BibT_eX]

[DOI]

Yoonjin Chung

Junwon Lee

Proceedings of the IEEE International Conference on Acoustics, 2024

K-pop Lyric Translation: Dataset, Analysis, and Neural-Modelling.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023

Editorial for TISMIR Special Collection: Cultural Diversity in MIR Research.

[BibT_eX]

[DOI]

Trans. Int. Soc. Music. Inf. Retr., January, 2023

The Song Describer Dataset: a Corpus of Audio Captions for Music-and-Language Evaluation.

[BibT_eX]

[DOI]

CoRR, 2023

A Phoneme-Informed Neural Network Model for Note-Level Singing Transcription.

[BibT_eX]

[DOI]

Li Su

CoRR, 2023

Music Playlist Title Generation Using Artist Information.

[BibT_eX]

[DOI]

CoRR, 2023

All-in-One Metrical and Functional Structure Analysis with Neighborhood Attentions on Demixed Audio.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

Motion to Dance Music Generation using Latent Diffusion Model.

[BibT_eX]

[DOI]

Proceedings of the SIGGRAPH Asia 2023 Technical Communications, 2023

A Computational Evaluation Framework for Singable Lyric Translation.

[BibT_eX]

[DOI]

Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

LP-MusicCaps: LLM-Based Pseudo Music Captioning.

[BibT_eX]

[DOI]

Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

Sense of Convergence: Exploring the Artistic Potential of Cross-modal Sensory Transfer in Virtual Reality.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Mixed and Augmented Reality Adjunct, 2023

A Phoneme-Informed Neural Network Model For Note-Level Singing Transcription.

[BibT_eX]

[DOI]

Li Su

Proceedings of the IEEE International Conference on Acoustics, 2023

A Study of Audio Mixing Methods for Piano Transcription in Violin-Piano Ensembles.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Toward Universal Text-To-Music Retrieval.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Textless Speech-to-Music Retrieval Using Emotion Similarity.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

PrimaDNN': A Characteristics-Aware DNN Customization for Singing Technique Detection.

[BibT_eX]

[DOI]

Yuya Yamamoto

Hiroko Terasawa

Proceedings of the 31st European Signal Processing Conference, 2023

2022

Deep Learning and Knowledge Integration for Music Audio Analysis (Dagstuhl Seminar 22082).

[BibT_eX]

[DOI]

Meinard Müller

Rachel M. Bittner

Dagstuhl Reports, 2022

Neural Vocoder Feature Estimation for Dry Singing Voice Separation.

[BibT_eX]

[DOI]

CoRR, 2022

Hi, KIA: A Speech Emotion Recognition Dataset for Wake-Up Words.

[BibT_eX]

[DOI]

CoRR, 2022

Analysis and detection of singing techniques in repertoires of J-POP solo singers.

[BibT_eX]

[DOI]

Yuya Yamamoto

Hiroko Terasawa

Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

YM2413-MDB: A Multi-Instrumental FM Video Game Music Dataset with Emotion Annotations.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

Deformable CNN and Imbalance-Aware Feature Learning for Singing Technique Classification.

[BibT_eX]

[DOI]

Yuya Yamamoto

Hiroko Terasawa

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Pseudo-Label Transfer from Frame-Level to Note-Level in a Teacher-Student Framework for Singing Transcription from Polyphonic Music.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

A Melody-Unsupervision Model for Singing Voice Synthesis.

[BibT_eX]

[DOI]

Soonbeom Choi

Proceedings of the IEEE International Conference on Acoustics, 2022

Seung-ee and Kkaebi: A VR-Mobile Cross Platform Game based on Co-Presence for a Balanced Immersive Experience.

[BibT_eX]

[DOI]

Proceedings of the Extended Abstracts of the Annual Symposium on Computer-Human Interaction in Play, 2022

The Melody of the Mysterious Stones: A VR Mindfulness Game Using Sound Spatialization.

[BibT_eX]

[DOI]

Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

Classy Trash Monster: An Educational Game for Teaching Machine Learning to Non-major Students.

[BibT_eX]

[DOI]

Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

2021

Music Playlist Title Generation: A Machine-Translation Approach.

[BibT_eX]

[DOI]

Seungheon Doh

Junwon Lee

CoRR, 2021

PocketVAE: A Two-step Model for Groove Generation and Control.

[BibT_eX]

[DOI]

Kyungyun Lee

Wonil Kim

CoRR, 2021

Reverse-Engineering The Transition Regions of Real-World DJ Mixes using Sub-band Analysis with Convex Optimization.

[BibT_eX]

[DOI]

Yi-Hsuan Yang

Proceedings of the 21th International Conference on New Interfaces for Musical Expression, 2021

Learning a cross-domain embedding space of vocal and mixed audio with a structure-preserving triplet loss.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

EMOPIA: A Multi-Modal Pop Piano Dataset For Emotion Recognition and Emotion-based Music Generation.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

Investigating Time-Frequency Representations for Audio Feature Extraction in Singing Technique Classification.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020

Semantic Tagging of Singing Voices in Popular Music Recordings.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2020

Novelty and influence of creative works, and quantifying patterns of advances based on probabilistic references networks.

[BibT_eX]

[DOI]

Doheum Park

Juyong Park

EPJ Data Sci., 2020

Semi-supervised learning using teacher-student models for vocal melody extraction.

[BibT_eX]

[DOI]

CoRR, 2020

Musical Word Embedding: Bridging the Gap between Listening Contexts and Music.

[BibT_eX]

[DOI]

CoRR, 2020

Metric learning vs classification for disentangled music representation learning.

[BibT_eX]

[DOI]

Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Polyphonic Piano Transcription Using Autoregressive Multi-State Note Model.

[BibT_eX]

[DOI]

Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Semi-supervised learning using teacher-student models for vocal melody extraction.

[BibT_eX]

[DOI]

Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

A Computational Analysis of Real-World DJ Mixes using Mix-To-Track Subsequence Alignment.

[BibT_eX]

[DOI]

Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Disentangled Multidimensional Metric Learning for Music Similarity.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Korean Singing Voice Synthesis Based on Auto-Regressive Boundary Equilibrium Gan.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Deep Learning for Audio-Based Music Classification and Tagging: Teaching Computers to Distinguish Rock from Bach.

[BibT_eX]

[DOI]

IEEE Signal Process. Mag., 2019

Introduction to the Issue on Data Science: Machine Learning for Audio Signal Processing.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2019

Comparison and Analysis of SampleCNN Architectures for Audio Classification.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2019

Temporal Feedback Convolutional Recurrent Neural Networks for Keyword Spotting.

[BibT_eX]

[DOI]

CoRR, 2019

Representation Learning of Music Using Artist, Album, and Track Information.

[BibT_eX]

[DOI]

Jiyoung Park

CoRR, 2019

Zero-shot Learning and Knowledge Transfer in Music Classification and Tagging.

[BibT_eX]

[DOI]

CoRR, 2019

Quantifying Novelty and Influence, and the Patterns of Paradigm Shifts.

[BibT_eX]

[DOI]

Doheum Park

Juyong Park

CoRR, 2019

A Cross-Scape Plot Representation for Visualizing Symbolic Melodic Similarity.

[BibT_eX]

[DOI]

Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

Learning a Joint Embedding Space of Monophonic and Mixed Music Signals for Singing Voice.

[BibT_eX]

[DOI]

Kyungyun Lee

Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

VirtuosoNet: A Hierarchical RNN-based System for Modeling Expressive Piano Performance.

[BibT_eX]

[DOI]

Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

Zero-shot Learning for Audio-based Music Classification and Tagging.

[BibT_eX]

[DOI]

Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

Graph Neural Network for Music Score Data and Modeling Expressive Piano Performance.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

2018

A Hybrid of Deep Audio Feature and i-vector for Artist Recognition.

[BibT_eX]

[DOI]

CoRR, 2018

Deep Content-User Embedding Model for Music Recommendation.

[BibT_eX]

[DOI]

CoRR, 2018

Representation Learning of Music Using Artist Labels.

[BibT_eX]

[DOI]

Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018

Revisiting Singing Voice Detection: A quantitative review and the future outlook.

[BibT_eX]

[DOI]

Kyungyun Lee

Keunwoo Choi

Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018

A Timbre-based Approach to Estimate Key Velocity from Polyphonic Piano Recordings.

[BibT_eX]

[DOI]

Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018

Singing Expression Transfer from One Voice to Another for a Given Song.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Sample-Level CNN Architectures for Music Auto-Tagging Using Raw Waveforms.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Multi-Level and Multi-Scale Feature Aggregation Using Pretrained Convolutional Neural Networks for Music Auto-Tagging.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2017

Raw Waveform-based Audio Classification Using Sample-level CNN Architectures.

[BibT_eX]

[DOI]

CoRR, 2017

Audio-to-score alignment of piano music using RNN-based automatic music transcription.

[BibT_eX]

[DOI]

CoRR, 2017

Sample-level Deep Convolutional Neural Networks for Music Auto-tagging Using Raw Waveforms.

[BibT_eX]

[DOI]

CoRR, 2017

Multi-Level and Multi-Scale Feature Aggregation Using Sample-level Deep Convolutional Neural Networks for Music Classification.

[BibT_eX]

[DOI]

CoRR, 2017

Multi-Level and Multi-Scale Feature Aggregation Using Pre-trained Convolutional Neural Networks for Music Auto-tagging.

[BibT_eX]

[DOI]

CoRR, 2017

ForceClicks: Enabling Efficient Button Interaction with Single Finger Touch.

[BibT_eX]

[DOI]

Edward Jangwon Lee

Roshan Lalintha Peiris

Liwei Chan

Proceedings of the Tenth International Conference on Tangible, 2017

Note Intensity Estimation of Piano Recordings by Score-Informed NMF.

[BibT_eX]

[DOI]

Proceedings of the AES International Conference Semantic Audio 2017, 2017

Combining Multi-Scale Features Using Sample-Level Deep Convolutional Neural Networks for Weakly Supervised Sound Event Detection.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017

Use the Force: Incorporating Touch Force Sensors into Mobile Music Interaction.

[BibT_eX]

[DOI]

Roshan Lalintha Peiris

Proceedings of the Music Technology with Swing - 13th International Symposium, 2017

2016

Melody Extraction on Vocal Segments Using Multi-Column Deep Neural Networks.

[BibT_eX]

[DOI]

Sangeun Kum

Changheun Oh

Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016

2015

Augmenting Room Acoustics and System Interaction for Intentional Control of Audio Feedback.

[BibT_eX]

[DOI]

Seunghun Kim

Graham Wakefield

Proceedings of the Looking Back, 2015

Toward Certain Sonic Properties of an Audio Feedback System by Evolutionary Control of Second-Order Structures.

[BibT_eX]

[DOI]

Seunghun Kim

Graham Wakefield

Proceedings of the Evolutionary and Biologically Inspired Music, Sound, Art and Design, 2015

2013

Acoustic scene classification using sparse feature learning and event-based pooling.

[BibT_eX]

[DOI]

Kyogu Lee

Ziwon Hyung

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

2012

Optimized Polynomial Spline Basis Function Design for Quasi-Bandlimited Classical Waveform Synthesis.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2012

Learning Sparse Feature Representations for Music Annotation and Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 13th International Society for Music Information Retrieval Conference, 2012

Sound Recognition in Mixtures.

[BibT_eX]

[DOI]