Juhan Nam
Orcid: 0000-0003-2664-2119Affiliations:
- KAIST, Music and Audio Computing Lab, Republic of Korea
According to our database1,
Juhan Nam
authored at least 96 papers
between 2010 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Towards Efficient and Real-Time Piano Transcription Using Neural Autoregressive Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Music Discovery Dialogue Generation Using Human Intent Analysis and Large Language Models.
CoRR, 2024
Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound.
CoRR, 2024
A Real-Time Lyrics Alignment System Using Chroma and Phonetic Features for Classical Vocal Performance.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Expressive Acoustic Guitar Sound Synthesis with an Instrument-Specific Input Representation and Diffusion Outpainting.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Enriching Music Descriptions with A Finetuned-LLM and Metadata for Text-to-Music Retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2024
T-Foley: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
2023
Trans. Int. Soc. Music. Inf. Retr., January, 2023
The Song Describer Dataset: a Corpus of Audio Captions for Music-and-Language Evaluation.
CoRR, 2023
CoRR, 2023
All-in-One Metrical and Functional Structure Analysis with Neighborhood Attentions on Demixed Audio.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023
Proceedings of the SIGGRAPH Asia 2023 Technical Communications, 2023
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023
Sense of Convergence: Exploring the Artistic Potential of Cross-modal Sensory Transfer in Virtual Reality.
Proceedings of the IEEE International Symposium on Mixed and Augmented Reality Adjunct, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
PrimaDNN': A Characteristics-Aware DNN Customization for Singing Technique Detection.
Proceedings of the 31st European Signal Processing Conference, 2023
2022
Deep Learning and Knowledge Integration for Music Audio Analysis (Dagstuhl Seminar 22082).
Dagstuhl Reports, 2022
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022
YM2413-MDB: A Multi-Instrumental FM Video Game Music Dataset with Emotion Annotations.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022
Deformable CNN and Imbalance-Aware Feature Learning for Singing Technique Classification.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Pseudo-Label Transfer from Frame-Level to Note-Level in a Teacher-Student Framework for Singing Transcription from Polyphonic Music.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Seung-ee and Kkaebi: A VR-Mobile Cross Platform Game based on Co-Presence for a Balanced Immersive Experience.
Proceedings of the Extended Abstracts of the Annual Symposium on Computer-Human Interaction in Play, 2022
The Melody of the Mysterious Stones: A VR Mindfulness Game Using Sound Spatialization.
Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022
Classy Trash Monster: An Educational Game for Teaching Machine Learning to Non-major Students.
Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022
2021
Reverse-Engineering The Transition Regions of Real-World DJ Mixes using Sub-band Analysis with Convex Optimization.
Proceedings of the 21th International Conference on New Interfaces for Musical Expression, 2021
Learning a cross-domain embedding space of vocal and mixed audio with a structure-preserving triplet loss.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021
EMOPIA: A Multi-Modal Pop Piano Dataset For Emotion Recognition and Emotion-based Music Generation.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021
Investigating Time-Frequency Representations for Audio Feature Extraction in Singing Technique Classification.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
2020
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Novelty and influence of creative works, and quantifying patterns of advances based on probabilistic references networks.
EPJ Data Sci., 2020
CoRR, 2020
CoRR, 2020
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020
A Computational Analysis of Real-World DJ Mixes using Mix-To-Track Subsequence Alignment.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
2019
Deep Learning for Audio-Based Music Classification and Tagging: Teaching Computers to Distinguish Rock from Bach.
IEEE Signal Process. Mag., 2019
Introduction to the Issue on Data Science: Machine Learning for Audio Signal Processing.
IEEE J. Sel. Top. Signal Process., 2019
IEEE J. Sel. Top. Signal Process., 2019
CoRR, 2019
CoRR, 2019
CoRR, 2019
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019
Learning a Joint Embedding Space of Monophonic and Mixed Music Signals for Singing Voice.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019
VirtuosoNet: A Hierarchical RNN-based System for Modeling Expressive Piano Performance.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
2018
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
2017
Multi-Level and Multi-Scale Feature Aggregation Using Pretrained Convolutional Neural Networks for Music Auto-Tagging.
IEEE Signal Process. Lett., 2017
CoRR, 2017
Audio-to-score alignment of piano music using RNN-based automatic music transcription.
CoRR, 2017
Sample-level Deep Convolutional Neural Networks for Music Auto-tagging Using Raw Waveforms.
CoRR, 2017
Multi-Level and Multi-Scale Feature Aggregation Using Sample-level Deep Convolutional Neural Networks for Music Classification.
CoRR, 2017
Multi-Level and Multi-Scale Feature Aggregation Using Pre-trained Convolutional Neural Networks for Music Auto-tagging.
CoRR, 2017
Proceedings of the Tenth International Conference on Tangible, 2017
Proceedings of the AES International Conference Semantic Audio 2017, 2017
Combining Multi-Scale Features Using Sample-Level Deep Convolutional Neural Networks for Weakly Supervised Sound Event Detection.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017
Proceedings of the Music Technology with Swing - 13th International Symposium, 2017
2016
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016
2015
Augmenting Room Acoustics and System Interaction for Intentional Control of Audio Feedback.
Proceedings of the Looking Back, 2015
Toward Certain Sonic Properties of an Audio Feedback System by Evolutionary Control of Second-Order Structures.
Proceedings of the Evolutionary and Biologically Inspired Music, Sound, Art and Design, 2015
2013
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013
2012
Optimized Polynomial Spline Basis Function Design for Quasi-Bandlimited Classical Waveform Synthesis.
IEEE Signal Process. Lett., 2012
Proceedings of the 13th International Society for Music Information Retrieval Conference, 2012
Proceedings of the Latent Variable Analysis and Signal Separation, 2012
2011
A Classification-Based Polyphonic Piano Transcription Approach Using Learned Feature Representations.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011
Proceedings of the 28th International Conference on Machine Learning, 2011
2010
IEEE Trans. Speech Audio Process., 2010
Efficient Antialiasing Oscillator Algorithms Using Low-Order Fractional Delay Filters.
IEEE Trans. Speech Audio Process., 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010