Yuki Mitsufuji
Orcid: 0000-0002-6806-6140
According to our database1,
Yuki Mitsufuji
authored at least 131 papers
between 2013 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
The whole is greater than the sum of its parts: improving music source separation by bridging networks.
EURASIP J. Audio Speech Music. Process., December, 2024
Trans. Int. Soc. Music. Inf. Retr., January, 2024
Trans. Int. Soc. Music. Inf. Retr., January, 2024
Trans. Mach. Learn. Res., 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models.
CoRR, 2024
CoRR, 2024
Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning.
CoRR, 2024
Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space.
CoRR, 2024
Mining Your Own Secrets: Diffusion Classifier Scores for Continual Personalization of Text-to-Image Diffusion Models.
CoRR, 2024
DisMix: Disentangling Mixtures of Musical Instruments for Source-level Pitch and Timbre Manipulation.
CoRR, 2024
Towards Assessing Data Replication in Music Generation with Music Similarity Metrics on Raw Audio.
CoRR, 2024
SpecMaskGIT: Masked Generative Modeling of Audio Spectrograms for Efficient Audio Synthesis and Beyond.
CoRR, 2024
Improving Unsupervised Clean-to-Rendered Guitar Tone Transformation Using GANs and Integrated Unaligned Clean Data.
CoRR, 2024
MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training.
CoRR, 2024
CoRR, 2024
Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tuning.
CoRR, 2024
CoRR, 2024
CoRR, 2024
PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher.
CoRR, 2024
CoRR, 2024
CoRR, 2024
MR-MT3: Memory Retaining Multi-Track Music Transcription to Mitigate Instrument Leakage.
CoRR, 2024
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
Towards reporting bias in visual-language datasets: bimodal augmentation by decoupling object-attribute association.
CoRR, 2023
Enhancing Semantic Communication with Deep Generative Models - An ICASSP Special Session Overview.
CoRR, 2023
On the Equivalence of Consistency-Type Models: Consistency Models, Consistent Diffusion Models, and Fokker-Planck Regularization.
CoRR, 2023
The Whole Is Greater than the Sum of Its Parts: Improving DNN-based Music Source Separation.
CoRR, 2023
Adversarially Slicing Generative Networks: Discriminator Slices Feature for One-Dimensional Optimal Transport.
CoRR, 2023
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration.
Proceedings of the International Conference on Machine Learning, 2023
FP-Diffusion: Improving Score-based Diffusion Models by Enforcing the Underlying Score Fokker-Planck Equation.
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
An Attention-Based Approach to Hierarchical Multi-Label Music Instrument Classification.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Music Mixing Style Transfer: A Contrastive Learning Approach to Disentangle Audio Effects.
Proceedings of the IEEE International Conference on Acoustics, 2023
Diffroll: Diffusion-Based Generative Music Transcription with Unsupervised Pretraining Capability.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
2022
Neurocomputing, 2022
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022
SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization.
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Multi-ACCDOA: Localizing And Detecting Overlapping Sounds From The Same Class With Auxiliary Duplicating Permutation Invariant Training.
Proceedings of the IEEE International Conference on Acoustics, 2022
Spatial Mixup: Directional Loudness Modification as Data Augmentation for Sound Event Localization and Detection.
Proceedings of the IEEE International Conference on Acoustics, 2022
Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Automatic DJ Transitions with Differentiable Audio Effects and Generative Adversarial Networks.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
STARSS22: A Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022
2021
Multichannel Blind Source Separation Based on Evanescent-Region-Aware Non-Negative Tensor Factorization in Spherical Harmonic Domain.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse Response Simulation for Sound Event Localization and Detection.
CoRR, 2021
Proceedings of the International Joint Conference on Neural Networks, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Accdoa: Activity-Coupled Cartesian Direction of Arrival Representation for Sound Event Localization And Detection.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Psychophysiological Effect of Immersive Spatial Audio Experience Enhanced Using Sound Field Synthesis.
Proceedings of the 9th International Conference on Affective Computing and Intelligent Interaction, 2021
2020
Multichannel Non-Negative Matrix Factorization Using Banded Spatial Covariance Matrices in Wavenumber Domain.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Spherical-Harmonic-Domain Feedforward Active Noise Control Using Sparse Decomposition of Reference Signals from Distributed Sensor Arrays.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
CoRR, 2020
CoRR, 2020
Sound Event Localization and Detection Using Activity-Coupled Cartesian DOA Vector and RD3net.
CoRR, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Array-Geometry-Aware Spatial Active Noise Control Based on Direction-of-Arrival Weighting.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
2019
J. Open Source Softw., 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Global and Local Mode-domain Adaptive Algorithms for Spatial Active Noise Control Using Higher-order Sources.
Proceedings of the IEEE International Conference on Acoustics, 2019
2018
Mmdenselstm: An Efficient Combination of Convolutional and Recurrent Neural Networks for Audio Source Separation.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018
PhaseNet: Discretized Phase Modeling with Deep Neural Networks for Audio Source Separation.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
2017
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017
Improving music source separation based on deep neural networks through data augmentation and network blending.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
2016
Multichannel blind source separation based on non-negative tensor factorization in wavenumber domain.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
NMF-based blind source separation using a linear predictive coding error clustering criterion.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
2014
On the use of a spatial cue as prior information for stereo sound source separation based on spatially weighted non-negative tensor factorization.
EURASIP J. Adv. Signal Process., 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
Sound source separation based on non-negative tensor factorization incorporating spatial cue as prior knowledge.
Proceedings of the IEEE International Conference on Acoustics, 2013