Shusuke Takahashi
According to our database1,
Shusuke Takahashi
authored at least 36 papers
between 2020 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
The whole is greater than the sum of its parts: improving music source separation by bridging networks.
EURASIP J. Audio Speech Music. Process., December, 2024
Trans. Int. Soc. Music. Inf. Retr., January, 2024
Mining Your Own Secrets: Diffusion Classifier Scores for Continual Personalization of Text-to-Image Diffusion Models.
CoRR, 2024
SpecMaskGIT: Masked Generative Modeling of Audio Spectrograms for Efficient Audio Synthesis and Beyond.
CoRR, 2024
MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training.
CoRR, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
The Whole Is Greater than the Sum of Its Parts: Improving DNN-based Music Source Separation.
CoRR, 2023
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
An Attention-Based Approach to Hierarchical Multi-Label Music Instrument Classification.
Proceedings of the IEEE International Conference on Acoustics, 2023
Diffroll: Diffusion-Based Generative Music Transcription with Unsupervised Pretraining Capability.
Proceedings of the IEEE International Conference on Acoustics, 2023
2022
An Approach to Collecting Object Graphs for Data-structure Live Programming Based on a Language Implementation Framework.
J. Inf. Process., 2022
Neurocomputing, 2022
SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization.
Proceedings of the International Conference on Machine Learning, 2022
Multi-ACCDOA: Localizing And Detecting Overlapping Sounds From The Same Class With Auxiliary Duplicating Permutation Invariant Training.
Proceedings of the IEEE International Conference on Acoustics, 2022
Improving Character Error Rate is Not Equal to Having Clean Speech: Speech Enhancement for ASR Systems with Black-Box Acoustic Models.
Proceedings of the IEEE International Conference on Acoustics, 2022
Spatial Mixup: Directional Loudness Modification as Data Augmentation for Sound Event Localization and Detection.
Proceedings of the IEEE International Conference on Acoustics, 2022
Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
STARSS22: A Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022
2021
Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse Response Simulation for Sound Event Localization and Detection.
CoRR, 2021
Manifold-Aware Deep Clustering: Maximizing Angles Between Embedding Vectors Based on Regular Simplex.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Accdoa: Activity-Coupled Cartesian Direction of Arrival Representation for Sound Event Localization And Detection.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
2020
Sound Event Localization and Detection Using Activity-Coupled Cartesian DOA Vector and RD3net.
CoRR, 2020