Shusuke Takahashi

According to our database1, Shusuke Takahashi authored at least 36 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
The whole is greater than the sum of its parts: improving music source separation by bridging networks.
EURASIP J. Audio Speech Music. Process., December, 2024

The Sound Demixing Challenge 2023 - Cinematic Demixing Track.
Trans. Int. Soc. Music. Inf. Retr., January, 2024

Mining Your Own Secrets: Diffusion Classifier Scores for Continual Personalization of Text-to-Image Diffusion Models.
CoRR, 2024

SpecMaskGIT: Masked Generative Modeling of Audio Spectrograms for Efficient Audio Synthesis and Beyond.
CoRR, 2024

MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training.
CoRR, 2024

Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation.
CoRR, 2024

Zero- and Few-Shot Sound Event Localization and Detection.
Proceedings of the IEEE International Conference on Acoustics, 2024

Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023


The Sound Demixing Challenge 2023 - Cinematic Demixing Track.
CoRR, 2023

The Whole Is Greater than the Sum of Its Parts: Improving DNN-based Music Source Separation.
CoRR, 2023

Diffusion-based Signal Refiner for Speech Separation.
CoRR, 2023

Extending Audio Masked Autoencoders toward Audio Restoration.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

An Attention-Based Approach to Hierarchical Multi-Label Music Instrument Classification.
Proceedings of the IEEE International Conference on Acoustics, 2023

Diffroll: Diffusion-Based Generative Music Transcription with Unsupervised Pretraining Capability.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022


An Approach to Collecting Object Graphs for Data-structure Live Programming Based on a Language Implementation Framework.
J. Inf. Process., 2022

Preventing oversmoothing in VAE via generalized variance parameterization.
Neurocomputing, 2022

A Versatile Diffusion-based Generative Refiner for Speech Enhancement.
CoRR, 2022

SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization.
Proceedings of the International Conference on Machine Learning, 2022

Multi-ACCDOA: Localizing And Detecting Overlapping Sounds From The Same Class With Auxiliary Duplicating Permutation Invariant Training.
Proceedings of the IEEE International Conference on Acoustics, 2022

Improving Character Error Rate is Not Equal to Having Clean Speech: Speech Enhancement for ASR Systems with Black-Box Acoustic Models.
Proceedings of the IEEE International Conference on Acoustics, 2022

Spatial Mixup: Directional Loudness Modification as Data Augmentation for Sound Event Localization and Detection.
Proceedings of the IEEE International Conference on Acoustics, 2022

Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection.
Proceedings of the IEEE International Conference on Acoustics, 2022

Music Source Separation With Deep Equilibrium Models.
Proceedings of the IEEE International Conference on Acoustics, 2022

STARSS22: A Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

2021
Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse Response Simulation for Sound Event Localization and Detection.
CoRR, 2021

Preventing Posterior Collapse Induced by Oversmoothing in Gaussian VAE.
CoRR, 2021

Manifold-Aware Deep Clustering: Maximizing Angles Between Embedding Vectors Based on Regular Simplex.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Accdoa: Activity-Coupled Cartesian Direction of Arrival Representation for Sound Event Localization And Detection.
Proceedings of the IEEE International Conference on Acoustics, 2021

All For One And One For All: Improving Music Separation By Bridging Networks.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Sound Event Localization and Detection Using Activity-Coupled Cartesian DOA Vector and RD3net.
CoRR, 2020


  Loading...