Yuhta Takida

Orcid: 0000-0001-7384-0842

According to our database1, Yuhta Takida authored at least 37 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes.
Trans. Mach. Learn. Res., 2024

TraSCE: Trajectory Steering for Concept Erasure.
CoRR, 2024

Classifier-Free Guidance inside the Attraction Basin May Cause Memorization.
CoRR, 2024

Music Foundation Model as Generic Booster for Music Downstream Tasks.
CoRR, 2024

Mitigating Embedding Collapse in Diffusion Models for Categorical Data.
CoRR, 2024

G2D2: Gradient-guided Discrete Diffusion for image inverse problem solving.
CoRR, 2024

Distillation of Discrete Diffusion through Dimensional Correlations.
CoRR, 2024

<i>Jump Your Steps</i>: Optimizing Sampling Schedule of Discrete Diffusion Models.
CoRR, 2024

VRVQ: Variable Bitrate Residual Vector Quantization for Audio Compression.
CoRR, 2024

DisMix: Disentangling Mixtures of Musical Instruments for Source-level Pitch and Timbre Manipulation.
CoRR, 2024

MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training.
CoRR, 2024

SoundCTM: Uniting Score-based and Consistency Models for Text-to-Sound Generation.
CoRR, 2024

PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher.
CoRR, 2024

Understanding Multimodal Contrastive Learning Through Pointwise Mutual Information.
CoRR, 2024

SAN: Inducing Metrizability of GAN with Discriminative Normalized Linear Layer.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Manifold Preserving Guided Diffusion.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

BIGVSAN: Enhancing Gan-Based Neural Vocoders with Slicing Adversarial Network.
Proceedings of the IEEE International Conference on Acoustics, 2024

On the Language Encoder of Contrastive Cross-modal Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
On the Equivalence of Consistency-Type Models: Consistency Models, Consistent Diffusion Models, and Fokker-Planck Regularization.
CoRR, 2023

Adversarially Slicing Generative Networks: Discriminator Slices Feature for One-Dimensional Optimal Transport.
CoRR, 2023

Automatic Piano Transcription With Hierarchical Frequency-Time Transformer.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration.
Proceedings of the International Conference on Machine Learning, 2023

FP-Diffusion: Improving Score-based Diffusion Models by Enforcing the Underlying Score Fokker-Planck Equation.
Proceedings of the International Conference on Machine Learning, 2023

Unsupervised Vocal Dereverberation with Diffusion-Based Generative Models.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Preventing oversmoothing in VAE via generalized variance parameterization.
Neurocomputing, 2022

A Versatile Diffusion-based Generative Refiner for Speech Enhancement.
CoRR, 2022

Regularizing Score-based Models with Score Fokker-Planck Equations.
CoRR, 2022

SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization.
Proceedings of the International Conference on Machine Learning, 2022

2021
Preventing Posterior Collapse Induced by Oversmoothing in Gaussian VAE.
CoRR, 2021

Fast Convergent Method for Active Noise Control Over Spatial Region with Causal Constraint.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

2020
Reciprocity gap functional in spherical harmonic domain for gridless sound field decomposition.
Signal Process., 2020

Array-Geometry-Aware Spatial Active Noise Control Based on Direction-of-Arrival Weighting.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Robust Gridless Sound Field Decomposition Based on Structured Reciprocity Gap Functional in Spherical Harmonic Domain.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Gridless Sound Field Decomposition Based on Reciprocity Gap Functional in Spherical Harmonic Domain.
Proceedings of the 10th IEEE Sensor Array and Multichannel Signal Processing Workshop, 2018

Exterior and Interior Sound Field Separation Using Convex Optimization: Comparison of Signal Models.
Proceedings of the 26th European Signal Processing Conference, 2018


  Loading...