Yuma Koizumi
Orcid: 0000-0003-3645-6213
According to our database1,
Yuma Koizumi
authored at least 61 papers
between 2014 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
2023
Description and Discussion on DCASE 2023 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring.
CoRR, 2023
Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text Representations.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
2022
Description and Discussion on DCASE 2022 Challenge Task 2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Applying Domain Generalization Techniques.
CoRR, 2022
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Wavefit: an Iterative and Non-Autoregressive Neural Vocoder Based on Fixed-Point Iteration.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Shaping.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Description and Discussion on DCASE 2022 Challenge Task 2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Applying Domain Generalization Techniques.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022
2021
Deep Griffin-Lim Iteration: Trainable Iterative Phase Reconstruction Using Neural Network.
IEEE J. Sel. Top. Signal Process., 2021
Description and Discussion on DCASE 2021 Challenge Task 2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring under Domain Shifted Conditions.
CoRR, 2021
DF-Conformer: Integrated Architecture of Conv-Tasnet and Conformer Using Linear Complexity Self-Attention for Speech Enhancement.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021
Sampling-Frequency-Independent Audio Source Separation Using Convolution Layer Based on Impulse Invariant Method.
Proceedings of the 29th European Signal Processing Conference, 2021
Noisy-target Training: A Training Strategy for DNN-based Speech Enhancement without Clean Speech.
Proceedings of the 29th European Signal Processing Conference, 2021
Description and Discussion on DCASE 2021 Challenge Task 2: Unsupervised Anomalous Detection for Machine Condition Monitoring Under Domain Shifted Conditions.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021
2020
Audio Captioning using Pre-Trained Large-Scale Language Model Guided by Audio-based Similar Caption Retrieval.
CoRR, 2020
The NTT DCASE2020 Challenge Task 6 system: Automated Audio Captioning with Keywords and Sentence Length Estimation.
CoRR, 2020
Crossmodal Sound Retrieval Based on Specific Target Co-Occurrence Denoted with Weak Labels.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Sound Event Localization Based on Sound Intensity Vector Refined by Dnn-Based Denoising and Source Separation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Stable Training of Dnn for Speech Enhancement Based on Perceptually-Motivated Black-Box Cost Function.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Sound Event Detection by Multitask Learning of Sound Events and Scenes with Soft Scene Labels.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020
Description and Discussion on DCASE2020 Challenge Task2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020
2019
Unsupervised Detection of Anomalous Sound Based on Deep Learning and the Neyman-Pearson Lemma.
IEEE ACM Trans. Audio Speech Lang. Process., 2019
DOA Estimation by DNN-based Denoising and Dereverberation from Sound Intensity Vector.
CoRR, 2019
Batch Uniformization for Minimizing Maximum Anomaly Score of Dnn-Based Anomaly Detection in Sounds.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019
ToyADMOS: A Dataset of Miniature-Machine Operating Sounds for Anomalous Sound Detection.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019
Finding Low-Dimensional Dynamical Structure Through Variational Auto-Encoding Dynamic Mode Decomposition.
Proceedings of the 29th IEEE International Workshop on Machine Learning for Signal Processing, 2019
AdaFlow: Domain-adaptive Density Estimator with Application to Anomaly Detection and Unpaired Cross-domain Translation.
Proceedings of the IEEE International Conference on Acoustics, 2019
Data-driven Design of Perfect Reconstruction Filterbank for DNN-based Sound Source Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
SNIPER: Few-shot Learning for Anomaly Detection to Minimize False-negative Rate with Ensured True-positive Rate.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Context-Aware Neural Voice Activity Detection Using Auxiliary Networks for Phoneme Recognition, Speech Enhancement and Acoustic Scene Classification.
Proceedings of the 27th European Signal Processing Conference, 2019
First Order Ambisonics Domain Spatial Augmentation for DNN-based Direction of Arrival Estimation.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019
2018
IEEE ACM Trans. Audio Speech Lang. Process., 2018
DNN-Based Near- and Far-Field Source Separation Using Spherical-Harmonic-Analysis-Based Acoustic Features.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018
End-to-End Sound Source Enhancement Using Deep Neural Network in the Modified Discrete Cosine Transform Domain.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Distant Noise Reduction Based on Multi-delay Noise Model Using Distributed Microphone Array.
Proceedings of the 26th European Signal Processing Conference, 2018
2017
Informative Acoustic Feature Selection to Maximize Mutual Information for Collecting Target Sources.
IEEE ACM Trans. Audio Speech Lang. Process., 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Supervised source enhancement composed of nonnegative auto-encoders and complementarity subtraction.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
DNN-based source enhancement self-optimized by reinforcement learning using sound quality measurements.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Optimizing acoustic feature extractor for anomalous sound detection based on Neyman-Pearson lemma.
Proceedings of the 25th European Signal Processing Conference, 2017
2016
Binaural sound generation corresponding to omnidirectional video view using angular region-wise source enhancement.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Pinpoint extraction of distant sound source based on DNN mapping from multiple beamforming outputs to prior SNR.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Integrated approach of feature extraction and sound source enhancement based on maximization of mutual information.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
2015
Informative acoustic feature selection on microphone array wiener filtering for collecting target source on sports ground.
Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015
Proceedings of the 10th Asia-Pacific Symposium on Information and Telecommunication Technologies, 2015
2014
Proceedings of the IEEE International Conference on Acoustics, 2014