Daisuke Niizumi

Orcid: 0000-0002-5063-0508

According to our database1, Daisuke Niizumi authored at least 28 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Masked Modeling Duo: Towards a Universal Audio Pre-Training Framework.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

SoundBeam meets M2D: Target Sound Extraction with Audio Foundation Model.
CoRR, 2024

Description and Discussion on DCASE 2024 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring.
CoRR, 2024

M2D-CLAP: Masked Modeling Duo Meets CLAP for Learning General-purpose Audio-Language Representation.
CoRR, 2024

Exploring Pre-trained General-purpose Audio Representations for Heart Murmur Detection.
CoRR, 2024

Light Gated Multi Mini-Patch Extractor for Audio Classification.
Proceedings of the IEEE International Conference on Acoustics, 2024

Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval.
Proceedings of the 32nd European Signal Processing Conference, 2024

2023
BYOL for Audio: Exploring Pre-Trained General-Purpose Audio Representations.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Audio Difference Captioning Utilizing Similarity-Discrepancy Disentanglement.
CoRR, 2023

Description and Discussion on DCASE 2023 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring.
CoRR, 2023

Masked Modeling Duo for Speech: Specializing General-Purpose Audio Representation to Speech using Denoising Distillation.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the Input.
Proceedings of the IEEE International Conference on Acoustics, 2023

First-Shot Anomaly Sound Detection for Machine Condition Monitoring: A Domain Generalization Baseline.
Proceedings of the 31st European Signal Processing Conference, 2023

Enhancing Spectrogram for Audio Classification Using Time-Frequency Enhancer.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

Joint Analysis of Acoustic Scenes and Sound Events Based on Semi-Supervised Approach.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
Description and Discussion on DCASE 2022 Challenge Task 2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Applying Domain Generalization Techniques.
CoRR, 2022

ConceptBeam: Concept Driven Target Speech Extraction.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Introducing Auxiliary Text Query-modifier to Content-based Audio Retrieval.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model.
Proceedings of the 30th European Signal Processing Conference, 2022

Description and Discussion on DCASE 2022 Challenge Task 2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Applying Domain Generalization Techniques.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

2021
Description and Discussion on DCASE 2021 Challenge Task 2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring under Domain Shifted Conditions.
CoRR, 2021

BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation.
Proceedings of the International Joint Conference on Neural Networks, 2021

Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representation.
Proceedings of the HEAR: Holistic Evaluation of Audio Representations, 2021

Description and Discussion on DCASE 2021 Challenge Task 2: Unsupervised Anomalous Detection for Machine Condition Monitoring Under Domain Shifted Conditions.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

ToyADMOS2: Another Dataset of Miniature-Machine Operating Sounds for Anomalous Sound Detection under Domain Shift Conditions.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

2020
Audio Captioning using Pre-Trained Large-Scale Language Model Guided by Audio-based Similar Caption Retrieval.
CoRR, 2020

The Morandi Room: Entering the World of Morandi's Paintings Through Machine Learning.
Proceedings of the Advances in Artificial Intelligence, 2020

2018
Acoustic Scene Classification: a Competition Review.
Proceedings of the 28th IEEE International Workshop on Machine Learning for Signal Processing, 2018


  Loading...