We stand with Ukraine

We stand with Ukraine

Daisuke Niizumi

Orcid: 0000-0002-5063-0508

According to our database¹, Daisuke Niizumi authored at least 28 papers between 2018 and 2024.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Masked Modeling Duo: Towards a Universal Audio Pre-Training Framework.

[BibT_eX]

[DOI]

Daisuke Niizumi

,

,

Yasunori Ohishi

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2024

SoundBeam meets M2D: Target Sound Extraction with Audio Foundation Model.

[BibT_eX]

[DOI]

Carlos Hernandez-Olivan

,

,

,

Daisuke Niizumi

,

,

Tomohiro Nakatani

,

CoRR, 2024

Description and Discussion on DCASE 2024 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring.

[BibT_eX]

[DOI]

,

,

Daisuke Niizumi

,

Davide Albertini

,

Roberto Sannino

,

Simone Pradolini

,

Filippo Augusti

,

,

,

,

,

Yohei Kawaguchi

CoRR, 2024

M2D-CLAP: Masked Modeling Duo Meets CLAP for Learning General-purpose Audio-Language Representation.

[BibT_eX]

[DOI]

Daisuke Niizumi

,

,

Yasunori Ohishi

,

,

Masahiro Yasuda

,

Shunsuke Tsubaki

,

CoRR, 2024

Exploring Pre-trained General-purpose Audio Representations for Heart Murmur Detection.

[BibT_eX]

[DOI]

Daisuke Niizumi

,

,

Yasunori Ohishi

,

,

CoRR, 2024

Light Gated Multi Mini-Patch Extractor for Audio Classification.

[BibT_eX]

[DOI]

,

,

,

,

,

Daisuke Niizumi

,

,

Proceedings of the IEEE International Conference on Acoustics, 2024

Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval.

[BibT_eX]

[DOI]

Shunsuke Tsubaki

,

Daisuke Niizumi

,

,

Yasunori Ohishi

,

,

Proceedings of the 32nd European Signal Processing Conference, 2024

2023

BYOL for Audio: Exploring Pre-Trained General-Purpose Audio Representations.

[BibT_eX]

[DOI]

Daisuke Niizumi

,

,

Yasunori Ohishi

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2023

Audio Difference Captioning Utilizing Similarity-Discrepancy Disentanglement.

[BibT_eX]

[DOI]

,

Yasunori Ohishi

,

Daisuke Niizumi

,

,

CoRR, 2023

Description and Discussion on DCASE 2023 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring.

[BibT_eX]

[DOI]

,

,

,

Daisuke Niizumi

,

,

,

,

,

,

Yohei Kawaguchi

CoRR, 2023

Masked Modeling Duo for Speech: Specializing General-Purpose Audio Representation to Speech using Denoising Distillation.

[BibT_eX]

[DOI]

Daisuke Niizumi

,

,

Yasunori Ohishi

,

,

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the Input.

[BibT_eX]

[DOI]

Daisuke Niizumi

,

,

Yasunori Ohishi

,

,

Proceedings of the IEEE International Conference on Acoustics, 2023

First-Shot Anomaly Sound Detection for Machine Condition Monitoring: A Domain Generalization Baseline.

[BibT_eX]

[DOI]

,

Daisuke Niizumi

,

Yasunori Ohishi

,

,

Masahiro Yasuda

Proceedings of the 31st European Signal Processing Conference, 2023

Enhancing Spectrogram for Audio Classification Using Time-Frequency Enhancer.

[BibT_eX]

[DOI]

,

,

,

Daisuke Niizumi

,

,

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

Joint Analysis of Acoustic Scenes and Sound Events Based on Semi-Supervised Approach.

[BibT_eX]

[DOI]

,

Shunsuke Tsubaki

,

Daisuke Niizumi

,

,

Yasunori Ohishi

,

,

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022

Description and Discussion on DCASE 2022 Challenge Task 2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Applying Domain Generalization Techniques.

[BibT_eX]

[DOI]

,

,

,

Daisuke Niizumi

,

,

,

,

,

Masaaki Yamamoto

,

Yohei Kawaguchi

CoRR, 2022

ConceptBeam: Concept Driven Target Speech Extraction.

[BibT_eX]

[DOI]

Yasunori Ohishi

,

,

,

,

,

Daisuke Niizumi

,

,

,

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Introducing Auxiliary Text Query-modifier to Content-based Audio Retrieval.

[BibT_eX]

[DOI]

,

Yasunori Ohishi

,

Daisuke Niizumi

,

,

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model.

[BibT_eX]

[DOI]

Daisuke Niizumi

,

,

Yasunori Ohishi

,

,

Proceedings of the 30th European Signal Processing Conference, 2022

Description and Discussion on DCASE 2022 Challenge Task 2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Applying Domain Generalization Techniques.

[BibT_eX]

[DOI]

,

,

,

Daisuke Niizumi

,

,

,

,

,

,

Masaaki Yamamoto

,

Yohei Kawaguchi

Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

2021

Description and Discussion on DCASE 2021 Challenge Task 2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring under Domain Shifted Conditions.

[BibT_eX]

[DOI]

Yohei Kawaguchi

,

,

,

,

Daisuke Niizumi

,

,

,

,

CoRR, 2021

BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation.

[BibT_eX]

[DOI]

Daisuke Niizumi

,

,

Yasunori Ohishi

,

,

Proceedings of the International Joint Conference on Neural Networks, 2021

Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representation.

[BibT_eX]

[DOI]

Daisuke Niizumi

,

,

Yasunori Ohishi

,

,

Proceedings of the HEAR: Holistic Evaluation of Audio Representations, 2021

Description and Discussion on DCASE 2021 Challenge Task 2: Unsupervised Anomalous Detection for Machine Condition Monitoring Under Domain Shifted Conditions.

[BibT_eX]

[DOI]

Yohei Kawaguchi

,

,

,

,

Daisuke Niizumi

,

,

,

,

Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

ToyADMOS2: Another Dataset of Miniature-Machine Operating Sounds for Anomalous Sound Detection under Domain Shift Conditions.

[BibT_eX]

[DOI]

,

Daisuke Niizumi

,

,

Yasunori Ohishi

,

Masahiro Yasuda

,

Shoichiro Saito

Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

2020

Audio Captioning using Pre-Trained Large-Scale Language Model Guided by Audio-based Similar Caption Retrieval.

[BibT_eX]

[DOI]

,

Yasunori Ohishi

,

Daisuke Niizumi

,

,

Masahiro Yasuda

CoRR, 2020

The Morandi Room: Entering the World of Morandi's Paintings Through Machine Learning.

[BibT_eX]

[DOI]

Shigeru Kobayashi

,

,

,

Yoshiyuki Otani

,

,

Daisuke Niizumi

Proceedings of the Advances in Artificial Intelligence, 2020

2018

Acoustic Scene Classification: a Competition Review.

[BibT_eX]

[DOI]

,

,

Daisuke Niizumi

,

Tuukka Senttula

,

,

,

Tuomas Virtanen

,

Heikki Huttunen

Proceedings of the 28th IEEE International Workshop on Machine Learning for Signal Processing, 2018

Loading...