Noboru Harada

Orcid: 0000-0002-1759-4533

According to our database1, Noboru Harada authored at least 102 papers between 1995 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Natural Gradient Primal-Dual Method for Decentralized Learning.
IEEE Trans. Signal Inf. Process. over Networks, 2024

Masked Modeling Duo: Towards a Universal Audio Pre-Training Framework.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Description and Discussion on DCASE 2024 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring.
CoRR, 2024

M2D-CLAP: Masked Modeling Duo Meets CLAP for Learning General-purpose Audio-Language Representation.
CoRR, 2024

Exploring Pre-trained General-purpose Audio Representations for Heart Murmur Detection.
CoRR, 2024

Guided Masked Self-Distillation Modeling for Distributed Multimedia Sensor Event Analysis.
CoRR, 2024

Unrestricted Global Phase Bias-Aware Single-Channel Speech Enhancement with Conformer-Based Metric Gan.
Proceedings of the IEEE International Conference on Acoustics, 2024

6DoF SELD: Sound Event Localization and Detection Using Microphones and Motion Tracking Sensors on Self-Motioning Human.
Proceedings of the IEEE International Conference on Acoustics, 2024

Light Gated Multi Mini-Patch Extractor for Audio Classification.
Proceedings of the IEEE International Conference on Acoustics, 2024

Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval.
Proceedings of the 32nd European Signal Processing Conference, 2024

Learning to Assess Subjective Impressions from Speech.
Proceedings of the 32nd European Signal Processing Conference, 2024

2023
BYOL for Audio: Exploring Pre-Trained General-Purpose Audio Representations.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Acousto-optic reconstruction of exterior sound field based on concentric circle sampling with circular harmonic expansion.
CoRR, 2023

Audio Difference Captioning Utilizing Similarity-Discrepancy Disentanglement.
CoRR, 2023

Description and Discussion on DCASE 2023 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring.
CoRR, 2023

Deep sound-field denoiser: optically-measured sound-field denoising using deep neural network.
CoRR, 2023

Masked Modeling Duo for Speech: Specializing General-Purpose Audio Representation to Speech using Denoising Distillation.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the Input.
Proceedings of the IEEE International Conference on Acoustics, 2023

W2N-AVSC: Audiovisual Extension For Whisper-To-Normal Speech Conversion.
Proceedings of the 31st European Signal Processing Conference, 2023

First-Shot Anomaly Sound Detection for Machine Condition Monitoring: A Domain Generalization Baseline.
Proceedings of the 31st European Signal Processing Conference, 2023

Enhancing Spectrogram for Audio Classification Using Time-Frequency Enhancer.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

Joint Analysis of Acoustic Scenes and Sound Events Based on Semi-Supervised Approach.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
Description and Discussion on DCASE 2022 Challenge Task 2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Applying Domain Generalization Techniques.
CoRR, 2022

ConceptBeam: Concept Driven Target Speech Extraction.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Introducing Auxiliary Text Query-modifier to Content-based Audio Retrieval.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Multi-View And Multi-Modal Event Detection Utilizing Transformer-Based Multi-Sensor Fusion.
Proceedings of the IEEE International Conference on Acoustics, 2022

Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model.
Proceedings of the 30th European Signal Processing Conference, 2022

Description and Discussion on DCASE 2022 Challenge Task 2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Applying Domain Generalization Techniques.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

Prototyping of low-cost color enhancement lighting using multicolor LEDs.
Proceedings of the Color Imaging XXVII: Displaying, 2022

2021
Deep Griffin-Lim Iteration: Trainable Iterative Phase Reconstruction Using Neural Network.
IEEE J. Sel. Top. Signal Process., 2021

Color saturation control by modulating spectral power distribution of illumination using color enhancement factors.
J. Electronic Imaging, 2021

Description and Discussion on DCASE 2021 Challenge Task 2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring under Domain Shifted Conditions.
CoRR, 2021

R-R Interval Estimation for Wearable Electrocardiogram Based on Single Complex Wavelet Filtering and Morphology-Based Peak Selection.
IEEE Access, 2021

BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation.
Proceedings of the International Joint Conference on Neural Networks, 2021

Asynchronous Decentralized Optimization With Implicit Stochastic Variance Reduction.
Proceedings of the 38th International Conference on Machine Learning, 2021

An Extension of Sparse Audio Declipper to Multiple Measurement Vectors.
Proceedings of the IEEE International Conference on Acoustics, 2021

Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representation.
Proceedings of the HEAR: Holistic Evaluation of Audio Representations, 2021

Description and Discussion on DCASE 2021 Challenge Task 2: Unsupervised Anomalous Detection for Machine Condition Monitoring Under Domain Shifted Conditions.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

ToyADMOS2: Another Dataset of Miniature-Machine Operating Sounds for Anomalous Sound Detection under Domain Shift Conditions.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

2020
Microphone Array Wiener Post Filtering Using Monotone Operator Splitting.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Multi-Delay Sparse Approach to Residual Crosstalk Reduction for Blind Source Separation.
IEEE Signal Process. Lett., 2020

The NTT DCASE2020 Challenge Task 6 system: Automated Audio Captioning with Keywords and Sentence Length Estimation.
CoRR, 2020

Edge-consensus Learning: Deep Learning on P2P Networks with Nonhomogeneous Data.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Crossmodal Sound Retrieval Based on Specific Target Co-Occurrence Denoted with Weak Labels.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Invertible DNN-Based Nonlinear Time-Frequency Transform for Speech Enhancement.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Real-Time Speech Enhancement Using Equilibriated RNN.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Phase Reconstruction Based On Recurrent Phase Unwrapping With Deep Neural Networks.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Subjective Quality Estimation Using PESQ For Hands-Free Terminals.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

SPIDERnet: Attention Network For One-Shot Anomaly Detection In Sounds.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

A Frequency-Domain BSS Method Based on ℓ1 Norm, Unitary Constraint, and Cayley Transform.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Effects of Word-Frequency Based Pre- and Post- Processings for Audio Captioning.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

Description and Discussion on DCASE2020 Challenge Task2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

2019
Unsupervised Detection of Anomalous Sound Based on Deep Learning and the Neyman-Pearson Lemma.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Batch Uniformization for Minimizing Maximum Anomaly Score of Dnn-Based Anomaly Detection in Sounds.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

ToyADMOS: A Dataset of Miniature-Machine Operating Sounds for Anomalous Sound Detection.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Finding Low-Dimensional Dynamical Structure Through Variational Auto-Encoding Dynamic Mode Decomposition.
Proceedings of the 29th IEEE International Workshop on Machine Learning for Signal Processing, 2019

AdaFlow: Domain-adaptive Density Estimator with Application to Anomaly Detection and Unpaired Cross-domain Translation.
Proceedings of the IEEE International Conference on Acoustics, 2019

Data-driven Design of Perfect Reconstruction Filterbank for DNN-based Sound Source Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2019

Function Designable Beamformer Based on Probabilistic Assumptions on Filter and Its Auxiliary Variables.
Proceedings of the IEEE International Conference on Acoustics, 2019

Non-negative Matrix Factorization Using Bregman Monotone Operator Splitting.
Proceedings of the IEEE International Conference on Acoustics, 2019

Deep Griffin-Lim Iteration.
Proceedings of the IEEE International Conference on Acoustics, 2019

SNIPER: Few-shot Learning for Anomaly Detection to Minimize False-negative Rate with Ensured True-positive Rate.
Proceedings of the IEEE International Conference on Acoustics, 2019

Trainable Adaptive Window Switching for Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2019

A Two-class Hyper-spherical Autoencoder for Supervised Anomaly Detection.
Proceedings of the IEEE International Conference on Acoustics, 2019

First Order Ambisonics Domain Spatial Augmentation for DNN-based Direction of Arrival Estimation.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

2018
Optimal Golomb-Rice Code Extension for Lossless Coding of Low-Entropy Exponentially Distributed Sources.
IEEE Trans. Inf. Theory, 2018

Distortionless Beamforming Optimized With ℓ<sub>1</sub>-Norm Minimization.
IEEE Signal Process. Lett., 2018

DNN-Based Near- and Far-Field Source Separation Using Spherical-Harmonic-Analysis-Based Acoustic Features.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Residual Echo Reduction for Multichannel Echo Canceller Using Echo Replica.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

End-to-End Sound Source Enhancement Using Deep Neural Network in the Modified Discrete Cosine Transform Domain.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Complementary Set Variational Autoencoder for Supervised Anomaly Detection.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Sound Field Decomposition Using SPICE Decomposition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Distant Noise Reduction Based on Multi-delay Noise Model Using Distributed Microphone Array.
Proceedings of the 26th European Signal Processing Conference, 2018

2017
Experimental evaluation of encoding parameters of MPEG-4 ALS for high-resolution audio.
Proceedings of the IEEE 6th Global Conference on Consumer Electronics, 2017

Optimizing acoustic feature extractor for anomalous sound detection based on Neyman-Pearson lemma.
Proceedings of the 25th European Signal Processing Conference, 2017

2015
RTP Payload Format for G.711.0.
RFC, November, 2015

A High-Speed Camera-Based Approach to Massive Sound Sensing With Optical Wireless Acoustic Sensors.
IEEE Trans. Computational Imaging, 2015

Optimal Coding of Generalized-Gaussian-Distributed Frequency Spectra for Low-Delay Audio Coder With Powered All-Pole Spectrum Estimation.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Resolution Warped Spectral Representation for Low-Delay and Low-Bit-Rate Audio Coder.
IEEE ACM Trans. Audio Speech Lang. Process., 2015


Adaptive selection of lag-window shape for linear predictive analysis in the 3GPP EVS codec.
Proceedings of the 2015 IEEE Global Conference on Signal and Information Processing, 2015

Harmonic model for MDCT based audio coding with LPC envelope.
Proceedings of the 23rd European Signal Processing Conference, 2015

2014
Golomb-rice coding optimized via LPC for frequency domain audio coder.
Proceedings of the 2014 IEEE Global Conference on Signal and Information Processing, 2014

Direct linear conversion of LSP parameters for perceptual control in speech and audio coding.
Proceedings of the 22nd European Signal Processing Conference, 2014

Representation of spectral envelope with warped frequency resolution for audio coder.
Proceedings of the 22nd European Signal Processing Conference, 2014

Adaptive post-filtering controlled by pitch frequency for CELP-based speech coder.
Proceedings of the 48th Asilomar Conference on Signals, Systems and Computers, 2014

2013
Image processing techniques for high speed camera-based free-field optical communication.
Proceedings of the 2013 IEEE International Conference on Signal and Image Processing Applications, 2013

A playback system that synchronizes the musical phrases with listener's respiration phases.
Proceedings of the 2013 ACM SIGCHI Conference on Human Factors in Computing Systems, 2013

2010
Archive and Preservation of Media Content Using MPEG-A.
IEEE Multim., 2010

Enhanced lossless coding tools for prediction residual.
Proceedings of the IEEE International Conference on Acoustics, 2010

Low-complexity PARCOR coefficient quantizer and prediction order estimator for lossless speech coding.
Proceedings of the IEEE International Conference on Acoustics, 2010

Emerging ITU-T standard G.711.0 - lossless compression of G.711 pulse code modulation.
Proceedings of the IEEE International Conference on Acoustics, 2010

Escaped-Huffman and adaptive recursive rice coding for lossless compression of the mapped domain linear prediction residual.
Proceedings of the IEEE International Conference on Acoustics, 2010

Enhanced Lossless Coding Tools of LPC Residual for ITU-T G.711.0.
Proceedings of the 2010 Data Compression Conference (DCC 2010), 2010

Low-Complexity PARCOR Coefficient Quantizer and Prediction Order Estimator for G.711.0 (Lossless Speech Coding).
Proceedings of the 2010 Data Compression Conference (DCC 2010), 2010

Lossless Compression of Mapped Domain Linear Prediction Residual for ITU-T Recommendation G.711.0.
Proceedings of the 2010 Data Compression Conference (DCC 2010), 2010

2008
Multichannel Linear Prediction Method Compliant with the MPEG-4 ALS.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2008

Lossless compression of biomedical signals by MPEG-4 ALS with enhanced encoding tools.
Proceedings of the International Workshop on Multimedia Signal Processing, 2008

Interchannel dependency analysis of biomedical signals for efficient lossless compression by MPEG-4 ALS.
Proceedings of the IEEE International Conference on Acoustics, 2008

2002
A 5-kHz-bandwidth low-bit-rate speech coder: Bandwidth, noisiness, and opinion scores.
Syst. Comput. Jpn., 2002

1997
Automatic extraction and measurement of leukocyte motion in microvessels using spatiotemporal image analysis.
IEEE Trans. Biomed. Eng., 1997

1995
Measuring Microcirculation Using Spatiotemporal Image Analysis.
Proceedings of the Computer Vision, 1995


  Loading...