Noboru Harada

J. Electronic Imaging, 2021

Description and Discussion on DCASE 2021 Challenge Task 2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring under Domain Shifted Conditions.

[BibT_eX]

[DOI]

CoRR, 2021

R-R Interval Estimation for Wearable Electrocardiogram Based on Single Complex Wavelet Filtering and Morphology-Based Peak Selection.

[BibT_eX]

[DOI]

IEEE Access, 2021

BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2021

Asynchronous Decentralized Optimization With Implicit Stochastic Variance Reduction.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

An Extension of Sparse Audio Declipper to Multiple Measurement Vectors.

[BibT_eX]

[DOI]

Satoru Emura

Proceedings of the IEEE International Conference on Acoustics, 2021

Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representation.

[BibT_eX]

[DOI]

Proceedings of the HEAR: Holistic Evaluation of Audio Representations, 2021

Description and Discussion on DCASE 2021 Challenge Task 2: Unsupervised Anomalous Detection for Machine Condition Monitoring Under Domain Shifted Conditions.

[BibT_eX]

[DOI]

Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

ToyADMOS2: Another Dataset of Miniature-Machine Operating Sounds for Anomalous Sound Detection under Domain Shift Conditions.

[BibT_eX]

[DOI]

Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

2020

Microphone Array Wiener Post Filtering Using Monotone Operator Splitting.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2020

Multi-Delay Sparse Approach to Residual Crosstalk Reduction for Blind Source Separation.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2020

The NTT DCASE2020 Challenge Task 6 system: Automated Audio Captioning with Keywords and Sentence Length Estimation.

[BibT_eX]

[DOI]

CoRR, 2020

Edge-consensus Learning: Deep Learning on P2P Networks with Nonhomogeneous Data.

[BibT_eX]

[DOI]

Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Crossmodal Sound Retrieval Based on Specific Target Co-Occurrence Denoted with Weak Labels.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Invertible DNN-Based Nonlinear Time-Frequency Transform for Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Real-Time Speech Enhancement Using Equilibriated RNN.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Phase Reconstruction Based On Recurrent Phase Unwrapping With Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Subjective Quality Estimation Using PESQ For Hands-Free Terminals.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

SPIDERnet: Attention Network For One-Shot Anomaly Detection In Sounds.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

A Frequency-Domain BSS Method Based on ℓ1 Norm, Unitary Constraint, and Cayley Transform.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Effects of Word-Frequency Based Pre- and Post- Processings for Audio Captioning.

[BibT_eX]

[DOI]

Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

Description and Discussion on DCASE2020 Challenge Task2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring.

[BibT_eX]

[DOI]

Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

2019

Unsupervised Detection of Anomalous Sound Based on Deep Learning and the Neyman-Pearson Lemma.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2019

Batch Uniformization for Minimizing Maximum Anomaly Score of Dnn-Based Anomaly Detection in Sounds.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

ToyADMOS: A Dataset of Miniature-Machine Operating Sounds for Anomalous Sound Detection.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Finding Low-Dimensional Dynamical Structure Through Variational Auto-Encoding Dynamic Mode Decomposition.

[BibT_eX]

[DOI]

Shin Murata

Proceedings of the 29th IEEE International Workshop on Machine Learning for Signal Processing, 2019

AdaFlow: Domain-adaptive Density Estimator with Application to Anomaly Detection and Unpaired Cross-domain Translation.

[BibT_eX]

[DOI]

Masataka Yamaguchi

Proceedings of the IEEE International Conference on Acoustics, 2019

Data-driven Design of Perfect Reconstruction Filterbank for DNN-based Sound Source Enhancement.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Function Designable Beamformer Based on Probabilistic Assumptions on Filter and Its Auxiliary Variables.

[BibT_eX]

[DOI]

Ryotaro Sato

Kenta Niwa

Proceedings of the IEEE International Conference on Acoustics, 2019

Non-negative Matrix Factorization Using Bregman Monotone Operator Splitting.

[BibT_eX]

[DOI]

Kenta Niwa

Proceedings of the IEEE International Conference on Acoustics, 2019

Deep Griffin-Lim Iteration.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

SNIPER: Few-shot Learning for Anomaly Detection to Minimize False-negative Rate with Ensured True-positive Rate.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Trainable Adaptive Window Switching for Speech Enhancement.

[BibT_eX]

[DOI]

Yoichi Haneda

Proceedings of the IEEE International Conference on Acoustics, 2019

A Two-class Hyper-spherical Autoencoder for Supervised Anomaly Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

First Order Ambisonics Domain Spatial Augmentation for DNN-based Direction of Arrival Estimation.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

2018

Optimal Golomb-Rice Code Extension for Lossless Coding of Low-Entropy Exponentially Distributed Sources.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Theory, 2018

Distortionless Beamforming Optimized With ℓ<sub>1</sub>-Norm Minimization.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2018

DNN-Based Near- and Far-Field Source Separation Using Spherical-Harmonic-Analysis-Based Acoustic Features.

[BibT_eX]

[DOI]

Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Residual Echo Reduction for Multichannel Echo Canceller Using Echo Replica.

[BibT_eX]

[DOI]

Satoru Emura

Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

End-to-End Sound Source Enhancement Using Deep Neural Network in the Modified Discrete Cosine Transform Domain.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Complementary Set Variational Autoencoder for Supervised Anomaly Detection.

[BibT_eX]

[DOI]

Yuta Kawachi

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Sound Field Decomposition Using SPICE Decomposition.

[BibT_eX]

[DOI]

Satoru Emura

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Distant Noise Reduction Based on Multi-delay Noise Model Using Distributed Microphone Array.

[BibT_eX]

[DOI]

Proceedings of the 26th European Signal Processing Conference, 2018

2017

Experimental evaluation of encoding parameters of MPEG-4 ALS for high-resolution audio.

[BibT_eX]

[DOI]

Proceedings of the IEEE 6th Global Conference on Consumer Electronics, 2017

Optimizing acoustic feature extractor for anomalous sound detection based on Neyman-Pearson lemma.

[BibT_eX]

[DOI]

Proceedings of the 25th European Signal Processing Conference, 2017

2015

RTP Payload Format for G.711.0.

[BibT_eX]

[DOI]

Michael A. Ramalho

Paul E. Jones

Muthu Arul Mozhi Perumal

Lei Miao

RFC, November, 2015

A High-Speed Camera-Based Approach to Massive Sound Sensing With Optical Wireless Acoustic Sensors.

[BibT_eX]

[DOI]

IEEE Trans. Computational Imaging, 2015

Optimal Coding of Generalized-Gaussian-Distributed Frequency Spectra for Low-Delay Audio Coder With Powered All-Pole Spectrum Estimation.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2015

Resolution Warped Spectral Representation for Low-Delay and Low-Bit-Rate Audio Coder.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2015

Standardization of the new 3GPP EVS codec.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Adaptive selection of lag-window shape for linear predictive analysis in the 3GPP EVS codec.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Global Conference on Signal and Information Processing, 2015

Harmonic model for MDCT based audio coding with LPC envelope.

[BibT_eX]

[DOI]

Christian R. Helmrich

Guillaume Fuchs

Proceedings of the 23rd European Signal Processing Conference, 2015

2014

Golomb-rice coding optimized via LPC for frequency domain audio coder.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Global Conference on Signal and Information Processing, 2014

Direct linear conversion of LSP parameters for perceptual control in speech and audio coding.

[BibT_eX]

[DOI]

Proceedings of the 22nd European Signal Processing Conference, 2014

Representation of spectral envelope with warped frequency resolution for audio coder.

[BibT_eX]

[DOI]

Proceedings of the 22nd European Signal Processing Conference, 2014

Adaptive post-filtering controlled by pitch frequency for CELP-based speech coder.

[BibT_eX]

[DOI]

Proceedings of the 48th Asilomar Conference on Signals, Systems and Computers, 2014

2013

Image processing techniques for high speed camera-based free-field optical communication.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Conference on Signal and Image Processing Applications, 2013

A playback system that synchronizes the musical phrases with listener's respiration phases.

[BibT_eX]

[DOI]

Proceedings of the 2013 ACM SIGCHI Conference on Human Factors in Computing Systems, 2013

2010

Archive and Preservation of Media Content Using MPEG-A.

[BibT_eX]

[DOI]

IEEE Multim., 2010

Enhanced lossless coding tools for prediction residual.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Low-complexity PARCOR coefficient quantizer and prediction order estimator for lossless speech coding.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Emerging ITU-T standard G.711.0 - lossless compression of G.711 pulse code modulation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Escaped-Huffman and adaptive recursive rice coding for lossless compression of the mapped domain linear prediction residual.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Enhanced Lossless Coding Tools of LPC Residual for ITU-T G.711.0.

[BibT_eX]

[DOI]

Proceedings of the 2010 Data Compression Conference (DCC 2010), 2010

Low-Complexity PARCOR Coefficient Quantizer and Prediction Order Estimator for G.711.0 (Lossless Speech Coding).

[BibT_eX]

[DOI]

Proceedings of the 2010 Data Compression Conference (DCC 2010), 2010

Lossless Compression of Mapped Domain Linear Prediction Residual for ITU-T Recommendation G.711.0.

[BibT_eX]

[DOI]

Proceedings of the 2010 Data Compression Conference (DCC 2010), 2010

2008

Multichannel Linear Prediction Method Compliant with the MPEG-4 ALS.

[BibT_eX]

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2008

Lossless compression of biomedical signals by MPEG-4 ALS with enhanced encoding tools.

[BibT_eX]

[DOI]

Proceedings of the International Workshop on Multimedia Signal Processing, 2008

Interchannel dependency analysis of biomedical signals for efficient lossless compression by MPEG-4 ALS.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

2002

A 5-kHz-bandwidth low-bit-rate speech coder: Bandwidth, noisiness, and opinion scores.

[BibT_eX]

[DOI]