Nobutaka Ono
Orcid: 0000-0003-4242-2773
According to our database1,
Nobutaka Ono
authored at least 228 papers
between 1999 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
End-to-end training of acoustic scene classification using distributed sound-to-light conversion devices: verification through simulation experiments.
EURASIP J. Audio Speech Music. Process., December, 2024
Efficient Joint Optimization of Sampling Rate Offsets Using Entire Multichannel Signal.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Causal and Relaxed-Distortionless Response Beamforming for Online Target Source Extraction.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Guided Masked Self-Distillation Modeling for Distributed Multimedia Sensor Event Analysis.
CoRR, 2024
Flexible and Comprehensive Framework of Element Selection Based on Nonconvex Sparse Optimization.
IEEE Access, 2024
Complexity Reduction for Classification of Musical Instruments Using Element Selection.
Proceedings of the 18th International Workshop on Acoustic Signal Enhancement, 2024
2023
Signal processing and machine learning for speech and audio in acoustic sensor networks.
EURASIP J. Audio Speech Music. Process., December, 2023
Acoustic object canceller: removing a known signal from monaural recording using blind synchronization.
EURASIP J. Audio Speech Music. Process., December, 2023
Sound Field Interpolation for Rotation-Invariant Multichannel Array Signal Processing.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Multi-Channel Target Speaker Extraction with Refinement: The WavLab Submission to the Second Clarity Enhancement Challenge.
CoRR, 2023
IEEE Access, 2023
Signal Reconstruction from Mel-Spectrogram Based on Bi-Level Consistency of Full-Band Magnitude and Phase.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023
Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023
Fast Online Source Steering Algorithm for Tracking Single Moving Source Using Online Independent Vector Analysis.
Proceedings of the IEEE International Conference on Acoustics, 2023
Element Selection with Wide Class of Optimization Criteria Using Non-Convex Sparse Optimization.
Proceedings of the IEEE International Conference on Acoustics, 2023
Effectiveness of Inter- and Intra-Subarray Spatial Features for Acoustic Scene Classification.
Proceedings of the IEEE International Conference on Acoustics, 2023
Multi-Channel Speaker Extraction with Adversarial Training: The Wavlab Submission to The Clarity ICASSP 2023 Grand Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2023
Security Evaluation of Compressible and Learnable Image Encryption against Jigsaw Puzzle Solver Attacks.
Proceedings of the 12th IEEE Global Conference on Consumer Electronics, 2023
Acoustic Traffic Monitoring Based on Deep Neural Network Trained by Stereo-Recorded Sound and Sensor Data.
Proceedings of the 31st European Signal Processing Conference, 2023
Proceedings of the 31st European Signal Processing Conference, 2023
Fundamental Frequency Estimation Based on Finite-Order Harmonic Constraint Differential Equation.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
Automatic Call Classification of Autism Model Marmosets by Deep Learning and Analysis of Their Vocal Development.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
Augmentation of Various Speed Data by Controlling Frame Overlap for Acoustic Traffic Monitoring.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
2022
End-to-End Integration of Speech Recognition, Dereverberation, Beamforming, and Self-Supervised Learning Representation.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Proceedings of the 17th International Workshop on Acoustic Signal Enhancement, 2022
Use of Nods Less Synchronized with Turn-Taking and Prosody During Conversations in Adults with Autism.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Joint Optimization of Sampling Rate Offsets Based on Entire Signal Relationship Among Distributed Microphones.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Instantaneous Linear Dimensionality Reduction of Multichannel Time-Series Signal for Array Signal Processing.
Proceedings of the IEEE International Conference on Acoustics, 2022
Entrainment Analysis for Assessment of Autistic Speech Prosody Using Bottleneck Features of Deep Neural Network.
Proceedings of the IEEE International Conference on Acoustics, 2022
Missing data recovery using autoencoder for multi-channel acoustic scene classification.
Proceedings of the 30th European Signal Processing Conference, 2022
2021
Time-Frequency-Bin-Wise Linear Combination of Beamformers for Distortionless Signal Enhancement.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
J. Inf. Process., 2021
Rotation-Robust Beamforming Based on Sound Field Interpolation with Regularly Circular Microphone Array.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Sampling Frequency Mismatch Estimation by Auxiliary-Function-Based Iterative Maximization of Double-Cross-Correlation.
Proceedings of the 29th European Signal Processing Conference, 2021
End-to-End Training for Acoustic Scene Analysis with Distributed Sound-to-Light Conversion Devices.
Proceedings of the 29th European Signal Processing Conference, 2021
A low-computational DNN-based speech enhancement for hearing aids based on element selection.
Proceedings of the 29th European Signal Processing Conference, 2021
Investigation on Spatial and Frequency-Based Features for Asynchronous Acoustic Scene Analysis.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
Causal Distortionless Response Beamforming by Alternating Direction Method of Multipliers.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
Self-rotation angle estimation of circular microphone array based on sound field interpolation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
Analysis on Roles of DNNs in End-to-End Acoustic Scene Analysis Framework with Distributed Sound-to-Light Conversion Devices.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
Framewise Finite Impulse Response Filtering Based on Time-Frequency Mask for Low-Latency Speech Enhancement.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
2020
Independent Low-Rank Matrix Analysis Based on Time-Variant Sub-Gaussian Source Model for Determined Blind Source Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
MM Algorithms for Joint Independent Subspace Analysis with Application to Blind Single and Multi-Source Extraction.
CoRR, 2020
Blinkies: Open Source Sound-to-Light Conversion Sensors for Large-Scale Acoustic Sensing and Applications.
IEEE Access, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 9th IEEE Global Conference on Consumer Electronics, 2020
Faster independent low-rank matrix analysis with pairwise updates of demixing vectors.
Proceedings of the 28th European Signal Processing Conference, 2020
Proceedings of the 28th European Signal Processing Conference, 2020
Dynamic synchronous averaging for enhancement of periodic signal under sampling frequency variation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020
Experimental investigation of robustness of spatial cepstrum features under various recording conditions.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020
2019
IEEE ACM Trans. Audio Speech Lang. Process., 2019
IEEE ACM Trans. Audio Speech Lang. Process., 2019
Bilevel Optimization Using Stationary Point of Lower-Level Objective Function for Discriminative Basis Learning in Nonnegative Matrix Factorization.
IEEE Signal Process. Lett., 2019
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019
Proceedings of the 21st IEEE International Workshop on Multimedia Signal Processing, 2019
Time-frequency-bin-wise Switching of Minimum Variance Distortionless Response Beamformer for Underdetermined Situations.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Estimation of Sampling Frequency Mismatch between Distributed Asynchronous Microphones under Existence of Source Movements with Stationary Time Periods Detection.
Proceedings of the IEEE International Conference on Acoustics, 2019
CNN-based virtual microphone signal estimation for MPDR beamforming in underdetermined situations.
Proceedings of the 27th European Signal Processing Conference, 2019
Proceedings of the 27th European Signal Processing Conference, 2019
RU Multichannel Domestic Acoustic Scenes 2019: A Multichannel Dataset Recorded by Distributed Microphones with Various Properties.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019
Autism Spectrum Disorder Discrimination Based on Voice Activities Related to Fillers and Laughter.
Proceedings of the 53rd Annual Conference on Information Sciences and Systems, 2019
Projection Back onto Filtered Observations for Speech Separation with Distributed Microphone Array.
Proceedings of the 8th IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing, 2019
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
Griffin-Lim phase reconstruction using short-time Fourier transform with zero-padded frame analysis.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
2018
Generalized independent low-rank matrix analysis using heavy-tailed distributions for blind source separation.
EURASIP J. Adv. Signal Process., 2018
Proceedings of the 16th Annual International Conference on Mobile Systems, 2018
Comparison of Reference Microphone Selection Algorithms for Distributed Microphone Array Based Speech Enhancement in Meeting Recognition Scenarios.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018
Meeting Recognition with Asynchronous Distributed Microphone Array Using Block-Wise Refinement of Mask-Based MVDR Beamformer.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Orthogonally-Constrained Extraction of Independent Non-Gaussian Component from Non-Gaussian Background Without ICA.
Proceedings of the Latent Variable Analysis and Signal Separation, 2018
Time-Frequency-Bin-Wise Beamformer Selection and Masking for Speech Enhancement in Underdetermined Noisy Scenarios.
Proceedings of the 26th European Signal Processing Conference, 2018
Proceedings of the 26th European Signal Processing Conference, 2018
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018
Maximum a posteriori estimation of spectral gain with harmonic-structure-based phase reconstruction for phase-aware speech enhancement.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018
Blinkies: Sound-to-light conversion sensors and their application to speech enhancement and sound source localization.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018
Independent Low-Rank Matrix Analysis Based on Time-Variant Sub-Gaussian Source Model.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018
2017
Closed-Form and Near Closed-Form Solutions for TDOA-Based Joint Source and Sensor Localization.
IEEE Trans. Signal Process., 2017
IEEE Trans. Multim., 2017
IEEE ACM Trans. Audio Speech Lang. Process., 2017
Spatial Cepstrum as a Spatial Feature Using a Distributed Microphone Array for Acoustic Scene Analysis.
IEEE ACM Trans. Audio Speech Lang. Process., 2017
Low Latency and High Quality Two-Stage Human-Voice-Enhancement System for a Hose-Shaped Rescue Robot.
J. Robotics Mechatronics, 2017
Reversible Audio Information Hiding for Tampering Detection and Localization Using Sample Scanning Method.
J. Inf. Process., 2017
Acoustic scene classification using asynchronous multichannel observations with different lengths.
Proceedings of the 19th IEEE International Workshop on Multimedia Signal Processing, 2017
Independent low-rank matrix analysis based on complex student's t-distribution for blind audio source separation.
Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017
Low-latency real-time blind source separation for hearing aids based on time-domain implementation of online independent vector analysis with truncation of non-causal components.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Blind source separation based on independent low-rank matrix analysis with sparse regularization for time-series activity.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Ego Noise Reduction for Hose-Shaped Rescue Robot Combining Independent Low-Rank Matrix Analysis and Multichannel Noise Cancellation.
Proceedings of the Latent Variable Analysis and Signal Separation, 2017
Proceedings of the Latent Variable Analysis and Signal Separation, 2017
Performance evaluation of nonlinear speech enhancement based on virtual increase of channels in reverberant environments.
Proceedings of the 25th European Signal Processing Conference, 2017
Multiple far noise suppression in a real environment using transfer-function-gain NMF.
Proceedings of the 25th European Signal Processing Conference, 2017
Refinement of time-difference-of-arrival measurements via rank properties in two-dimensional space.
Proceedings of the 25th European Signal Processing Conference, 2017
Experimental analysis of optimal window length for independent low-rank matrix analysis.
Proceedings of the 25th European Signal Processing Conference, 2017
Acoustic scene classification based on generative model of acoustic spatial words for distributed microphone array.
Proceedings of the 25th European Signal Processing Conference, 2017
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
Performance evaluation of acoustic scene classification using DNN-GMM and frame-concatenated acoustic features.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
Online sound structure analysis based on generative model of acoustic feature sequences.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
2016
Closed-Form and Near Closed-Form Solutions for TOA-Based Joint Source and Sensor Localization.
IEEE Trans. Signal Process., 2016
Determined Blind Source Separation Unifying Independent Vector Analysis and Nonnegative Matrix Factorization.
IEEE ACM Trans. Audio Speech Lang. Process., 2016
A Real-time Audio-to-audio Karaoke Generation System for Monaural Recordings Based on Singing Voice Suppression and Key Conversion Techniques.
J. Inf. Process., 2016
Nonlinear speech enhancement by virtual increase of channels and maximum SNR beamformer.
EURASIP J. Adv. Signal Process., 2016
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016
Voice Liveness Detection for Speaker Verification based on a Tandem Single/Double-channel Pop Noise Detector.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016
Ego-noise reduction for a hose-shaped rescue robot using determined rank-1 multichannel nonnegative matrix factorization.
Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016
Robust TDOA-based joint source and microphone localization in a reverberant environment using medians of acceptable recovered TOAs.
Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016
Discriminative and reconstructive basis training for audio source separation with semi-supervised nonnegative matrix factorization.
Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016
Efficient initialization for nonnegative matrix factorization based on nonnegative independent component analysis.
Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016
A computationally cheaper method for blind speech separation based on AuxIVA and incomplete demixing transform.
Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016
Noise reduction using independent vector analysis and noise cancellation for a hose-shaped rescue robot.
Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016
Multi-Talker Speech Recognition Based on Blind Source Separation with ad hoc Microphone Array Using Smartphones and Cloud Storage.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Automatic Discrimination of Soft Voice Onset Using Acoustic Features of Breathy Voicing.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Experimental validation of TOA-based methods for microphones array positions calibration.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Music signal separation using supervised NMF with all-pole-model-based discriminative basis deformation.
Proceedings of the 24th European Signal Processing Conference, 2016
Closed-form solution for TDOA-based joint source and sensor localization in two-dimensional space.
Proceedings of the 24th European Signal Processing Conference, 2016
Proceedings of the 24th European Signal Processing Conference, 2016
Proceedings of the 24th European Signal Processing Conference, 2016
Self-localization and channel synchronization of smartphone arrays using sound emissions.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016
Unsupervised detection of non-stationary segments based on single-basis non-negative matrix factorization for effective annotation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016
2015
Blind Suppression of Nonstationary Diffuse Acoustic Noise Based on Spatial Covariance Matrix Decomposition.
J. Signal Process. Syst., 2015
Blind compensation of interchannel sampling frequency mismatch for ad hoc microphone array based on maximum likelihood estimation.
Signal Process., 2015
Signal Process., 2015
Autoregressive Hidden Semi-Markov Model of Symbolic Music Performance for Score Following.
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015
Voice liveness detection algorithms based on pop noise caused by human breath for automatic speaker verification.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Reference-distance estimation approach for TDOA-based source and sensor localization.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Efficient multichannel nonnegative matrix factorization exploiting rank-1 spatial model.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Acoustic scene analysis from acoustic event sequence with intermittent missing event.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the Latent Variable Analysis and Signal Separation, 2015
Estimating Correlation Coefficient Between Two Complex Signals Without Phase Observation.
Proceedings of the Latent Variable Analysis and Signal Separation, 2015
Proceedings of the 23rd European Signal Processing Conference, 2015
Proceedings of the 23rd European Signal Processing Conference, 2015
Diffuse noise suppression with asynchronous microphone array based on amplitude additivity model.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015
2014
Singing Voice Enhancement in Monaural Music Signals Based on Two-stage Harmonic/Percussive Sound Separation on Multiple Resolution Spectrograms.
IEEE ACM Trans. Audio Speech Lang. Process., 2014
Harmonic/percussive sound separation based on anisotropic smoothness of spectrograms.
IEEE ACM Trans. Audio Speech Lang. Process., 2014
CoRR, 2014
Proceedings of the 14th International Workshop on Acoustic Signal Enhancement, 2014
Proceedings of the 14th International Workshop on Acoustic Signal Enhancement, 2014
Proceedings of the 14th International Workshop on Acoustic Signal Enhancement, 2014
Amplitude-based speech enhancement with nonnegative matrix factorization for asynchronous distributed recording.
Proceedings of the 14th International Workshop on Acoustic Signal Enhancement, 2014
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014
Robust Audio Information Hiding Based on Stereo Phase Difference in Time-Frequency Domain.
Proceedings of the 2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2014
Merged-Output Hidden Markov Model for Score Following of MIDI Performance with Ornaments, Desynchronized Voices, Repeats and Skips.
Proceedings of the Music Technology meets Philosophy, 2014
An auxiliary-function approach to online independent vector analysis for real-time blind source separation.
Proceedings of the 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014
On microphone arrangement for multichannel speech enhancement based on nonnegative matrix factorization in time-channel domain.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014
Design of FPGA-based rapid prototype spectral subtraction for hands-free speech applications.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014
2013
Bayesian Nonparametric Approach to Blind Separation of Infinitely Many Sparse Sources.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2013
Optimizing frame analysis with non-integrer shift for sampling mismatch compensation of long recording.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2013
General algorithms for estimating spectrogram and transfer functions of target signal for blind suppression of diffuse noise.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2013
Reversible Audio Information Hiding Based on Integer DCT Coefficients with Adaptive Hiding Locations.
Proceedings of the Digital-Forensics and Watermarking - 12th International Workshop, 2013
Blind compensation of inter-channel sampling frequency mismatch with maximum likelihood estimation in STFT domain.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the 21st European Signal Processing Conference, 2013
Virtually increasing microphone array elements by interpolation in complex-logarithmic domain.
Proceedings of the 21st European Signal Processing Conference, 2013
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
2012
Proceedings of the IWAENC 2012 - International Workshop on Acoustic Signal Enhancement, Proceedings, RWTH Aachen University, Germany, September 4th, 2012
Proceedings of the IWAENC 2012 - International Workshop on Acoustic Signal Enhancement, Proceedings, RWTH Aachen University, Germany, September 4th, 2012
Proceedings of the International Symposium on Communications and Information Technologies, 2012
Comparative evaluations of various harmonic/percussive sound separation algorithms based on anisotropic continuity of spectrogram.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
A tandem connectionist model using combination of multi-scale spectro-temporal features for acoustic event detection.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Auxiliary-function-based independent vector analysis with power of vector-norm type weighting functions.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
2011
Beyond Timbral Statistics: Improving Music Classification Using Percussive Patterns and Bass Lines.
IEEE ACM Trans. Audio Speech Lang. Process., 2011
IEEE Trans. Speech Audio Process., 2011
Computational auditory induction as a missing-data model-fitting problem with Bregman divergence.
Speech Commun., 2011
Polyphonic Pitch Estimation and Instrument Identification by Joint Modeling of Sustained and Attack Sounds.
IEEE J. Sel. Top. Signal Process., 2011
Stable and fast update rules for independent vector analysis based on auxiliary function technique.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011
Bayesian nonparametric spectrogram modeling based on infinite factorial infinite hidden Markov model.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011
Using Spectral Fluctuation of Speech in Multi-Feature HMM-Based Voice Activity Detection.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Concurrent Optimization of Context Clustering and GMM for Offline Handwritten Word Recognition Using HMM.
Proceedings of the 2011 International Conference on Document Analysis and Recognition, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Automatic video annotation via Hierarchical Topic Trajectory Model considering cross-modal correlations.
Proceedings of the IEEE International Conference on Acoustics, 2011
Multichannel harmonic and percussive component separation by joint modeling of spatial and spectral continuity.
Proceedings of the IEEE International Conference on Acoustics, 2011
2010
Proceedings of the Advances in Music Information Retrieval, 2010
Speech Spectrum Modeling for Joint Estimation of Spectral Envelope and Fundamental Frequency.
IEEE Trans. Speech Audio Process., 2010
SEMANTIC INDEXING AND KNOWN ITEM SEARCH BASED ON A UNIFIED MODEL WITH TOPIC TRANSITION REPRESENTATION.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010
Proceedings of the Entertainment Computing - ICEC 2010, 9th International Conference, 2010
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010
Autoregressive MFCC Models for Genre Classification Improved by Harmonic-percussion Separation.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010
Monophonic Instrument Sound Segregation by Clustering NMF Components Based on Basis Similarity and Gain Disjointness.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010
Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Melody line estimation in homophonic music audio signals based on temporal-variability of melodic source.
Proceedings of the IEEE International Conference on Acoustics, 2010
R-means localization: A simple iterative algorithm for range-difference-based source localization.
Proceedings of the IEEE International Conference on Acoustics, 2010
A sparse component model of source signals and its application to blind source separation.
Proceedings of the IEEE International Conference on Acoustics, 2010
Designing the Wiener post-filter for diffuse noise suppression using imaginary parts of inter-channel cross-spectra.
Proceedings of the IEEE International Conference on Acoustics, 2010
Consistent Wiener Filtering: Generalized Time-Frequency Masking Respecting Spectrogram Consistency.
Proceedings of the Latent Variable Analysis and Signal Separation, 2010
Proceedings of the Latent Variable Analysis and Signal Separation, 2010
Nonnegative Matrix Factorization with Markov-Chained Bases for Modeling Time-Varying Patterns in Music Spectrograms.
Proceedings of the Latent Variable Analysis and Signal Separation, 2010
Crystal-MUSIC: Accurate Localization of Multiple Sources in Diffuse Noise Environments Using Crystal-Shaped Microphone Arrays.
Proceedings of the Latent Variable Analysis and Signal Separation, 2010
Proceedings of the Latent Variable Analysis and Signal Separation, 2010
2009
Note detection with dynamic bayesian networks as a postanalysis step for NMF-based multiple pitch estimation techniques.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009
Musical Bass-Line Pattern Clustering and Its Application to Audio Genre Classification.
Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009
Stereo-input speech recognition using sparseness-based time-frequency masking in a reverberant environment.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Audio genre classification using percussive pattern clustering combined with timbral features.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009
Rhythm map: Extraction of unit rhythmic patterns and analysis of rhythmic structure from music acoustic signals.
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Extending Nonnegative Matrix Factorization - A discussion in the context of multiple frequency estimation of musical signals.
Proceedings of the 17th European Signal Processing Conference, 2009
2008
Sound Source Localization with Front-Back Judgement by Two Microphones Asymmetrically Mounted on a Sphere.
J. Multim., 2008
Proceedings of the ISMIR 2008, 2008
Explicit consistency constraints for STFT spectrograms and their application to phase reconstruction.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2008
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
Harmonic-Temporal-Timbral Clustering (HTTC) for the analysis of multi-instrument polyphonic music signals.
Proceedings of the IEEE International Conference on Acoustics, 2008
Auxiliary function approach to parameter estimation of constrained sinusoidal model for monaural speech separation.
Proceedings of the IEEE International Conference on Acoustics, 2008
A blind noise decorrelation approach with crystal arrays on designing post-filters for diffuse noise suppression.
Proceedings of the IEEE International Conference on Acoustics, 2008
Separation of a monaural audio signal into harmonic/percussive components by complementary diffusion on spectrogram.
Proceedings of the 2008 16th European Signal Processing Conference, 2008
2007
Single and Multiple F<sub>0</sub> Contour Estimation Through Parametric Spectrogram Modeling of Speech in Noisy Environments.
IEEE Trans. Speech Audio Process., 2007
Proceedings of the IEEE 9th Workshop on Multimedia Signal Processing, 2007
Proceedings of the 8th International Conference on Music Information Retrieval, 2007
Harmonic-Temporal Clustering of Speech for Single and Multiple F0 Contour Estimation in Noisy Environments.
Proceedings of the IEEE International Conference on Acoustics, 2007
2006
Speech analyzer using a joint estimation model of spectral envelope and fine structure.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
1999
Proceedings of the Third IEEE Workshop on Multimedia Signal Processing, 1999