Masanori Morise

According to our database1, Masanori Morise authored at least 76 papers between 2004 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.




In proceedings 
PhD thesis 




Neural electric bass guitar synthesis framework enabling attack-sustain-representation-based technique control.
EURASIP J. Audio Speech Music. Process., December, 2024

Interactive tools for making temporally variable, multiple-attributes, and multiple-instances morphing accessible: Flexible manipulation of divergent speech instances for explorational research and education.
CoRR, 2024

Measuring pitch extractors' response to frequency-modulated multi-component signals.
CoRR, 2022

Conformer-Based Lip-Reading for Japanese Sentence.
Proceedings of the Image and Vision Computing - 37th International Conference, 2022

An objective test tool for pitch extractors' response attributes.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Building a Measurement Model for Simulating Naturalness of Vibrato Based on Subjective Evaluation.
IEICE Trans. Inf. Syst., 2021

A Real-Time Drum-Wise Volume Visualization System for Learning Volume-Balanced Drum Performance.
Proceedings of the Entertainment Computing - ICEC 2021, 2021

Interactive and Real-Time Acoustic Measurement Tools for Speech Data Acquisition and Presentation: Application of an Extended Member of Time Stretched Pulses.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Mixture of Orthogonal Sequences Made from Extended Time-Stretched Pulses Enables Measurement of Involuntary Voice Fundamental Frequency Response to Pitch Perturbation.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Implementation of Interactive Tools for Investigating Fundamental Frequency Response of Voiced Sounds to Auditory Stimulation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

DNN-Based Full-Band Speech Synthesis Using GMM Approximation of Spectral Envelope.
IEICE Trans. Inf. Syst., 2020

Voice Conversion for Improving Perceived Likability of Uttered Speech.
IEICE Trans. Inf. Syst., 2020

Noise Reduction Using a Plane Microphone Array and a Spatiotemporal Sound Pressure Distribution Image.
Proceedings of the IEEE International Conference on Consumer Electronics - Taiwan, 2020

Implementation of sequential real-time waveform generator for high-quality vocoder.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

PJS: phoneme-balanced Japanese singing-voice corpus.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

Simultaneous measurement of time-invariant linear and nonlinear, and random and extra responses using frequency domain variant of velvet noise.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

Modification of Velvet Noise for Speech Waveform Generation by Using Vocoder-Based Speech Synthesizer.
IEICE Trans. Inf. Syst., 2019

Sound Source Separation by Spectral Subtraction Based on Instantaneous Estimation of Noise Spectrum.
Proceedings of the 6th International Conference on Systems and Informatics, 2019

High-quality waveform generator from fundamental frequency, spectral envelope, and band aperiodicity.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Efficient quantization of vocoded speech parameters without degradation.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Human-in-the-loop speech-design system and its evaluation.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Frequency domain variant of Velvet noise and its application to acoustic measurements.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Frequency domain variants of velvet noise and their application to speech processing and synthesis: with appendices.
CoRR, 2018

Frequency Domain Variants of Velvet Noise and Their Application to Speech Processing and Synthesis.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Sound Source Separation by Instantaneous Estimation-Based Spectral Subtraction.
Proceedings of the 5th International Conference on Systems and Informatics, 2018

Separation of Two Sound Sources in the Same Direction by Image Signal Processing.
Proceedings of the IEEE 7th Global Conference on Consumer Electronics, 2018

Revisiting spectral envelope recovery from speech sounds generated by periodic excitation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Expanded Estimation Model for Instantaneous Presence in Audio-visual Content Incorporating Binaural Information.
J. Inf. Hiding Multim. Signal Process., 2017

A modulation property of time-frequency derivatives of filtered phase and its application to aperiodicity and fo estimation.
CoRR, 2017

Relationship Between Perception of Cuteness in Female Voices and Their Durations.
Proceedings of the Speech and Computer - 19th International Conference, 2017

Low-Dimensional Representation of Spectral Envelope Without Deterioration for Full-Band Speech Analysis/Synthesis System.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Harvest: A High-Performance Fundamental Frequency Estimator from Speech Signals.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

A New Cosine Series Antialiasing Function and its Application to Aliasing-Free Glottal Source Models for Speech and Singing Synthesis.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

A Modulation Property of Time-Frequency Derivatives of Filtered Phase and its Application to Aperiodicity and f<sub>o</sub> Estimation.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Broadbanding of a NN-based microphone-array system by decomposing into frequency components.
Proceedings of the IEEE 6th Global Conference on Consumer Electronics, 2017

Accurate estimation of f0 and aperiodicity based on periodicity detector residuals and deviations of phase derivatives.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

D4C, a band-aperiodicity estimator for high-quality speech synthesis.
Speech Commun., 2016

Development of an Estimation Model for Instantaneous Presence in Audio-Visual Content.
IEICE Trans. Inf. Syst., 2016

WORLD: A Vocoder-Based High-Quality Speech Synthesis System for Real-Time Applications.
IEICE Trans. Inf. Syst., 2016

TUSK: A Framework for Overviewing the Performance of F0 Estimators.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Superdirective microphone array based on DOA and waveform estimations of noise.
Proceedings of the IEEE 5th Global Conference on Consumer Electronics, 2016

CheapTrick, a spectral envelope estimator for high-quality speech synthesis.
Speech Commun., 2015

Instantaneous Evaluation of the Sense of Presence in Audio-Visual Content.
IEICE Trans. Inf. Syst., 2015

Error Evaluation of an F0-Adaptive Spectral Envelope Estimator in Robustness against the Additive Noise and F0 Error.
IEICE Trans. Inf. Syst., 2015

Aliasing-free implementation of discrete-time glottal source models and their applications to speech synthesis and F0 extractor evaluation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

Muffled and Brisk Speech Evaluation with Criterion Based on Temporal Differentiation of Vocal Tract Area Function.
IEICE Trans. Inf. Syst., 2014

Excitation source analysis for high-quality speech manipulation systems based on an interference-free representation of group delay with minimum phase response compensation.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Development of an estimation model for instantaneous presence in audio content.
Proceedings of the IEEE Fourth International Conference on Consumer Electronics Berlin, 2014

Excitation source design for high-quality speech manipulation systems based on a temporally static group delay representation of periodic signals.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

YLAB@RU at Spoken Term Detection Task in NTCIR-10 SpokenDoc-2.
Proceedings of the 10th NTCIR Conference on Evaluation of Information Access Technologies, 2013

Controlling "shout" expression in a Japanese POP singing performance: analysis and suppression study.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Periodicity extraction for voiced sounds with multiple periodicity.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Beyond bandlimited sampling of speech spectral envelope imposed by the harmonic structure of voiced sounds.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Higher order waveform symmetry measure and its application to periodicity detectors for speech and singing with fine temporal resolution.
Proceedings of the IEEE International Conference on Acoustics, 2013

Temporally variable multi-aspect N-way morphing based on interference-free speech representations.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Incorporating dynamic features into minimum generation error training for HMM-based speech synthesis.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Deviation measure of waveform symmetry and its application to high-speed and temporally-fine F0 extraction for vocal sound texture manipulation.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Analysis and synthesis of strong vocal expressions: Extension and application of audio texture features to singing voice.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

An interference-free representation of group delay for periodic signals.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

An interference-free representation of instantaneous frequency of periodic signals and its application to F0 extraction.
Proceedings of the IEEE International Conference on Acoustics, 2011

Vocal Manipulation Based on Pitch Transcription and Its Application to Interactive Entertainment for Karaoke.
Proceedings of the Haptic and Audio Interaction Design - 6th International Workshop, 2011

Simplification and extension of non-periodic excitation source representations for high-quality speech manipulation systems.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Near field sound source localization based on cross-power spectrum phase analysis with multiple microphones.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Performance estimation of reverberant speech recognition based on reverberant criteria RSR-d<sub>n</sub> with acoustic parameters.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

High-quality and light-weight voice transformation enabling extrapolation without perceptual and objective breakdown.
Proceedings of the IEEE International Conference on Acoustics, 2010

A bottom-up procedure to extract periodicity structure of voiced sounds and its application to represent and restoration of pathological voices.
Proceedings of the Sixth International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, 2009

v.morish'09: A Morphing-Based Singing Design Interface for Vocal Melodies.
Proceedings of the Entertainment Computing, 2009

Observation of empirical cumulative distribution of vowel spectral distances and its application to vowel based voice conversion.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Temporally variable multi-aspect auditory morphing enabling extrapolation without objective and perceptual breakdown.
Proceedings of the IEEE International Conference on Acoustics, 2009

Spectral envelope recovery beyond the nyquist limit for high-quality manipulation of speech sounds.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Tandem-STRAIGHT: A temporally stable power spectral representation for periodic signals and applications to interference-free spectrum, F0, and aperiodicity estimation.
Proceedings of the IEEE International Conference on Acoustics, 2008

Group delay for acoustic event representation and its application for speech aperiodicity analysis.
Proceedings of the 15th European Signal Processing Conference, 2007

Logarithmic temporal processing applied to accurate empirical transfer function measurements in vocal sound propagation.
Proceedings of the 14th European Signal Processing Conference, 2006

Acappella synthesis demonstrations using RWC music database.
Proceedings of the New Interfaces for Musical Expression, 2004

Procedure "senza vibrato": a key component for morphing singing.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Loudspeaker equalization based on multi-location observation with reliable time-frequency region selection and its evaluation using sound propagation measurement.
Proceedings of the 2004 12th European Signal Processing Conference, 2004
