Yuki Saito

CoRR, 2024

Cross-Dialect Text-To-Speech in Pitch-Accent Language Incorporating Multi-Dialect Phoneme-Level BERT.

[BibT_eX]

[DOI]

Kazuki Yamauchi

CoRR, 2024

A Fashion Item Recommendation Model in Hyperbolic Space.

[BibT_eX]

[DOI]

CoRR, 2024

J-CHAT: Japanese Large-scale Spoken Dialogue Corpus for Spoken Dialogue Language Modeling.

[BibT_eX]

[DOI]

CoRR, 2024

Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals.

[BibT_eX]

[DOI]

CoRR, 2024

Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment.

[BibT_eX]

[DOI]

CoRR, 2024

SRC4VC: Smartphone-Recorded Corpus for Voice Conversion Benchmark.

[BibT_eX]

[DOI]

CoRR, 2024

Explaining Black-box Model Predictions via Two-level Nested Feature Attributions with Consistency Property.

[BibT_eX]

[DOI]

CoRR, 2024

On permutation-invariant neural networks.

[BibT_eX]

[DOI]

CoRR, 2024

UTDUSS: UTokyo-SaruLab System for Interspeech2024 Speech Processing Using Discrete Speech Unit Challenge.

[BibT_eX]

[DOI]

CoRR, 2024

Building speech corpus with diverse voice characteristics for its prompt-based representation.

[BibT_eX]

[DOI]

CoRR, 2024

Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-to-Speech.

[BibT_eX]

[DOI]

Dong Yang

Tomoki Koriyama

CoRR, 2024

Deep learned features selection algorithm: Removal operation of anomaly feature maps (RO-AFM).

[BibT_eX]

[DOI]

Appl. Soft Comput., 2024

JVNV: A Corpus of Japanese Emotional Speech With Verbal Content and Nonverbal Expressions.

[BibT_eX]

[DOI]

IEEE Access, 2024

Low Complexity CSI Feedback Method Using Reformer.

[BibT_eX]

[DOI]

Mondher Bouazizi

Tomoaki Ohtsuki

Proceedings of the 100th IEEE Vehicular Technology Conference, 2024

An Empirical Analysis of GPT-4V's Performance on Fashion Aesthetic Evaluation.

[BibT_eX]

[DOI]

Proceedings of the SIGGRAPH Asia 2024 Technical Communications, 2024

96-Core MPO-APC Connector using 4-core fiber with SMF Standard Insertion Loss Grade.

[BibT_eX]

[DOI]

Proceedings of the Optical Fiber Communications Conference and Exhibition, 2024

Analysis of Grinding Motion using Force/Tactile Sensation.

[BibT_eX]

[DOI]

Proceedings of the 33rd IEEE International Symposium on Industrial Electronics, 2024

STYLECAP: Automatic Speaking-Style Captioning from Speech Based on Speech and Language Self-Supervised Learning Models.

[BibT_eX]

[DOI]

Kazuki Yamauchi

Yusuke Ijima

Proceedings of the IEEE International Conference on Acoustics, 2024

An Analysis of Knowledge Representation for Anime Recommendation Using Graph Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Agents and Artificial Intelligence, 2024

2023

JVNV: A Corpus of Japanese Emotional Speech with Verbal Content and Nonverbal Expressions.

[BibT_eX]

[DOI]

Dataset, October, 2023

Multicore fiber interconnects for multi-terabit spine-leaf datacenter network topologies.

[BibT_eX]

[DOI]

J. Opt. Commun. Netw., July, 2023

GUI System to Support Cardiology Examination Based on Explainable Regression CNN for Estimating Pulmonary Artery Wedge Pressure.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., March, 2023

Evaluation of Lower-Limb Kinematics during Timed Up and Go (TUG) Test in Subjects with Locomotive Syndrome (LS) Using Wearable Gait Sensors (H-Gait System).

[BibT_eX]

[DOI]

Sensors, January, 2023

Fashion intelligence system: An outfit interpretation utilizing images and rich abstract tags.

[BibT_eX]

[DOI]

Expert Syst. Appl., 2023

Outfit Completion via Conditional Set Transformation.

[BibT_eX]

[DOI]

Takuma Nakamura

Ryosuke Goto

CoRR, 2023

Virtual Human Generative Model: Masked Modeling Approach for Learning Human Characteristics.

[BibT_eX]

[DOI]

Kenta Oono

Nontawat Charoenphakdee

CoRR, 2023

Monocular Depth Estimation for Tilted Images via Gravity Rectifier.

[BibT_eX]

[DOI]

Hideo Saito

Vincent Frémont

Proceedings of the 18th International Joint Conference on Computer Vision, 2023

Federated Learning for Human-in-the-Loop Many-to-Many Voice Conversion.

[BibT_eX]

[DOI]

Ryunosuke Hirai

Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023

Verification of Anode Position and Generated Force Vector of EHD at Wire-cylinder Electrode.

[BibT_eX]

[DOI]

Proceedings of the 32nd IEEE International Symposium on Industrial Electronics, 2023

HumanDiffusion: diffusion model using perceptual gradients.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

ChatGPT-EDSS: Empathetic Dialogue Speech Synthesis Trained from ChatGPT-derived Context Word Embeddings.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

CALLS: Japanese Empathetic Dialogue Speech Corpus of Complaint Handling and Attentive Listening in Customer Center.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Machine Learning-Based Performance Improvement of Bilateral Teleoperation with Hydraulic Actuator.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Mechatronics, 2023

Duration-Aware Pause Insertion Using Pre-Trained Language Model for Multi-Speaker Text-To-Speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

MID-Attribute Speaker Generation Using Optimal-Transport-Based Interpolation of Gaussian Mixture Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

SHIFT15M: Fashion-specific dataset for set-to-set matching with several distribution shifts.

[BibT_eX]

[DOI]

Masanari Kimura

Takuma Nakamura

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

COCO-NUT: Corpus of Japanese Utterance and Voice Characteristics Description for Prompt-Based Control.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022

Multi-Task Adversarial Training Algorithm for Multi-Speaker Neural Text-to-Speech.

[BibT_eX]

[DOI]

CoRR, 2022

Human-in-the-loop Speaker Adaptation for DNN-based Multi-speaker TTS.

[BibT_eX]

[DOI]

Kenta Udagawa

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

STUDIES: Corpus of Japanese Empathetic Dialogue Speech Towards Friendly Voice Agent.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Acoustic Modeling for End-to-End Empathetic Dialogue Speech Synthesis Using Linguistic and Prosodic Contexts of Dialogue History.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Predicting VQVAE-based Character Acting Style from Quotation-Annotated Text for Audiobook Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Validation of a Property Estimation Method Based on Sequential and Posteriori Estimation.

[BibT_eX]

[DOI]

Proceedings of the IECON 2022, 2022

2021

Perceptual-Similarity-Aware Deep Speaker Representation Learning for Multi-Speaker Generative Modeling.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

Analysis of 3-D Kinematics Using H-Gait System during Walking on a Lower Body Positive Pressure Treadmill.

[BibT_eX]

[DOI]

Sensors, 2021

Real-Time Full-Band Voice Conversion with Sub-Band Modeling and Data-Driven Phase Estimation of Spectral Differentials.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2021

DNN-Based Low-Musical-Noise Single-Channel Speech Enhancement Based on Higher-Order-Moments Matching.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2021

SHIFT15M: Multiobjective Large-Scale Fashion Dataset with Distributional Shifts.

[BibT_eX]

[DOI]

Masanari Kimura

Takuma Nakamura

CoRR, 2021

Camera Selection for Occlusion-Less Surgery Recording via Training With an Egocentric Camera.

[BibT_eX]

[DOI]

IEEE Access, 2021

Cross-Lingual Speaker Adaptation Using Domain Adaptation and Speaker Consistency Loss for Text-To-Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Humanacgan: Conditional Generative Adversarial Network with Human-Based Auxiliary Classifier and its Evaluation in Phoneme Perception.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Emotion-Controllable Speech Synthesis Using Emotion Soft Labels and Fine-Grained Prosody Factors.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

Performance Improvement of Bilateral Teleoperation with Hydraulic Actuator by Friction Compensation.

[BibT_eX]

[DOI]

Proceedings of the 17th IEEE International Conference on Advanced Motion Control, 2021

Motion Generation Based on Physical Property Estimation in Motion Copy System.

[BibT_eX]

[DOI]

Proceedings of the 17th IEEE International Conference on Advanced Motion Control, 2021

Experimental Verification of a Novel Continuously Variable Transmission with Electro-Hydrostatic Actuator.

[BibT_eX]

[DOI]

Proceedings of the 17th IEEE International Conference on Advanced Motion Control, 2021

2020

Phase reconstruction from amplitude spectrograms based on directional-statistics deep neural networks.

[BibT_eX]

[DOI]

Signal Process., 2020

Development of Scanning Line Tool Path Generation Algorithm Using Boundary Position Information of Approximate Polyhedron of Complex Molds.

[BibT_eX]

[DOI]

Int. J. Autom. Technol., 2020

Joint Adversarial Training of Speech Recognition and Synthesis Models for Many-to-One Voice Conversion Using Phonetic Posteriorgrams.

[BibT_eX]

[DOI]

Kei Akuzawa

Kentaro Tachibana

IEICE Trans. Inf. Syst., 2020

Generative Moment Matching Network-Based Neural Double-Tracking for Synthesized and Natural Singing Voices.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2020

DNN-based Speech Synthesis Using Abundant Tags of Spontaneous Speech Corpus.

[BibT_eX]

[DOI]

Proceedings of The 12th Language Resources and Evaluation Conference, 2020

SMASH Corpus: A Spontaneous Speech Corpus Recording Third-person Audio Commentaries on Gameplay.

[BibT_eX]

[DOI]

Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Investigating Effective Additional Contextual Factors in DNN-Based Spontaneous Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Cross-Lingual Text-To-Speech Synthesis via Domain Adaptation and Perceptual Similarity Regression in Speaker Space.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Real-Time, Full-Band, Online DNN-Based Voice Conversion System Using a Single CPU.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Face2Speech: Towards Multi-Speaker Text-to-Speech Synthesis Using an Embedding Vector Predicted from a Face Image.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Lifter Training and Sub-Band Modeling for Computationally Efficient and High-Quality Voice Conversion Using Spectral Differentials.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Humangan: Generative Adversarial Network With Human-Based Discriminator And Its Evaluation In Speech Perception Modeling.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

In-Plane Rotation-Aware Monocular Depth Estimation Using SLAM.

[BibT_eX]

[DOI]

Proceedings of the Frontiers of Computer Vision - 26th International Workshop, 2020

Exchangeable Deep Neural Networks for Set-to-Set Matching and Learning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

2019

Simulation of Reflectance and Vegetation Indices for Unmanned Aerial Vehicle (UAV) Monitoring of Paddy Fields.

[BibT_eX]

[DOI]

Remote. Sens., 2019

Vocoder-free text-to-speech synthesis incorporating generative adversarial networks using low-/multi-frequency STFT amplitude spectra.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2019

Deep Set-to-Set Matching and Learning.

[BibT_eX]

[DOI]

CoRR, 2019

JVS corpus: free Japanese multi-speaker voice corpus.

[BibT_eX]

[DOI]

CoRR, 2019

DNN-based Speaker Embedding Using Subjective Inter-speaker Similarity for Multi-speaker Modeling in Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

V2S attack: building DNN-based voice conversion from automatic speaker verification.

[BibT_eX]

[DOI]

Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

Bandwidth Expansion of Bilateral Teleoperation Based on Synergy of Observer Gain and Velocity Feedback Gain.

[BibT_eX]

[DOI]

Proceedings of the IECON 2019, 2019

A Controller Design Method of Bilateral Teleoperation for Velocity Control Driver.

[BibT_eX]

[DOI]

Proceedings of the IECON 2019, 2019

Symmetric Operational Force Compensator for Bilateral Teleoperation Under Time Delay Based on Power Flow Direction.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Mechatronics, 2019

Generative Moment Matching Network-based Random Modulation Post-filter for DNN-based Singing Voice Synthesis and Neural Double-tracking.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Artificial Replacement of Human Sensation Using Haptic Transplant Technology.

[BibT_eX]

[DOI]

IEEE Trans. Ind. Electron., 2018

Statistical Parametric Speech Synthesis Incorporating Generative Adversarial Networks.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2018

2.5D Faster R-CNN for Distance Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2018

Effects of EEG Electrode Positional Deviations for Classification Accuracy on Different Days.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2018

Physical-contact 256-core MPO Connector with Flat Polished Multi-core Fibers.

[BibT_eX]

[DOI]

Proceedings of the Optical Fiber Communications Conference and Exposition, 2018

Phase Reconstruction from Amplitude Spectrograms Based on Von-Mises-Distribution Deep Neural Network.

[BibT_eX]

[DOI]

Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Text-to-Speech Synthesis Using STFT Spectra Based on Low-/Multi-Resolution Generative Adversarial Networks.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Non-Parallel Voice Conversion Using Variational Autoencoders Conditioned by Phonetic Posteriorgrams and D-Vectors.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Accurate Passive Rotational Alignment of Multi-Core Fibre with Double-D-Shape Cladding on V Groove.

[BibT_eX]

[DOI]

Proceedings of the European Conference on Optical Communication, 2018

Generative approach using the noise generation models for DNN-based speech synthesis trained from noisy speech.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017

Voice Conversion Using Input-to-Output Highway Networks.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2017

Online compensation of gravity and friction for haptics with incremental position sensors.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Mechatronics and Machine Vision in Practice, 2017

Voice Conversion Using Sequence-to-Sequence Learning of Context Posterior Probabilities.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Motion-reproduction system adaptable to position fluctuation of picking objects based on image information.

[BibT_eX]

[DOI]

Proceedings of the IECON 2017 - 43rd Annual Conference of the IEEE Industrial Electronics Society, Beijing, China, October 29, 2017

Wearable finger exoskeleton using flexible actuator for rehabilitation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Mechatronics, 2017

Training algorithm to deceive Anti-Spoofing Verification for DNN-based speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2015

Conversion of Speaker's Face Image Using PCA and Animation Unit for Video Chatting.

[BibT_eX]

[DOI]

Proceedings of the 2015 International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2015

2014

Statistically significant subgraphs for genome-wide association study.

[BibT_eX]

[DOI]

Proceedings of the 1st ECML/PKDD Workshop on Statistically Sound Data Mining, 2014

Extraction and realization of human actions.

[BibT_eX]

[DOI]

Proceedings of the IEEE 13th International Workshop on Advanced Motion Control, 2014

2013

Recognition of Grasping Motion Based on Modal Space Haptic Information Using DP Pattern-Matching Algorithm.

[BibT_eX]

[DOI]

IEEE Trans. Ind. Informatics, 2013

Development of an atomic force microscope for measuring mechanical properties of cell population.

[BibT_eX]

[DOI]

Proceedings of the International Symposium on Micro-NanoMechatronics and Human Science, 2013

Acceleration-based position and force control for twist drive.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Mechatronics, 2013

Variable tension control for master-slave tendon-driven robot hand.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Mechatronics, 2013

Detection and Tracking Protein Molecules in Fluorescence Microscopic Video.

[BibT_eX]

[DOI]

Proceedings of the First International Symposium on Computing and Networking, 2013

Stability analysis of time-delay systems based on a power of the monodromy operator.

[BibT_eX]

[DOI]

Tomomichi Hagiwara

Proceedings of the 12th European Control Conference, 2013

Widely linear LQCMV beamformer and augmented dual-domain adaptive algorithm.

[BibT_eX]

[DOI]

Masahiro Yukawa

Proceedings of the 9th International Conference on Information, 2013

2012

Reduction of Patient Dose in Digital Mammography: Simulation of Low-Dose Image from a Routine Dose.

[BibT_eX]

[DOI]

Proceedings of the Breast Imaging, 2012

Model-based compensation of wire elongation for tendon-driven rotary actuator.

[BibT_eX]

[DOI]

Takahiro Nozaki

Kouhei Ohnishi

Proceedings of the 12th IEEE International Workshop on Advanced Motion Control, 2012

2010

Robust interference management to satisfy allowable outage probability using minority game.

[BibT_eX]

[DOI]

Proceedings of the IEEE 21st International Symposium on Personal, 2010

2009

Empirical Mode Decomposition Method for MEG Phantom Data Analysis.

[BibT_eX]

[DOI]

J. Circuits Syst. Comput., 2009

Adaptive rhythmic component extractionwith regularization for EEG data analysis.

[BibT_eX]

[DOI]

Toshihisa Tanaka

Hiroshi Higashi

Proceedings of the IEEE International Conference on Acoustics, 2009

Joint Dynamics of Spectrum Allocation and User Behavior in Spectrum Markets.

[BibT_eX]

[DOI]

Proceedings of the Global Communications Conference, 2009. GLOBECOM 2009, Honolulu, Hawaii, USA, 30 November, 2009

2008

Rhythmic component extraction for multi-channel EEG data analysis.

[BibT_eX]

[DOI]

Toshihisa Tanaka