Carlos Toshinori Ishi
Orcid: 0000-0001-8130-1048
According to our database1,
Carlos Toshinori Ishi
authored at least 126 papers
between 1999 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
HiMul-LGG: A hierarchical decision fusion-based local-global graph neural network for multimodal emotion recognition in conversation.
Neural Networks, 2025
Facial action units guided graph representation learning for multimodal depression detection.
Neurocomputing, 2025
HAM-GNN: A hierarchical attention-based multi-dimensional edge graph neural network for dialogue act classification.
Expert Syst. Appl., 2025
RoboDJ: Live Commentary Robots System Driven by Physical- and Cyber-World Observations.
Proceedings of the MultiMedia Modeling, 2025
2024
Speech-Driven Gesture Generation Using Transformer-Based Denoising Diffusion Probabilistic Models.
IEEE Trans. Hum. Mach. Syst., December, 2024
Assessing the influence of an android robot's persuasive behaviors and context of violation on compliance.
Adv. Robotics, December, 2024
Is It Possible to Recognize a Speaker Without Listening? Unraveling Conversation Dynamics in Multi-Party Interactions Using Continuous Eye Gaze.
IEEE Robotics Autom. Lett., November, 2024
Gaze modeling in multi-party dialogues and extraversion expression through gaze aversion control.
Adv. Robotics, October, 2024
How an Android Expresses "Now Loading...": Examining the Properties of Thinking Faces.
Int. J. Soc. Robotics, August, 2024
Age and Spatial Cue Effects on User Performance for an Adaptable Verbal Wayfinding System.
Proceedings of the 33rd IEEE International Conference on Robot and Human Interactive Communication, 2024
Retargeting Human Facial Expression to Human-like Robotic Face through Neural Network Surrogate-based Optimization.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024
2023
An Adversarial Training Based Speech Emotion Classifier With Isolated Gaussian Regularization.
IEEE Trans. Affect. Comput., 2023
QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion.
CoRR, 2023
Proceedings of the IEEE/SICE International Symposium on System Integration, 2023
An attention-based sound selective hearing support system: evaluation by subjects with age-related hearing loss.
Proceedings of the IEEE/SICE International Symposium on System Integration, 2023
Recognizing Real-World Intentions using A Multimodal Deep Learning Approach with Spatial-Temporal Graph Convolutional Networks.
IROS, 2023
HAG: Hierarchical Attention with Graph Network for Dialogue Act Classification in Conversation.
Proceedings of the IEEE International Conference on Acoustics, 2023
I Know Your Feelings Before You Do: Predicting Future Affective Reactions in Human-Computer Dialogue.
Proceedings of the Extended Abstracts of the 2023 CHI Conference on Human Factors in Computing Systems, 2023
Using Joint Training Speaker Encoder With Consistency Loss to Achieve Cross-Lingual Voice Conversion and Expressive Voice Conversion.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
QUICKVC: A Lightweight VITS-Based Any-to-Many Voice Conversion Model using ISTFT for Faster Conversion.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
2022
An improved CycleGAN-based emotional voice conversion model by augmenting temporal dependency with a transformer.
Speech Commun., 2022
Expression of Personality by Gaze Movements of an Android Robot in Multi-Party Dialogues<sup>*</sup>.
Proceedings of the 31st IEEE International Conference on Robot and Human Interactive Communication, 2022
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022
Butsukusa: A Conversational Mobile Robot Describing Its Own Observations and Internal States.
Proceedings of the ACM/IEEE International Conference on Human-Robot Interaction, 2022
Proceedings of the 10th International Conference on Affective Computing and Intelligent Interaction, ACII 2022, 2022
2021
Using an Android Robot to Improve Social Connectedness by Sharing Recent Experiences of Group Members in Human-Robot Conversations.
IEEE Robotics Autom. Lett., October, 2021
Skeleton-Based Emotion Recognition Based on Two-Stream Self-Attention Enhanced Spatial-Temporal Graph Convolutional Network.
Sensors, 2021
IEEE Robotics Autom. Lett., 2021
Advocating Attitudinal Change Through Android Robot's Intention-Based Expressive Behaviors: Toward WHO COVID-19 Guidelines Adherence.
IEEE Robotics Autom. Lett., 2021
CycleTransGAN-EVC: A CycleGAN-based Emotional Voice Conversion Model with Transformer.
CoRR, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the ICMI '21 Companion: Companion Publication of the 2021 International Conference on Multimodal Interaction, Montreal, QC, Canada, October 18, 2021
MAEC: Multi-Instance Learning with an Adversarial Auto-Encoder-Based Classifier for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021
Analysis of Role-Based Gaze Behaviors and Gaze Aversions, and Implementation of Robot's Gaze Control for Multi-party Dialogue.
Proceedings of the HAI '21: International Conference on Human-Agent Interaction, Virtual Event, Japan, November 9, 2021
2020
Multi-Modality Emotion Recognition Model with GAT-Based Multi-Head Inter-Modality Attention.
Sensors, 2020
Person-Directed Pointing Gestures and Inter-Personal Relationship: Expression of Politeness to Friendliness by Android Robots.
IEEE Robotics Autom. Lett., 2020
Adv. Robotics, 2020
Analysis of sound activities and voice activity detection using in-car microphone arrays.
Proceedings of the 2020 IEEE/SICE International Symposium on System Integration, 2020
Proceedings of the IEEE 14th International Conference on Semantic Computing, 2020
Proceedings of the MuSe'20: Proceedings of the 1st International on Multimodal Sentiment Analysis in Real-life Media Challenge and Workshop, 2020
Generation and Evaluation of Audio-Visual Anger Emotional Expression for Android Robot.
Proceedings of the Companion of the 2020 ACM/IEEE International Conference on Human-Robot Interaction, 2020
Proceedings of the 28th European Signal Processing Conference, 2020
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020
2019
Probabilistic nod generation model based on speech and estimated utterance categories.
Adv. Robotics, 2019
Expressing reactive emotion based on multimodal emotion recognition for natural conversation in human-robot interaction.
Adv. Robotics, 2019
Prosodic and voice quality analyses of loud speech: differences of hot anger and far-directed speech.
Proceedings of the 2019 Workshop on Speech, Music and Mind, 2019
Analysis of factors influencing the impression of speaker individuality in android robots.
Proceedings of the 28th IEEE International Conference on Robot and Human Interactive Communication, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 19th IEEE-RAS International Conference on Humanoid Robots, 2019
2018
IEEE Robotics Autom. Lett., 2018
2017
Probabilistic 3-D Mapping of Sound-Emitting Structures Based on Acoustic Ray Casting.
IEEE Trans. Robotics, 2017
Motion Analysis in Vocalized Surprise Expressions and Motion Generation in Android Robots.
IEEE Robotics Autom. Lett., 2017
Frontiers Robotics AI, 2017
Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017
Turn-Taking Estimation Model Based on Joint Embedding of Lexical and Prosodic Contents.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Emotion recognition by combining prosody and sentiment analysis for expressing reactive emotion by humanoid robot.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
2016
Proceedings of the 25th IEEE International Symposium on Robot and Human Interactive Communication, 2016
Proceedings of the 25th IEEE International Symposium on Robot and Human Interactive Communication, 2016
Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2016
Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2016
2015
Online speech-driven head motion generating system and evaluation on a tele-operated robot.
Proceedings of the 24th IEEE International Symposium on Robot and Human Interactive Communication, 2015
Robot-assisted acoustic inspection of infrastructures - cooperative hammer sounding inspection.
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015
Speech activity detection and face orientation estimation using multiple microphone arrays and human position information.
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015
Bringing the Scene Back to the Tele-operator: Auditory Scene Manipulation for Tele-presence Systems.
Proceedings of the Tenth Annual ACM/IEEE International Conference on Human-Robot Interaction, 2015
2014
Analysis of relationship between head motion events and speech in dialogue conversations.
Speech Commun., 2014
Integration of Multiple Microphone Arrays and Use of Sound Reflections for 3D Localization of Sound Sources.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2014
Proceedings of the 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2014
Analysis of laughter events in real science classes by using multiple environment sensor data.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014
2013
Int. J. Humanoid Robotics, 2013
Comput. Speech Lang., 2013
Proceedings of the 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2013
Using multiple microphone arrays and reflections for 3D localization of sound sources.
Proceedings of the 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2013
Creation of radiated sound intensity maps using multi-modal measurements onboard an autonomous mobile platform.
Proceedings of the 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2013
Analysis of factors involved in the choice of rising or non-rising intonation in question utterances appearing in conversational speech.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 2013 IEEE International Conference on Robotics and Automation, 2013
2012
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012
Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012
Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Fusion of standard and alternative acoustic sensors for robust automatic speech recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Generation of nodding, head tilting and eye gazing for human-robot dialogue interaction.
Proceedings of the International Conference on Human-Robot Interaction, 2012
Proceedings of the 12th IEEE International Conference on Bioinformatics & Bioengineering, 2012
2011
Proceedings of the International Conference on Computer Graphics and Interactive Techniques, 2011
The effects of microphone array processing on pitch extraction in real noisy environments.
Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011
Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011
Analysis of Acoustic-Prosodic Features Related to Paralinguistic Information Carried by Interjections in Dialogue Speech.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Range Based Multi Microphone Array Fusion for Speaker Activity Detection in Small Meetings.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Speech Production in Noisy Environments and the Effect on Automatic Speech Recognition.
Proceedings of the 17th International Congress of Phonetic Sciences, 2011
Proceedings of the Auditory-Visual Speech Processing, 2011
2010
Analysis of the Roles and the Dynamics of Breathy and Whispery Voice Qualities in Dialogue Speech.
EURASIP J. Audio Speech Music. Process., 2010
Proceedings of the 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010
Close speaker cancellation for suppression of non-stationary background noise for hands-free speech interface.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 5th ACM/IEEE International Conference on Human Robot Interaction, 2010
Real-time audio-visual voice activity detection for speech recognition in noisy environments.
Proceedings of the Auditory-Visual Speech Processing, 2010
Investigating the role of the Lombard reflex in visual and audiovisual speech recognition.
Proceedings of the Auditory-Visual Speech Processing, 2010
2009
Evaluation of a MUSIC-based real-time sound localization of multiple sound sources in real noisy environments.
Proceedings of the 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009
Proceedings of the Affective Computing and Intelligent Interaction, 2009
2008
IEEE Trans. Robotics, 2008
IEEE Trans. Speech Audio Process., 2008
Automatic extraction of paralinguistic information using prosodic features related to F.
Speech Commun., 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 3rd ACM/IEEE international conference on Human robot interaction, 2008
Analysis of inter- and intra-speaker variability of head motions during spoken dialogue.
Proceedings of the International Conference on Auditory-Visual Speech Processing 2008, 2008
2007
Proceedings of the 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, October 29, 2007
Proceedings of the 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, October 29, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
2006
Evaluation of Prosodic and Voice Quality Features on Automatic Extraction of Paralinguistic Information.
Proceedings of the 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2006
Analysis of prosodic and linguistic cues of phrase finals for turn-taking and dialog acts.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the 2006 6th IEEE-RAS International Conference on Humanoid Robots, 2006
2005
Perceptually-Related F0 Parameters for Automatic Classification of Phrase Final Tones.
IEICE Trans. Inf. Syst., 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
2003
Mora F0 representation for accent type identification in continuous speech and considerations on its relation with perceived pitch values.
Speech Commun., 2003
Perceptually-related acoustic-prosodic features of phrase finals in spontaneous speech.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Identification of Japanese double-mora phonemes considering speaking rate for the use in CALL systems.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
1999
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999