Joakim Gustafson

Orcid: 0000-0002-0397-6442

According to our database1, Joakim Gustafson authored at least 124 papers between 1993 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Revisiting Three Text-to-Speech Synthesis Experiments with a Web-Based Audience Response System.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

The Role of Creaky Voice in Turn Taking and the Perception of Speaker Stance: Experiments Using Controllable TTS.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
On the Use of Self-Supervised Speech Representations in Spontaneous Speech Synthesis.
Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023

Situating Speech Synthesis: Investigating Contextual Factors in the Evaluation of Conversational TTS.
Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023

Stuck in the MOS pit: A critical analysis of MOS test methodology in TTS evaluation.
Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023

Hi robot, it's not what you say, it's how you say it.
Proceedings of the 32nd IEEE International Conference on Robot and Human Interactive Communication, 2023

Generation of speech and facial animation with controllable articulatory effort for amusing conversational characters.
Proceedings of the 23rd ACM International Conference on Intelligent Virtual Agents, 2023

So-to-Speak: An Exploratory Platform for Investigating the Interplay between Style and Prosody in TTS.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Prosody-controllable Gender-ambiguous Speech Synthesis: A Tool for Investigating Implicit Bias in Speech Perception.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Beyond Style: Synthesizing Speech with Pragmatic Functions.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Pardon my disfluency: The impact of disfluency effects on the perception of speaker competence and confidence.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Automatic Evaluation of Turn-taking Cues in Conversational Speech Synthesis.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

A Comparative Study of Self-Supervised Speech Representations in Read and Spontaneous TTS.
Proceedings of the IEEE International Conference on Acoustics, 2023

Prosody-Controllable Spontaneous TTS with Neural HMMS.
Proceedings of the IEEE International Conference on Acoustics, 2023

Casual chatter or speaking up? Adjusting articulatory effort in generation of speech and animation for conversational characters.
Proceedings of the 17th IEEE International Conference on Automatic Face and Gesture Recognition, 2023

A Special Interest Group on Developing Theories of Language Use in Interaction with Conversational User Interfaces.
Proceedings of the Extended Abstracts of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

2022
Evaluating Sampling-based Filler Insertion with Spontaneous TTS.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Where's the uh, hesitation? The interplay between filled pause location, speech rate and fundamental frequency in perception of confidence.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021
Grounding behaviours with conversational interfaces: effects of embodiment and failures.
J. Multimodal User Interfaces, 2021

Towards an Engagement-Aware Attentive Artificial Listener for Multi-Party Interactions.
Frontiers Robotics AI, 2021

Multimodal Capture of Patient Behaviour for Improved Detection of Early Dementia: Clinical Feasibility and Preliminary Results.
Frontiers Comput. Sci., 2021

Perception of smiling voice in spontaneous speech synthesis.
Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021

Personality in the mix - investigating the contribution of fillers and speaking style to the perception of spontaneous speech synthesis.
Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021

Integrated Speech and Gesture Synthesis.
Proceedings of the ICMI '21: International Conference on Multimodal Interaction, 2021

A Systematic Cross-Corpus Analysis of Human Reactions to Robot Conversational Failures.
Proceedings of the ICMI '21: International Conference on Multimodal Interaction, 2021

2020
Augmented Prompt Selection for Evaluation of Spontaneous Speech Synthesis.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Chinese Whispers: A Multimodal Dataset for Embodied Language Grounding.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Breathing and Speech Planning in Spontaneous Speech Synthesis.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Effects of Different Interaction Contexts when Evaluating Gaze Models in HRI.
Proceedings of the HRI '20: ACM/IEEE International Conference on Human-Robot Interaction, 2020

Behavioural Responses to Robot Conversational Failures.
Proceedings of the HRI '20: ACM/IEEE International Conference on Human-Robot Interaction, 2020

Embodiment Effects in Interactions with Failing Robots.
Proceedings of the CHI '20: CHI Conference on Human Factors in Computing Systems, 2020

2019
Speech Synthesis Evaluation - State-of-the-Art Assessment and Suggestion for a Novel Research Program.
Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

How to train your fillers: uh and um in spontaneous speech synthesis.
Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

PROMIS: a statistical-parametric speech synthesis system with prominence control via a prominence network.
Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

The Effects of Anthropomorphism and Non-verbal Social Behaviour in Virtual Assistants.
Proceedings of the 19th ACM International Conference on Intelligent Virtual Agents, 2019

Responsive Joint Attention in Human-Robot Interaction.
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

Spontaneous Conversational Speech Synthesis from Found Data.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Off the Cuff: Exploring Extemporaneous Speech Delivery with TTS.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Estimating Uncertainty in Task-Oriented Dialogue.
Proceedings of the International Conference on Multimodal Interaction, 2019

Casting to Corpus: Segmenting and Selecting Spontaneous Dialogue for Tts with a Cnn-lstm Speaker-dependent Breath Detector.
Proceedings of the IEEE International Conference on Acoustics, 2019

The Effects of Embodiment and Social Eye-Gaze in Conversational Agents.
Proceedings of the 41th Annual Meeting of the Cognitive Science Society, 2019

Multimodal conversational interaction with robots.
Proceedings of the Handbook of Multimodal-Multisensor Interfaces: Language Processing, Software, Commercialization, and Emerging Directions, 2019

2018
A Comparison of Visualisation Methods for Disambiguating Verbal Requests in Human-Robot Interaction.
Proceedings of the 27th IEEE International Symposium on Robot and Human Interactive Communication, 2018

A Multimodal Corpus for Mutual Gaze and Joint Attention in Multiparty Situated Interaction.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Crowdsourced Multimodal Corpora Collection Tool.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Interactive, Collaborative Robots: Challenges and Opportunities.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Multimodal Reference Resolution In Collaborative Assembly Tasks.
Proceedings of the 4th International Workshop on Multimodal Analyses Enabling Artificial Agents in Human-Machine Interaction, 2018

2017
Machine Learning and Social Robotics for Detecting Early Signs of Dementia.
CoRR, 2017

Crowd-Powered Design of Virtual Attentive Listeners.
Proceedings of the Intelligent Virtual Agents - 17th International Conference, 2017

Synthesising Uncertainty: The Interplay of Vocal Effort and Hesitation Disfluencies.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Crowd-Sourced Design of Artificial Attentive Listeners.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Controlling Prominence Realisation in Parametric DNN-Based Speech Synthesis.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Using crowd-sourcing for the design of listening agents: challenges and opportunities.
Proceedings of the 1st ACM SIGCHI International Workshop on Investigating Social Interactions with Artificial Agents, 2017

2016
WikiSpeech - enabling open source text-to-speech for Wikipedia.
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

Making Turn-Taking Decisions for an Active Listening Robot for Memory Training.
Proceedings of the Social Robotics - 8th International Conference, 2016

Hidden Resources ― Strategies to Acquire and Exploit Potential Spoken Language Resources in National Archives.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Towards Building an Attentive Artificial Listener: On the Perception of Attentiveness in Feedback Utterances.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Towards building an attentive artificial listener: on the perception of attentiveness in audio-visual feedback tokens.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016

On data driven parametric backchannel synthesis for expressing attentiveness in conversational agents.
Proceedings of the Workshop on Multimodal Analyses enabling Artificial Agents in Human-Machine Interaction, 2016

2015
Automatic Detection of Miscommunication in Spoken Dialogue Systems.
Proceedings of the SIGDIAL 2015 Conference, 2015

Detecting repetitions in spoken dialogue systems using phonetic distances.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Audience response system-based assessment for analysis-by-synthesis.
Proceedings of the 18th International Congress of Phonetic Sciences, 2015

Deciphering the Silent Participant: On the Use of Audio-Visual Cues for the Classification of Listener Categories in Group Discussions.
Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015

2014
Data-driven models for timing feedback responses in a Map Task dialogue system.
Comput. Speech Lang., 2014

Crowdsourcing Street-level Geographic Information Using a Spoken Dialogue System.
Proceedings of the SIGDIAL 2014 Conference, 2014

Who Will Get the Grant?: A Multimodal Corpus for the Analysis of Conversational Behaviours in Group Interviews.
Proceedings of the 2014 Workshop on Understanding and Modeling Multiparty, 2014

Comparison of Human-Human and Human-Robot Turn-Taking Behaviour in Multiparty Situated Interaction.
Proceedings of the 2014 Workshop on Understanding and Modeling Multiparty, 2014

A comparative evaluation of vocoding techniques for HMM-based laughter synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2014

Human-robot collaborative tutoring using multiparty multimodal spoken dialogue.
Proceedings of the ACM/IEEE International Conference on Human-Robot Interaction, 2014

Human pause and resume behaviours for unobtrusive humanlike in-car spoken dialogue systems.
Proceedings of the Workshop on Dialogue in Motion, 2014

2013
Semi-supervised methods for exploring the acoustics of simple productive feedback.
Speech Commun., 2013

Face-to-Face with a Robot: What do we actually Talk about?
Int. J. Humanoid Robotics, 2013

Head Pose Patterns in Multiparty Human-Robot Team-Building Interactions.
Proceedings of the Social Robotics - 5th International Conference, 2013

A Data-driven Model for Timing Feedback in a Map Task Dialogue System.
Proceedings of the SIGDIAL 2013 Conference, 2013

The Map Task Dialogue System: A Test-bed for Modelling Human-Like Dialogue.
Proceedings of the SIGDIAL 2013 Conference, 2013

Non-linear Pitch Modification in Voice Conversion Using Artificial Neural Networks.
Proceedings of the Advances in Nonlinear Speech Processing - 6th International Conference, 2013

Analysis of gaze and speech patterns in three-party quiz game interaction.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

2012
Children and adults in dialogue with the robot head Furhat - corpus collection and initial analysis.
Proceedings of the Third Workshop on Child, Computer and Interaction, 2012

Walk This Way: Spatial Grounding for City Exploration.
Proceedings of the Natural Interaction with Robots, 2012

Gaze Patterns in Turn-Taking.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

A Data-driven Approach to Understanding Spoken Route Directions in Human-Robot Dialogue.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

On the effect of the acoustic environment on the accuracy of perception of speaker orientation from auditory cues alone.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Multimodal multiparty social interaction with the furhat head.
Proceedings of the International Conference on Multimodal Interaction, 2012

2011
Enhanced visual scene understanding through human-robot dialog.
Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011

A Dual Channel Coupled Decoder for Fillers and Feedback.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Predicting Speaker Changes and Listener Responses with and without Eye-Contact.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Tracking Pitch Contours Using Minimum Jerk Trajectories.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

2010
The prosody of Swedish conversational grunts.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Modeling conversational interaction using coupled Markov chains.
Proceedings of the DiSS-LPSS Joint Workshop 2010, 2010

Prosodic cues to engagement in non-lexical response tokens in Swedish.
Proceedings of the DiSS-LPSS Joint Workshop 2010, 2010

Enhanced Visual Scene Understanding through Human-Robot Dialog.
Proceedings of the Dialog with Robots, 2010

2009
Attention and Interaction Control in a Human-Human-Computer Dialogue Setting.
Proceedings of the SIGDIAL 2009 Conference, 2009

Eliciting Interactional Phenomena in Human-Human Dialogues.
Proceedings of the SIGDIAL 2009 Conference, 2009

The MonAMI reminder: a spoken dialogue system for face-to-face interaction.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Face-to-Face Interaction and the KTH Cooking Show.
Proceedings of the Development of Multimodal Interfaces: Active Listening and Synchrony, 2009

2008
Towards human-like spoken dialogue systems.
Speech Commun., 2008

Potential Benefits of Human-Like Dialogue Behaviour in the Call Routing Domain.
Proceedings of the Perception in Multimodal Dialogue Systems, 2008

EXPROS: A Toolkit for Exploratory Experimentation with Prosody in Customized Diphone Voices.
Proceedings of the Perception in Multimodal Dialogue Systems, 2008

Innovative Interfaces in MonAMI: The Reminder.
Proceedings of the Perception in Multimodal Dialogue Systems, 2008

What makes a good speaker? subject ratings, acoustic measurements and perceptual evaluations.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Innovative interfaces in MonAMI: the reminder.
Proceedings of the 10th International Conference on Multimodal Interfaces, 2008

2007
Children's convergence in referring expressions to graphical objects in a speech-enabled computer game.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

2006
Robust spoken language understanding in a computer game.
Speech Commun., 2006

2005
How to do Dialogue in a Fairy-tale World.
Proceedings of the 6th SIGdial Workshop on Discourse and Dialogue, 2005

Providing Computer Game Characters with Conversational Abilities.
Proceedings of the Intelligent Virtual Agents, 5th International Working Conference, 2005

The Swedish NICE corpus - spoken dialogues between children and embodied characters in a computer game scenario.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

2004
Voice creation for conversational fairy-tale characters.
Proceedings of the Fifth ISCA ITRW on Speech Synthesis, 2004

The NICE Fairy-tale Game System.
Proceedings of the SIGDIAL 2004 Workshop, The 5th Annual Meeting of the Special Interest Group on Discourse and Dialogue, April 30, 2004

2003
Child and adult speaker adaptation during error resolution in a publicly available spoken dialogue system.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
Developing Multimodal Spoken Dialogue Systems : Empirical Studies of Spoken Human-Computer Interaction.
PhD thesis, 2002

Voice transformations for improving children²s speech recognition in a publicly available dialogue system.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2000
Speech technology on trial: Experiences from the August system.
Nat. Lang. Eng., 2000

Adapt - a multimodal conversational dialogue system in an apartment domain.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Positive and negative user feedback in a spoken dialogue corpus.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

A comparison of disfluency distribution in a unimodal and a multimodal speech interface.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999
The august spoken dialogue system.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Interaction with an animated agent in a spoken dialogue system.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1998
Web-based educational tools for speech technology.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

An educational dialogue system with a user controllable dialogue manager.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

1997
An integrated system for teaching spoken dialogue systems technology.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

How do system questions influence lexical choices in user answers?
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

1995
Using two-level morphology to transcribe Swedish names.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

The waxholm application database.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

1993
An experimental dialogue system: waxholm.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993


  Loading...