Jens Edlund

Orcid: 0000-0001-9327-9482

Affiliations:
  • KTH Royal Institute of Technology, Stockholm, Sweden


According to our database1, Jens Edlund authored at least 78 papers between 2000 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Revisiting Three Text-to-Speech Synthesis Experiments with a Web-Based Audience Response System.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Listener sensitivity to deviating obstruents in WaveNet.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Crowdsource-based Validation of the Audio Cocktail as a Sound Browsing Tool.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022
Language Report Swedish.
Proceedings of the European Language Equality, 2022

2021
Understanding acceptability of disordered speech through Audience Response Systems-based evaluation.
Speech Commun., 2021

Methods of slowing down speech.
Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021

Human-in-the-Loop Efficiency Analysis for Binary Classification in Edyson.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

A Multimodal Digital Humanities Study of Terrorism in Swedish Politics: An Interdisciplinary Mixed Methods Project on the Configuration of Terrorism in Parliamentary Debates, Legislation, and Policy Networks 1968-2018.
Proceedings of the Intelligent Systems and Applications, 2021

2020
Augmented Prompt Selection for Evaluation of Spontaneous Speech Synthesis.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Exploring the archives for textual entry points to speech - experiences of interdisciplinary collaboration in making cultural heritage accessible for research.
Proceedings of the Twin Talks 2 and 3 Workshops at DHN 2020 and DH 2020, 2020

2019
The State of Speech in HCI: Trends, Themes and Challenges.
Interact. Comput., 2019

Speech Synthesis Evaluation - State-of-the-Art Assessment and Suggestion for a Novel Research Program.
Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

Preliminary guidelines for the efficient management of OOV words for spoken text.
Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

Spot the Pleasant People! Navigating the Cocktail Party Buzz.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

How to Annotate 100 Hours in 45 Minutes.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

First steps towards text profiling for speech synthesis.
Proceedings of the Digital Humanities in the Nordic Countries 4th Conference, 2019

Towards fast browsing of found audio data: 11 presidents.
Proceedings of the Digital Humanities in the Nordic Countries 4th Conference, 2019

New applications of gaze tracking in speech science.
Proceedings of the Digital Humanities in the Nordic Countries 4th Conference, 2019

Shoehorning in the name of science.
Proceedings of the 1st International Conference on Conversational User Interfaces, 2019

Mapping Theoretical and Methodological Perspectives for Understanding Speech Interface Interactions.
Proceedings of the Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems, 2019

2018
Language Technology and 3rd Wave HCI: Towards Phatic Communication and Situated Interaction.
Proceedings of the New Directions in Third Wave Human-Computer Interaction: Volume 1, 2018

The State of Speech in HCI: Trends, Themes and Challenges.
CoRR, 2018

Bringing Order to Chaos: A Non-Sequential Approach for Browsing Large Sets of Found Audio Data.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

A Tool for Exploring Large Amounts of Found Audio Data.
Proceedings of the Digital Humanities in the Nordic Countries 3rd Conference, 2018

2017
Approximating Phonotactic Input in Children's Linguistic Environments from Orthographic Transcripts.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016
WikiSpeech - enabling open source text-to-speech for Wikipedia.
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

Hidden Resources ― Strategies to Acquire and Exploit Potential Spoken Language Resources in National Archives.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

2015
Communicative needs and respiratory constraints.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Audience response system-based assessment for analysis-by-synthesis.
Proceedings of the 18th International Congress of Phonetic Sciences, 2015

2014
Ranking severity of speech errors by their phonological impact in context.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Human pause and resume behaviours for unobtrusive humanlike in-car spoken dialogue systems.
Proceedings of the Workshop on Dialogue in Motion, 2014

2013
D64: a corpus of richly recorded conversational interaction.
J. Multimodal User Interfaces, 2013

Timing responses to questions in dialogue.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Analysis of gaze and speech patterns in three-party quiz game interaction.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Co-present or Not?
Proceedings of the Eye Gaze in Intelligent User Interfaces, 2013

2012
Taming Mona Lisa: Communicating gaze faithfully in 2D and 3D facial projections.
ACM Trans. Interact. Intell. Syst., 2012

3rd party observer gaze as a continuous measure of dialogue flow.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Prosodic measurements and question types in the Spontal corpus of Swedish dialogues.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Gaze Patterns in Turn-Taking.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

On the Dynamics of Overlap in Multi-Party Conversation.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

On the effect of the acoustic environment on the accuracy of perception of speaker orientation from auditory cues alone.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

2011
The Mona Lisa Gaze Effect as an Objective Metric for Perceived Cospatiality.
Proceedings of the Intelligent Virtual Agents - 11th International Conference, 2011

Incremental Learning and Forgetting in Stochastic Turn-Taking Models.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Very Short Utterances and Timing in Turn-Taking.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

A single-port non-parametric model of turn-taking in multi-party conversation.
Proceedings of the IEEE International Conference on Acoustics, 2011

Syllabification of conversational speech using Bidirectional Long-Short-Term Memory Neural Networks.
Proceedings of the IEEE International Conference on Acoustics, 2011

Kinetic data for large-scale analysis and modeling of face-to-face conversation.
Proceedings of the Auditory-Visual Speech Processing, 2011

2010
Pauses, gaps and overlaps in conversations.
J. Phonetics, 2010

Spontal-N: A Corpus of Interactional Spoken Norwegian.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

A Snack Implementation and Tcl/Tk Interface to the Fundamental Frequency Variation Spectrum Algorithm.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Spontal: A Swedish Spontaneous Dialogue Corpus of Audio, Video and Motion Capture.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Pitch similarity in the vicinity of backchannels.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Animated Faces for Robotic Heads: Gaze and Beyond.
Proceedings of the Analysis of Verbal and Nonverbal Communication and Enactment. The Processing Issues, 2010

2009
Multimodal Interaction Control.
Proceedings of the Computers in the Human Interaction Loop, 2009

Using speech technology to promote increased pitch variation in oral presentations.
Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2009

A general-purpose 32 ms prosodic vector for hidden Markov modeling.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Pause and gap length in face-to-face interaction.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

The MonAMI reminder: a spoken dialogue system for face-to-face interaction.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Exploring the prosody of floor mechanisms in english using the fundamental frequency variation spectrum.
Proceedings of the 17th European Signal Processing Conference, 2009

Face-to-Face Interaction and the KTH Cooking Show.
Proceedings of the Development of Multimodal Interfaces: Active Listening and Synchrony, 2009

2008
Towards human-like spoken dialogue systems.
Speech Commun., 2008

Human-Likeness in Utterance Generation: Effects of Variability.
Proceedings of the Perception in Multimodal Dialogue Systems, 2008

Potential Benefits of Human-Like Dialogue Behaviour in the Call Routing Domain.
Proceedings of the Perception in Multimodal Dialogue Systems, 2008

EXPROS: A Toolkit for Exploratory Experimentation with Prosody in Customized Diphone Voices.
Proceedings of the Perception in Multimodal Dialogue Systems, 2008

Innovative Interfaces in MonAMI: The Reminder.
Proceedings of the Perception in Multimodal Dialogue Systems, 2008

Innovative interfaces in MonAMI: the reminder.
Proceedings of the 10th International Conference on Multimodal Interfaces, 2008

An instantaneous vector representation of delta pitch for speaker-change prediction in conversational dialogue systems.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Underpinning /.
Proceedings of the Speaker Classification II, 2007

Pushy versus meek - using avatars to influence turn-taking behaviour.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

2006
The Effect of Prosodic Features on the Interpretation of Synthesised Backchannels.
Proceedings of the Perception and Interactive Technologies, 2006

Talking with Higgins: Research Challenges in a Spoken Dialogue System.
Proceedings of the Perception and Interactive Technologies, 2006

User responses to prosodic variation in fragmentary grounding utterances in dialog.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

/nailon/ - software for online analysis of prosody.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

2005
Exploring Prosody in Interaction Control.
Phonetica, 2005

The effects of prosodic features on the interpretation of clarification ellipses.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

2004
Higgins - a spoken dialogue system for investigating error handling techniques.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

2002
Specification and realisation of multimodal output in dialogue systems.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2000
Adapt - a multimodal conversational dialogue system in an apartment domain.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000


  Loading...