Roland Göcke

Orcid: 0000-0003-2279-7041

  • University of Canberra, Australia
  • Australian National University, Canberra, Australia (former)

According to our database1, Roland Göcke authored at least 167 papers between 1999 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.



In proceedings 
PhD thesis 


Online presence:



Deep Learning-Driven Analysis of a Six-Bar Mechanism for Personalized Gait Rehabilitation.
J. Comput. Inf. Sci. Eng., 2025

Multilevel Pain Assessment with Functional Near-Infrared Spectroscopy: Evaluating ΔHBO2 and ΔHHB Measures for Comprehensive Analysis.
Sensors, January, 2024

Mood as a Contextual Cue for Improved Emotion Inference.
CoRR, 2024

Can Expression Sensitivity Improve Macro- and Micro-Expression Spotting in Long Videos?
Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing, 2024

MRAC Track 1: 2nd Workshop on Multimodal, Generative and Responsible Affective Computing.
Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing, 2024

An fNIRS Study on Standing Concurrent with Working Memory Task and Sensory Manipulation.
Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024

Interpretation of Depression Detection Models via Feature Selection Methods.
IEEE Trans. Affect. Comput., 2023

A systematic review of neurophysiological sensing for the assessment of acute pain.
npj Digit. Medicine, 2023

Synthesis of a six-bar mechanism for generating knee and ankle motion trajectories using deep generative neural network.
Eng. Appl. Artif. Intell., 2023

Focus on Change: Mood Prediction by Learning Emotion Changes via Spatio-Temporal Attention.
CoRR, 2023

Explainable Human-centered Traits from Head Motion and Facial Expression Dynamics.
CoRR, 2023

A Systematic Review on Functional Near-Infrared Spectroscopy Concurrent With Quantitative Balance Assessment.
IEEE Access, 2023

An Investigation of Video Vision Transformers for Depression Severity Estimation from Facial Video Data.
Proceedings of the Image and Video Technology - 11th Pacific-Rim Symposium, 2023

Efficient Labelling of Affective Video Datasets via Few-Shot & Multi-Task Contrastive Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Examining the Influence of Personality and Multimodal Behavior on Hireability Impressions.
Proceedings of the Fourteenth Indian Conference on Computer Vision, 2023

Explainable Depression Detection via Head Motion Patterns.
Proceedings of the 25th International Conference on Multimodal Interaction, 2023

EmotiW 2023: Emotion Recognition in the Wild Challenge.
Proceedings of the 25th International Conference on Multimodal Interaction, 2023

Examining Subject-Dependent and Subject-Independent Human Affect Inference from Limited Video Data.
Proceedings of the 17th IEEE International Conference on Automatic Face and Gesture Recognition, 2023

"Where does it hurt?": Exploring EDA Signals to Detect and Localise Acute Pain.
Proceedings of the 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2023

Estimating Depression Severity from Long-Sequence Face Videos via an Ensemble Global Diverse Convolutional Model.
Proceedings of the International Conference on Digital Image Computing: Techniques and Applications, 2023

A Weakly Supervised Approach to Emotion-change Prediction and Improved Mood Inference.
Proceedings of the 11th International Conference on Affective Computing and Intelligent Interaction, 2023

Analyzing Group-Level Emotion with Global Alignment Kernel based Approach.
IEEE Trans. Affect. Comput., 2022

Affective Computational Advertising Based on Perceptual Metrics.
CoRR, 2022

Automated Parkinson's Disease Detection and Affective Analysis from Emotional EEG Signals.
CoRR, 2022

To Improve Is to Change: Towards Improving Mood Prediction by Learning Changes in Emotion.
Proceedings of the International Conference on Multimodal Interaction, 2022

A Multidimensional Approach to Develop Sway Index Using Gaussian Mixture Model: A Way of Postural Sway Measurement and Analysis in Different Age Groups.
IEEE Trans. Instrum. Meas., 2021

Deeply Supervised Discriminative Learning for Adversarial Defense.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Micro-Expression Recognition Based On Video Motion Magnification And Pre-Trained Neural Network.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Characterizing Hirability via Personality and Behavior.
CoRR, 2020

Automatic Segmentation of Achilles Tendon Tissues Using Deep Convolutional Neural Network.
Proceedings of the Machine Learning in Medical Imaging - 11th International Workshop, 2020

CNN Depression Severity Level Estimation from Upper Body vs. Face-Only Images.
Proceedings of the Pattern Recognition. ICPR International Workshops and Challenges, 2020

EmotiW 2020: Driver Gaze, Group Emotion, Student Engagement and Physiological Signal based Challenges.
Proceedings of the ICMI '20: International Conference on Multimodal Interaction, 2020

Investigating LSTM for Micro-Expression Recognition.
Proceedings of the Companion Publication of the 2020 International Conference on Multimodal Interaction, 2020

Towards the Design of a Human-Inspired Gripper for Textile Manipulation.
Proceedings of the 16th IEEE International Conference on Automation Science and Engineering, 2020

Feature Map Augmentation to Improve Rotation Invariance in Convolutional Neural Networks.
Proceedings of the Advanced Concepts for Intelligent Vision Systems, 2020

Automatic depression classification based on affective read sentences: Opportunities for text-dependent analysis.
Speech Commun., 2019

Evaluating and Validating Emotion Elicitation Using English and Arabic Movie Clips on a Saudi Sample.
Sensors, 2019

An investigation of linguistic stress and articulatory vowel characteristics for automatic depression classification.
Comput. Speech Lang., 2019

Hierarchical Adversarial Network for Human Pose Estimation.
IEEE Access, 2019

Adversarial Defense by Restricting the Hidden Space of Deep Neural Networks.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Estimation of Missing Human Body Parts Via Bidirectional LSTM.
Proceedings of the 14th IEEE International Conference on Automatic Face & Gesture Recognition, 2019

Automated Measurement of Head Movement Synchrony during Dyadic Depression Severity Interviews.
Proceedings of the 14th IEEE International Conference on Automatic Face & Gesture Recognition, 2019

Sway Risk Analysis Based on Age Group Classification.
Proceedings of the 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2019

Gait Estimation and Analysis from Noisy Observations.
Proceedings of the 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2019

Multimodal Framework for Analyzing the Affect of a Group of People.
IEEE Trans. Multim., 2018

MSMCT: Multi-State Multi-Camera Tracker.
IEEE Trans. Circuits Syst. Video Technol., 2018

Multimodal Depression Detection: Fusion Analysis of Paralinguistic, Head Pose and Eye Gaze Behaviors.
IEEE Trans. Affect. Comput., 2018

A Global Alignment Kernel based Approach for Group-level Happiness Intensity Estimation.
CoRR, 2018

EmotiW 2018: Audio-Video, Student Engagement and Group-Level Affect Prediction.
Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018

Multimodal assessment of depression from behavioral signals.
Proceedings of the Handbook of Multimodal-Multisensor Interfaces: Foundations, User Modeling, and Common Modality Combinations, 2018

Investigating Word Affect Features and Fusion of Probabilistic Predictions Incorporating Uncertainty in AVEC 2017.
Proceedings of the 7th Annual Workshop on Audio/Visual Emotion Challenge, Mountain View, CA, USA, October 23, 2017

Elicitation Design for Acoustic Depression Classification: An Investigation of Articulation Effort, Linguistic Complexity, and Word Affect.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

From individual to group-level emotion recognition: EmotiW 5.0.
Proceedings of the 19th ACM International Conference on Multimodal Interaction, 2017

A multimodal system to characterise melancholia: cascaded bag of words approach.
Proceedings of the 19th ACM International Conference on Multimodal Interaction, 2017

Human Postural Sway Estimation from Noisy Observations.
Proceedings of the 12th IEEE International Conference on Automatic Face & Gesture Recognition, 2017

A Video-Based Facial Behaviour Analysis Approach to Melancholia.
Proceedings of the 12th IEEE International Conference on Automatic Face & Gesture Recognition, 2017

Joint Registration and Representation Learning for Unconstrained Face Identification.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Heart rate estimation from facial videos for depression analysis.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction, 2017

Emotion recognition in the wild.
J. Multimodal User Interfaces, 2016

Dimensionality reduction of Fisher vectors for human action recognition.
IET Comput. Vis., 2016

Efficient multi-target tracking via discovering dense subgraphs.
Comput. Vis. Image Underst., 2016

Analyzing the Affect of a Group of People Using Multi-modal Framework.
CoRR, 2016

An Investigation of Emotional Speech in Depression Classification.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Cross-Cultural Depression Recognition from Vocal Biomarkers.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

EmotiW 2016: video and group-level emotion recognition challenges.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016

Emotion recognition in the wild challenge 2016.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016

Extending Long Short-Term Memory for Multi-View Structured Learning.
Proceedings of the Computer Vision - ECCV 2016, 2016

Automatic Group Happiness Intensity Analysis.
IEEE Trans. Affect. Comput., 2015

Ordered trajectories for human action recognition with large number of classes.
Image Vis. Comput., 2015

Occlusion-Aware Human Pose Estimation with Mixtures of Sub-Trees.
CoRR, 2015

Harnessing the Deep Net Object Models for Enhancing Human Action Recognition.
CoRR, 2015

AVEC 2015 Keynote Talk: From Facial Expression Analysis to Multimodal Mood Analysis.
Proceedings of the 5th International Workshop on Audio/Visual Emotion Challenge, 2015

Video and Image based Emotion Recognition Challenges in the Wild: EmotiW 2015.
Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015

Multi-level action detection via learning latent structure.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Injury Mechanism Classification in Soccer Videos.
Proceedings of the 2015 IEEE International Conference on Computer Vision Workshop, 2015

Play with me - Measuring a child's engagement in a social interaction.
Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015

The more the merrier: Analysing the affect of a group of people in images.
Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015

Cross-cultural detection of depression from nonverbal behaviour.
Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015

Riesz-based Volume Local Binary Pattern and A Novel Group Expression Model for Group Happiness Intensity Analysis.
Proceedings of the British Machine Vision Conference 2015, 2015

A temporally piece-wise fisher vector approach for depression analysis.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

Thermal spatio-temporal data for stress recognition.
EURASIP J. Image Video Process., 2014

A discriminative parts based model approach for fiducial points free and shape constrained head pose normalisation in the wild.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

Automatic Prediction of Perceived Traits Using Visual Cues under Varied Situational Context.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Emotion Recognition In The Wild Challenge 2014: Baseline, Data and Protocol.
Proceedings of the 16th International Conference on Multimodal Interaction, 2014

Detecting self-stimulatory behaviours for autism diagnosis.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Dense body part trajectories for human action recognition.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Joint sparsity-based robust visual tracking.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Design of an Emotion Elicitation Framework for Arabic Speakers.
Proceedings of the Human-Computer Interaction. Advanced Interaction Modalities and Techniques, 2014

The Influence of Temporal Information on Human Action Recognition with Large Number of Classes.
Proceedings of the 2014 International Conference on Digital Image Computing: Techniques and Applications, 2014

Discriminative Multi-Task Sparse Learning for Robust Visual Tracking Using Conditional Random Field.
Proceedings of the 2014 International Conference on Digital Image Computing: Techniques and Applications, 2014

Robust Visual Tracking via Rank-Constrained Sparse Learning.
Proceedings of the 2014 International Conference on Digital Image Computing: Techniques and Applications, 2014

Enhanced Laplacian Group Sparse Learning with Lifespan Outlier Rejection for Visual Tracking.
Proceedings of the Computer Vision - ACCV 2014, 2014

Multimodal assistive technologies for depression diagnosis and monitoring.
J. Multimodal User Interfaces, 2013

Diagnosis of depression by behavioural signals: a multimodal approach.
Proceedings of the 3rd ACM international workshop on Audio/visual emotion challenge, 2013

R-norm: improving inter-speaker variability modelling at the score level via regression score normalisation.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Modeling spectral variability for the classification of depressed speech.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Characterising depressed speech for classification.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Adaptive Multiple Component Metric Learning for Robust Visual Tracking.
Proceedings of the Neural Information Processing - 20th International Conference, 2013

Emotion recognition in the wild challenge 2013.
Proceedings of the 2013 International Conference on Multimodal Interaction, 2013

Emotion recognition in the wild challenge (EmotiW) challenge and workshop summary.
Proceedings of the 2013 International Conference on Multimodal Interaction, 2013

Eye movement analysis for depression detection.
Proceedings of the IEEE International Conference on Image Processing, 2013

Self-Stimulatory Behaviours in the Wild for Autism Diagnosis.
Proceedings of the 2013 IEEE International Conference on Computer Vision Workshops, 2013

Ordered Trajectories for Large Scale Human Action Recognition.
Proceedings of the 2013 IEEE International Conference on Computer Vision Workshops, 2013

Monocular Image 3D Human Pose Estimation under Self-Occlusion.
Proceedings of the IEEE International Conference on Computer Vision, 2013

A comparative study of different classifiers for detecting depression from spontaneous speech.
Proceedings of the IEEE International Conference on Acoustics, 2013

Detecting depression: A comparison between spontaneous and read speech.
Proceedings of the IEEE International Conference on Acoustics, 2013

Can body expressions contribute to automatic depression analysis?
Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2013

On the Effect of Human Body Parts in Large Scale Human Behaviour Recognition.
Proceedings of the 2013 International Conference on Digital Image Computing: Techniques and Applications, 2013

Robust Visual Vocabulary Tracking Using Hierarchical Model Fusion.
Proceedings of the 2013 International Conference on Digital Image Computing: Techniques and Applications, 2013

Voice source waveforms for utterance level speaker identification using support vector machines.
Proceedings of the 8th International Conference on Information Technology in Asia, 2013

AusTalk - The Australian speech database: Design framework, recording experience and localisation.
Proceedings of the 8th International Conference on Information Technology in Asia, 2013

Modeling Stress Using Thermal Facial Patterns: A Spatio-temporal Approach.
Proceedings of the 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction, 2013

Relative Body Parts Movement for Automatic Depression Analysis.
Proceedings of the 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction, 2013

Head Pose and Movement Analysis as an Indicator of Depression.
Proceedings of the 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction, 2013

Facial Performance Transfer via Deformable Models and Parametric Correspondence.
IEEE Trans. Vis. Comput. Graph., 2012

Collecting Large, Richly Annotated Facial-Expression Databases from Movies.
IEEE Multim., 2012

Multi-view Multi-modal Gait Based Human Identity Recognition from Surveillance Videos.
Proceedings of the Multimodal Pattern Recognition of Social Signals in Human-Computer-Interaction, 2012

Correcting pose estimation with implicit occlusion detection and rectification.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Neural-net classification for spatio-temporal descriptor based depression analysis.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Group expression intensity estimation in videos via Gaussian Processes.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

An Improved NN Training Scheme Using Two-Stage LDA Features for Face Recognition.
Proceedings of the Neural Information Processing - 19th International Conference, 2012

Regression Based Pose Estimation with Automatic Occlusion Detection and Rectification.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

From Joyous to Clinically Depressed: Mood Detection Using Spontaneous Speech.
Proceedings of the Twenty-Fifth International Florida Artificial Intelligence Research Society Conference, 2012

Finding Happiest Moments in a Social Context.
Proceedings of the Computer Vision, 2012

Regression based automatic face annotation for deformable model building.
Pattern Recognit., 2011

An Investigation of Depressed Speech Detection: Features and Normalization.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Building an Audio-Visual Corpus of Australian English: Large Corpus Collection with an Economical Portable and Replicable Black Box.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Static facial expression analysis in tough conditions: Data, evaluation protocol and benchmark.
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011

Emotion recognition using PHOG and LPQ features.
Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011

A SSIM-based approach for finding similar facial expressions.
Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011

Pose Normalization via Learned 2D Warping for Fully Automatic Face Recognition.
Proceedings of the British Machine Vision Conference, 2011

Illumination and Expression Invariant Recognition Using SSIM Based Sparse Representation.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Linear Facial Expression Transfer with Active Appearance Models.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Facial Expression Based Automatic Album Creation.
Proceedings of the Neural Information Processing. Models and Applications, 2010

Learning AAM fitting through simulation.
Pattern Recognit., 2009

Automatic frontal face annotation and AAM building for arbitrary expressions from a single frontal image only.
Proceedings of the International Conference on Image Processing, 2009

Biologically Inspired Contrast Enhancement Using Asymmetric Gain Control.
Proceedings of the DICTA 2009, 2009

Learning based automatic face annotation for arbitrary poses and expressions from frontal images only.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Learning-based Face Synthesis for Pose-Robust Recognition from Single Image.
Proceedings of the British Machine Vision Conference, 2009

Auditory-visual infant directed speech in Japanese and English.
Proceedings of the Auditory-Visual Speech Processing, 2009

Audio-visual mutual dependency models for biometric liveness checks.
Proceedings of the Auditory-Visual Speech Processing, 2009

An approach for automatically measuring facial activity in depressed subjects.
Proceedings of the Affective Computing and Intelligent Interaction, 2009

Evaluating AAM fitting methods for facial expression recognition.
Proceedings of the Affective Computing and Intelligent Interaction, 2009

The Composite Sensing of Affect.
Proceedings of the Affect and Emotion in Human-Computer Interaction, 2008

A composite framework for affective sensing.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

A Hybrid Fuzzy Approach for Human Eye Gaze Pattern Recognition.
Proceedings of the Advances in Neuro-Information Processing, 15th International Conference, 2008

Image Reconstruction from Contrast Information.
Proceedings of the International Conference on Digital Image Computing: Techniques and Applications, 2008

Optical flow estimation using Fourier Mellin Transform.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

A comparative study of 2d and 3d lip tracking methods for AV ASR.
Proceedings of the International Conference on Auditory-Visual Speech Processing 2008, 2008

Automatic Parametrisation for an Image Completion Method Based on Markov Random Fields.
Proceedings of the International Conference on Image Processing, 2007

A Nonlinear Discriminative Approach to AAM Fitting.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

EREC-II in Use - Studies on Usability and Suitability of a Sensor System for Affect Detection and Human Performance Monitoring.
Proceedings of the Human-Computer Interaction. HCI Intelligent Multimodal Interaction Environments, 2007

Towards Affective Sensing.
Proceedings of the Human-Computer Interaction. HCI Intelligent Multimodal Interaction Environments, 2007

Quaternion Potential Functions for a Colour Image Completion Method Using Markov Random Fields.
Proceedings of the International Conference on Digital Image Computing: Techniques and Applications, 2007

Monocular and Stereo Methods for AAM Learning from Video.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Iterative Error Bound Minimisation for AAM Alignment.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

A Wavelet-Based Approach to Image Feature Stability Assessment.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2006

Current trends in joint audio-video signal processing: a review.
Proceedings of the Eighth International Symposium on Signal Processing and Its Applications, 2005

3d lip tracking and co-inertia analysis for improved robustness of audio-video automatic speech recognition.
Proceedings of the Auditory-Visual Speech Processing 2005, 2005

Aspects of speaking-face data corpus design methodology.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

The audio-video australian English speech data corpus AVOZES.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

A Freely Configurable, Multi-modal Sensor System for Affective Computing.
Proceedings of the Affective Dialogue Systems, Tutorial and Research Workshop, 2004

Statistical analysis of the relationship between audio and video speech parameters for Australian English.
Proceedings of the AVSP 2003, 2003

Noisy audio feature enhancement using audio-visual speech data.
Proceedings of the IEEE International Conference on Acoustics, 2002

Analysis of audio-video correlation in vowels in Australian English.
Proceedings of the Auditory-Visual Speech Processing, 2001

Fast voxel-based 2D/3D registration algorithm using a volume rendering method based on the shear-warp factorization.
Proceedings of the Medical Imaging 1999: Image Processing, 1999
