Angela Yao

Orcid: 0000-0001-7418-6141

Affiliations:
  • School of Computing, National University of Singapore, Singapore
  • University of Bonn, Institute of Computer Science, Germany (former)


According to our database1, Angela Yao authored at least 128 papers between 2010 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Temporal Action Segmentation: An Analysis of Modern Techniques.
IEEE Trans. Pattern Anal. Mach. Intell., February, 2024

A closer look at branch classifiers of multi-exit architectures.
Comput. Vis. Image Underst., February, 2024

Scene-Text Grounding for Text-Based Video Question Answering.
CoRR, 2024

Question-Answering Dense Video Events.
CoRR, 2024

VideoQA in the Era of LLMs: An Empirical Study.
CoRR, 2024

OfCaM: Global Human Mesh Recovery via Optimization-free Camera Motion Scale Calibration.
CoRR, 2024

InstructHumans: Editing Animated 3D Human Textures with Instructions.
CoRR, 2024

AID: Attention Interpolation of Text-to-Image Diffusion.
CoRR, 2024

Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects.
CoRR, 2024

Rethinking Visibility in Human Pose Estimation: Occluded Pose Reasoning via Transformers.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Learning to generate training datasets for robust semantic segmentation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Deep Regression Representation Learning with Topology.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

On the Calibration of Human Pose Estimation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Scaling for Training Time and Post-hoc Out-of-distribution Detection Enhancement.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

RealViformer: Investigating Attention for Real-World Video Super-Resolution.
Proceedings of the Computer Vision - ECCV 2024, 2024

NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model.
Proceedings of the Computer Vision - ECCV 2024, 2024

Pairwise Distance Distillation for Unsupervised Real-World Image Super-Resolution.
Proceedings of the Computer Vision - ECCV 2024, 2024

On the Utility of 3D Hand Poses for Action Recognition.
Proceedings of the Computer Vision - ECCV 2024, 2024

Long-Tail Temporal Action Segmentation with Group-Wise Temporal Logit Adjustment.
Proceedings of the Computer Vision - ECCV 2024, 2024

WAVE: Warping DDIM Inversion Features for Zero-Shot Text-to-Video Editing.
Proceedings of the Computer Vision - ECCV 2024, 2024

Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects.
Proceedings of the Computer Vision - ECCV 2024, 2024

KITRO: Refining Human Mesh by 2D Clues and Kinematic-tree Rotation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Enhancing Video Super-Resolution via Implicit Resampling-based Alignment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Deep Imbalanced Regression via Hierarchical Classification Adjustment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Can I Trust Your Answer? Visually Grounded Video Question Answering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Make Me a BNN: A Simple Strategy for Estimating Bayesian Uncertainty from Pre-trained Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Coherent Temporal Synthesis for Incremental Action Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Contrastive Video Question Answering via Video Graph Transformer.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

C2F-TCN: A Framework for Semi- and Fully-Supervised Temporal Action Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Bias-Compensated Integral Regression for Human Pose Estimation.
IEEE Trans. Pattern Anal. Mach. Intell., September, 2023

Transferring Knowledge From Text to Video: Zero-Shot Anticipation for Procedural Actions.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Temporal Action Segmentation With High-Level Complex Activity Labels.
IEEE Trans. Multim., 2023

Learning Unorthogonalized Matrices for Rotation Estimation.
CoRR, 2023

On the Calibration of Human Pose Estimation.
CoRR, 2023

Every Mistake Counts in Assembly.
CoRR, 2023

Overcoming the Trade-off Between Accuracy and Plausibility in 3D Hand Shape Reconstruction.
CoRR, 2023

An Implicit Alignment for Video Super-Resolution.
CoRR, 2023

Synthetic-to-Real Pose Estimation with Geometric Reconstruction.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Opening the Vocabulary of Egocentric Actions.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Improving Deep Regression with Ordinal Entropy.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

DropIT: Dropping Intermediate Tensors for Memory-Efficient DNN Training.
Proceedings of the Eleventh International Conference on Learning Representations, 2023


MHEntropy: Entropy Meets Multiple Hypotheses for Pose and Shape Recovery.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

HiFiHR: Enhancing 3D Hand Reconstruction from a Single Image via High-Fidelity Texture.
Proceedings of the Pattern Recognition - 45th DAGM German Conference, 2023

Overcoming the TradeOff between Accuracy and Plausibility in 3D Hand Shape Reconstruction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Cross-Domain 3D Hand Pose Estimation with Dual Modalities.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Analyzing and Diagnosing Pose Estimation with Attributions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Learning deep morphological networks with neural architecture search.
Pattern Recognit., 2022

Transformed ROIs for capturing visual transformations in videos.
Comput. Vis. Image Underst., 2022

Temporal Action Segmentation: An Analysis of Modern Technique.
CoRR, 2022

A Generalized & Robust Framework For Timestamp Supervision in Temporal Action Segmentation.
CoRR, 2022

DropIT: Dropping Intermediate Tensors for Memory-Efficient DNN Training.
CoRR, 2022

Dive Deeper Into Integral Pose Regression.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Perception-Distortion Balanced ADMM Optimization for Single-Image Super-Resolution.
Proceedings of the Computer Vision - ECCV 2022, 2022

Discrete-Constrained Regression for Local Counting Models.
Proceedings of the Computer Vision, 2022

A Generalized and Robust Framework for Timestamp Supervision in Temporal Action Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Leveraging Action Affinity and Continuity for Semi-supervised Temporal Action Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Accelerating Video Object Segmentation with Compressed Video.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

TemporalUV: Capturing Loose Clothing with Temporally Coherent UV Coordinates.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Assembly101: A Large-Scale Multi-View Video Dataset for Understanding Procedural Activities.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Multi-Scale Memory-Based Video Deblurring.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

UV-Based 3D Hand-Object Reconstruction with Grasp Optimization.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Video as Conditional Graph Hierarchy for Multi-Granular Question Answering.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Iterative Contrast-Classify for Semi-supervised Temporal Action Segmentation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Comprehensive Regularization in a Bi-directional Predictive Network for Video Anomaly Detection.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Reliable Semantic Segmentation with Superpixel-Mix.
CoRR, 2021

Efficient Video Object Segmentation with Compressed Video.
CoRR, 2021

Learning Video Models from Text: Zero-Shot Anticipation for Procedural Actions.
CoRR, 2021

Technical Report: Temporal Aggregate Representations.
CoRR, 2021

Coarse to Fine Multi-Resolution Temporal Convolutional Network.
CoRR, 2021

Towards Compact Single Image Super-Resolution via Contrastive Self-distillation.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

SemiHand: Semi-supervised Hand Pose Estimation with Consistency.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Removing the Bias of Integral Pose Regression.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Weakly-Supervised Dense Action Anticipation.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Local and Global Point Cloud Reconstruction for 3D Hand Pose Estimation.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Robust Semantic Segmentation with Superpixel-Mix.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
Deep morphological networks.
Pattern Recognit., 2020

Rethinking CNN Models for Audio Classification.
CoRR, 2020

Temporal Aggregate Representations for Long Term Video Understanding.
CoRR, 2020

Towards deep neural network compression via learnable wavelet transforms.
CoRR, 2020

Neural Network Compression via Learnable Wavelet Transforms.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2020, 2020

Sequence Prediction Using Spectral RNNs.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2020, 2020

Object-centered Fourier Motion Estimation and Segment-Transformation Prediction.
Proceedings of the 28th European Symposium on Artificial Neural Networks, 2020

Dual Grid Net: Hand Mesh Vertex Regression from Single Depth Maps.
Proceedings of the Computer Vision - ECCV 2020, 2020

Temporal Aggregate Representations for Long-Range Video Understanding.
Proceedings of the Computer Vision - ECCV 2020, 2020


Multi-stage Fusion for One-Click Segmentation.
Proceedings of the Pattern Recognition - 42nd DAGM German Conference, DAGM GCPR 2020, Tübingen, Germany, September 28, 2020

Two-in-One Refinement for Interactive Segmentation.
Proceedings of the 31st British Machine Vision Conference 2020, 2020

2019
A two-streamed network for estimating fine-scaled depth maps from single RGB images.
Comput. Vis. Image Underst., 2019

Bonn Activity Maps: Dataset Description.
CoRR, 2019

Aligning Latent Spaces for 3D Hand Pose Estimation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Zero-Shot Anticipation for Instructional Activities.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Localized Interactive Instance Segmentation.
Proceedings of the Pattern Recognition, 2019

Disentangling Latent Hands for Image Synthesis and Pose Estimation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Self-Supervised 3D Hand Pose Estimation Through Training by Fitting.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Content-Aware Multi-Level Guidance for Interactive Instance Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Fourier RNNs for Sequence Analysis and Prediction.
CoRR, 2018

Scale-aware multi-level guidance for interactive instance segmentation.
CoRR, 2018

Gated Complex Recurrent Neural Networks.
CoRR, 2018

Complex Gated Recurrent Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

HANDS18: Methods, Techniques and Applications for Hand Observation.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Workshop on Interactive and Adaptive Learning in an Open World.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Supervised Deep Kriging for Single-Image Super-Resolution.
Proceedings of the Pattern Recognition - 40th German Conference, 2018

Learning Style Compatibility for Furniture.
Proceedings of the Pattern Recognition - 40th German Conference, 2018

Dense 3D Regression for Hand Pose Estimation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Unsupervised Learning and Segmentation of Complex Activities From Video.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Efficient Unsupervised Temporal Segmentation of Motion Data.
IEEE Trans. Multim., 2017

Crossing Nets: Dual Generative Models with a Shared Latent Space for Hand Pose Estimation.
CoRR, 2017

Data Driven Synthesis of Hand Grasps from 3-D Object Models.
Proceedings of the 22nd International Symposium on Vision, Modeling, and Visualization, 2017

A Two-Streamed Network for Estimating Fine-Scaled Depth Maps from Single RGB Images.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Crossing Nets: Combining GANs and VAEs with a Shared Latent Space for Hand Pose Estimation.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Superpixel Optimization Using Higher Order Energy.
IEEE Trans. Circuits Syst. Video Technol., 2016

Direction matters: hand pose estimation from local surface normals.
CoRR, 2016

Learning Fine-Scaled Depth Maps from Single RGB Images.
CoRR, 2016

Hand Pose Estimation from Local Surface Normals.
Proceedings of the Computer Vision - ECCV 2016, 2016

2014
Gesture Recognition Portfolios for Personalization.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2012
Vision-Based Human Motion Analysis.
PhD thesis, 2012

Coupled Action Recognition and Pose Estimation from Multiple Views.
Int. J. Comput. Vis., 2012

Interactive object detection.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011
Hough Forests for Object Detection, Tracking, and Action Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2011

Learning Probabilistic Non-Linear Latent Variable Models for Tracking Complex Activities.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Does Human Action Recognition Benefit from Pose Estimation?.
Proceedings of the British Machine Vision Conference, 2011

2010
Variations of a Hough-Voting Action Recognition System.
Proceedings of the Recognizing Patterns in Signals, Speech, Images and Videos, 2010

2D Action Recognition Serves 3D Human Pose Estimation.
Proceedings of the Computer Vision, 2010

Hough Forest-Based Facial Expression Recognition from Video Sequences.
Proceedings of the Trends and Topics in Computer Vision, 2010

Tracking People in Broadcast Sports.
Proceedings of the Pattern Recognition, 2010

A Hough transform-based voting framework for action recognition.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010


  Loading...