Xu Zhao

Orcid: 0000-0002-8176-623X

Affiliations:
  • Shanghai Jiao Tong University, Department of Automation, Shanghai, China
  • Shanghai Jiao Tong University, Institute of Image Processing and Pattern Recognition, Shanghai, China (PhD 2011)


According to our database1, Xu Zhao authored at least 104 papers between 2007 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

2008
2010
2012
2014
2016
2018
2020
2022
2024
0
5
10
1
5
6
3
2
4
5
2
4
3
1
2
1
2
1
1
5
2
6
3
3
4
7
5
1
5
4
3
2
2
4
1
1
3

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
RSB-Pose: Robust Short-Baseline Binocular 3D Human Pose Estimation With Occlusion Handling.
IEEE Trans. Image Process., 2025

2024
Joint-Limb Compound Triangulation With Co-Fixing for Stereoscopic Human Pose Estimation.
IEEE Trans. Multim., 2024

An Embeddable Implicit IUVD Representation for Part-Based 3D Human Surface Reconstruction.
IEEE Trans. Image Process., 2024

M3Net: Movement Enhancement with Multi-Relation toward Multi-Scale video representation for Temporal Action Detection.
Pattern Recognit., 2024

Disambiguating Monocular Reconstruction of 3D Clothed Human with Spatial-Temporal Transformer.
CoRR, 2024

Hybrid 3D Human Pose Estimation with Monocular Video and Sparse IMUs.
CoRR, 2024

Real-Time Industrial Anomaly Detection via Sparse Reconstruction.
Proceedings of the IEEE International Conference on Real-time Computing and Robotics, 2024

Part-aware Surface Slice and Polarization Implicit Function with Regular Discretization for 3D Human Surface Reconstruction.
Proceedings of the IEEE International Conference on Real-time Computing and Robotics, 2024

Self-Supervised Fast Texture Defect Detection Based on Salient Object Detection.
Proceedings of the IEEE International Conference on Real-time Computing and Robotics, 2024

High-Quality Talking Face Generation via Cross-Attention Transformer.
Proceedings of the IEEE International Conference on Real-time Computing and Robotics, 2024

MESA: Matching Everything by Segmenting Anything.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
View consistency aware holistic triangulation for 3D human pose estimation.
Comput. Vis. Image Underst., November, 2023

Temporally consistent reconstruction of 3D clothed human surface with warp field.
Image Vis. Comput., September, 2023

FusePose: IMU-Vision Sensor Fusion in Kinematic Space for Parametric Human Pose Estimation.
IEEE Trans. Multim., 2023

3DTRIP: A General Framework for 3D Trajectory Recovery Integrated With Prediction.
IEEE Robotics Autom. Lett., 2023

RSB-Pose: Robust Short-Baseline Binocular 3D Human Pose Estimation with Occlusion Handling.
CoRR, 2023

Searching from Area to Point: A Hierarchical Framework for Semantic-Geometric Combined Feature Matching.
CoRR, 2023

Movement Enhancement toward Multi-Scale Video Feature Representation for Temporal Action Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

ETAD: Training Action Detection End to End on a Laptop.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Learning-Based Distortion Correction and Feature Detection for High Precision and Robust Camera Calibration.
IEEE Robotics Autom. Lett., 2022

ETAD: A Unified Framework for Efficient Temporal Action Detection.
CoRR, 2022

Learning-Based Framework for Camera Calibration with Distortion Correction and High Precision Feature Detection.
CoRR, 2022

Semi-supervised Learning for Multi-label Video Action Detection.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

BACNet: Boundary-Anchor Complementary Network for Temporal Action Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Mr.CAN: Class-Aware Network with Multi-Relations for Temporal Action Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Spatio-Temporal Motion Aggregation Network for Video Action Detection.
Proceedings of the IEEE International Conference on Acoustics, 2022

Mask-Based Attention Parallel Network for in-the-Wild Facial Expression Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

Structural Triangulation: A Closed-Form Solution to Constrained 3D Human Pose Estimation.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
CAT: Corner Aided Tracking With Deep Regression Network.
IEEE Trans. Multim., 2021

Transferable Knowledge-Based Multi-Granularity Fusion Network for Weakly Supervised Temporal Action Detection.
IEEE Trans. Multim., 2021

Joint Intention and Trajectory Prediction Based on Transformer.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Rgb-D Fusion For Point-Cloud-Based 3d Human Pose Estimation.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Human Carving: A Parsing-Based Framework For 3d Human Reconstruction.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

2020
Oriented Spatial Transformer Network for Pedestrian Detection Using Fish-Eye Camera.
IEEE Trans. Multim., 2020

Joint Learning of Local and Global Context for Temporal Action Proposal Generation.
IEEE Trans. Circuits Syst. Video Technol., 2020

EdgeStereo: An Effective Multi-task Learning Network for Stereo Matching and Edge Detection.
Int. J. Comput. Vis., 2020

An End to End Network Architecture for Fundamental Matrix Estimation.
CoRR, 2020

AEC-Net: Attention and Edge Constraint Network for Medical Image Segmentation.
Proceedings of the 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2020

TSI: Temporal Scale Invariant Network for Action Proposal Generation.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

Anatomy and Geometry Constrained One-Stage Framework for 3D Human Pose Estimation.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

2019
Attention-Based Multiview Re-Observation Fusion Network for Skeletal Action Recognition.
IEEE Trans. Multim., 2019

Discriminative representation combinations for accurate face spoofing detection.
Pattern Recognit., 2019

Small-objectness sensitive detection based on shifted single shot detector.
Multim. Tools Appl., 2019

Multi-Granularity Fusion Network for Proposal and Activity Localization: Submission to ActivityNet Challenge 2019 Task 1 and Task 2.
CoRR, 2019

EdgeStereo: An Effective Multi-Task Learning Network for Stereo Matching and Edge Detection.
CoRR, 2019

Temporal Regularized Spatial Attention for Video-Based Person Re-Identification.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

3D Body Pose and Shape Estimation from Multi-View Images With Limb Geometric Constraint.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

ISDNet: Importance Guided Semi-supervised Adversarial Learning for Medical Image Segmentation.
Proceedings of the Image and Graphics - 10th International Conference, 2019

Semantic Segmentation of Street Scenes Using Disparity Information.
Proceedings of the Image and Graphics - 10th International Conference, 2019

2018
Measuring Crowd Collectiveness by Macroscopic and Microscopic Motion Consistencies.
IEEE Trans. Multim., 2018

Skeleton Feature Fusion Based on Multi-Stream LSTM for Action Recognition.
IEEE Access, 2018

Weakly Supervised Temporal Action Detection with Shot-Based Temporal Pooling Network.
Proceedings of the Neural Information Processing - 25th International Conference, 2018

Led: Localization-Quality Estimation Embedded Detector.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

BSN: Boundary Sensitive Network for Temporal Action Proposal Generation.
Proceedings of the Computer Vision - ECCV 2018, 2018

Cascaded Pyramid Mining Network for Weakly Supervised Temporal Action Localization.
Proceedings of the Computer Vision - ACCV 2018, 2018

EdgeStereo: A Context Integrated Residual Pyramid Network for Stereo Matching.
Proceedings of the Computer Vision - ACCV 2018, 2018

Simultaneous Face Detection and Head Pose Estimation: A Fast and Unified Framework.
Proceedings of the Computer Vision - ACCV 2018, 2018

Putting the Anchors Efficiently: Geometric Constrained Pedestrian Detection.
Proceedings of the Computer Vision - ACCV 2018, 2018

2017
Context-Associative Hierarchical Memory Model for Human Activity Recognition and Prediction.
IEEE Trans. Multim., 2017

Online learning of dynamic multi-view gallery for person Re-identification.
Multim. Tools Appl., 2017

Plate refractive camera model and its applications.
J. Electronic Imaging, 2017

Temporal Convolution Based Action Proposal: Submission to ActivityNet 2017.
CoRR, 2017

Single Shot Temporal Action Detection.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Face spoofing detection by fusing binocular depth and spatial pyramid coding micro-texture features.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Temporal action localization with two-stream segment-based RNN.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Cost efficient subcategory-aware CNN for object detection.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

An Online Approach for Gesture Recognition Toward Real-World Applications.
Proceedings of the Image and Graphics - 9th International Conference, 2017

2016
Person Re-identification by encoding free energy feature maps.
Multim. Tools Appl., 2016

Reduce false positives for object detection by a priori probability in videos.
Neurocomputing, 2016

Parallelized deformable part models with effective hypothesis pruning.
Comput. Vis. Media, 2016

Multiple-Branches Faster RCNN for Human Parts Detection and Pose Estimation.
Proceedings of the Computer Vision - ACCV 2016 Workshops, 2016

2015
Multiple-Shot Person Re-identification by Features Learned from Third-party Image Sets.
KSII Trans. Internet Inf. Syst., 2015

Detect coherent motions in crowd scenes based on tracklets association.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

A discriminative tracklets representation for crowd analysis.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Adaptive appearance learning for human pose estimation.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Underwater camera model and its use in calibration.
Proceedings of the IEEE International Conference on Information and Automation, 2015

Reduce false positives for human detection by a priori probability in videos.
Proceedings of the 3rd IAPR Asian Conference on Pattern Recognition, 2015

2014
Support-plane estimation for floor detection to understand regions' spatial organization.
Proceedings of the 2014 IEEE International Conference on Robotics and Biomimetics, 2014

Statistical background subtraction based on imbalanced learning.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Recognition by detection: Perceiving human motion through part-configured feature maps.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

Person re-identification by free energy score space encoding.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

2013
Multiple Subcategories Parts-Based Representation for One Sample Face Identification.
IEEE Trans. Inf. Forensics Secur., 2013

Digitize Your Body and Action in 3-D at Over 10 FPS: Real Time Dense Voxel Reconstruction and Marker-less Motion Tracking via GPU Acceleration.
CoRR, 2013

Exploring discriminative pose sub-patterns for effective action classification.
Proceedings of the ACM Multimedia Conference, 2013

Motion pattern analysis in crowded scenes based on hybrid generative-discriminative feature maps.
Proceedings of the IEEE International Conference on Image Processing, 2013

Prototype based feature learning for face image set classification.
Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2013

2012
Unsupervised Motion Pattern Mining for Crowded Scenes Analysis.
KSII Trans. Internet Inf. Syst., 2012

Hybrid generative-discriminative recognition of human action in 3D joint space.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Parallelized Annealed Particle Filter for real-time marker-less motion tracking via heterogeneous computing.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

2011
Text From Corners: A Novel Approach to Detect Text and Caption in Videos.
IEEE Trans. Image Process., 2011

Human Motion Tracking by Temporal-Spatial Local Gaussian Process Experts.
IEEE Trans. Image Process., 2011

A Method for Detection and Classification of Glass Defects in Low Resolution Images.
Proceedings of the Sixth International Conference on Image and Graphics, 2011

Detecting Motion Patterns in Dynamic Crowd Scenes.
Proceedings of the Sixth International Conference on Image and Graphics, 2011

2010
Human Pose Regression Through Multiview Visual Fusion.
IEEE Trans. Circuits Syst. Video Technol., 2010

Weak Metric Learning for Feature Fusion towards Perception-Inspired Object Recognition.
Proceedings of the Advances in Multimedia Modeling, 2010

Multimodality gender estimation using Bayesian hierarchical model.
Proceedings of the IEEE International Conference on Acoustics, 2010

Bimodal gender recognition from face and fingerprint.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Sparse Coding on Local Spatial-Temporal Volumes for Human Action Recognition.
Proceedings of the Computer Vision - ACCV 2010, 2010

2009
Temporal-Spatial Local Gaussian Process Experts for Human Pose Estimation.
Proceedings of the Computer Vision, 2009

2008
Generative tracking of 3D human motion by hierarchical annealed genetic algorithm.
Pattern Recognit., 2008

Discriminative estimation of 3D human pose using Gaussian processes.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

2007
Tracking 3D Human Motion in Compact Base Space.
Proceedings of the 8th IEEE Workshop on Applications of Computer Vision (WACV 2007), 2007

Capturing 3D Human Motion from Monocular Images Using Orthogonal Locality Preserving Projection.
Proceedings of the Digital Human Modeling, 2007

Generative Estimation of 3D Human Pose Using Shape Contexts Matching.
Proceedings of the Computer Vision, 2007


  Loading...