Hang Zhou

Orcid: 0000-0002-2616-923X

Affiliations:
  • Baidu Inc., Department of Computer Vision Technology (VIS), China
  • Chinese University of Hong Kong, Department of Electronics, Hong Kong (PhD 2021)


According to our database1, Hang Zhou authored at least 45 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
ReliTalk: Relightable Talking Portrait Generation from a Single Video.
Int. J. Comput. Vis., August, 2024

Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation.
CoRR, 2024

AVI-Talking: Learning Audio-Visual Instructions for Expressive 3D Talking Face Generation.
IEEE Access, 2024

TALK-Act: Enhance Textural-Awareness for 2D Speaking Avatar Reenactment with Diffusion Model.
Proceedings of the SIGGRAPH Asia 2024 Conference Papers, 2024

DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

ReSyncer: Rewiring Style-Based Generator for Unified Audio-Visually Synced Facial Performer.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
Building an Invisible Shield for Your Portrait against Deepfakes.
CoRR, 2023

Make Your Brief Stroke Real and Stereoscopic: 3D-Aware Simplified Sketch to Portrait Generation.
CoRR, 2023

SeCo: Separating Unknown Musical Visual Sounds with Consistency Guidance.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Exploiting Visual Context Semantics for Sound Source Localization.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

ReEnFP: Detail-Preserving Face Reconstruction by Encoding Facial Priors.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Efficient Video Portrait Reenactment via Grid-based Codebook.
Proceedings of the ACM SIGGRAPH 2023 Conference Proceedings, 2023

Dual-Modality Co-Learning for Unveiling Deepfake in Spatio-Temporal Space.
Proceedings of the 2023 ACM International Conference on Multimedia Retrieval, 2023

Make Your Brief Stroke Real and Stereoscopic: 3D-Aware Simplified Sketch to Portrait Generation.
Proceedings of the 25th International Conference on Multimodal Interaction, 2023

Delicate Textured Mesh Recovery from NeRF via Adaptive Surface Refinement.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-Based Generator.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Robust Video Portrait Reenactment via Personalized Representation Quantization.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition.
CoRR, 2022

StyleFaceV: Face Video Generation via Decomposing and Recomposing Pretrained StyleGAN3.
CoRR, 2022

Detecting Deepfake by Creating Spatio-Temporal Regularity Disruption.
CoRR, 2022

SeCo: Separating Unknown Musical Visual Sounds with Consistency Guidance.
CoRR, 2022

Masked Lip-Sync Prediction by Audio-Visual Contextual Exploitation in Transformers.
Proceedings of the SIGGRAPH Asia 2022 Conference Papers, 2022

EAMM: One-Shot Emotional Talking Face via Audio-Based Emotion-Aware Motion Model.
Proceedings of the SIGGRAPH '22: Special Interest Group on Computer Graphics and Interactive Techniques Conference, Vancouver, BC, Canada, August 7, 2022

Audio-Driven Co-Speech Gesture Video Generation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Delving into Sequential Patches for Deepfake Detection.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

StyleSwap: Style-Based Generator Empowers Robust Face Swapping.
Proceedings of the Computer Vision - ECCV 2022, 2022

Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation.
Proceedings of the Computer Vision - ECCV 2022, 2022

TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers.
Proceedings of the Computer Vision - ECCV 2022, 2022

STC: Spatio-Temporal Contrastive Learning for Video Instance Segmentation.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Few-Shot Head Swapping in the Wild.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Expressive Talking Head Generation with Granular Audio-Visual Control.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

SepFusion: Finding Optimal Fusion Structures for Visual Sound Separation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Visual Sound Localization in the Wild by Cross-Modal Interference Erasing.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Speech2Talking-Face: Inferring and Driving a Face with Synchronized Audio-Visual Representation.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Visually Informed Binaural Audio Generation without Binaural Audios.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Audio-Driven Emotional Video Portraits.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

PTeacher: a Computer-Aided Personalized Pronunciation Training System with Exaggerated Audio-Visual Corrective Feedback.
Proceedings of the CHI '21: CHI Conference on Human Factors in Computing Systems, 2021

2020
Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Discriminability Distillation in Group Representation Learning.
Proceedings of the Computer Vision - ECCV 2020, 2020

Rotate-and-Render: Unsupervised Photorealistic Face Rotation From Single-View Images.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Vision-Infused Deep Audio Inpainting.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

A Graph-Based Framework to Bridge Movies and Synopses.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Talking Face Generation by Adversarially Disentangled Audio-Visual Representation.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019


  Loading...