2025
RepVideo: Rethinking Cross-Layer Representation for Video Generation.
CoRR, January, 2025
Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, January, 2025
2024
Talk-to-Edit: Fine-Grained 2D and 3D Facial Editing via Dialog.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024
Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models.
CoRR, 2024
VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
WiCV@CVPR2024: The Thirteenth Women In Computer Vision Workshop at the Annual CVPR Conference.
CoRR, 2024
ReVersion: Diffusion-Based Relation Inversion from Images.
Proceedings of the SIGGRAPH Asia 2024 Conference Papers, 2024
FreeInit: Bridging Initialization Gap in Video Diffusion Models.
Proceedings of the Computer Vision - ECCV 2024, 2024
FreeU: Free Lunch in Diffusion U-Net.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
VBench: Comprehensive Benchmark Suite for Video Generative Models.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
Hybrid learning-based digital twin for manufacturing process: Modeling framework and implementation.
Robotics Comput. Integr. Manuf., August, 2023
Sum Rate Analysis for Massive MIMO-NOMA Uplink System With Group-Level Successive Interference Cancellation.
IEEE Wirel. Commun. Lett., July, 2023
ArtiLock: Smartphone User Identification Based on Physiological and Behavioral Features of Monosyllable Articulation.
Sensors, February, 2023
LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
WiCV@CVPR2023: The Eleventh Women In Computer Vision Workshop at the Annual CVPR Conference.
CoRR, 2023
Understanding Privacy Over-collection in WeChat Sub-app Ecosystem.
CoRR, 2023
Understanding the (In)Security of Cross-side Face Verification Systems in Mobile Apps: A System Perspective.
Proceedings of the 44th IEEE Symposium on Security and Privacy, 2023
Literal-Aware Knowledge Graph Embedding for Welding Quality Monitoring: A Bosch Case.
Proceedings of the Semantic Web - ISWC 2023, 2023
Load Change Assessment-Based Feedforward Compensation for FCS-MPCC Used in PMSMs Considering Load Disturbances.
Proceedings of the 49th Annual Conference of the IEEE Industrial Electronics Society, 2023
Collaborative Diffusion for Multi-Modal Face Generation and Editing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
2022
Leveraging audible and inaudible signals for pronunciation training by sensing articulation through a smartphone.
Speech Commun., 2022
DS3-Net: Difficulty-perceived Common-to-T1ce Semi-Supervised Multimodal MRI Synthesis Network.
CoRR, 2022
Multi-modal Brain Tumor Segmentation via Missing Modality Synthesis and Modality-level Attention Fusion.
CoRR, 2022
Leveraging speech and ultrasonic signals toward articulation-based smartphone user authentication.
Proceedings of the MobiSys '22: The 20th Annual International Conference on Mobile Systems, Applications and Services, Portland, Oregon, 27 June 2022, 2022
DS<sup>3</sup>-Net: Difficulty-Perceived Common-to-T1ce Semi-supervised Multimodal MRI Synthesis Network.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022
Unsupervised Domain Adaptation for Cross-Modality Retinal Vessel Segmentation via Disentangling Representation Style Transfer and Collaborative Consistency Learning.
Proceedings of the 19th IEEE International Symposium on Biomedical Imaging, 2022
Efficient and Accurate Text Detection Combining Differentiable Binarization with Semantic Segmentation.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2022, 2022
Transformer-Based T2-weighted MRI Synthesis from T1-weighted Images.
Proceedings of the 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2022
AI-Driven Digital Process Twin via Networked Digital Process Chain.
Proceedings of the IEEE Intl. Conf. on Dependable, 2022
2021
A Survey on AI-Driven Digital Twins in Industry 4.0: Smart Manufacturing and Advanced Robotics.
Sensors, 2021
Pronunciation Training through Articulation Motion Sensing.
Proceedings of the UIST '21: The Adjunct Publication of the 34th Annual ACM Symposium on User Interface Software and Technology, 2021
Muscle-Mind: towards the Strength Training Monitoring via the Neuro-Muscular Connection Sensing.
Proceedings of the SenSys '21: The 19th ACM Conference on Embedded Networked Sensor Systems, Coimbra, Portugal, November 15, 2021
A Diagnostic Study Of Visual Question Answering With Analogical Reasoning.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021
Talk-to-Edit: Fine-Grained Facial Editing via Dialog.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Numerical Simulation Study of Information Digitization Establishment and Convergence in Triangle System for Epidemic Prevention.
Proceedings of the AIAM 2021: 3rd International Conference on Artificial Intelligence and Advanced Manufacture, Manchester, United Kingdom, October 23, 2021
2019
BB-KBQA: BERT-Based Knowledge Base Question Answering.
Proceedings of the Chinese Computational Linguistics - 18th China National Conference, 2019
2018
Content Attention Model for Aspect Based Sentiment Analysis.
Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018
2017
Don't Just Swipe Left, Tell Me Why: Enhancing Gesture-based Feedback with Reason Bins.
Proceedings of the 22nd International Conference on Intelligent User Interfaces, 2017
2015
Interactive Tweaking of Text Analytics Dashboards.
Proceedings of the Databases in Networked Information Systems, 2015
Beyond Binary Gestures: Exploring Documents using Touch Interfaces.
Proceedings of the 33rd Annual ACM Conference Extended Abstracts on Human Factors in Computing Systems, 2015