2025

RepVideo: Rethinking Cross-Layer Representation for Video Generation.

[DOI]

,

,

,

,

,

CoRR, January, 2025

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, January, 2025

2024

Talk-to-Edit: Fine-Grained 2D and 3D Facial Editing via Dialog.

[DOI]

,

,

,

,

Chen Change Loy

,

IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models.

[DOI]

,

,

,

,

CoRR, 2024

VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models.

[DOI]

,

,

,

,

,

,

,

Nattapol Chanpaisit

,

,

,

,

,

,

,

,

,

CoRR, 2024

WiCV@CVPR2024: The Thirteenth Women In Computer Vision Workshop at the Annual CVPR Conference.

[DOI]

,

,

,

Estefania Talavera

,

Deblina Bhattacharjee

,

,

Vanessa Staderini

,

,

CoRR, 2024

ReVersion: Diffusion-Based Relation Inversion from Images.

[DOI]

,

,

,

Kelvin C. K. Chan

,

Proceedings of the SIGGRAPH Asia 2024 Conference Papers, 2024

FreeInit: Bridging Initialization Gap in Video Diffusion Models.

[DOI]

,

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

FreeU: Free Lunch in Diffusion U-Net.

[DOI]

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

VBench: Comprehensive Benchmark Suite for Video Generative Models.

[DOI]

,

,

,

,

,

,

,

,

,

Nattapol Chanpaisit

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Hybrid learning-based digital twin for manufacturing process: Modeling framework and implementation.

[DOI]

,

,

,

,

,

Christian Brecher

Robotics Comput. Integr. Manuf., August, 2023

Sum Rate Analysis for Massive MIMO-NOMA Uplink System With Group-Level Successive Interference Cancellation.

[DOI]

,

,

,

IEEE Wirel. Commun. Lett., July, 2023

ArtiLock: Smartphone User Identification Based on Physiological and Behavioral Features of Monosyllable Articulation.

[DOI]

,

,

,

Sensors, February, 2023

LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Chen Change Loy

,

,

,

,

CoRR, 2023

WiCV@CVPR2023: The Eleventh Women In Computer Vision Workshop at the Annual CVPR Conference.

[DOI]

Doris Antensteiner

,

,

,

,

,

,

Sunnie S. Y. Kim

,

,

CoRR, 2023

Understanding Privacy Over-collection in WeChat Sub-app Ecosystem.

[DOI]

,

,

,

,

,

CoRR, 2023

Understanding the (In)Security of Cross-side Face Verification Systems in Mobile Apps: A System Perspective.

[DOI]

,

,

,

,

,

,

Proceedings of the 44th IEEE Symposium on Security and Privacy, 2023

Literal-Aware Knowledge Graph Embedding for Welding Quality Monitoring: A Bosch Case.

[DOI]

,

,

,

Ognjen Savkovic

,

,

Irlán Grangel-González

,

,

Evgeny Kharlamov

Proceedings of the Semantic Web - ISWC 2023, 2023

Load Change Assessment-Based Feedforward Compensation for FCS-MPCC Used in PMSMs Considering Load Disturbances.

[DOI]

,

,

,

,

,

Proceedings of the 49th Annual Conference of the IEEE Industrial Electronics Society, 2023

Collaborative Diffusion for Multi-Modal Face Generation and Editing.

[DOI]

,

Kelvin C. K. Chan

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Leveraging audible and inaudible signals for pronunciation training by sensing articulation through a smartphone.

[DOI]

,

,

Speech Commun., 2022

DS3-Net: Difficulty-perceived Common-to-T1ce Semi-Supervised Multimodal MRI Synthesis Network.

[DOI]

,

,

,

,

CoRR, 2022

Multi-modal Brain Tumor Segmentation via Missing Modality Synthesis and Modality-level Attention Fusion.

[DOI]

,

,

,

,

CoRR, 2022

Leveraging speech and ultrasonic signals toward articulation-based smartphone user authentication.

[DOI]

,

,

Proceedings of the MobiSys '22: The 20th Annual International Conference on Mobile Systems, Applications and Services, Portland, Oregon, 27 June 2022, 2022

DS<sup>3</sup>-Net: Difficulty-Perceived Common-to-T1ce Semi-supervised Multimodal MRI Synthesis Network.

[DOI]

,

,

,

,

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

Unsupervised Domain Adaptation for Cross-Modality Retinal Vessel Segmentation via Disentangling Representation Style Transfer and Collaborative Consistency Learning.

[DOI]

,

,

,

,

Proceedings of the 19th IEEE International Symposium on Biomedical Imaging, 2022

Efficient and Accurate Text Detection Combining Differentiable Binarization with Semantic Segmentation.

[DOI]

,

,

,

,

Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2022, 2022

Transformer-Based T2-weighted MRI Synthesis from T1-weighted Images.

[DOI]

,

,

,

,

Proceedings of the 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2022

AI-Driven Digital Process Twin via Networked Digital Process Chain.

[DOI]

,

,

,

Christian Brecher

Proceedings of the IEEE Intl. Conf. on Dependable, 2022

2021

A Survey on AI-Driven Digital Twins in Industry 4.0: Smart Manufacturing and Advanced Robotics.

[DOI]

,

,

,

,

Christian Brecher

Sensors, 2021

Pronunciation Training through Articulation Motion Sensing.

[DOI]

,

,

,

Proceedings of the UIST '21: The Adjunct Publication of the 34th Annual ACM Symposium on User Interface Software and Technology, 2021

Muscle-Mind: towards the Strength Training Monitoring via the Neuro-Muscular Connection Sensing.

[DOI]

,

,

,

,

,

Proceedings of the SenSys '21: The 19th ACM Conference on Embedded Networked Sensor Systems, Coimbra, Portugal, November 15, 2021

A Diagnostic Study Of Visual Question Answering With Analogical Reasoning.

[DOI]

,

,

,

,

,

Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Talk-to-Edit: Fine-Grained Facial Editing via Dialog.

[DOI]

,

,

,

Chen Change Loy

,

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Numerical Simulation Study of Information Digitization Establishment and Convergence in Triangle System for Epidemic Prevention.

[DOI]

,

Proceedings of the AIAM 2021: 3rd International Conference on Artificial Intelligence and Advanced Manufacture, Manchester, United Kingdom, October 23, 2021

2019

BB-KBQA: BERT-Based Knowledge Base Question Answering.

[DOI]

,

,

,

,

Proceedings of the Chinese Computational Linguistics - 18th China National Conference, 2019

2018

Content Attention Model for Aspect Based Sentiment Analysis.

[DOI]

,

,

,

,

Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018

2017

Don't Just Swipe Left, Tell Me Why: Enhancing Gesture-based Feedback with Reason Bins.

[DOI]

Juan Felipe Beltran

,

,

,

Proceedings of the 22nd International Conference on Intelligent User Interfaces, 2017

2015

Interactive Tweaking of Text Analytics Dashboards.

[DOI]

,

,

,

,

,

Srinivasan Parthasarathy

,

Ramiya Venkatachalam

Proceedings of the Databases in Networked Information Systems, 2015

Beyond Binary Gestures: Exploring Documents using Touch Interfaces.

[DOI]

,

Juan Felipe Beltran

,

,

Proceedings of the 33rd Annual ACM Conference Extended Abstracts on Human Factors in Computing Systems, 2015