Kai Chen

Orcid: 0000-0001-8436-6533

Affiliations:
  • Huazhong University of Science and Technology, School of Automation, National Key Laboratory of Science and Technology on Multi-spectral Information Processing, Wuhan, China


According to our database1, Kai Chen authored at least 33 papers between 2015 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
MagicDriveDiT: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control.
CoRR, 2024

Unified Triplet-Level Hallucination Evaluation for Large Vision-Language Models.
CoRR, 2024

PatchScaler: An Efficient Patch-Independent Diffusion Model for Super-Resolution.
CoRR, 2024

MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes.
CoRR, 2024

Mixture of insighTful Experts (MoTE): The Synergy of Thought Chains and Expert Mixtures in Self-Alignment.
CoRR, 2024

Automated Evaluation of Large Vision-Language Models on Self-driving Corner Cases.
CoRR, 2024

GeoDiffusion: Text-Prompted Geometric Control for Object Detection Data Generation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

MagicDrive: Street View Generation with Diverse 3D Geometry Control.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Implicit Concept Removal of Diffusion Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Eyes Closed, Safety on: Protecting Multimodal LLMs via Image-to-Text Transformation.
Proceedings of the Computer Vision - ECCV 2024, 2024

DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Mixture of Cluster-conditional LoRA Experts for Vision-language Instruction Tuning.
CoRR, 2023

TrackDiffusion: Multi-object Tracking Data Generation via Diffusion Models.
CoRR, 2023

Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis.
CoRR, 2023

Geom-Erasing: Geometry-Driven Removal of Implicit Concept in Diffusion Models.
CoRR, 2023

Integrating Geometric Control into Text-to-Image Diffusion Models for High-Quality Detection Data Generation via Text Prompt.
CoRR, 2023

Unfolding Once is Enough: A Deployment-Friendly Transformer Unit for Super-Resolution.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Task-customized Masked Autoencoder via Mixture of Cluster-conditional Experts.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Mixed Autoencoder for Self-Supervised Visual Representation Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
CODA: A Real-World Road Corner Case Dataset for Object Detection in Autonomous Driving.
Proceedings of the Computer Vision - ECCV 2022, 2022


GCFSR: a Generative and Controllable Face Super Resolution Method Without Facial and GAN Priors.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Task-Customized Self-Supervised Pre-training with Scalable Dynamic Routing.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
SODA10M: Towards Large-Scale Object Detection Benchmark for Autonomous Driving.
CoRR, 2021

SODA10M: A Large-Scale 2D Self/Semi-Supervised Object Detection Dataset for Autonomous Driving.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

MultiSiam: Self-supervised Multi-instance Siamese Representation Learning for Autonomous Driving.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2019
Learning Linear Regression via Single-Convolutional Layer for Visual Object Tracking.
IEEE Trans. Multim., 2019

2018
Convolutional Regression for Visual Tracking.
IEEE Trans. Image Process., 2018

Once for All: A Two-Flow Convolutional Neural Network for Visual Tracking.
IEEE Trans. Circuits Syst. Video Technol., 2018

2017
Visual object tracking via enhanced structural correlation filter.
Inf. Sci., 2017

The Visual Object Tracking VOT2017 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

2015
2D facial landmark model design by combining key points and inserted points.
Expert Syst. Appl., 2015


  Loading...