2025

Can Test-Time Scaling Improve World Foundation Model?

[DOI]

Wenyan Cong

Hanqing Zhu

CoRR, March, 2025

Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields.

[DOI]

CoRR, March, 2025

Copy or Not? Reference-Based Face Image Restoration with Fine Details.

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

4K4DGen: Panoramic 4D Generation at 4K Resolution.

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Atlas Gaussians Diffusion for 3D Generation.

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention.

[DOI]

CoRR, 2024

Atlas Gaussians Diffusion for 3D Generation with Infinite Number of Points.

[DOI]

CoRR, 2024

4K4DGen: Panoramic 4D Generation at 4K Resolution.

[DOI]

CoRR, 2024

CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation.

[DOI]

CoRR, 2024

LLMGeo: Benchmarking Large Language Models on Image Geolocation In-the-wild.

[DOI]

Zhiqiang Wang

Dejia Xu

Rana Muhammad Shahroz Khan

Yanbin Lin

Zhiwen Fan

Xingquan Zhu

CoRR, 2024

Comp4D: LLM-Guided Compositional 4D Scene Generation.

[DOI]

Konstantinos N. Plataniotis

Zhangyang Wang

CoRR, 2024

AGG: Amortized Generative 3D Gaussians for Single Image to 3D.

[DOI]

CoRR, 2024

VASE: Object-Centric Appearance and Shape Manipulation of Real Videos.

[DOI]

CoRR, 2024

SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity.

[DOI]

CoRR, 2024

Diffusion4D: Fast Spatial-temporal Consistent 4D generation via Video Diffusion Models.

[DOI]

Konstantinos N. Plataniotis

Yao Zhao

Yunchao Wei

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting.

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Taming Mode Collapse in Score Distillation for Text-to-3D Generation.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

POPE: 6-DoF Promptable Pose Estimation of Any Object, in Any Scene, with One Reference.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

OpenBias: Open-Set Bias Detection in Text-to-Image Generative Models.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

NeRF as Pretraining at Scale: Generalizable 3D-Aware Semantic Representation Learning from View Prediction.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Background Scene Recovery From an Image Looking Through Colored Glass.

[DOI]

IEEE Trans. Multim., 2023

4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency.

[DOI]

CoRR, 2023

Multi-Concept T2I-Zero: Tweaking Only The Text Embeddings and Nothing Else.

[DOI]

CoRR, 2023

Drag View: Generalizable Novel View Synthesis with Unposed Imagery.

[DOI]

CoRR, 2023

Reference-based Painterly Inpainting via Diffusion: Crossing the Wild Reference Domain Gap.

[DOI]

CoRR, 2023

POPE: 6-DoF Promptable Pose Estimation of Any Object, in Any Scene, with One Reference.

[DOI]

CoRR, 2023

PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models.

[DOI]

CoRR, 2023

CLE Diffusion: Controllable Light Enhancement Diffusion Model.

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation.

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

NeRF-SOS: Any-View Self-supervised Object Segmentation on Complex Scenes.

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

INR-Arch: A Dataflow Architecture and Compiler for Arbitrary-Order Gradient Computations in Implicit Neural Representation Processing.

[DOI]

Proceedings of the IEEE/ACM International Conference on Computer Aided Design, 2023

NeuralLift-360: Lifting an in-the-Wild 2D Photo to A 3D Object with 360° Views.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

AligNeRF: High-Fidelity Neural Radiance Fields via Alignment-Aware Training.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Signal Processing for Implicit Neural Representations.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

ReCoRo: Region-Controllable Robust Light Enhancement with User-Specified Imprecise Masks.

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Cloud2Sketch: Augmenting Clouds with Imaginary Sketches.

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

SinNeRF: Training Neural Radiance Fields on Complex Scenes from a Single Image.

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Unified Implicit Neural Stylization.

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

2021

Benchmarking Low-Light Image Enhancement and Beyond.

[DOI]

Int. J. Comput. Vis., 2021

Is Label Smoothing Truly Incompatible with Knowledge Distillation: An Empirical Study.

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

2020

AIM 2020 Challenge on Real Image Super-Resolution: Methods and Results.

[DOI]

Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

AIM 2020 Challenge on Image Extreme Inpainting.

[DOI]

Pranjal Singh Chauhan

Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

NTIRE 2020 Challenge on Image Demoireing: Methods and Results.

[DOI]

S. Mohamed Mansoor Roomi

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Moiré Pattern Removal via Attentive Fractal Network.

[DOI]

Dejia Xu

Yihao Chu

Qingyan Sun

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

NTIRE 2020 Challenge on Image and Video Deblurring.

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Towards Scale-Free Rain Streak Removal via Self-Supervised Fractal Band Learning.

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020