Zilong Dong

Orcid: 0000-0002-6833-9102

According to our database¹, Zilong Dong authored at least 48 papers between 2006 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

MWVOS: Mask-Free Weakly Supervised Video Object Segmentation via promptable foundation model.

[BibT_eX]

[DOI]

Pattern Recognit., 2025

Text-video retrieval re-ranking via multi-grained cross attention and frozen image encoders.

[BibT_eX]

[DOI]

Pattern Recognit., 2025

A chest imaging diagnosis report generation method based on dual-channel transmodal memory network.

[BibT_eX]

[DOI]

Biomed. Signal Process. Control., 2025

2024

StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal.

[BibT_eX]

[DOI]

ACM Trans. Graph., December, 2024

MVImgNet2.0: A Larger-scale Dataset of Multi-view Images.

[BibT_eX]

[DOI]

ACM Trans. Graph., December, 2024

Open-Vocabulary Category-Level Object Pose and Size Estimation.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., September, 2024

Guiding image inpainting via structure and texture features with dual encoder.

[BibT_eX]

[DOI]

Vis. Comput., June, 2024

Learning Spherical Radiance Field for Efficient 360° Unbounded Novel View Synthesis.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2024

LaMP: Language-Motion Pretraining for Motion Generation, Retrieval, and Captioning.

[BibT_eX]

[DOI]

CoRR, 2024

HIVE: HIerarchical Volume Encoding for Neural Implicit Surface Reconstruction.

[BibT_eX]

[DOI]

CoRR, 2024

Gaussian-Informed Continuum for Physical Property Identification and Simulation.

[BibT_eX]

[DOI]

CoRR, 2024

An Optimization Framework to Enforce Multi-View Consistency for Texturing 3D Meshes Using Pre-Trained Text-to-Image Models.

[BibT_eX]

[DOI]

CoRR, 2024

OV9D: Open-Vocabulary Category-Level 9D Object Pose and Size Estimation.

[BibT_eX]

[DOI]

CoRR, 2024

VideoMV: Consistent Multi-View Generation Based on Large Video Generative Model.

[BibT_eX]

[DOI]

CoRR, 2024

Sketch2NeRF: Multi-view Sketch-guided Text-to-3D Generation.

[BibT_eX]

[DOI]

CoRR, 2024

Multifocal region-assisted cross-modality learning for chest X-ray report generation.

[BibT_eX]

[DOI]

Comput. Biol. Medicine, 2024

High-Fidelity 3D Textured Shapes Generation by Sparse Encoding and Adversarial Decoding.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

An Optimization Framework to Enforce Multi-view Consistency for Texturing 3D Meshes.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Freditor: High-Fidelity and Transferable NeRF Editing by Frequency Decomposition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

IPoD: Implicit Field Learning with Point Diffusion for Generalizable 3D Object Reconstruction from Single RGB-D Images.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

GPLD3D: Latent Diffusion of 3D Shape Generative Models by Enforcing Geometric and Physical Priors.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

PanoContext-Former: Panoramic Total Scene Understanding with a Transformer.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

DRO: Deep Recurrent Optimizer for Video to Depth.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., May, 2023

$\mathcal {S}^{2}$Net: Accurate Panorama Depth Estimation on Spherical Surface.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., 2023

Fine-grained Text-Video Retrieval with Frozen Image Encoders.

[BibT_eX]

[DOI]

CoRR, 2023

S<sup>2</sup>Net: Accurate Panorama Depth Estimation on Spherical Surface.

[BibT_eX]

[DOI]

CoRR, 2023

Mirror-NeRF: Learning Neural Radiance Fields for Mirrors with Whitted-Style Ray Tracing.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Monocular Scene Reconstruction with 3D SDF Transformers.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Dense RGB Slam with Neural Implicit Maps.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

RenderNet: Visual Relocalization Using Virtual Viewpoints in Large-Scale Indoor Environments.

[BibT_eX]

[DOI]

CoRR, 2022

2021

UniFuse: Unidirectional Fusion for 360° Panorama Depth Estimation.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., 2021

AR Mapping: Accurate and Efficient Mapping for Augmented Reality.

[BibT_eX]

[DOI]

CoRR, 2021

UniFuse: Unidirectional Fusion for 360<sup>°</sup> Panorama Depth Estimation.

[BibT_eX]

[DOI]

CoRR, 2021

Single-Shot is Enough: Panoramic Infrastructure Based Calibration of Multiple Cameras and 3D LiDARs.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

2016

Efficient Non-Consecutive Feature Tracking for Robust Structure-From-Motion.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2016

2015

ENFT: Efficient Non-Consecutive Feature Tracking for Robust Structure-from-Motion.

[BibT_eX]

[DOI]

CoRR, 2015

2014

Efficient keyframe-based real-time camera tracking.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2014

2013

Robust monocular SLAM in dynamic environments.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Mixed and Augmented Reality, 2013

2012

Depth-Varying Human Video Sprite Synthesis.

[BibT_eX]

[DOI]

Trans. Edutainment, 2012

2011

Interactive weathering of depth-inferred videos.

[BibT_eX]

[DOI]

Proceedings of the 10th International Conference on Virtual Reality Continuum and its Applications in Industry, 2011

2010

Adaptive voxels: interactive rendering of massive 3D models.

[BibT_eX]

[DOI]

Vis. Comput., 2010

Efficient Non-consecutive Feature Tracking for Structure-from-Motion.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2010, 2010

2009

Refilming with Depth-Inferred Videos.

[BibT_eX]

[DOI]

IEEE Trans. Vis. Comput. Graph., 2009

Live Video Montage with a Rotating Camera.

[BibT_eX]

[DOI]

Comput. Graph. Forum, 2009

Keyframe-based real-time camera tracking.

[BibT_eX]

[DOI]

Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

2006

Synthesizing trees by plantons.

[BibT_eX]

[DOI]

Vis. Comput., 2006

Zilong Dong

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...