Zilong Dong

Orcid: 0000-0002-6833-9102

According to our database1, Zilong Dong authored at least 44 papers between 2006 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
A chest imaging diagnosis report generation method based on dual-channel transmodal memory network.
Biomed. Signal Process. Control., 2025

2024
Open-Vocabulary Category-Level Object Pose and Size Estimation.
IEEE Robotics Autom. Lett., September, 2024

Guiding image inpainting via structure and texture features with dual encoder.
Vis. Comput., June, 2024

Learning Spherical Radiance Field for Efficient 360° Unbounded Novel View Synthesis.
IEEE Trans. Image Process., 2024

LaMP: Language-Motion Pretraining for Motion Generation, Retrieval, and Captioning.
CoRR, 2024

HIVE: HIerarchical Volume Encoding for Neural Implicit Surface Reconstruction.
CoRR, 2024

StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal.
CoRR, 2024

Gaussian-Informed Continuum for Physical Property Identification and Simulation.
CoRR, 2024

Freditor: High-Fidelity and Transferable NeRF Editing by Frequency Decomposition.
CoRR, 2024

An Optimization Framework to Enforce Multi-View Consistency for Texturing 3D Meshes Using Pre-Trained Text-to-Image Models.
CoRR, 2024

OV9D: Open-Vocabulary Category-Level 9D Object Pose and Size Estimation.
CoRR, 2024

VideoMV: Consistent Multi-View Generation Based on Large Video Generative Model.
CoRR, 2024

Sketch2NeRF: Multi-view Sketch-guided Text-to-3D Generation.
CoRR, 2024

Multifocal region-assisted cross-modality learning for chest X-ray report generation.
Comput. Biol. Medicine, 2024

High-Fidelity 3D Textured Shapes Generation by Sparse Encoding and Adversarial Decoding.
Proceedings of the Computer Vision - ECCV 2024, 2024

An Optimization Framework to Enforce Multi-view Consistency for Texturing 3D Meshes.
Proceedings of the Computer Vision - ECCV 2024, 2024

IPoD: Implicit Field Learning with Point Diffusion for Generalizable 3D Object Reconstruction from Single RGB-D Images.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

GPLD3D: Latent Diffusion of 3D Shape Generative Models by Enforcing Geometric and Physical Priors.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

PanoContext-Former: Panoramic Total Scene Understanding with a Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
DRO: Deep Recurrent Optimizer for Video to Depth.
IEEE Robotics Autom. Lett., May, 2023

$\mathcal {S}^{2}$Net: Accurate Panorama Depth Estimation on Spherical Surface.
IEEE Robotics Autom. Lett., 2023

Fine-grained Text-Video Retrieval with Frozen Image Encoders.
CoRR, 2023

S<sup>2</sup>Net: Accurate Panorama Depth Estimation on Spherical Surface.
CoRR, 2023

Mirror-NeRF: Learning Neural Radiance Fields for Mirrors with Whitted-Style Ray Tracing.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Monocular Scene Reconstruction with 3D SDF Transformers.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Dense RGB Slam with Neural Implicit Maps.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
RenderNet: Visual Relocalization Using Virtual Viewpoints in Large-Scale Indoor Environments.
CoRR, 2022

2021
UniFuse: Unidirectional Fusion for 360° Panorama Depth Estimation.
IEEE Robotics Autom. Lett., 2021

AR Mapping: Accurate and Efficient Mapping for Augmented Reality.
CoRR, 2021

UniFuse: Unidirectional Fusion for 360<sup>°</sup> Panorama Depth Estimation.
CoRR, 2021

Single-Shot is Enough: Panoramic Infrastructure Based Calibration of Multiple Cameras and 3D LiDARs.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

2016
Efficient Non-Consecutive Feature Tracking for Robust Structure-From-Motion.
IEEE Trans. Image Process., 2016

2015
ENFT: Efficient Non-Consecutive Feature Tracking for Robust Structure-from-Motion.
CoRR, 2015

2014
Efficient keyframe-based real-time camera tracking.
Comput. Vis. Image Underst., 2014

2013
Robust monocular SLAM in dynamic environments.
Proceedings of the IEEE International Symposium on Mixed and Augmented Reality, 2013

2012
Depth-Varying Human Video Sprite Synthesis.
Trans. Edutainment, 2012

2011
Interactive weathering of depth-inferred videos.
Proceedings of the 10th International Conference on Virtual Reality Continuum and its Applications in Industry, 2011

2010
Adaptive voxels: interactive rendering of massive 3D models.
Vis. Comput., 2010

Efficient Non-consecutive Feature Tracking for Structure-from-Motion.
Proceedings of the Computer Vision - ECCV 2010, 2010

2009
Refilming with Depth-Inferred Videos.
IEEE Trans. Vis. Comput. Graph., 2009

Live Video Montage with a Rotating Camera.
Comput. Graph. Forum, 2009

Keyframe-based real-time camera tracking.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

2006
Synthesizing trees by plantons.
Vis. Comput., 2006


  Loading...