Xiaojuan Qi

Orcid: 0000-0002-4285-1626

Affiliations:
  • University of Hong Kong, Department of Electrical and Electronic Engineering, Pokfulam, Hong Kong
  • Chinese University of Hong Kong, Sha Tin, Hong Kong (PhD 2018)


According to our database1, Xiaojuan Qi authored at least 155 papers between 2015 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
TEXGen: a Generative Diffusion Model for Mesh Textures.
ACM Trans. Graph., December, 2024

3DGSR: Implicit Surface Reconstruction with 3D Gaussian Splatting.
ACM Trans. Graph., December, 2024

Object-Centric Representation Learning for Video Scene Understanding.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Lowis3D: Language-Driven Open-World Instance-Level 3D Scene Understanding.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Recursive Least-Squares Estimator-Aided Online Learning for Visual Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2024

MEN: Mutual Enhancement Networks for Sign Language Recognition and Education.
IEEE Trans. Neural Networks Learn. Syst., January, 2024

You Only Need One Thing One Click: Self-Training for Weakly Supervised 3D Scene Understanding.
World Sci. Annu. Rev. Artif. Intell., 2024

Granularity Matters in Long-Tail Learning.
CoRR, 2024

MC-MoE: Mixture Compressor for Mixture-of-Experts LLMs Gains More.
CoRR, 2024

Topology Optimization of Random Memristors for Input-Aware Dynamic SNN.
CoRR, 2024

SNNGX: Securing Spiking Neural Networks with Genetic XOR Encryption on RRAM-based Neuromorphic Accelerator.
CoRR, 2024

Dynamic neural network with memristive CIM and CAM for 2D and 3D vision.
CoRR, 2024

SVG: 3D Stereoscopic Video Generation via Denoising Frame Matrix.
CoRR, 2024

What Matters in Detecting AI-Generated Videos like Sora?
CoRR, 2024

Splatter a Video: Video Gaussian Representation for Versatile Processing.
CoRR, 2024

Generalization Beyond Data Imbalance: A Controlled Study on CLIP for Transferable Insights.
CoRR, 2024

SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models.
CoRR, 2024

How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study.
CoRR, 2024

Efficient and accurate neural field reconstruction using resistive memory.
CoRR, 2024

Resistive Memory-based Neural Differential Equation Solver for Score-based Diffusion Model.
CoRR, 2024

DO3D: Self-supervised Learning of Decomposed Object-aware 3D Motion and Depth from Monocular Videos.
CoRR, 2024

Spec-Gaussian: Anisotropic View-Dependent Appearance for 3D Gaussian Splatting.
CoRR, 2024

Debiasing Text-to-Image Diffusion Models.
CoRR, 2024

GO-NeRF: Generating Virtual Objects in Neural Radiance Fields.
CoRR, 2024

BiLLM: Pushing the Limit of Post-Training Quantization for LLMs.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Text-to-3D with Classifier Score Distillation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

V-IRL: Grounding Virtual Intelligence in Real Life.
Proceedings of the Computer Vision - ECCV 2024, 2024

3D-Aware Text-Driven Talking Avatar Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024

EA-VTR: Event-Aware Video-Text Retrieval.
Proceedings of the Computer Vision - ECCV 2024, 2024

Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Can OOD Object Detectors Learn from Foundation Models?
Proceedings of the Computer Vision - ECCV 2024, 2024

UniDream: Unifying Diffusion Priors for Relightable Text-to-3D Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024

RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

SaCo Loss: Sample-Wise Affinity Consistency for Vision-Language Pre-Training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Total-Decom: Decomposed 3D Scene Reconstruction with Minimal Interaction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

EscherNet: A Generative Model for Scalable View Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

SC-GS: Sparse-Controlled Gaussian Splatting for Editable Dynamic Scenes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Classes Are Not Equal: An Empirical Study on Image Recognition Fairness.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval?
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
EXIM: A Hybrid Explicit-Implicit Representation for Text-Guided 3D Shape Generation.
ACM Trans. Graph., December, 2023

Vertical Layering of Quantized Neural Networks for Heterogeneous Inference.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

DreamStone: Image as a Stepping Stone for Text-Guided 3D Shape Generation.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

In-Situ Model Downloading to Realize Versatile Edge AI in 6G Mobile Networks.
IEEE Wirel. Commun., June, 2023

ST3D++: Denoised Self-Training for Unsupervised Domain Adaptation on 3D Object Detection.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

Fully Convolutional Networks for Panoptic Segmentation With Point-Based Supervision.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2023

The Liver Tumor Segmentation Benchmark (LiTS).
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Medical Image Anal., 2023

Random resistive memory-based deep extreme point learning machine for unified visual processing.
CoRR, 2023

Pruning random resistive memory for optimizing analogue AI.
CoRR, 2023

SpikeMOT: Event-based Multi-Object Tracking with Sparse Motion Features.
CoRR, 2023

Resistive memory-based zero-shot liquid state machine for multimodal event data learning.
CoRR, 2023

Decoupled Kullback-Leibler Divergence Loss.
CoRR, 2023

RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding.
CoRR, 2023

One Thing One Click++: Self-Training for Weakly Supervised 3D Scene Understanding.
CoRR, 2023

Prior-free Category-level Pose Estimation with Implicit Space Transformation.
CoRR, 2023

CL-NeRF: Continual Learning of Neural Radiance Fields for Evolving Scene Representation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Data Pruning via Moving-one-Sample-out.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

CoDet: Co-occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Polar Eyeball Shape Net for 3D Posterior Ocular Shape Representation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

ISS: Image as Stepping Stone for Text-Guided 3D Shape Generation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Is Synthetic Data from Generative Models Ready for Image Recognition?
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Edge Guided GANs with Contrastive Learning for Semantic Image Synthesis.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Parametric Classification for Generalized Category Discovery: A Baseline Study.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning A Room with the Occ-SDF Hybrid: Signed Distance Function Mingled with Occupancy Aids Scene Representation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

IST-Net: Prior-free Category-level Pose Estimation with Implicit Space Transformation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Texture Generation on 3D Meshes with Point-UV Diffusion.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Understanding Imbalanced Semantic Segmentation Through Neural Collapse.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

MarS3D: A Plug-and-Play Motion-Aware Model for Semantic Segmentation on Multi-Scan 3D Point Clouds.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

PLA: Language-Driven Open-Vocabulary 3D Scene Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Hybrid Neural Rendering for Large-Scale Scenes with Motion Blur.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Command-driven Articulated Object Understanding and Manipulation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

LargeKernel3D: Scaling up Kernels in 3D Sparse CNNs.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

DPACS: Hardware Accelerated Dynamic Neural Network Pruning through Algorithm-Architecture Co-design.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

Learning Context-Aware Classifier for Semantic Segmentation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Context-Aware Transformer for 3D Point Cloud Automatic Annotation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

MGFN: Magnitude-Contrastive Glance-and-Focus Network for Weakly-Supervised Video Anomaly Detection.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
GeoNet++: Iterative Geometric Neural Network with Edge-Aware Refinement for Joint Depth and Surface Normal Estimation.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Language-driven Open-Vocabulary 3D Scene Understanding.
CoRR, 2022

A Simple Parametric Classification Baseline for Generalized Category Discovery.
CoRR, 2022

SL3D: Self-supervised-Self-labeled 3D Recognition.
CoRR, 2022

Scaling up Kernels in 3D CNNs.
CoRR, 2022

DODA: Data-oriented Sim-to-Real Domain Adaptation for 3D Indoor Semantic Segmentation.
CoRR, 2022

Fast Classification and Action Recognition With Event-Based Imaging.
IEEE Access, 2022

Prototypical VoteNet for Few-Shot 3D Point Cloud Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Towards Efficient 3D Object Detection with Knowledge Distillation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Self-Supervised Visual Representation Learning with Semantic Grouping.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Rethinking Resolution in the Context of Efficient Video Recognition.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Spatial Pruned Sparse Convolution for Efficient 3D Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Unifying Voxel-based Representation with Transformer for 3D Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

MAP-Gen: An Automated 3D-Box Annotation Flow with Multimodal Attention Point Generator.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Towards Efficient and Scale-Robust Ultra-High-Definition Image Demoiréing.
Proceedings of the Computer Vision - ECCV 2022, 2022

Multimodal Transformer for Automatic 3D Annotation and Object Detection.
Proceedings of the Computer Vision - ECCV 2022, 2022

DODA: Data-Oriented Sim-to-Real Domain Adaptation for 3D Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Slot-VPS: Object-centric Representation Learning for Video Panoptic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Progressive End-to-End Object Detection in Crowded Scenes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

HINT: Hierarchical Neuron Concept Explainer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Towards Implicit Text-Guided 3D Shape Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Voxel Field Fusion for 3D Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Stratified Transformer for 3D Point Cloud Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Knowledge Distillation as Efficient Pre-training: Faster Convergence, Higher Data-efficiency, and Better Transferability.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Video Demoiréing with Relation-Based Temporal Consistency.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

TWIST: Two-Way Inter-label Self-Training for Semi-supervised 3D Instance Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Rotation-Oriented Collaborative Self-Supervised Learning for Retinal Disease Diagnosis.
IEEE Trans. Medical Imaging, 2021

Gated Path Selection Network for Semantic Segmentation.
IEEE Trans. Image Process., 2021

Multilevel Knowledge Transfer for Cross-Domain Object Detection.
CoRR, 2021

AET-EFN: A Versatile Design for Static and Dynamic Event-Based Vision.
CoRR, 2021

Aggregation with Feature Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Re-distributing Biased Pseudo Labels for Semi-supervised Semantic Segmentation: A Baseline Investigation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

ST3D: Self-Training for Unsupervised Domain Adaptation on 3D Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

PAConv: Position Adaptive Convolution With Dynamic Kernel Assembling on Point Clouds.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

3D-to-2D Distillation for Indoor Scene Parsing.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

One Thing One Click: A Self-Training Approach for Weakly Supervised 3D Semantic Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Fully Convolutional Networks for Panoptic Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning Geometry-Disentangled Representation for Complementary Understanding of 3D Object Point Cloud.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Object-aware Feature Aggregation for Video Object Detection.
CoRR, 2020

Edge Guided GANs with Semantic Preserving for Semantic Image Synthesis.
CoRR, 2020

Image-to-Image Translation with Text Guidance.
CoRR, 2020

Global Texture Enhancement for Fake Face Detection in the Wild.
CoRR, 2020

Few-shot Action Recognition via Improved Attention with Self-supervision.
CoRR, 2020

Lightweight Generative Adversarial Networks for Text-Guided Image Manipulation.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Few-Shot Action Recognition with Permutation-Invariant Attention.
Proceedings of the Computer Vision - ECCV 2020, 2020

Domain-Invariant Stereo Matching Networks.
Proceedings of the Computer Vision - ECCV 2020, 2020

CN: Channel Normalization for Point Cloud Recognition.
Proceedings of the Computer Vision - ECCV 2020, 2020

Memory Selection Network for Video Propagation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Global Texture Enhancement for Fake Face Detection in the Wild.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Unifying Training and Inference for Panoptic Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

ManiGAN: Text-Guided Image Manipulation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

An Adversarial Perturbation Oriented Domain Adaptation Approach for Semantic Segmentation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
The Liver Tumor Segmentation Benchmark (LiTS).
CoRR, 2019

Human Pose Estimation with Spatial Contextual Information.
CoRR, 2019

Controllable Text-to-Image Generation.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

AGSS-VOS: Attention Guided Single-Shot Video Object Segmentation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Improved Techniques for Training Adaptive Deep Networks.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

3D Motion Decomposition for RGBD Future Dynamic Scene Synthesis.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
H-DenseUNet: Hybrid Densely Connected UNet for Liver and Tumor Segmentation From CT Volumes.
IEEE Trans. Medical Imaging, 2018

3D multi-scale FCN with random modality voxel dropout learning for Intervertebral Disc Localization and Segmentation from Multi-modality MR Images.
Medical Image Anal., 2018

Image Inpainting via Generative Multi-column Convolutional Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Self-boosted Gesture Interactive System with ST-Net.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

ICNet for Real-Time Semantic Segmentation on High-Resolution Images.
Proceedings of the Computer Vision - ECCV 2018, 2018

GAL: Geometric Adversarial Loss for Single-View 3D-Object Reconstruction.
Proceedings of the Computer Vision - ECCV 2018, 2018

GeoNet: Geometric Neural Network for Joint Depth and Surface Normal Estimation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Semi-Parametric Image Synthesis.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Referring Image Segmentation via Recurrent Refinement Networks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Gland segmentation in colon histology images: The glas challenge contest.
Medical Image Anal., 2017

DCAN: Deep contour-aware networks for object instance segmentation from histology images.
Medical Image Anal., 2017

Semantically Consistent Image Completion with Fine-grained Details.
CoRR, 2017

H-DenseUNet: Hybrid Densely Connected UNet for Liver and Liver Tumor Segmentation from CT Volumes.
CoRR, 2017

3D Graph Neural Networks for RGBD Semantic Segmentation.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Pyramid Scene Parsing Network.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Augmented Feedback in Semantic Segmentation Under Image Level Supervision.
Proceedings of the Computer Vision - ECCV 2016, 2016

Multi-scale Patch Aggregation (MPA) for Simultaneous Detection and Segmentation.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

DCAN: Deep Contour-Aware Networks for Accurate Gland Segmentation.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Deep Contextual Networks for Neuronal Structure Segmentation.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Semantic Segmentation with Object Clique Potential.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015


  Loading...