Tao Chen

CoRR, 2024

MeshXL: Neural Coordinate Field for Generative 3D Foundation Models.

[BibT_eX]

[DOI]

CoRR, 2024

EMR-Merging: Tuning-Free High-Performance Model Merging.

[BibT_eX]

[DOI]

CoRR, 2024

MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies.

[BibT_eX]

[DOI]

CoRR, 2024

ClipSAM: CLIP and SAM Collaboration for Zero-Shot Anomaly Segmentation.

[BibT_eX]

[DOI]

CoRR, 2024

CSD3D: Cross-Scale Distillation via Dual-Consistency Learning for Semi-Supervised 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2024

Multi-dimensional Search with Strip Convolution and R-Squared Loss for Lane Detection.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2024

G-Former: A Grouping Transformer for Weakly Supervised Point Cloud Segmentation.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2024

Spear: Evaluate the Adversarial Robustness of Compressed Neural Models.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Through the Real World Haze Scenes: Navigating the Synthetic-to-Real Gap in Challenging Image Dehazing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Reg-TTA3D: Better Regression Makes Better Test-Time Adaptive 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Enhanced Sparsification via Stimulative Training.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

M3DBench: Towards Omni 3D Assistant with Interleaved Multi-modal Instructions.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

MADTP: Multimodal Alignment-Guided Dynamic Token Pruning for Accelerating Vision-Language Transformer.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

PM-INR: Prior-Rich Multi-Modal Implicit Large-Scale Scene Neural Representation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Boosting Residual Networks with Group Knowledge.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Automatic Loss Function Search for Adversarial Unsupervised Domain Adaptation.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., October, 2023

DCNet: Large-Scale Point Cloud Semantic Segmentation With Discriminative and Efficient Feature Aggregation.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., August, 2023

Performance-Aware Approximation of Global Channel Pruning for Multitask CNNs.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Pull & Push: Leveraging Differential Knowledge Distillation for Efficient Unsupervised Anomaly Detection and Localization.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., May, 2023

Rethinking Saliency Map: A Context-Aware Perturbation Method to Explain EEG-Based Deep Learning Model.

[BibT_eX]

[DOI]

IEEE Trans. Biomed. Eng., May, 2023

A Closer Look at Few-Shot 3D Point Cloud Classification.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., March, 2023

An Efficient Multi-Task Network for Pedestrian Intrusion Detection.

[BibT_eX]

[DOI]

IEEE Trans. Intell. Veh., January, 2023

Exploring Kernel-Based Texture Transfer for Pose-Guided Person Image Generation.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

SpVOS: Efficient Video Object Segmentation With Triple Sparse Convolution.

[BibT_eX]

[DOI]

Weihao Lin

Chong Yu

IEEE Trans. Image Process., 2023

Rethinking Cross-Domain Pedestrian Detection: A Background-Focused Distribution Alignment Framework for Instance-Free One-Stage Detectors.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2023

Merging Vision Transformers from Different Tasks and Domains.

[BibT_eX]

[DOI]

CoRR, 2023

Partial Fine-Tuning: A Successor to Full Fine-Tuning for Vision Transformers.

[BibT_eX]

[DOI]

CoRR, 2023

Efficient Architecture Search via Bi-level Data Pruning.

[BibT_eX]

[DOI]

CoRR, 2023

Rethinking of Feature Interaction for Multi-task Learning on Dense Prediction.

[BibT_eX]

[DOI]

CoRR, 2023

Towards an End-to-End Artificial Intelligence Driven Global Weather Forecasting System.

[BibT_eX]

[DOI]

CoRR, 2023

M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts.

[BibT_eX]

[DOI]

CoRR, 2023

ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model.

[BibT_eX]

[DOI]

CoRR, 2023

VQ-NeRF: Vector Quantization Enhances Implicit Neural Representations.

[BibT_eX]

[DOI]

CoRR, 2023

Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering.

[BibT_eX]

[DOI]

CoRR, 2023

Experts Weights Averaging: A New General Training Scheme for Vision Transformers.

[BibT_eX]

[DOI]

CoRR, 2023

Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation.

[BibT_eX]

[DOI]

CoRR, 2023

When Hyperspectral Image Classification Meets Diffusion Models: An Unsupervised Feature Learning Framework.

[BibT_eX]

[DOI]

CoRR, 2023

StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation.

[BibT_eX]

[DOI]

CoRR, 2023

Stimulative Training++: Go Beyond The Performance Limits of Residual Networks.

[BibT_eX]

[DOI]

CoRR, 2023

Multi-view Vision-Prompt Fusion Network: Can 2D Pre-trained Model Boost 3D Point Cloud Data-scarce Learning?

[BibT_eX]

[DOI]

CoRR, 2023

A Large-Scale Outdoor Multi-modal Dataset and Benchmark for Novel View Synthesis and Implicit Scene Reconstruction.

[BibT_eX]

[DOI]

CoRR, 2023

β-DARTS++: Bi-level Regularization for Proxy-robust Differentiable Architecture Search.

[BibT_eX]

[DOI]

CoRR, 2023

Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

AD-PT: Autonomous Driving Pre-Training with Large-scale Point Cloud Dataset.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

MotionGPT: Human Motion as a Foreign Language.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

PDF: Point Diffusion Implicit Function for Large-scale Scene Neural Representation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Rethinking Pseudo-Label-Based Unsupervised Person Re-ID with Hierarchical Prototype-based Graph.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

RBGNet: Reliable Boundary-Guided Segmentation of Choroidal Neovascularization.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Adversarial Amendment is the Only Force Capable of Transforming an Enemy into a Friend.

[BibT_eX]

[DOI]

Chong Yu

Parthasarathy Ranganathan

Zhongxue Gan

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

A Large-Scale Outdoor Multi-modal Dataset and Benchmark for Novel View Synthesis and Implicit Scene Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Uni3D: A Unified Baseline for Multi-Dataset 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Bi3D: Bi-Domain Active Learning for Cross-Domain 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Boost Vision Transformer with GPU-Friendly Sparsity and Quantization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

End-to-End 3D Dense Captioning with Vote2Cap-DETR.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Executing your Commands via Motion Diffusion in Latent Space.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Hyperscale Hardware Optimized Neural Architecture Search.

[BibT_eX]

[DOI]

Norman P. Jouppi

Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

Boost Transformer-based Language Models with GPU-Friendly Sparsity and Quantization.

[BibT_eX]

[DOI]

Chong Yu

Zhongxue Gan

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

Joint Distribution Alignment via Adversarial Learning for Domain Adaptive Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2022

Sample-Centric Feature Generation for Semi-Supervised Few-Shot Learning.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

SC-EADNet: A Self-Supervised Contrastive Efficient Asymmetric Dilated Network for Hyperspectral Image Classification.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2022

Curriculum-Style Local-to-Global Adaptation for Cross-Domain Remote Sensing Image Segmentation.

[BibT_eX]

[DOI]

Bo Zhang

Bin Wang

IEEE Trans. Geosci. Remote. Sens., 2022

Densely Semantic Enhancement for Domain Adaptive Region-Free Detectors.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2022

Point Cloud Instance Segmentation With Semi-Supervised Bounding-Box Mining.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Efficient Joint-Dimensional Search with Solution Space Regularization for Real-Time Semantic Segmentation.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2022

ADAS: A Simple Active-and-Adaptive Baseline for Cross-Domain 3D Semantic Segmentation.

[BibT_eX]

[DOI]

CoRR, 2022

Executing your Commands via Motion Diffusion in Latent Space.

[BibT_eX]

[DOI]

CoRR, 2022

Instance-aware Model Ensemble With Distillation For Unsupervised Domain Adaptation.

[BibT_eX]

[DOI]

CoRR, 2022

Cross-Subject Emotion Recognition with Sparsely-Labeled Peripheral Physiological Data Using SHAP-Explained Tree Ensembles.

[BibT_eX]

[DOI]

Feng Zhou

Baiying Lei

CoRR, 2022

Neural Architecture Ranker.

[BibT_eX]

[DOI]

CoRR, 2022

What Makes for Effective Few-shot Point Cloud Classification?

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Coordinates Are NOT Lonely - Codebook Prior Helps Implicit Neural 3D representations.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Stimulative Training of Residual Networks: A Social Psychology Perspective of Loafing.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning Cross-Image Object Semantic Relation in Transformer for Few-Shot Fine-Grained Image Classification.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Generalized Global Ranking-Aware Neural Architecture Ranker for Efficient Image Classifier Search.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

β-DARTS: Beta-Decay Regularization for Differentiable Architecture Search.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Sketch Me A Video.

[BibT_eX]

[DOI]

CoRR, 2021

Fine-grained Identity Preserving Landmark Synthesis for Face Reenactment.

[BibT_eX]

[DOI]

CoRR, 2021

Coarse-to-Fine Gaze Redirection with Numerical and Pictorial Guidance.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Object-aware Long-short-range Spatial Alignment for Few-Shot Fine-Grained Image Classification.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Spcr: semi-supervised point cloud instance segmentation with perturbation consistency regularization.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

HSEGAN: Hair Synthesis and Editing Using Structure-Adaptive Normalization on Generative Adversarial Network.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

EADNet: Efficient Asymmetric Dilated Network For Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

M$^3$Lung-Sys: A Deep Learning System for Multi-Class Lung Pneumonia Screening From CT Imaging.

[BibT_eX]

[DOI]

IEEE J. Biomed. Health Informatics, 2020

BURSTS: A bottom-up approach for robust spotting of texts in scenes.

[BibT_eX]

[DOI]

Feng Zhou

J. Vis. Commun. Image Represent., 2020

Fine-grained facial expression analysis using dimensional emotion model.

[BibT_eX]

[DOI]

Neurocomputing, 2020

M3Lung-Sys: A Deep Learning System for Multi-Class Lung Pneumonia Screening from CT Imaging.

[BibT_eX]

[DOI]

CoRR, 2020

Do not forget interaction: Predicting fatality of COVID-19 patients using logistic regression.

[BibT_eX]

[DOI]

Feng Zhou

Baiying Lei

CoRR, 2020

MGGR: MultiModal-Guided Gaze Redirection with Coarse-to-Fine Learning.

[BibT_eX]

[DOI]

CoRR, 2020

Learning to Augment Expressions for Few-shot Fine-grained Facial Expression Recognition.

[BibT_eX]

[DOI]

CoRR, 2020

PIDNet: An Efficient Network for Dynamic Pedestrian Intrusion Detection.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Front-plane and Back-plane Bias Temperature Instability of 22 nm Gate-last FDSOI MOSFETs.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Reliability Physics Symposium, 2020

Cascade EF-GAN: Progressive Facial Expression Editing With Local Focuses.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

SS-HCNN: Semi-Supervised Hierarchical Convolutional Neural Network for Image Classification.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2019

A Fine-Grained Facial Expression Database for End-to-End Multi-Pose Facial Expression Recognition.

[BibT_eX]

[DOI]

CoRR, 2019

Comp-GAN: Compositional Generative Adversarial Network in Synthesizing and Recognizing Facial Expression.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

2018

Superpixel Guided Deep-Sparse-Representation Learning for Hyperspectral Image Classification.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2018

S-CNN: Subcategory-Aware Convolutional Networks for Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2018

2017

Unsupervised Feature Learning for Land-Use Scene Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2017

Subcategory-Aware Feature Selection and SVM Optimization for Automatic Aerial Image-Based Oil Spill Inspection.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2017

Object-Level Motion Detection From Moving Cameras.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2017

Robust Vehicle Detection and Viewpoint Estimation With Soft Discriminative Mixture Model.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2017

EXIF-white balance recognition for image forensic analysis.

[BibT_eX]

[DOI]

Alex ChiChung Kot

Multidimens. Syst. Signal Process., 2017

2016

Accurate and Efficient Traffic Sign Detection Using Discriminative AdaBoost and Support Vector Regression.

[BibT_eX]

[DOI]

IEEE Trans. Veh. Technol., 2016

Landmark recognition with compact BoW histogram and ensemble ELM.

[BibT_eX]

[DOI]

Jiuwen Cao

Multim. Tools Appl., 2016

2015

Context-aware vocabulary tree for mobile landmark recognition.

[BibT_eX]

[DOI]

J. Vis. Commun. Image Represent., 2015

Scene text extraction based on edges and support vector regression.

[BibT_eX]

[DOI]

Int. J. Document Anal. Recognit., 2015

Vegetation coverage detection from very high resolution satellite imagery.

[BibT_eX]

[DOI]

Proceedings of the 2015 Visual Communications and Image Processing, 2015

Reversible watermarking using enhanced local prediction.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Context-aware lane marking detection on urban roads.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

DPM revisited: Utilizing root-part spatial distribution for vehicle viewpoint estimation.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Image tampering detection using noise histogram features.

[BibT_eX]

[DOI]

Jiuwen Cao

Proceedings of the 2015 IEEE International Conference on Digital Signal Processing, 2015

2014

Discriminative Soft Bag-of-Visual Phrase for Mobile Landmark Recognition.

[BibT_eX]

[DOI]

Dajiang Zhang

IEEE Trans. Multim., 2014

Discriminative BoW Framework for Mobile Landmark Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Cybern., 2014

Context-aware codebook learning for mobile landmark recognition.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

2013

Context-Aware Discriminative Vocabulary Learning for Mobile Landmark Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2013

2012

Discriminative bag-of-visual phrase learning for landmark recognition.

[BibT_eX]

[DOI]

Dajiang Zhang

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011

Integrated Content and Context Analysis for Mobile Landmark Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2011

From universal bag-of-words to adaptive bag-of-phrases for mobile scene recognition.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE International Conference on Image Processing, 2011

A discriminative learning technique for mobile landmark recognition.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Content and context information fusion for mobile landmark recognition.

[BibT_eX]

[DOI]