Tao Chen

Orcid: 0000-0002-0779-9818

Affiliations:
  • Fudan University, School of Information Science and Technology, Shanghai, China
  • Institute for Infocomm Research, Visual Computing Department, Singapore
  • Nanyang Technological University, School of Electrical and Electronic Engineering, Singapore (PhD 2013)


According to our database1, Tao Chen authored at least 145 papers between 2009 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2024

Few-Shot Cross-Domain Object Detection With Instance-Level Prototype-Based Meta-Learning.
IEEE Trans. Circuits Syst. Video Technol., October, 2024

Push-and-Pull: A General Training Framework With Differential Augmentor for Domain Generalized Point Cloud Classification.
IEEE Trans. Circuits Syst. Video Technol., August, 2024

DeNKD: Decoupled Non-Target Knowledge Distillation for Complementing Transformer-Based Unsupervised Domain Adaptation.
IEEE Trans. Circuits Syst. Video Technol., May, 2024

Lightweight Model Pre-Training via Language Guided Knowledge Distillation.
IEEE Trans. Multim., 2024

Exploring Multi-Timestep Multi-Stage Diffusion Features for Hyperspectral Image Classification.
IEEE Trans. Geosci. Remote. Sens., 2024

U²ConvFormer: Marrying and Evolving Nested U-Net and Scale-Aware Transformer for Hyperspectral Image Classification.
IEEE Trans. Geosci. Remote. Sens., 2024

Joint Distribution Adaptive-Alignment for Cross-Domain Segmentation of High-Resolution Remote Sensing Images.
IEEE Trans. Geosci. Remote. Sens., 2024

Revisiting 3D visual grounding with Context-aware Feature Aggregation.
Neurocomputing, 2024

Instruct Pix-to-3D: Instructional 3D object generation from a single image.
Neurocomputing, 2024

BIFRÖST: 3D-Aware Image compositing with Language Instructions.
CoRR, 2024

Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy.
CoRR, 2024

S2HPruner: Soft-to-Hard Distillation Bridges the Discretization Gap in Pruning.
CoRR, 2024

HiSplat: Hierarchical 3D Gaussian Splatting for Generalizable Sparse-View Reconstruction.
CoRR, 2024

DynaSurfGS: Dynamic Surface Reconstruction with Planar-based Gaussian Splatting.
CoRR, 2024

Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision.
CoRR, 2024

FNP: Fourier Neural Processes for Arbitrary-Resolution Data Assimilation.
CoRR, 2024

Δ-DiT: A Training-Free Acceleration Method Tailored for Diffusion Transformers.
CoRR, 2024

MeshXL: Neural Coordinate Field for Generative 3D Foundation Models.
CoRR, 2024

EMR-Merging: Tuning-Free High-Performance Model Merging.
CoRR, 2024

MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies.
CoRR, 2024

ClipSAM: CLIP and SAM Collaboration for Zero-Shot Anomaly Segmentation.
CoRR, 2024

CSD3D: Cross-Scale Distillation via Dual-Consistency Learning for Semi-Supervised 3D Object Detection.
Proceedings of the International Joint Conference on Neural Networks, 2024

Multi-dimensional Search with Strip Convolution and R-Squared Loss for Lane Detection.
Proceedings of the International Joint Conference on Neural Networks, 2024

G-Former: A Grouping Transformer for Weakly Supervised Point Cloud Segmentation.
Proceedings of the International Joint Conference on Neural Networks, 2024

Spear: Evaluate the Adversarial Robustness of Compressed Neural Models.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Through the Real World Haze Scenes: Navigating the Synthetic-to-Real Gap in Challenging Image Dehazing.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Reg-TTA3D: Better Regression Makes Better Test-Time Adaptive 3D Object Detection.
Proceedings of the Computer Vision - ECCV 2024, 2024

Enhanced Sparsification via Stimulative Training.
Proceedings of the Computer Vision - ECCV 2024, 2024

M3DBench: Towards Omni 3D Assistant with Interleaved Multi-modal Instructions.
Proceedings of the Computer Vision - ECCV 2024, 2024

Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

MADTP: Multimodal Alignment-Guided Dynamic Token Pruning for Accelerating Vision-Language Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

PM-INR: Prior-Rich Multi-Modal Implicit Large-Scale Scene Neural Representation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Boosting Residual Networks with Group Knowledge.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Automatic Loss Function Search for Adversarial Unsupervised Domain Adaptation.
IEEE Trans. Circuits Syst. Video Technol., October, 2023

DCNet: Large-Scale Point Cloud Semantic Segmentation With Discriminative and Efficient Feature Aggregation.
IEEE Trans. Circuits Syst. Video Technol., August, 2023

Performance-Aware Approximation of Global Channel Pruning for Multitask CNNs.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Pull & Push: Leveraging Differential Knowledge Distillation for Efficient Unsupervised Anomaly Detection and Localization.
IEEE Trans. Circuits Syst. Video Technol., May, 2023

Rethinking Saliency Map: A Context-Aware Perturbation Method to Explain EEG-Based Deep Learning Model.
IEEE Trans. Biomed. Eng., May, 2023

A Closer Look at Few-Shot 3D Point Cloud Classification.
Int. J. Comput. Vis., March, 2023

An Efficient Multi-Task Network for Pedestrian Intrusion Detection.
IEEE Trans. Intell. Veh., January, 2023

Exploring Kernel-Based Texture Transfer for Pose-Guided Person Image Generation.
IEEE Trans. Multim., 2023

SpVOS: Efficient Video Object Segmentation With Triple Sparse Convolution.
IEEE Trans. Image Process., 2023

Rethinking Cross-Domain Pedestrian Detection: A Background-Focused Distribution Alignment Framework for Instance-Free One-Stage Detectors.
IEEE Trans. Image Process., 2023

Merging Vision Transformers from Different Tasks and Domains.
CoRR, 2023

Partial Fine-Tuning: A Successor to Full Fine-Tuning for Vision Transformers.
CoRR, 2023

Efficient Architecture Search via Bi-level Data Pruning.
CoRR, 2023

Rethinking of Feature Interaction for Multi-task Learning on Dense Prediction.
CoRR, 2023

Towards an End-to-End Artificial Intelligence Driven Global Weather Forecasting System.
CoRR, 2023

M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts.
CoRR, 2023

ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model.
CoRR, 2023

VQ-NeRF: Vector Quantization Enhances Implicit Neural Representations.
CoRR, 2023

Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering.
CoRR, 2023

Experts Weights Averaging: A New General Training Scheme for Vision Transformers.
CoRR, 2023

Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation.
CoRR, 2023

When Hyperspectral Image Classification Meets Diffusion Models: An Unsupervised Feature Learning Framework.
CoRR, 2023

StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation.
CoRR, 2023

Stimulative Training++: Go Beyond The Performance Limits of Residual Networks.
CoRR, 2023

Multi-view Vision-Prompt Fusion Network: Can 2D Pre-trained Model Boost 3D Point Cloud Data-scarce Learning?
CoRR, 2023

A Large-Scale Outdoor Multi-modal Dataset and Benchmark for Novel View Synthesis and Implicit Scene Reconstruction.
CoRR, 2023

β-DARTS++: Bi-level Regularization for Proxy-robust Differentiable Architecture Search.
CoRR, 2023

Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

AD-PT: Autonomous Driving Pre-Training with Large-scale Point Cloud Dataset.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

MotionGPT: Human Motion as a Foreign Language.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

PDF: Point Diffusion Implicit Function for Large-scale Scene Neural Representation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Rethinking Pseudo-Label-Based Unsupervised Person Re-ID with Hierarchical Prototype-based Graph.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

RBGNet: Reliable Boundary-Guided Segmentation of Choroidal Neovascularization.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Adversarial Amendment is the Only Force Capable of Transforming an Enemy into a Friend.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

A Large-Scale Outdoor Multi-modal Dataset and Benchmark for Novel View Synthesis and Implicit Scene Reconstruction.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Uni3D: A Unified Baseline for Multi-Dataset 3D Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Bi3D: Bi-Domain Active Learning for Cross-Domain 3D Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Boost Vision Transformer with GPU-Friendly Sparsity and Quantization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

End-to-End 3D Dense Captioning with Vote2Cap-DETR.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Executing your Commands via Motion Diffusion in Latent Space.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Hyperscale Hardware Optimized Neural Architecture Search.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

Boost Transformer-based Language Models with GPU-Friendly Sparsity and Quantization.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Joint Distribution Alignment via Adversarial Learning for Domain Adaptive Object Detection.
IEEE Trans. Multim., 2022

Sample-Centric Feature Generation for Semi-Supervised Few-Shot Learning.
IEEE Trans. Image Process., 2022

SC-EADNet: A Self-Supervised Contrastive Efficient Asymmetric Dilated Network for Hyperspectral Image Classification.
IEEE Trans. Geosci. Remote. Sens., 2022

Curriculum-Style Local-to-Global Adaptation for Cross-Domain Remote Sensing Image Segmentation.
IEEE Trans. Geosci. Remote. Sens., 2022

Densely Semantic Enhancement for Domain Adaptive Region-Free Detectors.
IEEE Trans. Circuits Syst. Video Technol., 2022

Point Cloud Instance Segmentation With Semi-Supervised Bounding-Box Mining.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Efficient Joint-Dimensional Search with Solution Space Regularization for Real-Time Semantic Segmentation.
Int. J. Comput. Vis., 2022

ADAS: A Simple Active-and-Adaptive Baseline for Cross-Domain 3D Semantic Segmentation.
CoRR, 2022

Executing your Commands via Motion Diffusion in Latent Space.
CoRR, 2022

Instance-aware Model Ensemble With Distillation For Unsupervised Domain Adaptation.
CoRR, 2022

Cross-Subject Emotion Recognition with Sparsely-Labeled Peripheral Physiological Data Using SHAP-Explained Tree Ensembles.
CoRR, 2022

Neural Architecture Ranker.
CoRR, 2022

What Makes for Effective Few-shot Point Cloud Classification?
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Coordinates Are NOT Lonely - Codebook Prior Helps Implicit Neural 3D representations.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Stimulative Training of Residual Networks: A Social Psychology Perspective of Loafing.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning Cross-Image Object Semantic Relation in Transformer for Few-Shot Fine-Grained Image Classification.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Generalized Global Ranking-Aware Neural Architecture Ranker for Efficient Image Classifier Search.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

β-DARTS: Beta-Decay Regularization for Differentiable Architecture Search.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Sketch Me A Video.
CoRR, 2021

Fine-grained Identity Preserving Landmark Synthesis for Face Reenactment.
CoRR, 2021

Coarse-to-Fine Gaze Redirection with Numerical and Pictorial Guidance.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Object-aware Long-short-range Spatial Alignment for Few-Shot Fine-Grained Image Classification.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Spcr: semi-supervised point cloud instance segmentation with perturbation consistency regularization.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

HSEGAN: Hair Synthesis and Editing Using Structure-Adaptive Normalization on Generative Adversarial Network.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

EADNet: Efficient Asymmetric Dilated Network For Semantic Segmentation.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
M$^3$Lung-Sys: A Deep Learning System for Multi-Class Lung Pneumonia Screening From CT Imaging.
IEEE J. Biomed. Health Informatics, 2020

BURSTS: A bottom-up approach for robust spotting of texts in scenes.
J. Vis. Commun. Image Represent., 2020

Fine-grained facial expression analysis using dimensional emotion model.
Neurocomputing, 2020

M3Lung-Sys: A Deep Learning System for Multi-Class Lung Pneumonia Screening from CT Imaging.
CoRR, 2020

Do not forget interaction: Predicting fatality of COVID-19 patients using logistic regression.
CoRR, 2020

MGGR: MultiModal-Guided Gaze Redirection with Coarse-to-Fine Learning.
CoRR, 2020

Learning to Augment Expressions for Few-shot Fine-grained Facial Expression Recognition.
CoRR, 2020

PIDNet: An Efficient Network for Dynamic Pedestrian Intrusion Detection.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Front-plane and Back-plane Bias Temperature Instability of 22 nm Gate-last FDSOI MOSFETs.
Proceedings of the 2020 IEEE International Reliability Physics Symposium, 2020

Cascade EF-GAN: Progressive Facial Expression Editing With Local Focuses.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
SS-HCNN: Semi-Supervised Hierarchical Convolutional Neural Network for Image Classification.
IEEE Trans. Image Process., 2019

A Fine-Grained Facial Expression Database for End-to-End Multi-Pose Facial Expression Recognition.
CoRR, 2019

Comp-GAN: Compositional Generative Adversarial Network in Synthesizing and Recognizing Facial Expression.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

2018
Superpixel Guided Deep-Sparse-Representation Learning for Hyperspectral Image Classification.
IEEE Trans. Circuits Syst. Video Technol., 2018

S-CNN: Subcategory-Aware Convolutional Networks for Object Detection.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

2017
Unsupervised Feature Learning for Land-Use Scene Recognition.
IEEE Trans. Geosci. Remote. Sens., 2017

Subcategory-Aware Feature Selection and SVM Optimization for Automatic Aerial Image-Based Oil Spill Inspection.
IEEE Trans. Geosci. Remote. Sens., 2017

Object-Level Motion Detection From Moving Cameras.
IEEE Trans. Circuits Syst. Video Technol., 2017

Robust Vehicle Detection and Viewpoint Estimation With Soft Discriminative Mixture Model.
IEEE Trans. Circuits Syst. Video Technol., 2017

EXIF-white balance recognition for image forensic analysis.
Multidimens. Syst. Signal Process., 2017

2016
Accurate and Efficient Traffic Sign Detection Using Discriminative AdaBoost and Support Vector Regression.
IEEE Trans. Veh. Technol., 2016

Landmark recognition with compact BoW histogram and ensemble ELM.
Multim. Tools Appl., 2016

2015
Context-aware vocabulary tree for mobile landmark recognition.
J. Vis. Commun. Image Represent., 2015

Scene text extraction based on edges and support vector regression.
Int. J. Document Anal. Recognit., 2015

Vegetation coverage detection from very high resolution satellite imagery.
Proceedings of the 2015 Visual Communications and Image Processing, 2015

Reversible watermarking using enhanced local prediction.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Context-aware lane marking detection on urban roads.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

DPM revisited: Utilizing root-part spatial distribution for vehicle viewpoint estimation.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Image tampering detection using noise histogram features.
Proceedings of the 2015 IEEE International Conference on Digital Signal Processing, 2015

2014
Discriminative Soft Bag-of-Visual Phrase for Mobile Landmark Recognition.
IEEE Trans. Multim., 2014

Discriminative BoW Framework for Mobile Landmark Recognition.
IEEE Trans. Cybern., 2014

Context-aware codebook learning for mobile landmark recognition.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

2013
Context-Aware Discriminative Vocabulary Learning for Mobile Landmark Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2013

2012
Discriminative bag-of-visual phrase learning for landmark recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Integrated Content and Context Analysis for Mobile Landmark Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2011

From universal bag-of-words to adaptive bag-of-phrases for mobile scene recognition.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

A discriminative learning technique for mobile landmark recognition.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Content and context information fusion for mobile landmark recognition.
Proceedings of the 8th International Conference on Information, 2011

2010
A Comparative Study of Mobile-Based Landmark Recognition Techniques.
IEEE Intell. Syst., 2010

2009
A Survey on Mobile Landmark Recognition for Information Retrieval.
Proceedings of the MDM 2009, 2009


  Loading...