Gaoang Wang

Orcid: 0000-0002-8403-1538

According to our database1, Gaoang Wang authored at least 109 papers between 2015 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
DIVOTrack: A Novel Dataset and Baseline Method for Cross-View Multi-Object Tracking in DIVerse Open Scenes.
Int. J. Comput. Vis., April, 2024

Deep Learning Methods for Small Molecule Drug Discovery: A Survey.
IEEE Trans. Artif. Intell., February, 2024

Knowledge-guided pre-training and fine-tuning: Video representation learning for action recognition.
Neurocomputing, February, 2024

UniDCP: Unifying Multiple Medical Vision-Language Tasks via Dynamic Cross-Modal Learnable Prompts.
IEEE Trans. Multim., 2024

DiffFashion: Reference-Based Fashion Design With Structure-Aware Transfer by Diffusion Models.
IEEE Trans. Multim., 2024

Self-Paced Multi-Grained Cross-Modal Interaction Modeling for Referring Expression Comprehension.
IEEE Trans. Image Process., 2024

Ego3DT: Tracking Every 3D Object in Ego-centric Videos.
CoRR, 2024

Frame Order Matters: A Temporal Sequence-Aware Model for Few-Shot Action Recognition.
CoRR, 2024

STEVE Series: Step-by-Step Construction of Agent Systems in Minecraft.
CoRR, 2024

CityCraft: A Real Crafter for 3D City Generation.
CoRR, 2024

S4Fusion: Saliency-aware Selective State Space Model for Infrared Visible Image Fusion.
CoRR, 2024

FlexiFilm: Long Video Generation with Flexible Conditions.
CoRR, 2024

MovieChat+: Question-aware Sparse Memory for Long Video Question Answering.
CoRR, 2024

Do We Really Need a Complex Agent System? Distill Embodied Agent into a Single Model.
CoRR, 2024

VersaT2I: Improving Text-to-Image Models with Versatile Reward.
CoRR, 2024

Hierarchical Auto-Organizing System for Open-Ended Multi-Agent Navigation.
CoRR, 2024

MedM2G: Unifying Medical Multi-Modal Generation via Cross-Guided Diffusion with Visual Invariant.
CoRR, 2024

Divide and Conquer for Large Language Models Reasoning.
CoRR, 2024

MPM: A Unified 2D-3D Human Pose Representation via Masked Pose Modeling.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

Ego3DT: Tracking Every 3D Object in Ego-centric Videos.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

LLaVA-Ultra: Large Chinese Language and Vision Assistant for Ultrasound.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

SNAFusion: Distilling 2D Axial Plane Diffusion Priors for Sparse-View 3D Cone-Beam CT Imaging.
Proceedings of the Deep Generative Models - 4th MICCAI Workshop, 2024

Enhanced Multimodal Trajectory Prediction for Autonomous Vehicles Using Advanced Diffusion Model Techniques.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2024

Vision meets mmWave Radar: 3D Object Perception Benchmark for Autonomous Driving.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2024

Sam-Guided Enhanced Fine-Grained Encoding with Mixed Semantic Learning for Medical Image Captioning.
Proceedings of the IEEE International Conference on Acoustics, 2024

Blind Inpainting with Object-Aware Discrimination for Artificial Marker Removal.
Proceedings of the IEEE International Conference on Acoustics, 2024

See and Think: Embodied Agent in Virtual Environment.
Proceedings of the Computer Vision - ECCV 2024, 2024

MedM2G: Unifying Medical Multi-Modal Generation via Cross-Guided Diffusion with Visual Invariant.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-Based Roadside 3D Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

MovieChat: From Dense Token to Sparse Memory for Long Video Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

UniAP: Towards Universal Animal Perception in Vision via Few-Shot Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Multi-Step Denoising Scheduled Sampling: Towards Alleviating Exposure Bias for Diffusion Models.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Handwritten Chinese signature detection with simple Copy-Paste augmentation on power plants technical documents.
Serv. Oriented Comput. Appl., December, 2023

Deep learning-enabled 3D multimodal fusion of cone-beam CT and intraoral mesh scans for clinically applicable tooth-bone reconstruction.
Patterns, September, 2023

Hierarchical Self-Supervised Learning for 3D Tooth Segmentation in Intra-Oral Mesh Scans.
IEEE Trans. Medical Imaging, February, 2023

Split and Connect: A Universal Tracklet Booster for Multi-Object Tracking.
IEEE Trans. Multim., 2023

CityGen: Infinite and Controllable 3D City Layout Generation.
CoRR, 2023

See and Think: Embodied Agent in Virtual Environment.
CoRR, 2023

Devil in the Number: Towards Robust Multi-modality Data Filter.
CoRR, 2023

FrameRS: A Video Frame Compression Model Composed by Self supervised Video Frame Reconstructor and Key Frame Selector.
CoRR, 2023

Chasing Consistency in Text-to-3D Generation from a Single Image.
CoRR, 2023

Bridging Cross-task Protocol Inconsistency for Distillation in Dense Object Detection.
CoRR, 2023

MovieChat: From Dense Token to Sparse Memory for Long Video Understanding.
CoRR, 2023

A Survey of Deep Learning in Sports Applications: Perception, Comprehension, and Decision.
CoRR, 2023

DDMM-Synth: A Denoising Diffusion Model for Cross-modal Medical Image Synthesis with Sparse-view Measurement Embedding.
CoRR, 2023

DIVOTrack: A Novel Dataset and Baseline Method for Cross-View Multi-Object Tracking in DIVerse Open Scenes.
CoRR, 2023

AI Assisted Fashion Design: A Review.
IEEE Access, 2023

User-Aware Prefix-Tuning Is a Good Learner for Personalized Image Captioning.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

A CNN-based generative model for vehicle trajectory reconstruction in mixed traffic flow.
Proceedings of the 8th International Conference on Models and Technologies for Intelligent Transportation Systems, 2023

PoSynDA: Multi-Hypothesis Pose Synthesis Domain Adaptation for Robust 3D Human Pose Estimation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Debiasing Medical Visual Question Answering via Counterfactual Training.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Temporal Constrained Feasible Subspace Learning for Human Pose Forecasting.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

SGAT4PASS: Spherical Geometry-Aware Transformer for PAnoramic Semantic Segmentation.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Learning Discrimination from Contaminated Data: Multi-Instance Learning for Unsupervised Anomaly Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Bridging Cross-task Protocol Inconsistency for Distillation in Dense Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Global Adaptation meets Local Generalization: Unsupervised Domain Adaptation for 3D Human Pose Estimation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

StableVideo: Text-driven Consistency-aware Diffusion Video Editing.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

TransLink: Transformer-Based Embedding for Tracklets' Global Link.
Proceedings of the IEEE International Conference on Acoustics, 2023

Language Adaptive Weight Generation for Multi-Task Visual Grounding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Image Reference-guided Fashion Design with Structure-aware Transfer by Diffusion Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

A Class-Rebalancing Self-Training Framework for Distantly-Supervised Named Entity Recognition.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
pcnaDeep: a fast and robust single-cell tracking method using deep-learning mediated cell cycle profiling.
Bioinform., October, 2022

Forgery-Domain-Supervised Deepfake Detection With Non-Negative Constraint.
IEEE Signal Process. Lett., 2022

Unsupervised universal hierarchical multi-person 3D pose estimation for natural scenes.
Multim. Tools Appl., 2022

VGSC-DB: an online database of voltage-gated sodium channels.
J. Cheminformatics, 2022

STSC-SNN: Spatio-Temporal Synaptic Connection with Temporal Convolution and Attention for Spiking Neural Networks.
CoRR, 2022

Recent Advances in Embedding Methods for Multi-Object Tracking: A Survey.
CoRR, 2022

Preserve Pre-trained Knowledge: Transfer Learning With Self-Distillation For Action Recognition.
CoRR, 2022

MAP-SNN: Mapping Spike Activities with Multiplicity, Adaptability, and Plasticity into Bio-Plausible Spiking Neural Networks.
CoRR, 2022

Unsupervised Pre-training Improves Tooth Segmentation in 3-Dimensional Intraoral Mesh Scans.
Proceedings of the International Conference on Medical Imaging with Deep Learning, 2022

Modeling Freight-Sharing Platform Operations for Optimal Compensation Strategy Using Markov Decision Processes.
Proceedings of the 25th IEEE International Conference on Intelligent Transportation Systems, 2022

Missing Modality meets Meta Sampling (M3S): An Efficient Universal Approach for Multimodal Sentiment Analysis with Missing Modality.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

When Few-Shot Learning Meets Video Object Detection.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Human-Centered Prior-Guided and Task-Dependent Multi-Task Representation Learning for Action Recognition Pre-Training.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

ActiveMatch: End-To-End Semi-Supervised Active Representation Learning.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Handwritten Chinese Signature Detection on Scanned Technical Documents for Authenticity Verification.
Proceedings of the IEEE International Conference on e-Business Engineering, 2022

A Framework for Handwritten Date Recognition in Quality Documents.
Proceedings of the IEEE International Conference on e-Business Engineering, 2022

Hierarchical Semi-supervised Contrastive Learning for Contamination-Resistant Anomaly Detection.
Proceedings of the Computer Vision - ECCV 2022, 2022

Signature Detection, Restoration, and Verification: A Novel Chinese Document Signature Forgery Detection Benchmark.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2021
ASFP (Artificial Intelligence based Scoring Function Platform): a web server for the development of customized scoring functions.
J. Cheminformatics, 2021

Weakly supervised instance segmentation using multi-prior fusion.
Comput. Vis. Image Underst., 2021

Disjoint Contrastive Regression Learning for Multi-Sourced Annotations.
CoRR, 2021

Few-Shot Learning for Video Object Detection in a Transfer-Learning Scheme.
CoRR, 2021

Can machine learning consistently improve the scoring power of classical scoring functions? Insights into the role of machine learning in scoring functions.
Briefings Bioinform., 2021

Beware of the generic machine learning-based scoring functions in structure-based virtual screening.
Briefings Bioinform., 2021

ROD2021 Challenge: A Summary for Radar Object Detection Challenge for Autonomous Driving Applications.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

Track without Appearance: Learn Box and Tracklet Embedding with Local and Global Motion Patterns for Vehicle Tracking.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Rethinking of Radar's Role: A Camera-Radar Dataset and Systematic Annotator via Coordinate Alignment.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

2020
Bundle Adjustment for Monocular Visual Odometry Based on Detections of Traffic Signs.
IEEE Trans. Veh. Technol., 2020

Multi-Person Hierarchical 3D Pose Estimation in Natural Videos.
IEEE Trans. Circuits Syst. Video Technol., 2020

DAIL: Dataset-Aware and Invariant Learning for Face Recognition.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Exploring Severe Occlusion: Multi-Person 3D Pose Estimation with Gated Convolution.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

2019
Uncertainty-Based Active Learning via Sparse Modeling for Image Classification.
IEEE Trans. Image Process., 2019

Multi-Scale Fish Segmentation Refinement and Missing Shape Recovery.
IEEE Access, 2019

Eye in the Sky: Drone-Based Object Tracking and 3D Localization.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Exploit the Connectivity: Multi-Object Tracking with TrackletNet.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Efficient Multi-person Hierarchical 3D Pose Estimation for Autonomous Driving.
Proceedings of the 2nd IEEE Conference on Multimedia Information Processing and Retrieval, 2019

Simultaneous Intracranial Artery Tracing and Segmentation from Magnetic Resonance Angiography by Joint Optimization from Multiplanar Reformation.
Proceedings of the Machine Learning and Medical Engineering for Cardiovascular Health and Intravascular Imaging and Computer Assisted Stenting, 2019


Anomaly Candidate Identification and Starting Time Estimation of Vehicles from Traffic Videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Multi-Camera Tracking of Vehicles based on Deep Features Re-ID and Trajectory-Based Camera Link Models.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

2018
Coarse-To-Fine Segmentation Refinement and Missing Shape Recovery for Halibut Fish.
Proceedings of the 2018 IEEE Global Conference on Signal and Information Processing, 2018

Single-Camera and Inter-Camera Vehicle Tracking and 3D Speed Estimation Based on Fusion of Visual and Semantic Features.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

2017
Multiple-Kernel Based Vehicle Tracking Using 3D Deformable Model and Camera Self-Calibration.
CoRR, 2017

An Open-Source Platform for Underwater Image and Video Analytics.
Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision, 2017

Uncertainty sampling based active learning with diversity constraint by sparse selection.
Proceedings of the 19th IEEE International Workshop on Multimedia Signal Processing, 2017

2016
Shrinking Encoding with Two-Level Codebook Learning for Fine-Grained Fish Recognition.
Proceedings of the 2nd ICPR Workshop on Computer Vision for Analysis of Underwater Imagery, 2016

Closed-Loop Tracking-by-Detection for ROV-Based Multiple Fish Tracking.
Proceedings of the 2nd ICPR Workshop on Computer Vision for Analysis of Underwater Imagery, 2016

2015
Piecewise planar super-resolution for 3D scene.
Proceedings of the 2015 IEEE Global Conference on Signal and Information Processing, 2015


  Loading...