Di Wang

Orcid: 0000-0001-8027-4287

Affiliations:
  • Xidian University, School of Computer Science and Technology, Xi'an, China


According to our database1, Di Wang authored at least 55 papers between 2015 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Global-aware Fragment Representation Aggregation Network for image-text retrieval.
Pattern Recognit., 2025

2024
Deep Hierarchical Multimodal Metric Learning.
IEEE Trans. Neural Networks Learn. Syst., November, 2024

Part-of-speech- and syntactic-aware graph convolutional network for aspect-level sentiment classification.
Multim. Tools Appl., March, 2024

Multimodal transformer with adaptive modality weighting for multimodal sentiment analysis.
Neurocomputing, March, 2024

Global semantic enhancement network for video captioning.
Pattern Recognit., January, 2024

Dual-Perspective Fusion Network for Aspect-Based Multimodal Sentiment Analysis.
IEEE Trans. Multim., 2024

Gist, Content, Target-Oriented: A 3-Level Human-Like Framework for Video Moment Retrieval.
IEEE Trans. Multim., 2024

VLDadaptor: Domain Adaptive Object Detection With Vision-Language Model Distillation.
IEEE Trans. Multim., 2024

Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.
IEEE Trans. Geosci. Remote. Sens., 2024

Multiscale Spectral-Spatial Attention Residual Fusion Network for Multisource Remote Sensing Data Classification.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2024

GR-GAN: A unified adversarial framework for single image glare removal and denoising.
Pattern Recognit., 2024

DiagSWin: A multi-scale vision transformer with diagonal-shaped windows for object detection and segmentation.
Neural Networks, 2024

Visual Selection and Multistage Reasoning for RSVG.
IEEE Geosci. Remote. Sens. Lett., 2024

Candidate-Heuristic In-Context Learning: A new framework for enhancing medical visual question answering with LLMs.
Inf. Process. Manag., 2024

NIV-SSD: Neighbor IoU-voting single-stage object detector from point cloud.
Neurocomputing, 2024

Multi-object behavior recognition based on object detection for dense crowds.
Expert Syst. Appl., 2024

Show Me What and Where has Changed? Question Answering and Grounding for Remote Sensing Change Detection.
CoRR, 2024

Divide and Conquer: Isolating Normal-Abnormal Attributes in Knowledge Graph-Enhanced Radiology Report Generation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Fine-grained Semantics-aware Representation Learning for Text-based Person Retrieval.
Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

Leveraging Coarse-to-Fine Grained Representations in Contrastive Learning for Differential Medical Visual Question Answering.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

Transferable Physical Adversarial Patch Attack for Remote Sensing Object Detection.
Proceedings of the IGARSS 2024, 2024

Kernel-Adaptive Change Detection Network in Remote Sensing Imagery.
Proceedings of the IGARSS 2024, 2024

Alignment and Multimodal Reasoning for Remote Sensing Visual Question Answering.
Proceedings of the IGARSS 2024, 2024

A Comprehensive Framework for Occluded Human Pose Estimation.
Proceedings of the IEEE International Conference on Acoustics, 2024

Unleashing Channel Potential: Space-Frequency Selection Convolution for SAR Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

TMFN: A Target-oriented Multi-grained Fusion Network for End-to-end Aspect-based Multimodal Sentiment Analysis.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Refining Latent Homophilic Structures over Heterophilic Graphs for Robust Graph Convolution Networks.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

SHaRPose: Sparse High-Resolution Representation for Human Pose Estimation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Bi-Attention enhanced representation learning for image-text matching.
Pattern Recognit., August, 2023

TETFN: A text enhanced transformer fusion network for multimodal sentiment analysis.
Pattern Recognit., April, 2023

Hierarchical Semantic Structure Preserving Hashing for Cross-Modal Retrieval.
IEEE Trans. Multim., 2023

Cross-Modal Enhancement Network for Multimodal Sentiment Analysis.
IEEE Trans. Multim., 2023

Mixing Self-Attention and Convolution: A Unified Framework for Multisource Remote Sensing Data Classification.
IEEE Trans. Geosci. Remote. Sens., 2023

Relation-Aware Multi-Positive Contrastive Knowledge Graph Completion with Embedding Dimension Scaling.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Enhancing CLIP-Based Text-Person Retrieval by Leveraging Negative Samples.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Language-Guided Visual Aggregation Network for Video Question Answering.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

2022
Pseudo-Label Guided Collective Matrix Factorization for Multiview Clustering.
IEEE Trans. Cybern., 2022

2021
DFER-Net: Recognizing Facial Expression In The Wild.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Saliency Region Detection in Complex Scenes Based on Multi-scale Cascaded Attention.
Proceedings of the 7th IEEE International Conference on Cloud Computing and Intelligent Systems, 2021

2020
LPR-Net: Recognizing Chinese license plate in complex environments.
Pattern Recognit. Lett., 2020

Joint and individual matrix factorization hashing for large-scale cross-modal retrieval.
Pattern Recognit., 2020

Online Adaptive Supervised Hashing for Large-Scale Cross-Modal Retrieval.
IEEE Access, 2020

Online Collective Matrix Factorization Hashing for Large-Scale Cross-Media Retrieval.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

2019
Label Consistent Matrix Factorization Hashing for Large-Scale Cross-Modal Similarity Search.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Semi-paired and semi-supervised multimodal hashing via cross-modality label propagation.
Multim. Tools Appl., 2019

Robust joint learning network: improved deep representation learning for person re-identification.
Multim. Tools Appl., 2019

High confidence detection for moving target in aerial video.
IET Image Process., 2019

2018
Robust and Flexible Discrete Hashing for Cross-Modal Similarity Search.
IEEE Trans. Circuits Syst. Video Technol., 2018

Online Matrix Factorization Hashing for Large-Scale Image Retrieval.
Proceedings of the Big Data - 6th CCF Conference, 2018

2016
Multimodal Discriminative Binary Embedding for Large-Scale Cross-Modal Retrieval.
IEEE Trans. Image Process., 2016

Semi-Supervised Nonnegative Matrix Factorization via Constraint Propagation.
IEEE Trans. Cybern., 2016

Fast image quality assessment via supervised iterative quantization method.
Neurocomputing, 2016

2015
Semi-supervised constraints preserving hashing.
Neurocomputing, 2015

Semantic Topic Multimodal Hashing for Cross-Media Retrieval.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Fast Image Quality Assessment via Hash Code.
Proceedings of the Computer Vision - CCF Chinese Conference, 2015


  Loading...