Zhendong Mao

Orcid: 0000-0001-5739-8126

According to our database¹, Zhendong Mao authored at least 122 papers between 2009 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Exploring Visual Relationships via Transformer-based Graphs for Enhanced Image Captioning.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., May, 2024

Sentiment-Oriented Transformer-Based Variational Autoencoder Network for Live Video Commenting.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., April, 2024

Enhanced Semantic Similarity Learning Framework for Image-Text Matching.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., April, 2024

Curriculum Learning Driven Domain Adaptation for Low-Resource Machine Reading Comprehension.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2024

CustomContrast: A Multilevel Contrastive Perspective For Subject-Driven Text-to-Image Customization.

[BibT_eX]

[DOI]

CoRR, 2024

Enhance Lifelong Model Editing with Continuous Data-Adapter Association.

[BibT_eX]

[DOI]

CoRR, 2024

RealCustom++: Representing Images as Real-Word for Real-Time Customization.

[BibT_eX]

[DOI]

CoRR, 2024

USTC-BUPT at SemEval-2024 Task 8: Enhancing Machine-Generated Text Detection via Domain Adversarial Neural Networks and LLM Embeddings.

[BibT_eX]

[DOI]

Proceedings of the 18th International Workshop on Semantic Evaluation, 2024

Dual-path Collaborative Generation Network for Emotional Video Captioning.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Neighborhood-Adaptive Context Enhancement Learning For Scene Graph Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Improving Radiology Report Generation with D<sup>2</sup>-Net: When Diffusion Meets Discriminator.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

FlipGuard: Defending Preference Alignment against Update Regression with Constrained Optimization.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

KNN-Instruct: Automatic Instruction Construction with K Nearest Neighbor Deduction.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

RealCustom: Narrowing Real Text Word for Real-Time Open-Domain Text-to-Image Customization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Linguistic-Aware Patch Slimming Framework for Fine-Grained Cross-Modal Alignment.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

IDEATE: Detecting AI-Generated Text Using Internal and External Factual Structures.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Visual-Linguistic Dependency Encoding for Image-Text Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

LIRE: listwise reward enhancement for preference alignment.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Benchmarking and Improving Compositional Generalization of Multi-aspect Controllable Text Generation.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Knowledge Context Modeling with Pre-trained Language Models for Contrastive Knowledge Graph Completion.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Chain-of-Question: A Progressive Question Decomposition Approach for Complex Knowledge Base Question Answering.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Feature-Adaptive and Data-Scalable In-Context Learning.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

RESEMO: A Benchmark Chinese Dataset for Studying Responsive Emotion from Social Media Content.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Disentangled Learning with Synthetic Parallel Data for Text Style Transfer.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Identification of Necessary Semantic Undertakers in the Causal View for Image-Text Matching.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Gradual Residuals Alignment: A Dual-Stream Framework for GAN Inversion and Image Attribute Editing.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Benchmarking Large Language Models on Controllable Generation under Diversified Instructions.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

DreamIdentity: Enhanced Editability for Efficient Face-Identity Preserved Image Generation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

GH-DDM: the generalized hybrid denoising diffusion model for medical image generation.

[BibT_eX]

[DOI]

Multim. Syst., June, 2023

Multi-task hourglass network for online automatic diagnosis of developmental dysplasia of the hip.

[BibT_eX]

[DOI]

World Wide Web (WWW), March, 2023

Unified Adaptive Relevance Distinguishable Attention Network for Image-Text Matching.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Intra-Class Adaptive Augmentation With Neighbor Correction for Deep Metric Learning.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

DreamIdentity: Improved Editability for Efficient Face-identity Preserved Image Generation.

[BibT_eX]

[DOI]

CoRR, 2023

ExpertPrompting: Instructing Large Language Models to be Distinguished Experts.

[BibT_eX]

[DOI]

CoRR, 2023

kNN Prompting: Beyond-Context Learning with Calibration-Free Nearest Neighbor Inference.

[BibT_eX]

[DOI]

CoRR, 2023

Unlocking the Power of Cross-Dimensional Semantic Dependency for Image-Text Matching.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Difference-Aware Iterative Reasoning Network for Key Relation Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

$k$NN Prompting: Beyond-Context Learning with Calibration-Free Nearest Neighbor Inference.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

SADE: A Self-Adaptive Expert for Multi-Dataset Question Answering.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Inductive Relation Prediction from Relational Paths and Context with Hierarchical Transformers.

[BibT_eX]

[DOI]

Jiaang Li

Quan Wang

Zhendong Mao

Proceedings of the IEEE International Conference on Acoustics, 2023

Contour-Augmented Concept Prediction Network for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the Artificial Neural Networks and Machine Learning, 2023

On the Calibration of Large Language Models and Alignment.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Air-Decoding: Attribute Distribution Reconstruction for Decoding-Time Controllable Text Generation.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Improving Image Captioning via Predicting Structured Concepts.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Grammatical Error Correction via Mixed-Grained Weighted Training.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Random Entity Quantization for Parameter-Efficient Compositional Knowledge Graph Representation.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

IAEval: A Comprehensive Evaluation of Instance Attribution on Natural Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

E-CORE: Emotion Correlation Enhanced Empathetic Dialogue Generation.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Crossing the Gap: Domain Generalization for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning Semantic Relationship among Instances for Image-Text Matching.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

S2ynRE: Two-stage Self-training with Synthetic data for Low-resource Relation Extraction.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Text Style Transfer with Contrastive Transfer Pattern Mining.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

Focus Your Attention: A Focal Attention for Multimodal Learning.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2022

Semantically Similarity-Wise Dual-Branch Network for Scene Graph Generation.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2022

Task-Adaptive Attention for Image Captioning.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2022

Joint Local Correlation and Global Contextual Information for Unsupervised 3D Model Retrieval and Classification.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2022

Self-Supervised Synthesis Ranking for Deep Metric Learning.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2022

Joint Channel Estimation and Active-User Detection for Massive Access in Internet of Things - A Deep Learning Approach.

[BibT_eX]

[DOI]

IEEE Internet Things J., 2022

Channel Estimation for Intelligent Reflecting Surface Assisted Massive MIMO Systems - A Deep Learning Approach.

[BibT_eX]

[DOI]

Zhendong Mao

Xiqing Liu

Mugen Peng

IEEE Commun. Lett., 2022

EmRel: Joint Representation of Entities and Embedded Relations for Multi-triple Extraction.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Fine-tuning with Multi-modal Entity Prompts for News Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

DSE-GAN: Dynamic Semantic Evolution Generative Adversarial Network for Text-to-Image Generation.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Background Layout Generation and Object Knowledge Transfer for Text-to-Image Generation.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Weakly Supervised Pediatric Bone Age Assessment Using Ultrasonic Images via Automatic Anatomical RoI Detection.

[BibT_eX]

[DOI]

Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

ER-SAN: Enhanced-Adaptive Relation Self-Attention Network for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

UniRel: Unified Representation and Interaction for Joint Relational Triple Extraction.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Improving Chinese Spelling Check by Character Pronunciation Prediction: The Effects of Adaptivity and Granularity.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Negative-Aware Attention Framework for Image-Text Matching.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

GroupDiff: Exploring A Unified Graph Structure and High-order Interactions for Group Recommendation.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Big Data Computing and Communications, 2022

Show Your Faith: Cross-Modal Confidence-Aware Network for Image-Text Matching.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

M-GCN: Multi-Branch Graph Convolution Network for 2D Image-based on 3D Model Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2021

Multi-Scale Structure-Aware Network for Weakly Supervised Temporal Action Detection.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

Review and Arrange: Curriculum Learning for Natural Language Understanding.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

Evolution of ICTs-empowered-identification: A general re-ranking method for person re-identification.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2021

Object-difference drived graph convolutional networks for visual question answering.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2021

Hierarchical multi-view context modelling for 3D object classification and retrieval.

[BibT_eX]

[DOI]

Inf. Sci., 2021

Mask and Predict: Multi-step Reasoning for Scene Graph Generation.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Lesion-Aware Transformers for Diabetic Retinopathy Grading.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Entity Structure Within and Throughout: Modeling Mention Dependencies for Document-Level Relation Extraction.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Image Captioning with Context-Aware Auxiliary Guidance.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Deep Metric Learning with Self-Supervised Ranking.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Misshapen Pelvis Landmark Detection With Local-Global Feature Learning for Diagnosing Developmental Dysplasia of the Hip.

[BibT_eX]

[DOI]

IEEE Trans. Medical Imaging, 2020

Context propagation embedding network for weakly supervised semantic segmentation.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2020

SP-VITON: shape-preserving image-based virtual try-on network.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2020

Compact Position-Aware Attention Network for Image Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

A Feature Generalization Framework for Social Media Popularity Prediction.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Domain-Specific Alignment Network for Multi-Domain Image-Based 3D Object Retrieval.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Learning Rich Attention for Pediatric Bone Age Assessment.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

Overcoming Language Priors with Self-supervised Learning for Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Graph Structured Network for Image-Text Matching.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Curriculum Learning for Natural Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019

Double-Bit Quantization and Index Hashing for Nearest Neighbor Search.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2019

MMJN: Multi-Modal Joint Networks for 3D Shape Recognition.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Focus Your Attention: A Bidirectional Focal Attention Network for Image-Text Matching.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

A Neighbor-aware Approach for Image-text Matching.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Channel Matrix Sparsity With Imperfect Channel State Information in Cloud Radio Access Networks.

[BibT_eX]

[DOI]

IEEE Trans. Veh. Technol., 2018

Pulmonary Vessel Segmentation via Stage-Wise Convolutional Networks With Orientation-Based Region Growing Optimization.

[BibT_eX]

[DOI]

IEEE Access, 2018

Stacked Fully Convolutional Networks for Pulmonary Vessel Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Visual Communications and Image Processing, 2018

Post Tuned Hashing: A New Approach to Indexing High-dimensional Data.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

On the Design of OFDM-Based Simultaneous Wireless Information and Power Transfer in Fog-Radio Access Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CIC International Conference on Communications in China, 2018

Performance Analysis of Outage and Average Sum Rate of Sparse Code Division Multiple Access in Fog Radio Access Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CIC International Conference on Communications in China, 2018

2017

Knowledge Graph Embedding: A Survey of Approaches and Applications.

[BibT_eX]

[DOI]

IEEE Trans. Knowl. Data Eng., 2017

Uyghur Language Text Detection in Complex Background Images Using Enhanced MSERs.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

Double-bit quantization and weighting for nearest neighbor search.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016

Training Design for Channel Estimation in Uplink Cloud Radio Access Networks.

[BibT_eX]

[DOI]

IEEE Trans. Signal Process., 2016

Recent Advances in Cloud Radio Access Networks: System Architectures, Key Techniques, and Open Issues.

[BibT_eX]

[DOI]

IEEE Commun. Surv. Tutorials, 2016

Joint Design of Iterative Training-Based Channel Estimation and Cluster Formation in Cloud-Radio Access Networks.

[BibT_eX]

[DOI]

IEEE Access, 2016

2015

Low-Complexity Segment Training Channel Estimation in Cloud Radio Access Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE 82nd Vehicular Technology Conference, 2015

Hierarchical Encoding of Binary Descriptors for Image Matching.

[BibT_eX]

[DOI]

Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

What is the next step of binary features?

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

2014

Salient region detection for complex background images using integrated features.

[BibT_eX]

[DOI]

Inf. Sci., 2014

2013

COGE: A Novel Binary Feature Descriptor Exploring Anisotropy and Non-uniformity.

[BibT_eX]

[DOI]

Zhendong Mao

Yongdong Zhang

Qi Tian

Proceedings of the Advances in Multimedia Information Processing - PCM 2013, 2013

What are the distance metrics for local features?

[BibT_eX]

[DOI]

Zhendong Mao

Yongdong Zhang

Qi Tian

Proceedings of the ACM Multimedia Conference, 2013

2012

Geometric context-preserving progressive transmission in mobile visual search.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

A method for detecting salient regions using integrated features.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

2009

TRECVID 2009 of MCG-ICT-CAS.

[BibT_eX]

[DOI]

Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

C3M: A Classification Model for Multivariate Motion Time Series.

[BibT_eX]

[DOI]

Proceedings of the CSIE 2009, 2009 WRI World Congress on Computer Science and Information Engineering, March 31, 2009

An adaptive ensemble classifier for concept drifting stream.

[BibT_eX]

[DOI]

Proceedings of the IEEE Symposium on Computational Intelligence and Data Mining, 2009

Zhendong Mao

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...