We stand with Ukraine

We stand with Ukraine

Fangxiang Feng

Orcid: 0000-0002-4798-4233

According to our database¹, Fangxiang Feng authored at least 46 papers between 2013 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

2014

2016

2018

2020

2022

2024

0

5

10

1

4

3

3

2

1

3

1

5

3

9

4

4

1

1

1

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

Online presence:

on orcid.org

On csauthors.net:

Bibliography

2025

Action-guided prompt tuning for video grounding.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Inf. Fusion, 2025

2024

LGR-NET: Language Guided Reasoning Network for Referring Expression Comprehension.

[BibT_eX]

[DOI]

,

,

,

,

IEEE Trans. Circuits Syst. Video Technol., August, 2024

DualGCN: Exploring Syntactic and Semantic Information for Aspect-Based Sentiment Analysis.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE Trans. Neural Networks Learn. Syst., June, 2024

Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

Concept Conductor: Orchestrating Multiple Personalized Concepts in Text-to-Image Synthesis.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2024

DiffHarmony++: Enhancing Image Harmonization with Harmony-VAE and Inverse Harmonization Model.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Q-MoE: Connector for MLLMs with Text-Driven Routing.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Triple Alignment Strategies for Zero-shot Phrase Grounding under Weak Supervision.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

DiffHarmony: Latent Diffusion Model Meets Image Harmonization.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

Visual Prompt Tuning for Weakly Supervised Phrase Grounding.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Whether you can locate or not? Interactive Referring Expression Generation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Referring Image Harmonization.

[BibT_eX]

[DOI]

,

,

Zhuangzhuang Li

,

,

Proceedings of the 9th International Conference on Communication and Information Processing, 2023

Speech Enhancement with Lip Perceptual Loss.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 9th IEEE International Conference on Cloud Computing and Intelligent Systems, 2023

2022

Modality Disentangled Discriminator for Text-to-Image Synthesis.

[BibT_eX]

[DOI]

,

,

,

IEEE Trans. Multim., 2022

A deformable CNN-based triplet model for fine-grained sketch-based image retrieval.

[BibT_eX]

[DOI]

,

,

,

Pattern Recognit., 2022

Spot the Difference: A Cooperative Object-Referring Game in Non-Perfectly Co-Observable Scene.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2022

Visual Dialog for Spotting the Differences between Pairs of Similar Images.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

A Region-based Document VQA.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

GR-GAN: Gradual Refinement Text-To-Image Generation.

[BibT_eX]

[DOI]

,

,

Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Question-Driven Graph Fusion Network for Visual Question Answering.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Improving Image Paragraph Captioning with Dual Relations.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

COM-MRC: A COntext-Masked Machine Reading Comprehension Framework for Aspect Sentiment Triplet Extraction.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

A Simple Model for Distantly Supervised Relation Extraction.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 29th International Conference on Computational Linguistics, 2022

Co-VQA : Answering by Interactive Sub Question Sequence.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Enhanced Multi-Channel Graph Convolutional Network for Aspect Sentiment Triplet Extraction.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021

S2TD: A Tree-Structured Decoder for Image Paragraph Captioning.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021

Category-Based Strategy-Driven Question Generator for Visual Dialogue.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Chinese Computational Linguistics - 20th China National Conference, 2021

Multi-stage Pre-training over Simplified Multimodal Pre-training Models.

[BibT_eX]

[DOI]

,

,

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Dual Graph Convolutional Networks for Aspect-based Sentiment Analysis.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020

Exploring Global and Local Linguistic Representations for Text-to-Image Synthesis.

[BibT_eX]

[DOI]

,

,

,

,

IEEE Trans. Multim., 2020

Dual-CNN: A Convolutional language decoder for paragraph image captioning.

[BibT_eX]

[DOI]

,

,

,

,

Neurocomputing, 2020

Answer-Driven Visual State Estimator for Goal-Oriented Visual Dialogue.

[BibT_eX]

[DOI]

,

,

,

,

,

Zhongyuan Ouyang

CoRR, 2020

Referring Expression Generation via Visual Dialogue.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Natural Language Processing and Chinese Computing, 2020

Answer-Driven Visual State Estimator for Goal-Oriented Visual Dialogue.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Learning Visual Features from Product Title for Image Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Image Synthesis from Locally Related Texts.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2020 on International Conference on Multimedia Retrieval, 2020

2019

A survey on freehand sketch recognition and retrieval.

[BibT_eX]

[DOI]

,

,

,

Image Vis. Comput., 2019

Retrieving real world clothing images via multi-weight deep convolutional neural networks.

[BibT_eX]

[DOI]

,

,

,

Clust. Comput., 2019

2017

Personalized Image Annotation Using Deep Architecture.

[BibT_eX]

[DOI]

,

,

,

,

IEEE Access, 2017

Improving deep convolutional neural networks for real-world clothing image.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 13th International Conference on Natural Computation, 2017

2015

Correspondence Autoencoders for Cross-Modal Retrieval.

[BibT_eX]

[DOI]

,

,

,

ACM Trans. Multim. Comput. Commun. Appl., 2015

Challenges in representation learning: A report on three machine learning contests.

[BibT_eX]

[DOI]

Neural Networks, 2015

Deep correspondence restricted Boltzmann machine for cross-modal retrieval.

[BibT_eX]

[DOI]

,

,

Neurocomputing, 2015

2014

Cross-modal Retrieval with Correspondence Autoencoder.

[BibT_eX]

[DOI]

,

,

Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

2013

Constructing Hierarchical Image-tags Bimodal Representations for Word Tags Alternative Choice.

[BibT_eX]

[DOI]

,

,

CoRR, 2013

Challenges in Representation Learning: A Report on Three Machine Learning Contests.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 20th International Conference, 2013

Loading...