Jing Shi

Orcid: 0000-0002-4509-0535

Affiliations:

Adobe Research
University of Rochester, NY, USA

According to our database¹, Jing Shi authored at least 24 papers between 2018 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

MAGNET: Augmenting Generative Decoders with Representation Learning and Infilling Capabilities.

[BibT_eX]

[DOI]

CoRR, January, 2025

2024

GUI Agents: A Survey.

[BibT_eX]

[DOI]

CoRR, 2024

FINECAPTION: Compositional Image Captioning Focusing on Wherever You Want at Any Granularity.

[BibT_eX]

[DOI]

CoRR, 2024

AV-DiT: Efficient Audio-Visual Diffusion Transformer for Joint Audio and Video Generation.

[BibT_eX]

[DOI]

CoRR, 2024

Text-to-Audio Generation Synchronized with Videos.

[BibT_eX]

[DOI]

Shentong Mo

Jing Shi

Yapeng Tian

CoRR, 2024

Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

FineMatch: Aspect-Based Fine-Grained Image and Text Mismatch Detection and Correction.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

VIXEN: Visual Text Comparison Network for Image Difference Captioning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

DiffAVA: Personalized Text-to-Audio Generation with Visual Alignment.

[BibT_eX]

[DOI]

Shentong Mo

Jing Shi

Yapeng Tian

CoRR, 2023

2022

SpaceEdit: Learning a Unified Editing Space for Open-Domain Image Color Editing.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Anomaly Crossing: A New Method for Video Anomaly Detection as Cross-domain Few-shot Learning.

[BibT_eX]

[DOI]

CoRR, 2021

SpaceEdit: Learning a Unified Editing Space for Open-Domain Image Editing.

[BibT_eX]

[DOI]

CoRR, 2021

How to Make a BLT Sandwich? Learning VQA towards Understanding Web Instructional Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Learning to Generate Scene Graph from Natural Language Supervision.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Language-Guided Global Image Editing via Cross-Modal Cyclic Mechanism.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

A Simple Baseline for Weakly-Supervised Scene Graph Generation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Learning by Planning: Language-Guided Global Image Editing.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

Cubic Spline Smoothing Compensation for Irregularly Sampled Sequences.

[BibT_eX]

[DOI]

CoRR, 2020

Actor-Action Video Classification CSC 249/449 Spring 2020 Challenge Report.

[BibT_eX]

[DOI]

CoRR, 2020

A Benchmark and Baseline for Language-Driven Image Editing.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

2019

GAN-EM: GAN Based EM Learning Framework.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Audio-Visual Event Localization in the Wild.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Not All Frames Are Equal: Weakly-Supervised Video Grounding With Contextual Similarity and Visual Clustering Losses.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Audio-Visual Event Localization in Unconstrained Videos.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Jing Shi

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...