Jing Shi

Orcid: 0000-0002-4509-0535

Affiliations:
  • Adobe Research
  • University of Rochester, NY, USA


According to our database1, Jing Shi authored at least 23 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
GUI Agents: A Survey.
CoRR, 2024

FINECAPTION: Compositional Image Captioning Focusing on Wherever You Want at Any Granularity.
CoRR, 2024

AV-DiT: Efficient Audio-Visual Diffusion Transformer for Joint Audio and Video Generation.
CoRR, 2024

Text-to-Audio Generation Synchronized with Videos.
CoRR, 2024

Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

FineMatch: Aspect-Based Fine-Grained Image and Text Mismatch Detection and Correction.
Proceedings of the Computer Vision - ECCV 2024, 2024

VIXEN: Visual Text Comparison Network for Image Difference Captioning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
DiffAVA: Personalized Text-to-Audio Generation with Visual Alignment.
CoRR, 2023

2022
SpaceEdit: Learning a Unified Editing Space for Open-Domain Image Color Editing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Anomaly Crossing: A New Method for Video Anomaly Detection as Cross-domain Few-shot Learning.
CoRR, 2021

SpaceEdit: Learning a Unified Editing Space for Open-Domain Image Editing.
CoRR, 2021

How to Make a BLT Sandwich? Learning VQA towards Understanding Web Instructional Videos.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Learning to Generate Scene Graph from Natural Language Supervision.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Language-Guided Global Image Editing via Cross-Modal Cyclic Mechanism.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

A Simple Baseline for Weakly-Supervised Scene Graph Generation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Learning by Planning: Language-Guided Global Image Editing.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Cubic Spline Smoothing Compensation for Irregularly Sampled Sequences.
CoRR, 2020

Actor-Action Video Classification CSC 249/449 Spring 2020 Challenge Report.
CoRR, 2020

A Benchmark and Baseline for Language-Driven Image Editing.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

2019
GAN-EM: GAN Based EM Learning Framework.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Audio-Visual Event Localization in the Wild.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Not All Frames Are Equal: Weakly-Supervised Video Grounding With Contextual Similarity and Visual Clustering Losses.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Audio-Visual Event Localization in Unconstrained Videos.
Proceedings of the Computer Vision - ECCV 2018, 2018


  Loading...