Yin Cui

Orcid: 0000-0003-3936-9932

According to our database1, Yin Cui authored at least 65 papers between 2009 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Why Fine-grained Labels in Pretraining Benefit Generalization?
CoRR, 2024

Wolf: Captioning Everything with a World Summarization Framework.
CoRR, 2024

Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Stability Analysis and Group Consensus Tracking Predictive Control of Multi-Agent Systems.
J. Syst. Sci. Complex., October, 2023

Intelligent design of display space layout based on two-stage deep learning network.
J. Comput. Methods Sci. Eng., 2023

VideoGLUE: Video General Understanding Evaluation of Foundation Models.
CoRR, 2023

Towards Understanding the Effect of Pretraining Label Granularity.
CoRR, 2023

MovieCLIP: Visual Scene Recognition in Movies.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Module-wise Adaptive Distillation for Multimodality Foundation Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

A Simple Zero-shot Prompt Weighting Technique to Improve Prompt Ensembling in Text-Image Models.
Proceedings of the International Conference on Machine Learning, 2023

Open-Vocabulary Object Detection upon Frozen Vision and Language Models.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Unified Visual Relationship Detection with Vision and Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Research on the Influence of Artificial Intelligence Interactive Function on Youth Sports Training - Taking Tiantian Skipping Rope App as an Example.
Proceedings of the HCI International 2023 - Late Breaking Papers, 2023

Human Factors Based New Media Design: Methodology and Assessment.
Proceedings of the Human Aspects of IT for the Aged Population, 2023

Train-Once-for-All Personalization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Global Consensus of High-Order Discrete-Time Multi-Agent Systems with Communication Delay and Saturation Constraint.
Sensors, 2022

F-VLM: Open-Vocabulary Object Detection upon Frozen Vision and Language Models.
CoRR, 2022

Multimodal Open-Vocabulary Video Classification via Pre-Trained Vision and Language Models.
CoRR, 2022

Research on Anomaly Suppression Correlation Filtering Algorithm.
IEEE Access, 2022

Federated Multi-Target Domain Adaptation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Surrogate Gap Minimization Improves Sharpness-Aware Training.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Open-vocabulary Object Detection via Vision and Language Knowledge Distillation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

SITTA: Single Image Texture Translation for Data Augmentation.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Scaling Open-Vocabulary Image Segmentation with Image-Level Labels.
Proceedings of the Computer Vision - ECCV 2022, 2022

Contextualized Spatio-Temporal Contrastive Learning with Self-Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

On Temporal Granularity in Self-Supervised Video Representation Learning.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2021
Open-Vocabulary Image Segmentation.
CoRR, 2021

Towards a Unified Foundation Model: Jointly Pre-Training Transformers on Unpaired Images and Text.
CoRR, 2021

Exploring Temporal Granularity in Self-Supervised Video Representation Learning.
CoRR, 2021

Revisiting 3D ResNets for Video Recognition.
CoRR, 2021

Single Image Texture Translation for Data Augmentation.
CoRR, 2021

Bridging the Gap Between Object Detection and User Intent via Query-Modulation.
CoRR, 2021

Zero-Shot Detection via Vision and Language Knowledge Distillation.
CoRR, 2021

Research on Visual Tracking Algorithm Based on Peak Sidelobe Ratio.
IEEE Access, 2021

VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Spatiotemporal Contrastive Video Representation Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Simple Copy-Paste Is a Strong Data Augmentation Method for Instance Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Transfer learning in computer vision tasks: Remember where you come from.
Image Vis. Comput., 2020

Small-floating Target Detection in Sea Clutter via Visual Feature Classifying in the Time-Doppler Spectra.
CoRR, 2020

Rethinking Pre-training and Self-training.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Fashionpedia: Ontology, Segmentation, and an Attribute Localization Dataset.
Proceedings of the Computer Vision - ECCV 2020, 2020

Efficient Scale-Permuted Backbone with Learned Resource Distribution.
Proceedings of the Computer Vision - ECCV 2020, 2020

SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Learning from Fine-Grained and Long-Tailed Visual Data.
PhD thesis, 2019

Measuring Dataset Granularity.
CoRR, 2019

The iMaterialist Fashion Attribute Dataset.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Application of Agent in Security Platform.
Proceedings of the 2019 IEEE/CIC International Conference on Communications in China, 2019

Learning Single-View 3D Reconstruction with Limited Pose Supervision.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Class-Balanced Loss Based on Effective Number of Samples.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Unbiased offline recommender evaluation for missing-not-at-random implicit feedback.
Proceedings of the 12th ACM Conference on Recommender Systems, 2018

The INaturalist Species Classification and Detection Dataset.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Learning to Evaluate Image Captioning.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Collaborative Metric Learning.
Proceedings of the 26th International Conference on World Wide Web, 2017

Kernel Pooling for Convolutional Neural Networks.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Fine-Grained Categorization and Dataset Bootstrapping Using Deep Metric Learning with Humans in the Loop.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Learning deep representations for ground-to-aerial geolocalization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

PlateClick: Bootstrapping Food Preferences Through an Adaptive Visual Interface.
Proceedings of the 24th ACM International Conference on Information and Knowledge Management, 2015

2014
Building A Large Concept Bank for Representing Events in Video.
CoRR, 2014

A spatial-color layout feature for representing galaxy images.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

Object-Based Visual Sentiment Concept Analysis and Application.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Event-Driven Semantic Concept Discovery by Exploiting Weakly Tagged Internet Images.
Proceedings of the International Conference on Multimedia Retrieval, 2014

2009
Group sequential testing of homogeneity in genetic linkage analysis.
Comput. Stat. Data Anal., 2009


  Loading...