We stand with Ukraine

We stand with Ukraine

Rui Qian

Orcid: 0000-0002-0378-6438

According to our database¹, Rui Qian authored at least 53 papers between 2013 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Pimo: memory-efficient privacy protection in video streaming and analytics.

[BibT_eX]

[DOI]

,

,

,

,

Multim. Syst., June, 2024

Controllable augmentations for video representation learning.

[BibT_eX]

[DOI]

,

,

,

Vis. Intell., 2024

SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2024

Imagen 3.

[BibT_eX]

[DOI]

Jason Baldridge

,

,

,

Nicole Brichtova

,

,

,

,

Sander Dieleman

,

,

Zach Eaton-Rosen

,

,

Nando de Freitas

,

,

Evgeny Gladchenko

,

Sergio Gómez Colmenarejo

,

,

,

,

,

,

Tobenna Peter Igwe

,

Christos Kaplanis

,

Siavash Khodadadeh

,

,

Ksenia Konyushkova

,

,

,

,

,

,

,

Aäron van den Oord

,

,

Jordi Pont-Tuset

,

,

,

Deepak Ramachandran

,

,

Abdullah Rashwan

,

,

,

Hansa Srinivasan

,

Srivatsan Srinivasan

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Isabela Albuquerque

,

,

Marco Andreetto

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Christina Butterfield

,

,

Viral Carpenter

,

Norman Casagrande

,

,

,

Shamik Chaudhuri

,

,

,

Dmitry Churbanau

,

,

,

,

Mikhail Dektiarev

,

,

,

,

,

,

Shlomi Fruchter

,

,

CoRR, 2024

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Xingcheng Zhang

,

,

,

,

,

CoRR, 2024

Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2024

Streaming Long Video Understanding with Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

SongComposer: A Large Language Model for Lyric and Melody Composition in Song Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

VideoPrism: A Foundational Visual Encoder for Video Understanding.

[BibT_eX]

[DOI]

,

Nitesh Bharadwaj Gundavarapu

,

,

,

,

Jennifer J. Sun

,

,

,

,

,

,

Florian Schroff

,

Ming-Hsuan Yang

,

,

,

,

Mikhail Sirotenko

,

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Rethinking Image-to-Video Adaptation: An Object-Centric Perspective.

[BibT_eX]

[DOI]

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

2023

LayerCFL: an efficient federated learning with layer-wised clustering.

[BibT_eX]

[DOI]

,

,

,

,

,

Cybersecur., December, 2023

Deeper Exploiting Graph Structure Information by Discrete Ricci Curvature in a Graph Transformer.

[BibT_eX]

[DOI]

,

,

,

,

Entropy, June, 2023

Semantics Meets Temporal Correspondence: Self-supervised Object-centric Learning in Videos.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Prune Spatio-temporal Tokens by Semantic-aware Temporal Accumulation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

3D Object Detection for Autonomous Driving: A Survey.

[BibT_eX]

[DOI]

,

,

Pattern Recognit., 2022

BADet: Boundary-Aware 3D Object Detection from Point Clouds.

[BibT_eX]

[DOI]

,

,

Pattern Recognit., 2022

Class-Aware Sounding Objects Localization via Audiovisual Correspondence.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Motion-inductive Self-supervised Object Discovery in Videos.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2022

Multimodal Open-Vocabulary Video Classification via Pre-Trained Vision and Language Models.

[BibT_eX]

[DOI]

,

,

,

Ming-Hsuan Yang

,

Serge J. Belongie

,

CoRR, 2022

Dual Contrastive Learning for Spatio-temporal Representation.

[BibT_eX]

[DOI]

,

,

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Static and Dynamic Concepts for Self-supervised Video Representation Learning.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Computer Vision - ECCV 2022, 2022

Exploring Fine-Grained Audiovisual Categorization with the SSW60 Dataset.

[BibT_eX]

[DOI]

,

,

Kimberly Wilber

,

,

Oisin Mac Aodha

,

Serge J. Belongie

Proceedings of the Computer Vision - ECCV 2022, 2022

Contextualized Spatio-Temporal Contrastive Learning with Self-Supervision.

[BibT_eX]

[DOI]

,

,

,

,

Florian Schroff

,

Ming-Hsuan Yang

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Motion-aware Contrastive Video Representation Learning via Foreground-background Merging.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

On Temporal Granularity in Self-Supervised Video Representation Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Serge J. Belongie

,

Ming-Hsuan Yang

,

,

Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Visual Sound Localization in the Wild by Cross-Modal Interference Erasing.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

TA2N: Two-Stage Action Alignment Network for Few-Shot Action Recognition.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Exploring Temporal Granularity in Self-Supervised Video Representation Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Serge J. Belongie

,

Ming-Hsuan Yang

,

,

CoRR, 2021

Motion-aware Self-supervised Video Representation Learning via Foreground-background Merging.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2021

Revisiting 3D ResNets for Video Recognition.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2021

TTAN: Two-Stage Temporal Alignment Network for Few-shot Action Recognition.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2021

Boundary-Aware 3D Object Detection from Point Clouds.

[BibT_eX]

[DOI]

,

,

CoRR, 2021

VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text.

[BibT_eX]

[DOI]

,

,

,

Wei-Hong Chuang

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Enhancing Self-supervised Video Representation Learning via Multi-level Feature Optimization.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Remote sensing identification of seasonal pasture based on Sentinel-2.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2021 IEEE 23rd Int Conf on High Performance Computing & Communications; 7th Int Conf on Data Science & Systems; 19th Int Conf on Smart City; 7th Int Conf on Dependability in Sensor, 2021

Spatiotemporal Contrastive Video Representation Learning.

[BibT_eX]

[DOI]

,

,

,

Ming-Hsuan Yang

,

,

Serge J. Belongie

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Simple Copy-Paste Is a Strong Data Augmentation Method for Instance Segmentation.

[BibT_eX]

[DOI]

,

,

Aravind Srinivas

,

,

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

Enhanced Recovery Concept in Percutaneous Nephrolithotomy.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

J. Medical Imaging Health Informatics, 2020

Human in Events: A Large-Scale Benchmark for Human-centric Video Analysis in Complex Events.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, 2020

Discriminative Sounding Objects Localization via Self-supervised Audiovisual Matching.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

ATRW: A Benchmark for Amur Tiger Re-identification in the Wild.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Multiple Sound Sources Localization from Coarse to Fine.

[BibT_eX]

[DOI]

,

,

Heinrich Dinkel

,

,

,

Proceedings of the Computer Vision - ECCV 2020, 2020

End-to-End Pseudo-LiDAR for Image-Based 3D Object Detection.

[BibT_eX]

[DOI]

,

,

,

,

Serge J. Belongie

,

Bharath Hariharan

,

Mark E. Campbell

,

Kilian Q. Weinberger

,

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Finding Action Tubes with a Sparse-to-Dense Framework.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Renmin University of China and Zhejiang Gongshang University at TRECVID 2019: Learn to Search and Describe Videos.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 2019 TREC Video Retrieval Evaluation, 2019

Four Models for Automatic Recognition of Left and Right Eye in Fundus Images.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the MultiMedia Modeling - 25th International Conference, 2019

Weakly Supervised Scene Parsing with Point-Based Distance Metric Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Thomas S. Huang

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Attentive Generative Adversarial Network for Raindrop Removal From a Single Image.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

LFP functional network analysis of different states in hippocampus of pigeons.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 10th International Congress on Image and Signal Processing, 2017

2013

URP: A unified routing protocol for heterogeneous wireless mesh networks.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2013 IEEE Wireless Communications and Networking Conference (WCNC), 2013

Loading...