Rui Qian

Orcid: 0000-0002-0378-6438

According to our database1, Rui Qian authored at least 53 papers between 2013 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Pimo: memory-efficient privacy protection in video streaming and analytics.
Multim. Syst., June, 2024

Controllable augmentations for video representation learning.
Vis. Intell., 2024

SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree.
CoRR, 2024

Imagen 3.
CoRR, 2024

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output.
CoRR, 2024

Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models.
CoRR, 2024

Streaming Long Video Understanding with Large Language Models.
CoRR, 2024

SongComposer: A Large Language Model for Lyric and Melody Composition in Song Generation.
CoRR, 2024

VideoPrism: A Foundational Visual Encoder for Video Understanding.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Rethinking Image-to-Video Adaptation: An Object-Centric Perspective.
Proceedings of the Computer Vision - ECCV 2024, 2024

Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
LayerCFL: an efficient federated learning with layer-wised clustering.
Cybersecur., December, 2023

Deeper Exploiting Graph Structure Information by Discrete Ricci Curvature in a Graph Transformer.
Entropy, June, 2023

Semantics Meets Temporal Correspondence: Self-supervised Object-centric Learning in Videos.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Prune Spatio-temporal Tokens by Semantic-aware Temporal Accumulation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
3D Object Detection for Autonomous Driving: A Survey.
Pattern Recognit., 2022

BADet: Boundary-Aware 3D Object Detection from Point Clouds.
Pattern Recognit., 2022

Class-Aware Sounding Objects Localization via Audiovisual Correspondence.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Motion-inductive Self-supervised Object Discovery in Videos.
CoRR, 2022

Multimodal Open-Vocabulary Video Classification via Pre-Trained Vision and Language Models.
CoRR, 2022

Dual Contrastive Learning for Spatio-temporal Representation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Static and Dynamic Concepts for Self-supervised Video Representation Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022

Exploring Fine-Grained Audiovisual Categorization with the SSW60 Dataset.
Proceedings of the Computer Vision - ECCV 2022, 2022

Contextualized Spatio-Temporal Contrastive Learning with Self-Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Motion-aware Contrastive Video Representation Learning via Foreground-background Merging.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

On Temporal Granularity in Self-Supervised Video Representation Learning.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Visual Sound Localization in the Wild by Cross-Modal Interference Erasing.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

TA2N: Two-Stage Action Alignment Network for Few-Shot Action Recognition.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Exploring Temporal Granularity in Self-Supervised Video Representation Learning.
CoRR, 2021

Motion-aware Self-supervised Video Representation Learning via Foreground-background Merging.
CoRR, 2021

Revisiting 3D ResNets for Video Recognition.
CoRR, 2021

TTAN: Two-Stage Temporal Alignment Network for Few-shot Action Recognition.
CoRR, 2021

Boundary-Aware 3D Object Detection from Point Clouds.
CoRR, 2021

VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Enhancing Self-supervised Video Representation Learning via Multi-level Feature Optimization.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Remote sensing identification of seasonal pasture based on Sentinel-2.
Proceedings of the 2021 IEEE 23rd Int Conf on High Performance Computing & Communications; 7th Int Conf on Data Science & Systems; 19th Int Conf on Smart City; 7th Int Conf on Dependability in Sensor, 2021

Spatiotemporal Contrastive Video Representation Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Simple Copy-Paste Is a Strong Data Augmentation Method for Instance Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Enhanced Recovery Concept in Percutaneous Nephrolithotomy.
J. Medical Imaging Health Informatics, 2020

Human in Events: A Large-Scale Benchmark for Human-centric Video Analysis in Complex Events.
CoRR, 2020

Discriminative Sounding Objects Localization via Self-supervised Audiovisual Matching.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

ATRW: A Benchmark for Amur Tiger Re-identification in the Wild.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Multiple Sound Sources Localization from Coarse to Fine.
Proceedings of the Computer Vision - ECCV 2020, 2020

End-to-End Pseudo-LiDAR for Image-Based 3D Object Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Finding Action Tubes with a Sparse-to-Dense Framework.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Renmin University of China and Zhejiang Gongshang University at TRECVID 2019: Learn to Search and Describe Videos.
Proceedings of the 2019 TREC Video Retrieval Evaluation, 2019

Four Models for Automatic Recognition of Left and Right Eye in Fundus Images.
Proceedings of the MultiMedia Modeling - 25th International Conference, 2019

Weakly Supervised Scene Parsing with Point-Based Distance Metric Learning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Attentive Generative Adversarial Network for Raindrop Removal From a Single Image.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
LFP functional network analysis of different states in hippocampus of pigeons.
Proceedings of the 10th International Congress on Image and Signal Processing, 2017

2013
URP: A unified routing protocol for heterogeneous wireless mesh networks.
Proceedings of the 2013 IEEE Wireless Communications and Networking Conference (WCNC), 2013


  Loading...