Shilei Wen

Orcid: 0009-0009-4746-6928

According to our database1, Shilei Wen authored at least 58 papers between 2011 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
360-VIO: A Robust Visual-Inertial Odometry Using a 360° Camera.
IEEE Trans. Ind. Electron., September, 2024

DiffusionGPT: LLM-Driven Text-to-Image Generation System.
CoRR, 2024

Outlier-aware Slicing for Post-Training Quantization in Vision Transformer.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

AffineQuant: Affine Transformation Quantization for Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
MeMaHand: Exploiting Mesh-Mano Interaction for Single Image Two-Hand Reconstruction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Solving Oscillation Problem in Post-Training Quantization Through a Theoretical Perspective.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Semi-Supervised Temporal Action Proposal Generation via Exploiting 2-D Proposal Map.
IEEE Trans. Multim., 2022

Purely Attention Based Local Feature Integration for Video Classification.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

2021
VSRNet: End-to-end video segment retrieval with text query.
Pattern Recognit., 2021

Beyond Self-Supervision: A Simple Yet Effective Network Distillation Alternative to Improve Backbones.
CoRR, 2021

RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
TPM: Multiple object tracking with tracklet-plane matching.
Pattern Recognit., 2020

Coherent Loss: A Generic Framework for Stable Video Segmentation.
CoRR, 2020

PP-YOLO: An Effective and Efficient Implementation of Object Detector.
CoRR, 2020

PointTrack++ for Effective Online Multi-Object Tracking and Segmentation.
CoRR, 2020

Discriminative Sounding Objects Localization via Self-supervised Audiovisual Matching.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Modularized Framework with Category-Sensitive Abnormal Filter for City Anomaly Detection.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Deep Concept-wise Temporal Convolutional Networks for Action Localization.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

HANet: Hybrid Attention-aware Network for Crowd Counting.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Monocular 3D Object Detection via Feature Domain Adaptation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Segment as Points for Efficient Online Multi-Object Tracking and Segmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Graph-PCNN: Two Stage Human Pose Estimation with Graph Pose Refinement.
Proceedings of the Computer Vision - ECCV 2020, 2020


Going Beyond Real Data: A Robust Visual Representation for Vehicle Re-identification.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020


Dynamic Inference: A New Approach Toward Efficient Video Action Recognition.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Robust Movement-Specific Vehicle Counting at Crowded Intersections.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Multi-Granularity Tracking with Modularlized Components for Unsupervised Vehicles Anomaly Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

NTIRE 2020 Challenge on Video Quality Mapping: Methods and Results.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Associate-3Ddet: Perceptual-to-Conceptual Association for 3D Point Cloud Object Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Cross-Modality Attention with Semantic Graph Embedding for Multi-Label Classification.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

ZoomNet: Part-Aware Adaptive Zooming Neural Network for 3D Object Detection.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Multi-Label Classification with Label Graph Superimposing.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Dynamic Instance Normalization for Arbitrary Style Transfer.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
TruNet: Short Videos Generation from Long Videos via Story-Preserving Truncation.
CoRR, 2019

Perspective-Guided Convolution Networks for Crowd Counting.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Image Inpainting With Learnable Bidirectional Attention Maps.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Multi-Agent Reinforcement Learning Based Frame Sampling for Effective Untrimmed Video Recognition.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

BMN: Boundary-Matching Network for Temporal Action Proposal Generation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Multi-camera vehicle tracking and re-identification based on visual and spatial-temporal features.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019


Adapting Image Super-Resolution State-Of-The-Arts and Learning Multi-Model Ensemble for Video Super-Resolution.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019


STGAN: A Unified Selective Transfer Network for Arbitrary Image Attribute Editing.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Read, Watch, and Move: Reinforcement Learning for Temporally Grounding Natural Language Descriptions in Videos.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

StNet: Local and Global Spatial-Temporal Modeling for Action Recognition.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Solution for Large-Scale Hierarchical Object Detection Datasets with Incomplete Annotation and Data Imbalance.
CoRR, 2018

Exploiting Spatial-Temporal Modelling and Multi-Modal Fusion for Human Action Recognition.
CoRR, 2018

Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Multimodal Keyless Attention Fusion for Video Classification.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Revisiting the Effectiveness of Off-the-shelf Temporal Modeling Approaches for Large-scale Video Classification.
CoRR, 2017

Dynamic Computational Time for Visual Attention.
CoRR, 2017

Temporal Modeling Approaches for Large-scale Youtube-8M Video Understanding.
CoRR, 2017

Dynamic Computational Time for Visual Attention.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Deep Metric Learning with Angular Loss.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Localizing by Describing: Attribute-Guided Attention Localization for Fine-Grained Recognition.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2011
Compounded Face Image Retrieval Based on Vertical Web Image Retrieval.
Proceedings of the Sixth Chinagrid Annual Conference, ChinaGrid 2011, Dalian, Liaoning, 2011


  Loading...