Can Zhang

Orcid: 0000-0001-9530-5218

Affiliations:
  • Peking University, School of Electrical and Computer Engineering, Shenzhen, China


According to our database1, Can Zhang authored at least 24 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
SpatioTemporal focus for skeleton-based action recognition.
Pattern Recognit., April, 2023

Improving Scene Graph Generation with Superpixel-Based Interaction Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Cross-Modality Time-Variant Relation Learning for Generating Dynamic Scene Graphs.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Iterative Proposal Refinement for Weakly-Supervised Video Grounding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Deep Motion Prior for Weakly-Supervised Temporal Action Localization.
IEEE Trans. Image Process., 2022

RR-Net: Relation Reasoning for End-to-End Human-Object Interaction Detection.
IEEE Trans. Circuits Syst. Video Technol., 2022

All You Need Is a Second Look: Towards Arbitrary-Shaped Text Detection.
IEEE Trans. Circuits Syst. Video Technol., 2022

LocVTP: Video-Text Pre-training for Temporal Localization.
Proceedings of the Computer Vision - ECCV 2022, 2022

Unsupervised Pre-training for Temporal Action Localization Tasks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
AFNet: Temporal Locality-Aware Network With Dual Structure for Accurate and Fast Action Detection.
IEEE Trans. Multim., 2021

EAR: Efficient action recognition with local-global temporal aggregation.
Image Vis. Comput., 2021

Synergic learning for noise-insensitive webly-supervised temporal action localization.
Image Vis. Comput., 2021

RR-Net: Injecting Interactive Semantics in Human-Object Interaction Detection.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Long-Short Temporal Modeling for Efficient Action Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

SRF-Net: Selective Receptive Field Network for Anchor-Free Temporal Action Detection.
Proceedings of the IEEE International Conference on Acoustics, 2021

On Pursuit of Designing Multi-modal Transformer for Video Grounding.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

CoLA: Weakly-Supervised Temporal Action Localization With Snippet Contrastive Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Non-Autoregressive Coarse-to-Fine Video Captioning.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
PAN: Towards Fast Action Recognition via Learning Persistence of Appearance.
CoRR, 2020

2019
Hierarchical Temporal Pooling for Efficient Online Action Recognition.
Proceedings of the MultiMedia Modeling - 25th International Conference, 2019

STMP: Spatial Temporal Multi-level Proposal Network for Activity Detection.
Proceedings of the MultiMedia Modeling - 25th International Conference, 2019

PAN: Persistent Appearance Network with an Efficient Motion Cue for Fast Action Recognition.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Discriminative Feature Learning Using Two-Stage Training Strategy for Facial Expression Recognition.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2019: Image Processing, 2019


  Loading...