Jiashuo Yu

Orcid: 0000-0002-3094-6687

According to our database1, Jiashuo Yu authored at least 24 papers between 2021 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Learning Music-Dance Representations Through Explicit-Implicit Rhythm Synchronization.
IEEE Trans. Multim., 2024

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text.
CoRR, 2024

InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding.
CoRR, 2024

TupleRadar: Accelerating Tuple Space Search in Packet Classification by Learned Index.
Proceedings of the 32nd IEEE/ACM International Symposium on Quality of Service, 2024

InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

InternVideo2: Scaling Foundation Models for Multimodal Video Understanding.
Proceedings of the Computer Vision - ECCV 2024, 2024

VBench: Comprehensive Benchmark Suite for Video Generative Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models.
CoRR, 2023

InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation.
CoRR, 2023

InternGPT: Solving Vision-Centric Tasks by Interacting with Chatbots Beyond Language.
CoRR, 2023

DTRadar: Accelerating Search Process of Decision Trees in Packet Classification.
Proceedings of the IEEE Symposium on Computers and Communications, 2023

Long-Term Rhythmic Video Soundtracker.
Proceedings of the International Conference on Machine Learning, 2023

MiCuts: Combing Bit-Based Cutting and Splitting for Efficient Packet Classification.
Proceedings of the IEEE International Conference on Communications, 2023

MINT: Empowering Multiple Flow Definition Query for Network-Wide Measurement.
Proceedings of the IEEE International Conference on Communications, 2023

2022
InternVideo: General Video Foundation Models via Generative and Discriminative Learning.
CoRR, 2022

InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges.
CoRR, 2022

Self-Supervised Learning of Music-Dance Representation through Explicit-Implicit Rhythm Synchronization.
CoRR, 2022

Modality-aware Contrastive Instance Learning with Self-Distillation for Weakly-Supervised Audio-Visual Violence Detection.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

MM-Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

FROD: An Efficient Framework for Optimizing Decision Trees in Packet Classification.
Proceedings of the 30th IEEE/ACM International Symposium on Quality of Service, 2022

2021
Exploring Logical Reasoning for Referring Expression Comprehension.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

MPN: Multimodal Parallel Network for Audio-Visual Event Localization.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Improving Multimodal Speech Enhancement by Incorporating Self-Supervised and Curriculum Learning.
Proceedings of the IEEE International Conference on Acoustics, 2021


  Loading...