Songyang Zhang

Orcid: 0000-0002-2895-5728

According to our database1, Songyang Zhang authored at least 129 papers between 2015 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
RadioGAT: A Joint Model-Based and Data-Driven Framework for Multi-Band Radiomap Reconstruction via Graph Attention Networks.
IEEE Trans. Wirel. Commun., November, 2024

Radiomap Inpainting for Restricted Areas Based on Propagation Priority and Depth Map.
IEEE Trans. Wirel. Commun., August, 2024

Physics-Inspired Machine Learning for Radiomap Estimation: Integration of Radio Propagation Models and Artificial Intelligence.
IEEE Commun. Mag., August, 2024

A digital speckle stereo matching algorithm based on epipolar line correction.
Signal Image Video Process., July, 2024

SGTR+: End-to-End Scene Graph Generation With Transformer.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2024

Signal Processing Over Multilayer Graphs: Theoretical Foundations and Practical Applications.
IEEE Internet Things J., January, 2024

Efficient Eigen-Decomposition for Low-Rank Symmetric Matrices in Graph Signal Processing: An Incremental Approach.
IEEE Trans. Signal Process., 2024

Cross Modality Bias in Visual Question Answering: A Causal View With Possible Worlds VQA.
IEEE Trans. Multim., 2024

PixMIM: Rethinking Pixel Reconstruction in Masked Image Modeling.
Trans. Mach. Learn. Res., 2024

PS-FedGAN: An Efficient Federated Learning Framework With Strong Data Privacy.
IEEE Internet Things J., 2024

Efficient cross-information fusion decoder for semantic segmentation.
Comput. Vis. Image Underst., 2024

CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution.
CoRR, 2024

HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models.
CoRR, 2024

Cross: A Delay Based Congestion Control Method for RTP Media.
CoRR, 2024

UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios.
CoRR, 2024

NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window?
CoRR, 2024

CIBench: Evaluating Your LLMs with a Code Interpreter Plugin.
CoRR, 2024

GTA: A Benchmark for General Tool Agents.
CoRR, 2024

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output.
CoRR, 2024

InternLM-Law: An Open Source Chinese Legal Large Language Model.
CoRR, 2024

Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs.
CoRR, 2024

FoundaBench: Evaluating Chinese Fundamental Knowledge Capabilities of Large Language Models.
CoRR, 2024

Adapting LLaMA Decoder to Vision Transformer.
CoRR, 2024

InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD.
CoRR, 2024

InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning.
CoRR, 2024

InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model.
CoRR, 2024

HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance.
CoRR, 2024

Fake Alignment: Are LLMs Really Aligned Well?
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

BotChat: Evaluating LLMs' Capabilities of Having Multi-Turn Dialogues.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

FedSC: Provable Federated Self-supervised Learning with Spectral Contrastive Objective over Non-i.i.d. Data.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Diff-GO: Diffusion Goal-Oriented Communications with Ultra-High Spectrum Efficiency.
Proceedings of the IEEE International Conference on Communications Workshops, 2024

UFed-GAN: Secure Federated Learning over Wireless Sensor Networks with Unlabeled Data.
Proceedings of the IEEE International Conference on Communications Workshops, 2024

Split-FL: An Efficient Online Federated Learning Framework with Constrained Computation and Streaming Data.
Proceedings of the IEEE International Conference on Communications Workshops, 2024

ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

LawBench: Benchmarking Legal Knowledge of Large Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

MMBench: Is Your Multi-modal Model an All-Around Player?
Proceedings of the Computer Vision - ECCV 2024, 2024

From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Benchmarking Chinese Commonsense Reasoning of LLMs: From Chinese-Specifics to Reasoning-Memorization Correlations.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
RME-GAN: A Learning Framework for Radio Map Estimation Based on Conditional Generative Adversarial Network.
IEEE Internet Things J., October, 2023

T-Eval: Evaluating the Tool Utilization Capability Step by Step.
CoRR, 2023

Diff-GO: Diffusion Goal-Oriented Communications to Achieve Ultra-High Spectrum Efficiency.
CoRR, 2023

LawBench: Benchmarking Legal Knowledge of Large Language Models.
CoRR, 2023

InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition.
CoRR, 2023

The Cultural Psychology of Large Language Models: Is ChatGPT a Holistic or Analytic Thinker?
CoRR, 2023

PFL-GAN: When Client Heterogeneity Meets Generative Models in Personalized Federated Learning.
CoRR, 2023

UFed-GAN: A Secure Federated Learning Framework with Constrained Computation and Unlabeled Data.
CoRR, 2023

Learning Referring Video Object Segmentation from Weak Annotation.
CoRR, 2023

Unveiling Cross Modality Bias in Visual Question Answering: A Causal View with Possible Worlds VQA.
CoRR, 2023

PS-FedGAN: An Efficient Federated Learning Framework Based on Partially Shared Generative Adversarial Networks For Data Privacy.
CoRR, 2023

Latent-Shift: Latent Diffusion with Temporal Shift for Efficient Text-to-Video Generation.
CoRR, 2023

RIFormer: Keep Your Vision Backbone Effective While Removing Token Mixer.
CoRR, 2023

Temporal Segment Transformer for Action Segmentation.
CoRR, 2023

TG-VQA: Ternary Game of Video Question Answering.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Make-A-Video: Text-to-Video Generation without Text-Video Data.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Improving Pixel-based MIM by Reducing Wasted Modeling Capability.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

To Work-Conserving Packet Scheduling by Load Balance for VOQ Switches.
Proceedings of the 15th IEEE International Conference on Advanced Infocomm Technology, 2023

RIFormer: Keep Your Vision Backbone Effective But Removing Token Mixer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
An Efficient Hypergraph Approach to Robust Point Cloud Resampling.
IEEE Trans. Image Process., 2022

The Vibroacoustic Characteristics Analysis of Transformer Core Faults Based on Multi-Physical Field Coupling.
Symmetry, 2022

Multi-Scale 2D Temporal Adjacency Networks for Moment Localization With Natural Language.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Multilayer graph spectral analysis for hyperspectral images.
EURASIP J. Adv. Signal Process., 2022

Budget-aware Few-shot Learning via Graph Convolutional Network.
CoRR, 2022

Robust Temporally-Coherent Strategy for Few-shot Video Instance Segmentation.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Exemplar-Based Radio Map Reconstruction of Missing Areas Using Propagation Priority.
Proceedings of the IEEE Global Communications Conference, 2022

Learning a Grammar Inducer from Massive Uncurated Instructional Videos.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Expanding Language-Image Pretrained Models for General Video Recognition.
Proceedings of the Computer Vision - ECCV 2022, 2022

Learning Semantic Correspondence with Sparse Annotations.
Proceedings of the Computer Vision - ECCV 2022, 2022

MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration.
Proceedings of the Computer Vision - ECCV 2022, 2022

Action Quality Assessment with Temporal Parsing Transformer.
Proceedings of the Computer Vision - ECCV 2022, 2022

The Devil is in the Labels: Noisy Label Correction for Robust Scene Graph Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Rethinking the Evaluation of Unbiased Scene Graph Generation.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2021
Hypergraph Spectral Analysis and Processing in 3D Point Cloud.
IEEE Trans. Image Process., 2021

Point Cloud Resampling via Hypergraph Signal Processing.
IEEE Signal Process. Lett., 2021

LIA-EN: enhancing the performance of multipath congestion control over lossy networks.
Int. J. Sens. Networks, 2021

An evaluation of bottleneck bandwidth and round trip time and its variants.
Int. J. Commun. Syst., 2021

LearningCC: An online learning approach for congestion control.
Trans. Emerg. Telecommun. Technol., 2021

Hyperspectral Image Segmentation based on Graph Processing over Multilayer Networks.
CoRR, 2021

Workshop on Autonomous Driving at CVPR 2021: Technical Report for Streaming Perception Challenge.
CoRR, 2021

Dynamic Grained Encoder for Vision Transformers.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Video-aided Unsupervised Grammar Induction.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

An EM Framework for Online Incremental Learning of Semantic Segmentation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Instance-wise or Class-wise? A Tale of Neighbor Shapley for Concept-based Explanation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Learning Implicit Temporal Alignment for Few-shot Video Classification.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

SAT: 2D Semantics Assisted Training for 3D Visual Grounding.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Distribution Alignment: A Unified Framework for Long-Tail Visual Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Bipartite Graph Network With Adaptive Message Passing for Unbiased Scene Graph Generation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Mi YouTube es Su YouTube? Analyzing the Cultures using YouTube Thumbnails of Popular Videos.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

Boundary Proposal Network for Two-stage Natural Language Video Localization.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Hypergraph Spectral Clustering for Point Cloud Segmentation.
IEEE Signal Process. Lett., 2020

Introducing Hypergraph Signal Processing: Theoretical Foundation and Practical Applications.
IEEE Internet Things J., 2020

Shared bottleneck detection based on trend line regression for multipath transmission.
Int. J. Commun. Syst., 2020

An online learning based path selection for multipath real-time video transmission in overlay network.
Trans. Emerg. Telecommun. Technol., 2020

Multi-Scale 2D Temporal Adjacent Networks for Moment Localization with Natural Language.
CoRR, 2020

LearningCC: An online learning approach for congestion control.
CoRR, 2020

From Spectrum Wavelet to Vertex Propagation: Graph Convolutional Networks Based on Taylor Approximation.
CoRR, 2020

An Online Learning Based Path Selection for Multipath Video Telephony Service in Overlay.
CoRR, 2020

A Multipath Transport Scheme for Real-Time Multimedia Services Based on Software-Defined Networking and Segment Routing.
IEEE Access, 2020

GPS Intelligent Solution of Aerial Image Target in State Grid EIA Survey.
Proceedings of the Parallel Architectures, Algorithms and Programming, 2020

Point Cloud Segmentation based on Hypergraph Spectral Clustering.
Proceedings of the Information Theory and Applications Workshop, 2020

Transformer with Bidirectional Decoder for Speech Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Global Image Sentiment Transfer.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Hypergraph-Based Image Processing.
Proceedings of the IEEE International Conference on Image Processing, 2020

Part-Aware Prototype Network for Few-Shot Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Content-based Analysis of the Cultural Differences between TikTok and Douyin.
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020

Learning 2D Temporal Adjacent Networks for Moment Localization with Natural Language.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
A Multi-Dimension Spatial Method for Topology Awareness and Multipath Generating.
Symmetry, 2019

Explorations of skeleton features for LSTM-based action recognition.
Multim. Tools Appl., 2019

SGMR: A spatial geometry-based multipath routing method on overlay networks.
Int. J. Commun. Syst., 2019

Learning Sparse 2D Temporal Adjacent Networks for Temporal Action Localization.
CoRR, 2019

An Evaluation of BBR and its variants.
CoRR, 2019

An Optimized BBR for Multipath Real Time Video Streaming.
CoRR, 2019

Congestion Control and Packet Scheduling for Multipath Real Time Video Streaming.
IEEE Access, 2019

Congestion Control for RTP Media: A Comparison on Simulated Environment.
Proceedings of the Simulation Tools and Techniques - 11th International Conference, 2019

Exploiting Temporal Relationships in Video Moment Localization with Natural Language.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

LatentGNN: Learning Efficient Non-local Relations for Visual Recognition.
Proceedings of the 36th International Conference on Machine Learning, 2019

Dynamic Context Correspondence Network for Semantic Alignment.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

A Dual Attention Network with Semantic Embedding for Few-Shot Learning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Fusing Geometric Features for Skeleton-Based Action Recognition Using Multilayer LSTM Networks.
IEEE Trans. Multim., 2018

Shared Bottleneck Detecction Based on Trend Line Regression for Multipath Transmission.
CoRR, 2018

Congestion Control for RTP Media: a Comparison on Simulated Environment.
CoRR, 2018

Tensor-based Spectral Analysis of Cascading Failures over Multilayer Complex Systems.
Proceedings of the 56th Annual Allerton Conference on Communication, 2018

2017
Generalization Tower Network: A Novel Deep Neural Network Architecture for Multi-Task Learning.
CoRR, 2017

On Geometric Features for Skeleton-Based Action Recognition Using Multilayer LSTM Networks.
Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision, 2017

Predicting Salient Face in Multiple-Face Videos.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2015
SparkRDF: Elastic Discreted RDF Graph Processing Engine With Distributed Memory.
Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, 2015


  Loading...