2025
Privacy-preserved federated clustering with Non-IID data via GANs.
J. Supercomput., March, 2025
Physics-Inspired Distributed Radio Map Estimation.
CoRR, February, 2025
LiT: Delving into a Simplified Linear Diffusion Transformer for Image Generation.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, January, 2025
Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement.
CoRR, January, 2025
InternLM-Law: An Open-Sourced Chinese Legal Large Language Model.
Proceedings of the 31st International Conference on Computational Linguistics, 2025
2024
RadioGAT: A Joint Model-Based and Data-Driven Framework for Multi-Band Radiomap Reconstruction via Graph Attention Networks.
IEEE Trans. Wirel. Commun., November, 2024
Radiomap Inpainting for Restricted Areas Based on Propagation Priority and Depth Map.
IEEE Trans. Wirel. Commun., August, 2024
Physics-Inspired Machine Learning for Radiomap Estimation: Integration of Radio Propagation Models and Artificial Intelligence.
IEEE Commun. Mag., August, 2024
A digital speckle stereo matching algorithm based on epipolar line correction.
Signal Image Video Process., July, 2024
SGTR+: End-to-End Scene Graph Generation With Transformer.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2024
Signal Processing Over Multilayer Graphs: Theoretical Foundations and Practical Applications.
IEEE Internet Things J., January, 2024
Efficient Eigen-Decomposition for Low-Rank Symmetric Matrices in Graph Signal Processing: An Incremental Approach.
IEEE Trans. Signal Process., 2024
Cross Modality Bias in Visual Question Answering: A Causal View With Possible Worlds VQA.
IEEE Trans. Multim., 2024
PixMIM: Rethinking Pixel Reconstruction in Masked Image Modeling.
Trans. Mach. Learn. Res., 2024
PS-FedGAN: An Efficient Federated Learning Framework With Strong Data Privacy.
IEEE Internet Things J., 2024
Efficient cross-information fusion decoder for semantic segmentation.
Comput. Vis. Image Underst., 2024
LaMI-GO: Latent Mixture Integration for Goal-Oriented Communications Achieving High Spectrum Efficiency.
CoRR, 2024
DualGFL: Federated Learning with a Dual-Level Coalition-Auction Game.
CoRR, 2024
Are Your LLMs Capable of Stable Reasoning?
CoRR, 2024
CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution.
CoRR, 2024
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models.
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Cross: A Delay Based Congestion Control Method for RTP Media.
CoRR, 2024
UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios.
CoRR, 2024
NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window?
CoRR, 2024
CIBench: Evaluating Your LLMs with a Code Interpreter Plugin.
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
InternLM-Law: An Open Source Chinese Legal Large Language Model.
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
FoundaBench: Evaluating Chinese Fundamental Knowledge Capabilities of Large Language Models.
CoRR, 2024
Adapting LLaMA Decoder to Vision Transformer.
CoRR, 2024
InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance.
CoRR, 2024
GTA: A Benchmark for General Tool Agents.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Fake Alignment: Are LLMs Really Aligned Well?
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
BotChat: Evaluating LLMs' Capabilities of Having Multi-Turn Dialogues.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024
FedSC: Provable Federated Self-supervised Learning with Spectral Contrastive Objective over Non-i.i.d. Data.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Diff-GO: Diffusion Goal-Oriented Communications with Ultra-High Spectrum Efficiency.
Proceedings of the IEEE International Conference on Communications Workshops, 2024
UFed-GAN: Secure Federated Learning over Wireless Sensor Networks with Unlabeled Data.
Proceedings of the IEEE International Conference on Communications Workshops, 2024
Split-FL: An Efficient Online Federated Learning Framework with Constrained Computation and Streaming Data.
Proceedings of the IEEE International Conference on Communications Workshops, 2024
ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
LawBench: Benchmarking Legal Knowledge of Large Language Models.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
MMBench: Is Your Multi-modal Model an All-Around Player?
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Computer Vision - ECCV 2024, 2024
From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Benchmarking Chinese Commonsense Reasoning of LLMs: From Chinese-Specifics to Reasoning-Memorization Correlations.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step.
,
,
,
,
,
,
,
,
,
,
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
RME-GAN: A Learning Framework for Radio Map Estimation Based on Conditional Generative Adversarial Network.
IEEE Internet Things J., October, 2023
T-Eval: Evaluating the Tool Utilization Capability Step by Step.
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
Diff-GO: Diffusion Goal-Oriented Communications to Achieve Ultra-High Spectrum Efficiency.
CoRR, 2023
LawBench: Benchmarking Legal Knowledge of Large Language Models.
CoRR, 2023
InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
The Cultural Psychology of Large Language Models: Is ChatGPT a Holistic or Analytic Thinker?
CoRR, 2023
PFL-GAN: When Client Heterogeneity Meets Generative Models in Personalized Federated Learning.
CoRR, 2023
UFed-GAN: A Secure Federated Learning Framework with Constrained Computation and Unlabeled Data.
CoRR, 2023
Learning Referring Video Object Segmentation from Weak Annotation.
CoRR, 2023
Unveiling Cross Modality Bias in Visual Question Answering: A Causal View with Possible Worlds VQA.
CoRR, 2023
PS-FedGAN: An Efficient Federated Learning Framework Based on Partially Shared Generative Adversarial Networks For Data Privacy.
CoRR, 2023
Latent-Shift: Latent Diffusion with Temporal Shift for Efficient Text-to-Video Generation.
CoRR, 2023
RIFormer: Keep Your Vision Backbone Effective While Removing Token Mixer.
CoRR, 2023
Temporal Segment Transformer for Action Segmentation.
CoRR, 2023
TG-VQA: Ternary Game of Video Question Answering.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023
Make-A-Video: Text-to-Video Generation without Text-Video Data.
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Improving Pixel-based MIM by Reducing Wasted Modeling Capability.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
To Work-Conserving Packet Scheduling by Load Balance for VOQ Switches.
Proceedings of the 15th IEEE International Conference on Advanced Infocomm Technology, 2023
RIFormer: Keep Your Vision Backbone Effective But Removing Token Mixer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
2022
An Efficient Hypergraph Approach to Robust Point Cloud Resampling.
IEEE Trans. Image Process., 2022
The Vibroacoustic Characteristics Analysis of Transformer Core Faults Based on Multi-Physical Field Coupling.
Symmetry, 2022
Multi-Scale 2D Temporal Adjacency Networks for Moment Localization With Natural Language.
IEEE Trans. Pattern Anal. Mach. Intell., 2022
Multilayer graph spectral analysis for hyperspectral images.
EURASIP J. Adv. Signal Process., 2022
Budget-aware Few-shot Learning via Graph Convolutional Network.
CoRR, 2022
Robust Temporally-Coherent Strategy for Few-shot Video Instance Segmentation.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022
Exemplar-Based Radio Map Reconstruction of Missing Areas Using Propagation Priority.
Proceedings of the IEEE Global Communications Conference, 2022
Learning a Grammar Inducer from Massive Uncurated Instructional Videos.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Expanding Language-Image Pretrained Models for General Video Recognition.
Proceedings of the Computer Vision - ECCV 2022, 2022
Learning Semantic Correspondence with Sparse Annotations.
Proceedings of the Computer Vision - ECCV 2022, 2022
MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration.
Proceedings of the Computer Vision - ECCV 2022, 2022
Action Quality Assessment with Temporal Parsing Transformer.
Proceedings of the Computer Vision - ECCV 2022, 2022
The Devil is in the Labels: Noisy Label Correction for Robust Scene Graph Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Rethinking the Evaluation of Unbiased Scene Graph Generation.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022
2021
Hypergraph Spectral Analysis and Processing in 3D Point Cloud.
IEEE Trans. Image Process., 2021
Point Cloud Resampling via Hypergraph Signal Processing.
IEEE Signal Process. Lett., 2021
LIA-EN: enhancing the performance of multipath congestion control over lossy networks.
Int. J. Sens. Networks, 2021
An evaluation of bottleneck bandwidth and round trip time and its variants.
Int. J. Commun. Syst., 2021
LearningCC: An online learning approach for congestion control.
Trans. Emerg. Telecommun. Technol., 2021
Hyperspectral Image Segmentation based on Graph Processing over Multilayer Networks.
CoRR, 2021
Workshop on Autonomous Driving at CVPR 2021: Technical Report for Streaming Perception Challenge.
CoRR, 2021
Dynamic Grained Encoder for Vision Transformers.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Video-aided Unsupervised Grammar Induction.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021
An EM Framework for Online Incremental Learning of Semantic Segmentation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
Instance-wise or Class-wise? A Tale of Neighbor Shapley for Concept-based Explanation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
Learning Implicit Temporal Alignment for Few-shot Video Classification.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021
SAT: 2D Semantics Assisted Training for 3D Visual Grounding.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Distribution Alignment: A Unified Framework for Long-Tail Visual Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Bipartite Graph Network With Adaptive Message Passing for Unbiased Scene Graph Generation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Mi YouTube es Su YouTube? Analyzing the Cultures using YouTube Thumbnails of Popular Videos.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021
Boundary Proposal Network for Two-stage Natural Language Video Localization.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
Hypergraph Spectral Clustering for Point Cloud Segmentation.
IEEE Signal Process. Lett., 2020
Introducing Hypergraph Signal Processing: Theoretical Foundation and Practical Applications.
IEEE Internet Things J., 2020
Shared bottleneck detection based on trend line regression for multipath transmission.
Int. J. Commun. Syst., 2020
An online learning based path selection for multipath real-time video transmission in overlay network.
Trans. Emerg. Telecommun. Technol., 2020
Multi-Scale 2D Temporal Adjacent Networks for Moment Localization with Natural Language.
CoRR, 2020
LearningCC: An online learning approach for congestion control.
CoRR, 2020
From Spectrum Wavelet to Vertex Propagation: Graph Convolutional Networks Based on Taylor Approximation.
CoRR, 2020
An Online Learning Based Path Selection for Multipath Video Telephony Service in Overlay.
CoRR, 2020
A Multipath Transport Scheme for Real-Time Multimedia Services Based on Software-Defined Networking and Segment Routing.
IEEE Access, 2020
GPS Intelligent Solution of Aerial Image Target in State Grid EIA Survey.
Proceedings of the Parallel Architectures, Algorithms and Programming, 2020
Point Cloud Segmentation based on Hypergraph Spectral Clustering.
Proceedings of the Information Theory and Applications Workshop, 2020
Transformer with Bidirectional Decoder for Speech Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Global Image Sentiment Transfer.
Proceedings of the 25th International Conference on Pattern Recognition, 2020
Hypergraph-Based Image Processing.
Proceedings of the IEEE International Conference on Image Processing, 2020
Part-Aware Prototype Network for Few-Shot Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020
Content-based Analysis of the Cultural Differences between TikTok and Douyin.
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020
Learning 2D Temporal Adjacent Networks for Moment Localization with Natural Language.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
A Multi-Dimension Spatial Method for Topology Awareness and Multipath Generating.
Symmetry, 2019
Explorations of skeleton features for LSTM-based action recognition.
Multim. Tools Appl., 2019
SGMR: A spatial geometry-based multipath routing method on overlay networks.
Int. J. Commun. Syst., 2019
Learning Sparse 2D Temporal Adjacent Networks for Temporal Action Localization.
CoRR, 2019
An Evaluation of BBR and its variants.
CoRR, 2019
An Optimized BBR for Multipath Real Time Video Streaming.
CoRR, 2019
Congestion Control and Packet Scheduling for Multipath Real Time Video Streaming.
IEEE Access, 2019
Congestion Control for RTP Media: A Comparison on Simulated Environment.
Proceedings of the Simulation Tools and Techniques - 11th International Conference, 2019
Exploiting Temporal Relationships in Video Moment Localization with Natural Language.
Proceedings of the 27th ACM International Conference on Multimedia, 2019
LatentGNN: Learning Efficient Non-local Relations for Visual Recognition.
Proceedings of the 36th International Conference on Machine Learning, 2019
Dynamic Context Correspondence Network for Semantic Alignment.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019
A Dual Attention Network with Semantic Embedding for Few-Shot Learning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
Fusing Geometric Features for Skeleton-Based Action Recognition Using Multilayer LSTM Networks.
IEEE Trans. Multim., 2018
Shared Bottleneck Detecction Based on Trend Line Regression for Multipath Transmission.
CoRR, 2018
Congestion Control for RTP Media: a Comparison on Simulated Environment.
CoRR, 2018
Tensor-based Spectral Analysis of Cascading Failures over Multilayer Complex Systems.
Proceedings of the 56th Annual Allerton Conference on Communication, 2018
2017
Generalization Tower Network: A Novel Deep Neural Network Architecture for Multi-Task Learning.
CoRR, 2017
On Geometric Features for Skeleton-Based Action Recognition Using Multilayer LSTM Networks.
Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision, 2017
Predicting Salient Face in Multiple-Face Videos.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017
2015
SparkRDF: Elastic Discreted RDF Graph Processing Engine With Distributed Memory.
Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, 2015