2025
BREEN: Bridge Data-Efficient Encoder-Free Multimodal Learning with Learnable Queries.
CoRR, March, 2025

On the Robustness of Transformers against Context Hijacking for Linear Classification.
CoRR, February, 2025

Prompt-to-Leaderboard.
CoRR, February, 2025

Chatbot Arena Estimate: towards a generalized performance benchmark for LLM capabilities.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

How to Evaluate Reward Models for RLHF.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
The Parameter Calibration of Social Force Model for Pedestrian Flow Simulation Based on YOLOv5.
Sensors, August, 2024

Gaussian distribution resampling via Chebyshev distance for food computing.
Appl. Soft Comput., January, 2024

Project MPG: towards a generalized performance benchmark for LLM capabilities.
CoRR, 2024

Y-Mol: A Multiscale Biomedical Knowledge-Guided Large Language Model for Drug Development.
CoRR, 2024

From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline.
CoRR, 2024

Long-context LLMs Struggle with Long In-context Learning.
CoRR, 2024

MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

GenAI Arena: An Open Evaluation Platform for Generative Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Threshold-Based Automated Pest Detection System for Sustainable Agriculture.
Proceedings of the IEEE International Conference on Internet of Things and Intelligence Systems, 2024

Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

ImagenHub: Standardizing the evaluation of conditional image generation models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

SWAG: Storytelling With Action Guidance.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Feature Fusion for Multi-Condition Controllable Image Generation.
Proceedings of the 2024 8th International Conference on Big Data and Internet of Things, 2024

A Privacy-Preserving Aggregation Method via Grouped Secure Multi-Party Computation.
Proceedings of the Proceeding of the 5th Asia Service Sciences and Software Engineering Conference, 2024

2023
Function2vec: A Geographic Knowledge Graph Model of Urban Function Evolution and Its Application.
ISPRS Int. J. Geo Inf., November, 2023

DreamEdit: Subject-driven Image Editing.
Trans. Mach. Learn. Res., 2023

A Masked Attention Network with Query Sparsity Measurement for Time Series Anomaly Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Rethinking graph anomaly detection: A self-supervised Group Discrimination paradigm with Structure-Aware.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Few-shot In-context Learning on Knowledge Base Question Answering.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

AnaMeta: A Table Understanding Dataset of Field Metadata Knowledge Shared by Multi-dimensional Data Analysis Tasks.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Inferring Tabular Analysis Metadata by Infusing Distribution and Knowledge Information.
CoRR, 2022

ASTA: Learning Analytical Semantics over Tables for Intelligent Data Analysis and Visualization.
CoRR, 2022

A Simple Yet Efficient Method for Adversarial Word-Substitute Attack.
CoRR, 2022

Fake News Detection with Heterogeneous Transformer.
CoRR, 2022

2019
A Multi-Stage Model for the Electromagnetic Shielding Effectiveness Prediction of an Infinite Conductor Plane With Periodic Apertures.
IEEE Access, 2019