BREEN: Bridge Data-Efficient Encoder-Free Multimodal Learning with Learnable Queries.
CoRR, March, 2025
On the Robustness of Transformers against Context Hijacking for Linear Classification.
CoRR, February, 2025
Chatbot Arena Estimate: towards a generalized performance benchmark for LLM capabilities.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025
How to Evaluate Reward Models for RLHF.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
The Parameter Calibration of Social Force Model for Pedestrian Flow Simulation Based on YOLOv5.
Sensors, August, 2024
Gaussian distribution resampling via Chebyshev distance for food computing.
Appl. Soft Comput., January, 2024
Project MPG: towards a generalized performance benchmark for LLM capabilities.
CoRR, 2024
Y-Mol: A Multiscale Biomedical Knowledge-Guided Large Language Model for Drug Development.
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline.
CoRR, 2024
Long-context LLMs Struggle with Long In-context Learning.
CoRR, 2024
MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
GenAI Arena: An Open Evaluation Platform for Generative Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Threshold-Based Automated Pest Detection System for Sustainable Agriculture.
Proceedings of the IEEE International Conference on Internet of Things and Intelligence Systems, 2024
Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference.
,
,
,
,
,
,
,
,
,
,
Proceedings of the Forty-first International Conference on Machine Learning, 2024
LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset.
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Twelfth International Conference on Learning Representations, 2024
ImagenHub: Standardizing the evaluation of conditional image generation models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
SWAG: Storytelling With Action Guidance.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Feature Fusion for Multi-Condition Controllable Image Generation.
Proceedings of the 2024 8th International Conference on Big Data and Internet of Things, 2024
A Privacy-Preserving Aggregation Method via Grouped Secure Multi-Party Computation.
Proceedings of the Proceeding of the 5th Asia Service Sciences and Software Engineering Conference, 2024
Function2vec: A Geographic Knowledge Graph Model of Urban Function Evolution and Its Application.
ISPRS Int. J. Geo Inf., November, 2023
DreamEdit: Subject-driven Image Editing.
Trans. Mach. Learn. Res., 2023
A Masked Attention Network with Query Sparsity Measurement for Time Series Anomaly Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023
Rethinking graph anomaly detection: A self-supervised Group Discrimination paradigm with Structure-Aware.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023
Few-shot In-context Learning on Knowledge Base Question Answering.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
AnaMeta: A Table Understanding Dataset of Field Metadata Knowledge Shared by Multi-dimensional Data Analysis Tasks.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Inferring Tabular Analysis Metadata by Infusing Distribution and Knowledge Information.
CoRR, 2022
ASTA: Learning Analytical Semantics over Tables for Intelligent Data Analysis and Visualization.
CoRR, 2022
A Simple Yet Efficient Method for Adversarial Word-Substitute Attack.
CoRR, 2022
Fake News Detection with Heterogeneous Transformer.
CoRR, 2022
A Multi-Stage Model for the Electromagnetic Shielding Effectiveness Prediction of an Infinite Conductor Plane With Periodic Apertures.
IEEE Access, 2019