Generative AI Act II: Test Time Scaling Drives Cognition Engineering.
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, April, 2025
PC Agent: While You Sleep, AI Works - A Cognitive Journey into Digital World.
CoRR, 2024
Benchmarking Benchmark Leakage in Large Language Models.
CoRR, 2024
OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Generative Judge for Evaluating Alignment.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
RIGHT: Retrieval-Augmented Generation for Mainstream Hashtag Recommendation.
Proceedings of the Advances in Information Retrieval, 2024
MerA: Merging Pretrained Adapters For Few-Shot Learning.
CoRR, 2023
Merging Experts into One: Improving Computational Efficiency of Mixture of Experts.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023