2025
BrowseComp: A Simple Yet Challenging Benchmark for Browsing Agents.
CoRR, April, 2025

Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Self-Play Preference Optimization for Language Model Alignment.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Inference Scaling Laws: An Empirical Analysis of Compute-Optimal Inference for LLM Problem-Solving.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Lean-STaR: Learning to Interleave Thinking and Proving.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Micro-grid source-load storage energy minimization method based on improved competitive depth Q - network algorithm and digital twinning.
Energy Inform., December, 2024

Design of data center system for immersed liquid-cooled computer room with improved deep learning algorithm.
J. Intell. Fuzzy Syst., January, 2024

Improve Vision Language Model Chain-of-thought Reasoning.
CoRR, 2024

An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models.
CoRR, 2024

Dr-LLaVA: Visual Instruction Tuning with Symbolic Clinical Grounding.
CoRR, 2024

HaluEval-Wild: Evaluating Hallucinations of Language Models in the Wild.
CoRR, 2024

Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble.
CoRR, 2024

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific Discovery.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

SALMON: Self-Alignment with Instructable Reward Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Enhancing Load Forecasting with VAE-GAN-Based Data Cleaning for Electric Vehicle Charging Loads.
Proceedings of the Database Systems for Advanced Applications. DASFAA 2024 International Workshops, 2024

Aligning Large Multimodal Models with Factually Augmented RLHF.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Instruction-tuned Language Models are Better Knowledge Learners.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Visual Chain-of-Thought Prompting for Knowledge-Based Visual Reasoning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Power grid user asset value evaluation method and application under the background of artificial intelligence and smart grid.
J. Comput. Methods Sci. Eng., 2023

SALMON: Self-Alignment with Principle-Following Reward Models.
CoRR, 2023

Accelerating Diffusion-based Combinatorial Optimization Solvers by Progressive Distillation.
CoRR, 2023

Distributed Optimal Scheduling of Integrated Energy Systems Based on Federated Reinforcement Learning.
Proceedings of the 5th International Conference on Robotics, 2023

A Lightweight-Model Based Federated Learning Approach for Fault Diagnosis in Intelligent Power Distribution Systems.
Proceedings of the 5th International Conference on Robotics, 2023

DIFUSCO: Graph-based Diffusion Solvers for Combinatorial Optimization.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

A Neural PDE Solver with Temporal Stencil Modeling.
Proceedings of the International Conference on Machine Learning, 2023

Recitation-Augmented Language Models.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Demand-side data augmentation method based on generative adversarial networks.
Proceedings of the 6th IEEE International Conference on Information Systems and Computer Aided Educatio, 2023

Active Retrieval Augmented Generation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts.
CoRR, 2022

DIMES: A Differentiable Meta Solver for Combinatorial Optimization Problems.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Sparse Attention with Learning to Hash.
Proceedings of the Tenth International Conference on Learning Representations, 2022

PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, 2022

2021
Multi-Model Fusion Short-Term Load Forecasting Based on Random Forest Feature Selection and Hybrid Neural Network.
IEEE Access, 2021

Real-Time Demand Response Strategy of Temperature-Controlled Load for High Elastic Distribution Network.
IEEE Access, 2021

Multi-Objective Optimal Configuration of Multiple Switchgear Considering Distribution Network Fault Reconfiguration.
IEEE Access, 2021

Hierarchical Probabilistic Ultrasound Image Inpainting via Variational Inference.
Proceedings of the Deep Generative Models, and Data Augmentation, Labelling, and Imperfections, 2021

Rethinking Transformer-based Set Prediction for Object Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
An EM Approach to Non-autoregressive Conditional Sequence Generation.
Proceedings of the 37th International Conference on Machine Learning, 2020

Dynamically Pruned Message Passing Networks for Large-scale Knowledge Graph Reasoning.
Proceedings of the 8th International Conference on Learning Representations, 2020

MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

A Re-evaluation of Knowledge Graph Completion Methods.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Understanding and Improving Transformer From a Multi-Particle Dynamic System Point of View.
CoRR, 2019

Neural Consciousness Flow.
CoRR, 2019

DivGraphPointer: A Graph Pointer Network for Extracting Diverse Keyphrases.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Fast Structured Decoding for Sequence Models.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

RotatE: Knowledge Graph Embedding by Relational Rotation in Complex Space.
Proceedings of the 7th International Conference on Learning Representations, 2019

Research on the Development Trend and Auditing Mode of High Security Enterprise Intranet Security Audit.
Proceedings of the 11th IEEE International Conference on Advanced Infocomm Technology, 2019

2018
Unsupervised Neural Word Segmentation for Chinese via Segmental Language Modeling.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

2017
A Gap-Based Framework for Chinese Word Segmentation via Very Deep Convolutional Networks.
CoRR, 2017