2025
BrowseComp: A Simple Yet Challenging Benchmark for Browsing Agents.
CoRR, April, 2025
Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward.
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025
Self-Play Preference Optimization for Language Model Alignment.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Inference Scaling Laws: An Empirical Analysis of Compute-Optimal Inference for LLM Problem-Solving.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Lean-STaR: Learning to Interleave Thinking and Proving.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
2024
Micro-grid source-load storage energy minimization method based on improved competitive depth Q - network algorithm and digital twinning.
Energy Inform., December, 2024
Design of data center system for immersed liquid-cooled computer room with improved deep learning algorithm.
J. Intell. Fuzzy Syst., January, 2024
Improve Vision Language Model Chain-of-thought Reasoning.
CoRR, 2024
An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models.
CoRR, 2024
Dr-LLaVA: Visual Instruction Tuning with Symbolic Clinical Grounding.
CoRR, 2024
HaluEval-Wild: Evaluating Hallucinations of Language Models in the Wild.
CoRR, 2024
Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble.
CoRR, 2024
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific Discovery.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
SALMON: Self-Alignment with Instructable Reward Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Enhancing Load Forecasting with VAE-GAN-Based Data Cleaning for Electric Vehicle Charging Loads.
Proceedings of the Database Systems for Advanced Applications. DASFAA 2024 International Workshops, 2024
Aligning Large Multimodal Models with Factually Augmented RLHF.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Instruction-tuned Language Models are Better Knowledge Learners.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Visual Chain-of-Thought Prompting for Knowledge-Based Visual Reasoning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
Power grid user asset value evaluation method and application under the background of artificial intelligence and smart grid.
J. Comput. Methods Sci. Eng., 2023
SALMON: Self-Alignment with Principle-Following Reward Models.
CoRR, 2023
Accelerating Diffusion-based Combinatorial Optimization Solvers by Progressive Distillation.
CoRR, 2023
Distributed Optimal Scheduling of Integrated Energy Systems Based on Federated Reinforcement Learning.
Proceedings of the 5th International Conference on Robotics, 2023
A Lightweight-Model Based Federated Learning Approach for Fault Diagnosis in Intelligent Power Distribution Systems.
Proceedings of the 5th International Conference on Robotics, 2023
DIFUSCO: Graph-based Diffusion Solvers for Combinatorial Optimization.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
A Neural PDE Solver with Temporal Stencil Modeling.
Proceedings of the International Conference on Machine Learning, 2023
Recitation-Augmented Language Models.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Demand-side data augmentation method based on generative adversarial networks.
Proceedings of the 6th IEEE International Conference on Information Systems and Computer Aided Educatio, 2023
Active Retrieval Augmented Generation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
2022
PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2022
DIMES: A Differentiable Meta Solver for Combinatorial Optimization Problems.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Sparse Attention with Learning to Hash.
Proceedings of the Tenth International Conference on Learning Representations, 2022
PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, 2022
2021
Multi-Model Fusion Short-Term Load Forecasting Based on Random Forest Feature Selection and Hybrid Neural Network.
IEEE Access, 2021
Real-Time Demand Response Strategy of Temperature-Controlled Load for High Elastic Distribution Network.
IEEE Access, 2021
Multi-Objective Optimal Configuration of Multiple Switchgear Considering Distribution Network Fault Reconfiguration.
IEEE Access, 2021
Hierarchical Probabilistic Ultrasound Image Inpainting via Variational Inference.
Proceedings of the Deep Generative Models, and Data Augmentation, Labelling, and Imperfections, 2021
Rethinking Transformer-based Set Prediction for Object Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
2020
An EM Approach to Non-autoregressive Conditional Sequence Generation.
Proceedings of the 37th International Conference on Machine Learning, 2020
Dynamically Pruned Message Passing Networks for Large-scale Knowledge Graph Reasoning.
Proceedings of the 8th International Conference on Learning Representations, 2020
MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
A Re-evaluation of Knowledge Graph Completion Methods.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
2019
Understanding and Improving Transformer From a Multi-Particle Dynamic System Point of View.
CoRR, 2019
Neural Consciousness Flow.
CoRR, 2019
DivGraphPointer: A Graph Pointer Network for Extracting Diverse Keyphrases.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019
Fast Structured Decoding for Sequence Models.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
RotatE: Knowledge Graph Embedding by Relational Rotation in Complex Space.
Proceedings of the 7th International Conference on Learning Representations, 2019
Research on the Development Trend and Auditing Mode of High Security Enterprise Intranet Security Audit.
Proceedings of the 11th IEEE International Conference on Advanced Infocomm Technology, 2019
2018
Unsupervised Neural Word Segmentation for Chinese via Segmental Language Modeling.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018
2017
A Gap-Based Framework for Chinese Word Segmentation via Very Deep Convolutional Networks.
CoRR, 2017