2025
OpenCharacter: Training Customizable Role-Playing LLMs with Large-Scale Synthetic Personas.
CoRR, January, 2025
2024
Knowledge-augmented Methods for Natural Language Processing
Springer Briefs in Computer Science, Springer, ISBN: 978-981-97-0749-2, 2024
OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization.
CoRR, 2024
RepoGraph: Enhancing AI Software Engineering with Repository-level Code Graph.
CoRR, 2024
Leopard: A Vision Language Model For Text-Rich Multi-Image Tasks.
CoRR, 2024
DOCBENCH: A Benchmark for Evaluating LLM-based Document Reading Systems.
CoRR, 2024
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning.
CoRR, 2024
MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions.
CoRR, 2024
Describe-then-Reason: Improving Multimodal Mathematical Reasoning through Visual Comprehension Training.
CoRR, 2024
StarCoder 2 and The Stack v2: The Next Generation.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
et al.
CoRR, 2024
Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
Deep Multimodal Complementarity Learning.
IEEE Trans. Neural Networks Learn. Syst., December, 2023
StarCoder: may the source be with you!
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Trans. Mach. Learn. Res., 2023
Exploring Contrast Consistency of Open-Domain Question Answering Systems on Minimally Edited Questions.
Trans. Assoc. Comput. Linguistics, 2023
Improving Language Models via Plug-and-Play Retrieval Feedback.
CoRR, 2023
Knowledge-Augmented Methods for Natural Language Processing.
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023
GraphPatcher: Mitigating Degree Bias for Graph Neural Networks via Test-time Augmentation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
The Second Workshop on Knowledge-Augmented Methods for Natural Language Processing.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023
Multi-task Self-supervised Graph Neural Networks Enable Stronger Task Generalization.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Generate rather than Retrieve: Large Language Models are Strong Context Generators.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Pre-training Language Models for Comparative Reasoning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Let GPT be a Math Tutor: Teaching Math Word Problem Solvers with Customized Exercise Generation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
IfQA: A Dataset for Open-domain Question Answering under Counterfactual Presuppositions.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
A Survey of Multi-task Learning in Natural Language Processing: Regarding Task Relatedness and Training Methods.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023
Large Language Models are Built-in Autoregressive Search Engines.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
A Survey of Deep Learning for Mathematical Reasoning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
APOLLO: A Simple Approach for Adaptive Pretraining of Language Models for Logical Reasoning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
2022
A Survey of Knowledge-enhanced Text Generation.
ACM Comput. Surv., January, 2022
Empowering Language Models with Knowledge Graph Reasoning for Question Answering.
CoRR, 2022
Enhancing Automated Software Traceability by Transfer Learning from Open-World Data.
CoRR, 2022
Retrieval-augmented Generation across Heterogeneous Knowledge.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop, 2022
Learning from Counterfactual Links for Link Prediction.
Proceedings of the International Conference on Machine Learning, 2022
A Unified Encoder-Decoder Framework with Entity Memory.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Grape: Knowledge Graph Enhanced Passage Reader for Open-domain Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
Empowering Language Models with Knowledge Graph Reasoning for Open-Domain Question Answering.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Retrieval Augmentation for Commonsense Reasoning: A Unified Approach.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Task Compass: Scaling Multi-task Pre-training with Task Prefix.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
KG-FiD: Infusing Knowledge Graph in Fusion-in-Decoder for Open-Domain Question Answering.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
Diversifying Content Generation for Commonsense Reasoning with Mixture of Knowledge Graph Experts.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022
Dict-BERT: Enhancing Language Model Pre-training with Dictionary.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022
2021
Counterfactual Graph Learning for Link Prediction.
CoRR, 2021
Validating Label Consistency in NER Data Annotation.
CoRR, 2021
Few-Shot Graph Learning for Molecular Property Prediction.
Proceedings of the WWW '21: The Web Conference 2021, 2021
Technical Question Answering across Tasks and Domains.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Papers, 2021
Enhancing Taxonomy Completion with Concept Generation via Fusing Relational Representations.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021
Validating Label Consistency in NER Data Annotation.
Proceedings of the 2nd Workshop on Evaluation and Comparison of NLP Systems, 2021
Sentence-Permuted Paragraph Generation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Injecting Entity Types into Entity-Guided Text Generation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Action Sequence Augmentation for Early Graph-based Anomaly Detection.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021
2020
Early Anomaly Detection by Learning and Forecasting Behavior.
CoRR, 2020
Identifying Referential Intention with Heterogeneous Contexts.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020
Experimental Evidence Extraction System in Data Science with Hybrid Table Features and Ensemble Learning.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020
Tri-Train: Automatic Pre-Fine Tuning between Pre-Training and Fine-Tuning for SciNER.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
A Technical Question Answering System with Transfer Learning.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2020
GraSeq: Graph and Sequence Fusion Learning for Molecular Property Prediction.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020
Crossing Variational Autoencoders for Answer Retrieval.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
2019
Tablepedia: Automating PDF Table Reading in an Experimental Evidence Exploration and Analytic System.
Proceedings of the World Wide Web Conference, 2019
Faceted Hierarchy: A New Graph Type to Organize Scientific Concepts and a Construction Method.
Proceedings of the Thirteenth Workshop on Graph-Based Methods for Natural Language Processing, 2019