Shizhu He

Orcid: 0000-0001-9053-9517

According to our database1, Shizhu He authored at least 120 papers between 2013 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Towards Better Quantity Representations for Solving Math Word Problems.
ACM Trans. Asian Low Resour. Lang. Inf. Process., July, 2024

Seq2Set2Seq: A Two-stage Disentangled Method for Reply Keyword Generation in Social Media.
ACM Trans. Asian Low Resour. Lang. Inf. Process., March, 2024

DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models.
CoRR, 2024

Neural-Symbolic Collaborative Distillation: Advancing Small Language Models for Complex Reasoning Tasks.
CoRR, 2024

<i>SKIntern</i>: Internalizing Symbolic Knowledge for Distilling Better CoT Capabilities into Small Language Models.
CoRR, 2024

Beyond Instruction Following: Evaluating Rule Following of Large Language Models.
CoRR, 2024

From Instance Training to Instruction Learning: Task Adapters Generation from Instructions.
CoRR, 2024

Generate-on-Graph: Treat LLM as both Agent and KG in Incomplete Knowledge Graph Question Answering.
CoRR, 2024

Imagination Augmented Generation: Learning to Imagine Richer Context for Question Answering over Large Language Models.
CoRR, 2024

From Chain to Tree: Refining Chain-like Rules into Tree-like Rules on Knowledge Graphs.
CoRR, 2024

MoELoRA: Contrastive Learning Guided Mixture of Experts on Parameter-Efficient Fine-Tuning for Large Language Models.
CoRR, 2024

ControlLM: Crafting Diverse Personalities for Language Models.
CoRR, 2024

Large Language Models With Holistically Thought Could Be Better Doctors.
Proceedings of the Natural Language Processing and Chinese Computing, 2024

S3Eval: A Synthetic, Scalable, Systematic Evaluation Suite for Large Language Model.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Mastering Symbolic Operations: Augmenting Language Models with Compiled Neural Networks.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Unsupervised Learning of Neural Semantic Mappings with the Hungarian Algorithm for Compositional Semantics.
Proceedings of the IEEE International Conference on Acoustics, 2024

Generate-on-Graph: Treat LLM as both Agent and KG for Incomplete Knowledge Graph Question Answering.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Instance-Level Dynamic LoRAs Composition for Cross-Task Generalization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Does Large Language Model Contain Task-Specific Neurons?
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Towards Graph-hop Retrieval and Reasoning in Complex Question Answering over Textual Database.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

MoDE-CoTD: Chain-of-Thought Distillation for Complex Reasoning Tasks with Mixture of Decoupled LoRA-Experts.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

BP4ER: Bootstrap Prompting for Explicit Reasoning in Medical Dialogue Generation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

ItD: Large Language Models Can Teach Themselves Induction through Deduction.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Teaching Small Language Models to Reason for Knowledge-Intensive Multi-Hop Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Unsupervised Domain Adaptation on Sentence Matching Through Self-Supervision.
J. Comput. Sci. Technol., December, 2023

Unsupervised Dialogue State Tracking for End-to-End Task-Oriented Dialogue with a Multi-Span Prediction Network.
J. Comput. Sci. Technol., July, 2023

A Brief Overview of ChatGPT: The History, Status Quo and Potential Future Development.
IEEE CAA J. Autom. Sinica, May, 2023

Bidirectional Sentence Ordering with Interactive Decoding.
ACM Trans. Asian Low Resour. Lang. Inf. Process., January, 2023

S3Eval: A Synthetic, Scalable, Systematic Evaluation Suite for Large Language Models.
CoRR, 2023

TableQAKit: A Comprehensive and Practical Toolkit for Table-based Question Answering.
CoRR, 2023

HRoT: Hybrid prompt strategy and Retrieval of Thought for Table-Text Hybrid Question Answering.
CoRR, 2023

MMHQA-ICL: Multimodal In-context Learning for Hybrid Question Answering over Text, Tables and Images.
CoRR, 2023

LMTuner: An user-friendly and highly-integrable Training Framework for fine-tuning Large Language Models.
CoRR, 2023

Towards Graph-hop Retrieval and Reasoning in Complex Question Answering over Textual Database.
CoRR, 2023

S<sup>3</sup>HQA: A Three-Stage Approach for Multi-hop Text-Table Hybrid Question Answering.
CoRR, 2023

Large Language Models Need Holistically Thought in Medical Conversational QA.
CoRR, 2023

Neural Comprehension: Language Models with Compiled Neural Networks.
CoRR, 2023

Knowledge Reasoning via Jointly Modeling Knowledge Graphs and Soft Rules.
CoRR, 2023

Multi-Target Semantic Parsing with Collaborative Deliberation Network.
Proceedings of the Findings of the Association for Computational Linguistics: IJCNLP-AACL 2023, 2023

Learning to Build Reasoning Chains by Reliable Path Retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2023

Query2Triple: Unified Query Encoding for Answering Diverse Complex Queries over Knowledge Graphs.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Large Language Models are Better Reasoners with Self-Verification.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

ExpNote: Black-box Large Language Models are better Task Solvers with Experience Notebook.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Bipartite Graph Pre-training for Unsupervised Extractive Summarization with Graph Convolutional Auto-Encoders.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Efficient Data Learning for Open Information Extraction with Pre-trained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Find Parent then Label Children: A Two-stage Taxonomy Completion Method with Pre-trained Language Model.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

On the Effects of Structural Modeling for Neural Semantic Parsing.
Proceedings of the 27th Conference on Computational Natural Language Learning, 2023

Prediction and Calibration: Complex Reasoning over Knowledge Graph with Bi-directional Directed Acyclic Graph Neural Network.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Multilingual Knowledge Graph Completion from Pretrained Language Models with Knowledge Constraints.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Class Lifelong Learning for Intent Detection via Structure Consolidation Networks.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

S3HQA: A Three-Stage Approach for Multi-hop Text-Table Hybrid Question Answering.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

2022
Fact-Driven Abstractive Summarization by Utilizing Multi-Granular Multi-Relational Knowledge.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Using Pre-trained Language Model to Enhance Active Learning for Sentence Matching.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2022

Large Language Models are reasoners with Self-Verification.
CoRR, 2022

ReasonChainQA: Text-based Complex Question Answering with Explainable Evidence Chains.
CoRR, 2022

LingYi: Medical Conversational Question Answering System based on Multi-modal Knowledge Graphs.
CoRR, 2022

Example-guided stylized response generation in zero-shot setting.
Sci. China Inf. Sci., 2022

LingJing at SemEval-2022 Task 3: Applying DeBERTa to Lexical-level Presupposed Relation Taxonomy with Knowledge Transfer.
Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

LingJing at SemEval-2022 Task 1: Multi-task Self-supervised Pre-training for Multilingual Reverse Dictionary.
Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

MedConQA: Medical Conversational Question Answering System based on Knowledge Graphs.
Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Decoupling Mixture-of-Graphs: Unseen Relational Learning for Knowledge Graph Completion by Fusing Ontology and Textual Experts.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Answering Numerical Reasoning Questions in Table-Text Hybrid Contents with Graph-based Encoder and Tree-based Decoder.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Learning to Answer Complex Visual Questions from Multi-View Analysis.
Proceedings of the CCKS 2022 - Evaluation Track, 2022

Leveraging Explicit Lexico-logical Alignments in Text-to-SQL Parsing.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

Incremental Intent Detection for Medical Domain with Contrast Replay Networks.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

ANACONDA: Adversarial training with iNtrust loss in ACrONym DisambiguAtion.
Proceedings of the Workshop on Scientific Document Understanding co-located with 36th AAAI Conference on Artificial Inteligence, 2022

ADBCMM : Acronym Disambiguation by Building Counterfactuals and Multilingual Mixing.
Proceedings of the Workshop on Scientific Document Understanding co-located with 36th AAAI Conference on Artificial Inteligence, 2022

2021
Path-based knowledge reasoning with textual semantic information for medical knowledge graph completion.
BMC Medical Informatics Decis. Mak., 2021

Heterogeneous Relational Graph Neural Networks with Adaptive Objective for End-to-End Task-Oriented Dialogue.
Knowl. Based Syst., 2021

A Unified Shared-Private Network with Denoising for Dialogue State Tracking.
J. Comput. Sci. Technol., 2021

ADBCMM : Acronym Disambiguation by Building Counterfactuals and Multilingual Mixing.
CoRR, 2021

Lifelong Intent Detection via Multi-Strategy Rebalancing.
CoRR, 2021

Domain-Lifelong Learning for Dialogue State Tracking via Knowledge Preservation Networks.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Multi-strategy Knowledge Distillation Based Teacher-Student Framework for Machine Reading Comprehension.
Proceedings of the Chinese Computational Linguistics - 20th China National Conference, 2021

Does BERT Know Which Answer Beyond the Question?
Proceedings of the CCKS 2021 - Evaluation Track, 2021

Toward a Better Text Data Augmentation via Filtering and Transforming Augmented Instances.
Proceedings of the Knowledge Graph and Semantic Computing: Knowledge Graph Empowers New Infrastructure Construction, 2021

2020
Pre-trained Language Model Based Active Learning for Sentence Matching.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

2019
Copy-Enhanced Heterogeneous Information Learning for Dialogue State Tracking.
CoRR, 2019

Variational Attention for Commonsense Knowledge Aware Conversation Generation.
Proceedings of the Natural Language Processing and Chinese Computing, 2019

Learning the Extraction Order of Multiple Relational Facts in a Sentence with Reinforcement Learning.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Generating Questions for Knowledge Bases via Incorporating Diversified Contexts and Answer-Aware Loss.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Incorporating Interlocutor-Aware Context into Response Generation on Multi-Party Chatbots.
Proceedings of the 23rd Conference on Computational Natural Language Learning, 2019

Utterance Alignment in Custom Service by Integer Programming.
Proceedings of the Chinese Computational Linguistics - 18th China National Conference, 2019

AdaNSP: Uncertainty-driven Adaptive Decoding in Neural Semantic Parsing.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Vocabulary Pyramid Network: Multi-Pass Encoding and Decoding with Multi-Level Vocabularies for Response Generation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Learning to Align Question and Answer Utterances in Customer Service Conversation with Recurrent Pointer Networks.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Curriculum Learning for Natural Answer Generation.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Pattern-revising Enhanced Simple Question Answering over Knowledge Bases.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Learning to Detect Verbose Expressions in Spoken Texts.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2018

Extracting Relational Facts by an End-to-End Neural Model with Copy Mechanism.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Large Scaled Relation Extraction With Reinforcement Learning.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
IJCNLP-2017 Task 5: Multi-choice Question Answering in Examinations.
Proceedings of the IJCNLP 2017, Shared Tasks, Taipei, Taiwan, November 27, 2017

Which is the Effective Way for Gaokao: Information Retrieval or Neural Networks?
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Unsupervised Joint Entity Linking over Question Answering Pair with Global Knowledge.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2017

Generating Natural Answers by Incorporating Copying and Retrieving Mechanisms in Sequence-to-Sequence Learning.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

An End-to-End Model for Question Answering over Knowledge Base with Cross-Attention Combining Global Knowledge.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Distant Supervision for Relation Extraction with Sentence-Level Attention and Entity Descriptions.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
How to Generate a Good Word Embedding.
IEEE Intell. Syst., 2016

Question Answering over Knowledge Base with Neural Attention Combining Global Knowledge Information.
CoRR, 2016

Employing External Rich Knowledge for Machine Comprehension.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Learning to Represent Review with Tensor Decomposition for Spam Detection.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Event Extraction via Bidirectional Long Short-Term Memory Tensor Neural Networks.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2016

Link Prediction via Mining Markov Logic Formulas to Improve Social Recommendation.
Proceedings of the Knowledge Graph and Semantic Computing: Semantic, Knowledge, and Linked Big Data, 2016

A Joint Embedding Method for Entity Alignment of Knowledge Bases.
Proceedings of the Knowledge Graph and Semantic Computing: Semantic, Knowledge, and Linked Big Data, 2016

Leveraging FrameNet to Improve Automatic Event Detection.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

A Joint Model for Question Answering over Multiple Knowledge Bases.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

A Probabilistic Soft Logic Based Approach to Exploiting Latent and Global Information in Event Classification.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Knowledge Graph Completion with Adaptive Sparse Transfer Matrix.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Question Answering over Knowledge Bases.
IEEE Intell. Syst., 2015

Learning to Represent Knowledge Graphs with Gaussian Embedding.
Proceedings of the 24th ACM International Conference on Information and Knowledge Management, 2015

Knowledge Graph Embedding via Dynamic Mapping Matrix.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
Question Answering over Linked Data Using First-order Logic.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Ontology Matching with Word Embeddings.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2014

CASIA@V2: A MLN-based Question Answering System over Linked Data.
Proceedings of the Working Notes for CLEF 2014 Conference, 2014

2013
The CASIA Entity linking System at TAC 2013.
Proceedings of the Sixth Text Analysis Conference, 2013

IAMA results for OAEI 2013.
Proceedings of the 8th International Workshop on Ontology Matching co-located with the 12th International Semantic Web Conference (ISWC 2013), 2013

CASIA@QALD-3: A Question Answering System over Linked Data.
Proceedings of the Working Notes for CLEF 2013 Conference , 2013

Open Relation Mapping Based on Instances and Semantics Expansion.
Proceedings of the Information Retrieval Technology, 2013

Statistical Machine Translation Improves Question Retrieval in Community Question Answering via Matrix Factorization.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013


  Loading...