Hai Zhao

Orcid: 0000-0001-7290-0487

Affiliations:
  • Shanghai Jiao Tong University, Department of Computer Science and Engineering, China


According to our database1, Hai Zhao authored at least 313 papers between 2004 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Enhancing Lyrics Rewriting with Weak Supervision from Grammatical Error Correction Pre-training and Reference Knowledge Fusion.
ACM Trans. Asian Low Resour. Lang. Inf. Process., November, 2024

Channel-Aware Decoupling Network for Multiturn Dialog Comprehension.
IEEE Trans. Neural Networks Learn. Syst., June, 2024

Multimodal Chain-of-Thought Reasoning in Language Models.
Trans. Mach. Learn. Res., 2024

Entity-Relation Extraction as Full Shallow Semantic Dependency Parsing.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Multi-turn dialogue comprehension from a topic-aware perspective.
Neurocomputing, 2024

KVSharer: Efficient Inference via Layer-Wise Dissimilar KV Cache Sharing.
CoRR, 2024

Instruction-Driven Game Engine: A Poker Case Study.
CoRR, 2024

Reference Trustable Decoding: A Training-Free Augmentation Paradigm for Large Language Models.
CoRR, 2024

Nothing in Excess: Mitigating the Exaggerated Safety for LLMs via Safety-Conscious Activation Steering.
CoRR, 2024

MEGen: Generative Backdoor in Large Language Models via Model Editing.
CoRR, 2024

BatGPT-Chem: A Foundation Large Model For Retrosynthesis Prediction.
CoRR, 2024

Self-Directed Turing Test for Large Language Models.
CoRR, 2024

Game Development as Human-LLM Interaction.
CoRR, 2024

Evolving Virtual World with Delta-Engine.
CoRR, 2024

Caution for the Environment: Multimodal Agents are Susceptible to Environmental Distractions.
CoRR, 2024

Keep the Cost Down: A Review on Methods to Optimize LLM' s KV-Cache Consumption.
CoRR, 2024

DOCBENCH: A Benchmark for Evaluating LLM-based Document Reading Systems.
CoRR, 2024

Venturing into Uncharted Waters: The Navigation Compass from Transformer to Mamba.
CoRR, 2024

The Music Maestro or The Musically Challenged, A Massive Music Evaluation Benchmark for Large Language Models.
CoRR, 2024

Vript: A Video Is Worth Thousands of Words.
CoRR, 2024

From Role-Play to Drama-Interaction: An LLM Solution.
CoRR, 2024

Instruction-Driven Game Engines on Large Language Models.
CoRR, 2024

Multi-modal Auto-regressive Modeling via Visual Words.
CoRR, 2024

Hypertext Entity Extraction in Webpage.
CoRR, 2024

Comprehensive Cognitive LLM Agent for Smartphone GUI Automation.
CoRR, 2024

Is it Possible to Edit Large Language Models Robustly?
CoRR, 2024

GLaPE: Gold Label-agnostic Prompt Evaluation and Optimization for Large Language Model.
CoRR, 2024

GoT: Effective Graph-of-Thought Reasoning in Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Self-Prompting Large Language Models for Zero-Shot Open-Domain QA.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Multi-modal Auto-regressive Modeling via Visual Tokens.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Sparse is Enough in Fine-tuning Pre-trained Large Language Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Generative Judge for Evaluating Alignment.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Semantics-Preserved Distortion for Personal Privacy Protection in Information Management.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2024, 2024

GLaPE: Gold Label-agnostic Prompt Evaluation for Large Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

LaCo: Large Language Model Pruning via Layer Collapse.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Are LLMs Aware that Some Questions are not Open-ended?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

On the Robustness of Editing Large Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

VHASR: A Multimodal Speech Recognition System With Vision Hotwords.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Head-wise Shareable Attention for Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

A Coin Has Two Sides: A Novel Detector-Corrector Framework for Chinese Spelling Correction.
Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

CSPRD: A Financial Policy Retrieval Dataset for Chinese Stock Market.
Proceedings of the Database and Expert Systems Applications, 2024

AuRoRA: A One-for-all Platform for Augmented Reasoning and Refining with Task-Adaptive Chain-of-Thought Prompting.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Attack Named Entity Recognition by Entity Boundary Interference.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Mitigating Misleading Chain-of-Thought Reasoning with Selective Filtering.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

PROM: A Phrase-level Copying Mechanism with Pre-training for Abstractive Summarization.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Unveiling Vulnerability of Self-Attention.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Selective Prefix Tuning for Pre-trained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

GKT: A Novel Guidance-Based Knowledge Transfer Framework For Efficient Cloud-edge Collaboration LLM Deployment.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

SirLLM: Streaming Infinite Retentive LLM.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

PyramidInfer: Pyramid KV Cache Compression for High-throughput LLM Inference.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

From Role-Play to Drama-Interaction: An LLM Solution.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

CoCo-Agent: A Comprehensive Cognitive MLLM Agent for Smartphone GUI Automation.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Dissecting Human and LLM Preferences.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

The Music Maestro or The Musically Challenged, A Massive Music Evaluation Benchmark for Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Chinese Spelling Corrector Is Just a Language Learner.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

CMMLU: Measuring massive multitask language understanding in Chinese.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Hypergraph based Understanding for Document Semantic Entity Recognition.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

A Novel Energy Based Model Mechanism for Multi-Modal Aspect-Based Sentiment Analysis.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Fact-Driven Logical Reasoning for Machine Reading Comprehension.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Chinese Spelling Correction as Rephrasing Language Model.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Cross-Lingual Universal Dependency Parsing Only From One Monolingual Treebank.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

Universal Multimodal Representation for Language Understanding.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

Semi-Supervised Semantic Role Labeling with Bidirectional Language Models.
ACM Trans. Asian Low Resour. Lang. Inf. Process., June, 2023

Multi-Grained Evidence Inference for Multi-Choice Reading Comprehension.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Enhanced Speaker-Aware Multi-Party Multi-Turn Dialogue Comprehension.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Syntax-Aware Data Augmentation for Neural Machine Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Dialogue-adaptive language model pre-training from quality estimation<sup>☆</sup>.
Neurocomputing, 2023

Sparse is Enough in Fine-tuning Pre-trained Large Language Model.
CoRR, 2023

Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents.
CoRR, 2023

Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models.
CoRR, 2023

AutoHall: Automated Hallucination Dataset Generation for Large Language Models.
CoRR, 2023

BatGPT: A Bidirectional Autoregessive Talker from Generative Pre-trained Transformer.
CoRR, 2023

Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Large Language Models.
CoRR, 2023

RefGPT: Reference -> Truthful & Customized Dialogues Generation by GPTs and for GPTs.
CoRR, 2023

Channel-aware Decoupling Network for Multi-turn Dialogue Comprehension.
CoRR, 2023

Enhancing Visually-Rich Document Understanding via Layout Structure Modeling.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

iRe2f: Rethinking Effective Refinement in Language Structure Prediction via Efficient Iterative Retrospecting and Reasoning.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Bidirectional Looking with A Novel Double Exponential Moving Average to Adaptive and Non-adaptive Momentum Optimizers.
Proceedings of the International Conference on Machine Learning, 2023

Heat Up The Sentiment Learning With ICE.
Proceedings of the First Tiny Papers Track at ICLR 2023, 2023

Toward Adversarial Training on Contextualized Language Representation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Contrastive Learning of Functionality-Aware Code Embeddings.
Proceedings of the IEEE International Conference on Acoustics, 2023

RefGPT: Dialogue Generation of GPT, by GPT, and for GPT.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Empower Nested Boolean Logic via Self-Supervised Curriculum Learning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Self-prompted Chain-of-Thought on Large Language Models for Open-domain Multi-hop Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Query Rewriting in Retrieval-Augmented Large Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Towards End-to-End Open Conversational Machine Reading.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Decker: Double Check with Heterogeneous Knowledge for Commonsense Fact Verification.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Learning Event-aware Measures for Event Coreference Resolution.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Learning Better Masking for Better Language Model Pre-training.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Rethinking Masked Language Modeling for Chinese Spelling Correction.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Contextualized Semantic Distance between Highly Overlapped Texts.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

FSUIE: A Novel Fuzzy Span Mechanism for Universal Information Extraction.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Encoder and Decoder, Not One Less for Pre-trained Language Model Sponsored NMT.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

EM Pre-training for Multi-party Dialogue Response Generation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Pre-training Multi-party Dialogue Models with Latent Discourse Inference.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Adversarial Self-Attention for Language Understanding.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Language Model Pre-training on True Negatives.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Dependency and Span, Cross-Style Semantic Role Labeling on PropBank and NomBank.
ACM Trans. Asian Low Resour. Lang. Inf. Process., November, 2022

Open Named Entity Modeling From Embedding Distribution.
IEEE Trans. Knowl. Data Eng., 2022

DUMA: Reading Comprehension With Transposition Thinking.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Reference Knowledgeable Network for Machine Reading Comprehension.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Syntax-Aware Multi-Spans Generation for Reading Comprehension.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Which Apple Keeps Which Doctor Away? Colorful Word Representations With Visual Oracles.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Rethinking Textual Adversarial Defense for Pre-Trained Language Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

HPSG-Inspired Joint Neural Constituent and Dependency Parsing in O($n^3$) Time Complexity.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Memorizing All for Implicit Discourse Relation Recognition.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2022

Tri-training for Dependency Parsing Domain Adaptation.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2022

SG-Net: Syntax Guided Transformer for Language Representation.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Text Compression-Aided Transformer Encoding.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Neural Character-Level Syntactic Parsing for Chinese.
J. Artif. Intell. Res., 2022

Incorporating rich syntax information in Grammatical Error Correction.
Inf. Process. Manag., 2022

Self-Prompting Large Language Models for Open-Domain QA.
CoRR, 2022

Evaluate Confidence Instead of Perplexity for Zero-shot Commonsense Reasoning.
CoRR, 2022

Adversarial Self-Attention for Language Understanding.
CoRR, 2022

Solution of DeBERTaV3 on CommonsenseQA.
CoRR, 2022

Nested Named Entity Recognition as Holistic Structure Parsing.
CoRR, 2022

Semantics-Preserved Distortion for Personal Privacy Protection.
CoRR, 2022

A Fuzzy Training Framework for Controllable Sequence-to-Sequence Generation.
IEEE Access, 2022

Explicit Alignment Learning for Neural Machine Translation.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Forging Multiple Training Objectives for Pre-trained Language Models via Meta-Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Reorder and then Parse, Fast and Accurate Discontinuous Constituency Parsing.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Back to the Future: Bidirectional Information Decoupling Network for Multi-turn Dialogue Modeling.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Instance Regularization for Discriminative Language Model Pre-training.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Task Compass: Scaling Multi-task Pre-training with Task Prefix.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Aspect-based Sentiment Analysis as Machine Reading Comprehension.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Nested Named Entity Recognition as Corpus Aware Holistic Structure Parsing.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

BiBL: AMR Parsing and Generation with Bidirectional Bayesian Learning.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Modeling Hierarchical Reasoning Chains by Linking Discourse Units and Key Phrases for Reading Comprehension.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Semantic-Preserving Adversarial Code Comprehension.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

EventBERT: Incorporating Event-Based Semantics for Natural Language Understanding.
Proceedings of the Chinese Computational Linguistics - 21st China National Conference, 2022

Lite Unified Modeling for Discriminative Reading Comprehension.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Distinguishing Non-natural from Natural Adversarial Samples for More Robust Pre-trained Language Model.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Structural Characterization for Dialogue Disentanglement.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

What Works and Doesn't Work, A Deep Decoder for Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Restricted or Not: A General Training Framework for Neural Machine Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, 2022

Sentence-aware Contrastive Learning for Open-Domain Passage Retrieval.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Tracing Origins: Coreference-aware Machine Reading Comprehension.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Robust Neural Relation Extraction via Multi-Granularity Noises Reduction.
IEEE Trans. Knowl. Data Eng., 2021

Multi-Turn Dialogue Reading Comprehension With Pivot Turns and Knowledge.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Learning Context-Aware Convolutional Filters for Implicit Discourse Relation Classification.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Adaptive Convolution for Semantic Role Labeling.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Neural Unsupervised Semantic Role Labeling.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2021

Document-Level Neural Machine Translation with Associated Memory Network.
IEICE Trans. Inf. Syst., 2021

Unsupervised Full Constituency Parsing with Neighboring Distribution Divergence.
CoRR, 2021

Structural Modeling for Dialogue Disentanglement.
CoRR, 2021

Tracing Origins: Coref-aware Machine Reading Comprehension.
CoRR, 2021

Representation Decoupling for Open-Domain Passage Retrieval.
CoRR, 2021

A Novel Metric for Evaluating Semantics Preservation.
CoRR, 2021

Sparse Fuzzy Attention for Structured Sentiment Analysis.
CoRR, 2021

Unsupervised Open-Domain Question Answering.
CoRR, 2021

Cross-lingual Transferring of Pre-trained Contextualized Language Models.
CoRR, 2021

Graph-free Multi-hop Reading Comprehension: A Select-to-Guide Strategy.
CoRR, 2021

Defending Pre-trained Language Models from Adversarial Word Substitutions Without Performance Sacrifice.
CoRR, 2021

Fact-driven Logical Reasoning.
CoRR, 2021

Head-driven Phrase Structure Parsing in O($n^3$) Time Complexity.
CoRR, 2021

Not All Attention Is All You Need.
CoRR, 2021

Advances in Multi-turn Dialogue Comprehension: A Survey.
CoRR, 2021

To Understand Representation of Layer-aware Sequence Encoders as Multi-order-graph.
CoRR, 2021

Syntax Role for Neural Semantic Role Labeling.
Comput. Linguistics, 2021

When SMILES Smiles, Practicality Judgment and Yield Prediction of Chemical Reaction via Deep Chemical Language Processing.
IEEE Access, 2021

MiSS@WMT21: Contrastive Learning-reinforced Domain Adaptation in Neural Machine Translation.
Proceedings of the Sixth Conference on Machine Translation, 2021

Multi-tasking Dialogue Comprehension with Discourse Parsing.
Proceedings of the 35th Pacific Asia Conference on Language, Information and Computation, 2021

Multilingual Pre-training with Universal Dependency Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Unsupervised Neural Machine Translation with Universal Grammar.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

MiSS: An Assistant for Multi-Style Simultaneous Translation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2021

Seeking Common but Distinguishing Difference, A Joint Aspect-based Sentiment Analysis Model.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Span Fine-tuning for Pre-trained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Self- and Pseudo-self-supervised Prediction of Speaker and Key-utterance for Multi-party Dialogue Reading Comprehension.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Smoothing Dialogue States for Open Conversational Machine Reading.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Advances and Challenges in Unsupervised Neural Machine Translation.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Tutorial Abstracts, 2021

NICT's Neural Machine Translation Systems for the WAT21 Restricted Translation Task.
Proceedings of the 8th Workshop on Asian Translation, 2021

Code Summarization with Structure-induced Transformer.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Grammatical Error Correction as GAN-like Sequence Labeling.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Dialogue Graph Modeling for Conversational Machine Reading.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Defending Pre-trained Language Models from Adversarial Word Substitution Without Performance Sacrifice.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Structural Pre-training for Dialogue Comprehension.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Semantics-Aware Inferential Network for Natural Language Understanding.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Multi-span Style Extraction for Generative Reading Comprehension.
Proceedings of the Workshop on Scientific Document Understanding co-located with 35th AAAI Conference on Artificial Inteligence, 2021

Topic-Aware Multi-turn Dialogue Modeling.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Filling the Gap of Utterance-aware and Speaker-aware Representation for Multi-turn Dialogue.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Retrospective Reader for Machine Reading Comprehension.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Memory Network for Linguistic Structure Parsing.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Towards More Diverse Input Representation for Neural Machine Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Accurate Word Representations with Universal Visual Guidance.
CoRR, 2020

Enhancing Pre-trained Language Model with Lexical Simplification.
CoRR, 2020

SIT3: Code Summarization with Structure-Induced Transformer.
CoRR, 2020

Cross-lingual Dependency Parsing as Domain Adaptation.
CoRR, 2020

Document-level Neural Machine Translation with Document Embeddings.
CoRR, 2020

Graph-to-Sequence Neural Machine Translation.
CoRR, 2020

Task-specific Objectives of Pre-trained Language Models for Dialogue Adaptation.
CoRR, 2020

Machine Reading Comprehension: The Role of Contextualized Language Models and Beyond.
CoRR, 2020

Capsule-Transformer for Neural Machine Translation.
CoRR, 2020

Syntax-aware Data Augmentation for Neural Machine Translation.
CoRR, 2020

Multi-choice Dialogue-Based Reading Comprehension with Knowledge and Key Turns.
CoRR, 2020

Semantics-Aware Inferential Network for Natural Language Understanding.
CoRR, 2020

Dual Multi-head Co-attention for Multi-choice Reading Comprehension.
CoRR, 2020

SJTU-NICT's Supervised and Unsupervised Neural Machine Translation Systems for the WMT20 News Translation Task.
Proceedings of the Fifth Conference on Machine Translation, 2020

Data-dependent Gaussian Prior Objective for Language Generation.
Proceedings of the 8th International Conference on Learning Representations, 2020

Neural Machine Translation with Universal Visual Representation.
Proceedings of the 8th International Conference on Learning Representations, 2020

Grammatical Error Correction: More Data with More Context.
Proceedings of the International Conference on Asian Language Processing, 2020

Parsing All: Syntax and Semantics, Dependencies and Spans.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

LIMIT-BERT : Linguistics Informed Multi-Task BERT.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Named Entity Recognition Only from Word Embeddings.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Reference Language based Unsupervised Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

High-order Semantic Role Labeling.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Attention Is All You Need for Chinese Word Segmentation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Bipartite Flat-Graph Network for Nested Named Entity Recognition.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

DCMN+: Dual Co-Matching Network for Multi-Choice Reading Comprehension.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Hierarchical Contextualized Representation for Named Entity Recognition.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Global Greedy Dependency Parsing.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Explicit Sentence Compression for Neural Machine Translation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Semantics-Aware BERT for Language Understanding.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

SG-Net: Syntax-Guided Machine Reading Comprehension.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Effective Subword Segmentation for Text Comprehension.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Probing Contextualized Sentence Representations with Visual Awareness.
CoRR, 2019

Dependency and Span, Cross-Style Semantic Role Labeling on PropBank and NomBank.
CoRR, 2019

Document-level Neural Machine Translation with Inter-Sentence Attention.
CoRR, 2019

LIMIT-BERT : Linguistic Informed Multi-Task BERT.
CoRR, 2019

Subword ELMo.
CoRR, 2019

Modeling Named Entity Embedding Distribution into Hypersphere.
CoRR, 2019

Dual Skew Divergence Loss for Neural Machine Translation.
CoRR, 2019

Concurrent Parsing of Constituency and Dependency.
CoRR, 2019

SG-Net: Syntax-Guided Machine Reading Comprehension.
CoRR, 2019

Judging Chemical Reaction Practicality From Positive Sample only Learning.
CoRR, 2019

Dual Co-Matching Network for Multi-choice Reading Comprehension.
CoRR, 2019

Chinese Word Segmentation: Another Decade Review (2007-2017).
CoRR, 2019

Effective Representation for Easy-First Dependency Parsing.
Proceedings of the PRICAI 2019: Trends in Artificial Intelligence, 2019

Cross-Domain Transfer Learning for Dependency Parsing.
Proceedings of the Natural Language Processing and Chinese Computing, 2019

GAN Driven Semi-distant Supervision for Relation Extraction.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Examination-Style Reading Comprehension with Neural augmented Retrieval.
Proceedings of the International Conference on Asian Language Processing, 2019

Syntax-aware Transformer Encoder for Neural Machine Translation.
Proceedings of the International Conference on Asian Language Processing, 2019

Syntax-aware Multilingual Semantic Role Labeling.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

SJTU-NICT at MRP 2019: Multi-Task Learning for End-to-End Uniform Semantic Graph Parsing.
Proceedings of the Shared Task on Cross-Framework Meaning Representation Parsing at the 2019 Conference on Natural Language Learning, 2019

Head-Driven Phrase Structure Grammar Parsing on Penn Treebank.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Open Vocabulary Learning for Neural Chinese Pinyin IME.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Lattice-Based Transformer Encoder for Neural Machine Translation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Multi-Labeled Relation Extraction with Attentive Capsule Network.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Unsupervised Learning Helps Supervised Neural Word Segmentation.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Dependency or Span, End-to-End Uniform Semantic Role Labeling.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Graph-Based Bilingual Word Embedding for Statistical Machine Translation.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2018

Neural-based Pinyin-to-Character Conversion with Adaptive Vocabulary.
CoRR, 2018

Fast Neural Chinese Word Segmentation for Long Sentences.
CoRR, 2018

Effective Subword Segmentation for Text Comprehension.
CoRR, 2018

Attentive Semantic Role Labeling with Boundary Indicator.
CoRR, 2018

Explicit Contextual Semantics for Text Comprehension.
CoRR, 2018

A Full End-to-End Semantic Role Labeler, Syntax-agnostic Over Syntax-aware?
CoRR, 2018

SJTU-NLP at SemEval-2018 Task 9: Neural Hypernym Discovery with Term Embeddings.
Proceedings of The 12th International Workshop on Semantic Evaluation, 2018

Effective Character-Augmented Word Embedding for Machine Reading Comprehension.
Proceedings of the Natural Language Processing and Chinese Computing, 2018

Prediction of MicroRNA Subcellular Localization by Using a Sequence-to-Sequence Model.
Proceedings of the IEEE International Conference on Data Mining, 2018

Exploring Recombination for Efficient Decoding of Neural Machine Translation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

A Unified Syntax-aware Framework for Semantic Role Labeling.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Joint Learning of POS and Dependencies for Multilingual Universal Dependency Parsing.
Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, Brussels, Belgium, October 31, 2018

Lingke: a Fine-grained Multi-turn Chatbot for Customer Service.
Proceedings of the COLING 2018, 2018

One-shot Learning for Question-Answering in Gaokao History Challenge.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Modeling Multi-turn Conversation with Deep Utterance Aggregation.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Subword-augmented Embedding for Cloze Reading Comprehension.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Seq2seq Dependency Parsing.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

A Full End-to-End Semantic Role Labeler, Syntactic-agnostic Over Syntactic-aware?
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Syntax for Semantic Role Labeling, To Be, Or Not To Be.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Moon IME: Neural-based Chinese Pinyin Aided Input Method with Customizable Association.
Proceedings of ACL 2018, Melbourne, Australia, July 15-20, 2018, System Demonstrations, 2018

Neural Character-level Dependency Parsing for Chinese.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
A Hybrid Model for Chinese Spelling Check.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2017

Pair-Aware Neural Sentence Modeling for Implicit Discourse Relation Classification.
Proceedings of the Advances in Artificial Intelligence: From Theory to Practice, 2017

Fast and Accurate Neural Word Segmentation for Chinese.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Converting Continuous-Space Language Models into <i>N</i>-gram Language Models with Efficient Bilingual Pruning for Statistical Machine Translation.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2016

Learning local word reorderings for hierarchical phrase-based statistical machine translation.
Mach. Transl., 2016

A Novel Bilingual Word Embedding Method for Lexical Translation Using Bilingual Sense Clique.
CoRR, 2016

A Bilingual Graph-Based Semantic Model for Statistical Machine Translation.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Connecting Phrase based Statistical Machine Translation Adaptation.
Proceedings of the COLING 2016, 2016

Neural Word Segmentation Learning for Chinese.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
Bilingual Continuous-Space Language Model Growing for Statistical Machine Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Probabilistic Models for High-Order Projective Dependency Parsing.
CoRR, 2015

English to Chinese Translation: How Chinese Character Matters.
Proceedings of the 29th Pacific Asia Conference on Language, Information and Computation, 2015

A Machine Learning Method to Distinguish Machine Translation from Human Translation.
Proceedings of the 29th Pacific Asia Conference on Language, Information and Computation, 2015

Neural Network Language Model for Chinese Pinyin Input Method Engine.
Proceedings of the 29th Pacific Asia Conference on Language, Information and Computation, 2015

Deceptive Opinion Spam Detection Using Deep Level Linguistic Features.
Proceedings of the Natural Language Processing and Chinese Computing - 4th CCF Conference, 2015

A novel word reordering method for statistical machine translation.
Proceedings of the 12th International Conference on Fuzzy Systems and Knowledge Discovery, 2015

Learning Word Reorderings for Hierarchical Phrase-based Statistical Machine Translation.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
A Meta-Top-Down Method for Large-Scale Hierarchical Classification.
IEEE Trans. Knowl. Data Eng., 2014

Parallelized extreme learning machine ensemble based on min-max modular network.
Neurocomputing, 2014

Learning Hierarchical Translation Spans.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Neural Network Based Bilingual Language Model Growing for Statistical Machine Translation.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

An Improved Graph Model for Chinese Spell Checking.
Proceedings of The Third CIPS-SIGHAN Joint Conference on Chinese Language Processing, 2014

2013
An Improved Patent Machine Translation System Using Adaptive Enhancement for NTCIR-10 PatentMT Task.
Proceedings of the 10th NTCIR Conference on Evaluation of Information Access Technologies, 2013

BCMI-NLP Labeled-Alignment-Based Entailment System for NTCIR-10 RITE-2 Task.
Proceedings of the 10th NTCIR Conference on Evaluation of Information Access Technologies, 2013

Labeled Alignment for Recognizing Textual Entailment.
Proceedings of the Sixth International Joint Conference on Natural Language Processing, 2013

Converting Continuous-Space Language Models into N-Gram Language Models for Statistical Machine Translation.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

An Empirical Study on Word Segmentation for Chinese Machine Translation.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2013

2012
Towards a Semantic Annotation of English Television News - Building and Evaluating a Constraint Grammar FrameNet.
Proceedings of the 26th Pacific Asia Conference on Language, Information and Computation, 2012

Spell Checking for Chinese.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Parallel learning of large-scale multi-label classification problems with min-max modular LIBLINEAR.
Proceedings of the 2012 International Joint Conference on Neural Networks (IJCNN), 2012

Automated quality assessment of web pages from textual content.
Proceedings of the International Conference on Machine Learning and Cybernetics, 2012

Fourth-Order Dependency Parsing.
Proceedings of the COLING 2012, 2012

2011
An Empirical Comparative Study on Two Large-Scale Hierarchical Text Classification Approaches.
Int. J. Comput. Process. Orient. Lang., 2011

Redundancy Removal to Selectively Diversify Information Retrieval Results.
Proceedings of the 9th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2011

GeoTime Retrieval through Passage-based Learning to Rank.
Proceedings of the 9th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2011

Enhance Top-down method with Meta-Classification for Very Large-scale Hierarchical Classification.
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011

2010
A Unified Character-Based Tagging Framework for Chinese Word Segmentation.
ACM Trans. Asian Lang. Inf. Process., 2010

An Empirical Study on Development Set Selection Strategy for Machine Translation Learning.
Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, 2010

Multiple Strategies for NTCIR-8 Patent Mining at BCMI.
Proceedings of the 8th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2010

Pruning Training Samples Using a Supervised Clustering Algorithm.
Proceedings of the Advances in Neural Networks, 2010

Adaptive Ensemble Learning Strategy Using an Assistant Classifier for Large-Scale Imbalanced Patent Categorization.
Proceedings of the Neural Information Processing. Theory and Algorithms, 2010

A comparative study on two large-scale hierarchical text classification tasks' solutions.
Proceedings of the International Conference on Machine Learning and Cybernetics, 2010

Hedge Detection and Scope Finding by Sequence Labeling with Procedural Feature Selection.
Proceedings of the Fourteenth Conference on Computational Natural Language Learning: Shared Task, 2010

Dependency Parser for Chinese Constituent Parsing.
Proceedings of the CIPS-SIGHAN Joint Conference on Chinese Language Processing, 2010

2006
Effective Tag Set Selection in Chinese Word Segmentation via Conditional Random Field Modeling.
Proceedings of the 20st Pacific Asia Conference on Language, Information and Computation, 2006

A Modular Reduction Method for <i>k</i>-NN Algorithm with Self-recombination Learning.
Proceedings of the Advances in Neural Networks - ISNN 2006, Third International Symposium on Neural Networks, Chengdu, China, May 28, 2006

An Improved Chinese Word Segmentation System with Conditional Random Field.
Proceedings of the Fifth Workshop on Chinese Language Processing, 2006

2005
Improvement on Response Performance of Min-Max Modular Classifier by Symmetric Module Selection.
Proceedings of the Advances in Neural Networks - ISNN 2005, Second International Symposium on Neural Networks, Chongqing, China, May 30, 2005

Task Decomposition Using Geometric Relation for Min-Max Modular SVMs.
Proceedings of the Advances in Neural Networks - ISNN 2005, Second International Symposium on Neural Networks, Chongqing, China, May 30, 2005

A General Procedure for Combining Binary Classifiers and Its Performance Analysis.
Proceedings of the Advances in Natural Computation, First International Conference, 2005

2004
Analysis of Fault Tolerance of a Combining Classifier.
Proceedings of the Advances in Neural Networks, 2004

A Modular k-Nearest Neighbor Classification Method for Massively Parallel Text Categorization.
Proceedings of the Computational and Information Science, First International Symposium, 2004


  Loading...