Jing Liu

Orcid: 0000-0003-1727-6321

Affiliations:
  • Baidu Inc., Beijing, China
  • Microsoft Research Asia, Beijing, China (2014 - 2017)
  • Harbin Institute of Technology, Harbin, China (PhD 2014)


According to our database1, Jing Liu authored at least 48 papers between 2010 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Dense Text Retrieval Based on Pretrained Language Models: A Survey.
ACM Trans. Inf. Syst., July, 2024

REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

BASES: Large-scale Web Search User Simulation with Large Language Model based Agents.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023
GLS-CSC: A Simple but Effective Strategy to Mitigate Chinese STM Models' Over-Reliance on Superficial Clue.
CoRR, 2023

Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation.
CoRR, 2023

Less Learn Shortcut: Analyzing and Mitigating Learning of Spurious Feature-Label Correlation.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

A Thorough Examination on Zero-shot Dense Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

TOME: A Two-stage Approach for Model-based Retrieval.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
An Interpretability Evaluation Benchmark for Pre-trained Language Models.
CoRR, 2022

Less Learn Shortcut: Analyzing and Mitigating Learning of Spurious Feature-Label Correlation.
CoRR, 2022

DuReader_retrieval: A Large-scale Chinese Benchmark for Passage Retrieval from Web Search Engine.
CoRR, 2022

DuQM: A Chinese Dataset of Linguistically Perturbed Natural Questions for Evaluating the Robustness of Question Matching Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

DuReader-Retrieval: A Large-scale Chinese Benchmark for Passage Retrieval from Web Search Engine.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

DuReader<sub>vis</sub>: A Chinese Dataset for Open-domain Document Visual Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
RocketQA: An Optimized Training Approach to Dense Passage Retrieval for Open-Domain Question Answering.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

RocketQAv2: A Joint Training Method for Dense Passage Retrieval and Passage Re-ranking.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

DuReader_robust: A Chinese Dataset Towards Evaluating Robustness and Generalization of Machine Reading Comprehension in Real-World Applications.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

PAIR: Leveraging Passage-Centric Similarity Relation for Improving Dense Passage Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
DuReaderrobust: A Chinese Dataset Towards Evaluating the Robustness of Machine Reading Comprehension Models.
CoRR, 2020

A Robust Adversarial Training Approach to Machine Reading Comprehension.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
CoKE: Contextualized Knowledge Graph Embedding.
CoRR, 2019

Towards Time-Aware Distant Supervision for Relation Extraction.
CoRR, 2019

Towards Robust Neural Machine Reading Comprehension via Question Paraphrases.
Proceedings of the International Conference on Asian Language Processing, 2019

Enhancing Pre-Trained Language Representations with Rich Knowledge for Machine Reading Comprehension.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

D-NET: A Pre-Training and Fine-Tuning Framework for Improving the Generalization of Machine Reading Comprehension.
Proceedings of the 2nd Workshop on Machine Reading for Question Answering, 2019

2018
Revisiting Distant Supervision for Relation Extraction.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Answer-focused and Position-aware Neural Question Generation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Aggregated Semantic Matching for Short Text Entity Linking.
Proceedings of the 22nd Conference on Computational Natural Language Learning, 2018

Neural Math Word Problem Solver with Reinforcement Learning.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Adaptations of ROUGE and BLEU to Better Evaluate Machine Reading Comprehension Task.
Proceedings of the Workshop on Machine Reading for Question Answering@ACL 2018, 2018

Multi-Passage Machine Reading Comprehension with Cross-Passage Answer Verification.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

DuReader: a Chinese Machine Reading Comprehension Dataset from Real-world Applications.
Proceedings of the Workshop on Machine Reading for Question Answering@ACL 2018, 2018

2017
A Statistical Framework for Product Description Generation.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

2016
A computational approach to measuring the correlation between expertise and social media influence for celebrities on microblogs.
World Wide Web, 2016

Tagging Users' Social Circles via Multiple Linear Regression.
Informatics, 2016

Knowledge Base Completion via Coupled Path Ranking.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

RBPB: Regularization-Based Pattern Balancing Method for Event Extraction.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

News Citation Recommendation with Implicit and Explicit Semantics.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
Improving Ranking Consistency for Web Search by Leveraging a Knowledge Base and Search Logs.
Proceedings of the 24th ACM International Conference on Information and Knowledge Management, 2015

2014
A Regularized Competition Model for Question Difficulty Estimation in Community Question Answering Services.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

2013
What's in a name?: an unsupervised approach to link users across communities.
Proceedings of the Sixth ACM International Conference on Web Search and Data Mining, 2013

Question Difficulty Estimation in Community Question Answering Services.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

A Hierarchical Entity-Based Approach to Structuralize User Generated Content in Social Media: A Case of Yahoo! Answers.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

2012
An unsupervised method for author extraction from web pages containing user-generated content.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

2011
Competition-based user expertise score estimation.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Nonlinear Evidence Fusion and Propagation for Hyponymy Relation Mining.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

2010
Microsoft Research Asia with Redmond at the NTCIR-8 Community QA Pilot Task.
Proceedings of the 8th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2010

Automatic extraction of web data records containing user-generated content.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010


  Loading...