Wei Wu

Orcid: 0000-0001-6079-7697

Affiliations:
  • Meituan Inc, Beijing, China
  • Microsoft Research Asia, Beijing, China (former)
  • Peking University, Beijing, China (former, PhD 2012)


According to our database1, Wei Wu authored at least 155 papers between 2011 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Multi-Task Multi-Attention Transformer for Generative Named Entity Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

2D-TPE: Two-Dimensional Positional Encoding Enhances Table Understanding for Large Language Models.
CoRR, 2024

CodePlan: Unlocking Reasoning Potential in Large Langauge Models by Scaling Code-form Planning.
CoRR, 2024

Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules.
CoRR, 2024

AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback.
CoRR, 2024

CAMLO: Cross-Attentive Multi-View Network for Long-Term Origin-Destination Flow Prediction.
Proceedings of the 2024 SIAM International Conference on Data Mining, 2024

Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

"In-Dialogues We Learn": Towards Personalized Dialogue Without Pre-defined Profiles through In-Dialogue Learning.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

From the Least to the Most: Building a Plug-and-Play Visual Reasoner via Data Synthesis.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Minimal Distillation Schedule for Extreme Language Model Compression.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

2023
Popularity Bias is not Always Evil: Disentangling Benign and Harmful Bias for Recommendation.
IEEE Trans. Knowl. Data Eng., October, 2023

AOG-LSTM: An adaptive attention neural network for visual storytelling.
Neurocomputing, October, 2023

FFHR: Fully and Flexible Hyperbolic Representation for Knowledge Graph Completion.
CoRR, 2023

M2GNN: Metapath and Multi-interest Aggregated Graph Neural Network for Tag-based Cross-domain Recommendation.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Intent-aware Recommendation via Disentangled Graph Contrastive Learning.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Multi-Task Transformer with Relation-Attention and Type-Attention for Named Entity Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Guide and Select: A Transformer-Based Multimodal Fusion Method for Points of Interest Description Generation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Time-Aware Multiway Adaptive Fusion Network for Temporal Knowledge Graph Question Answering.
Proceedings of the IEEE International Conference on Acoustics, 2023

T5-SR: A Unified Seq-to-Seq Decoding Strategy for Semantic Parsing.
Proceedings of the IEEE International Conference on Acoustics, 2023

MAPO: Boosting Large Language Model Performance with Model-Adaptive Prompt Optimization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Seen to Unseen: Exploring Compositional Generalization of Multi-Attribute Controllable Dialogue Generation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Let Me Check the Examples: Enhancing Demonstration Learning via Explicit Imitation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

Fusion or Defusion? Flexible Vision-and-Language Pre-Training.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

RankCSE: Unsupervised Sentence Representations Learning via Learning to Rank.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

PreQuant: A Task-agnostic Quantization Approach for Pre-trained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Robust Lottery Tickets for Pre-trained Language Models.
CoRR, 2022

A Curriculum Learning Approach for Multi-domain Text Classification Using Keyword weight Ranking.
CoRR, 2022

Focus Is What You Need For Chinese Grammatical Error Correction.
CoRR, 2022

Semi-Supervised Knowledge-Grounded Pre-training for Task-Oriented Dialog Systems.
CoRR, 2022

CLOWER: A Pre-trained Language Model with Contrastive Learning over Word and Character Representations.
CoRR, 2022

Long Short-Term Preference Modeling for Continuous-Time Sequential Recommendation.
CoRR, 2022

Searching for Optimal Subword Tokenization in Cross-domain NER.
CoRR, 2022

AutoDisc: Automatic Distillation Schedule for Large Language Model Compression.
CoRR, 2022

Making Pre-trained Language Models Good Long-tailed Learners.
CoRR, 2022

GNN-encoder: Learning a Dual-encoder Architecture via Graph Neural Networks for Passage Retrieval.
CoRR, 2022

TANet: Thread-Aware Pretraining for Abstractive Conversational Summarization.
CoRR, 2022

InstructionNER: A Multi-Task Instruction-Based Generative Framework for Few-shot NER.
CoRR, 2022

ADPL: Adversarial Prompt-based Domain Adaptation for Dialogue Summarization with Knowledge Disentanglement.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Personalized Abstractive Opinion Tagging.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Graph Adaptive Semantic Transfer for Cross-domain Sentiment Classification.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

S<sup>2</sup>QL: Retrieval Augmented Zero-Shot Question Answering over Knowledge Graph.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2022

Moderate-fitting as a Natural Backdoor Defender for Pre-trained Language Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Domain-Oriented Prefix-Tuning: Towards Efficient and Generalizable Fine-tuning for Zero-Shot Dialogue Summarization.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Learning to Express in Knowledge-Grounded Conversation.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

TANet: Thread-Aware Pretraining for Abstractive Conversational Summarization.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Revisit Overconfidence for OOD Detection: Reassigned Contrastive Learning with Adaptive Class-dependent Threshold.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Mask and Reason: Pre-Training Knowledge Graph Transformers for Complex Logical Queries.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Pay More Attention to History: A Context Modeling Strategy for Conversational Text-to-SQL.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Complex Question Answering over Incomplete Knowledge Graph as N-ary Link Prediction.
Proceedings of the International Joint Conference on Neural Networks, 2022

AMR-to-Text Generation with Graph Structure Reconstruction and Coverage Mechanism.
Proceedings of the International Joint Conference on Neural Networks, 2022

Ensemble Multi-Relational Graph Neural Networks.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Searching for Optimal Subword Tokenization in Cross-domain NER.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Learning What You Need from What You Did: Product Taxonomy Expansion with User Behaviors Supervision.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Retrieval Enhanced Segment Generation Neural Network for Task-Oriented Dialogue Systems.
Proceedings of the IEEE International Conference on Acoustics, 2022

PATS: Sensitivity-aware Noisy Learning for Pretrained Language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Making Pretrained Language Models Good Long-tailed Learners.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Improving Semantic Matching through Dependency-Enhanced Pre-trained Model with Adaptive Fusion.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

UniNL: Aligning Representation Learning with Scoring Function for OOD Detection via Unified Neighborhood Learning.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Watch the Neighbors: A Unified K-Nearest Neighbor Contrastive Learning Framework for OOD Intent Discovery.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

XPrompt: Exploring the Extreme of Prompt Tuning.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

GNN-encoder: Learning a Dual-encoder Architecture via Graph Neural Networks for Dense Passage Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

VIRT: Improving Representation-based Text Matching via Virtual Interaction.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Visualizable or Non-visualizable? Exploring the Visualizability of Concepts in Multi-modal Knowledge Graph.
Proceedings of the Database Systems for Advanced Applications, 2022

Making Parameter-efficient Tuning More Efficient: A Unified Framework for Classification Tasks.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

PlugAT: A Plug and Play Module to Defend against Textual Adversarial Attack.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Structural Bias for Aspect Sentiment Triplet Extraction.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

DABERT: Dual Attention Enhanced BERT for Semantic Matching.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Generalized Intent Discovery: Learning from Open World Dialogue System.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

CLOWER: A Pre-trained Language Model with Contrastive Learning over Word and Character Representations.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Knowledge Enhanced Multi-Interest Network for the Generation of Recommendation Candidates.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Unified Knowledge Prompt Pre-training for Customer Service Dialogues.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Robust Lottery Tickets for Pre-trained Language Models.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Incorporating Dynamic Semantics into Pre-Trained Language Model for Aspect-based Sentiment Analysis.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Disentangled Knowledge Transfer for OOD Intent Discovery with Unified Contrastive Learning.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

CQG: A Simple and Effective Controlled Generation Framework for Multi-hop Question Generation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Response Ranking with Multi-types of Deep Interactive Representations in Retrieval-based Dialogues.
ACM Trans. Inf. Syst., 2021

Conditional Text Generation for Harmonious Human-Machine Interaction.
ACM Trans. Intell. Syst. Technol., 2021

Towards information-rich, logical dialogue systems with knowledge-enhanced neural models.
Neurocomputing, 2021

VIRT: Improving Representation-based Models for Text Matching through Virtual Interaction.
CoRR, 2021

TODSum: Task-Oriented Dialogue Summarization with State Tracking.
CoRR, 2021

ASAP: A Chinese Review Dataset Towards Aspect Category Sentiment Analysis and Rating Prediction.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

An End-to-end Beacon Placement Optimization System for Indoor Positioning.
Proceedings of the International Conference on Indoor Positioning and Indoor Navigation, 2021

A Survey on Response Selection for Retrieval-based Dialogues.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Dance Revolution: Long-Term Dance Generation with Music via Curriculum Learning.
Proceedings of the 9th International Conference on Learning Representations, 2021

Improving Event Detection by Exploiting Label Hierarchy.
Proceedings of the IEEE International Conference on Acoustics, 2021

Virtual Data Augmentation: A Robust and General Framework for Fine-tuning Pre-trained Models.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Large-Scale Relation Learning for Question Answering over Knowledge Bases with Pre-trained Language Models.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Task-Oriented Clustering for Dialogues.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

DisenKGAT: Knowledge Graph Embedding with Disentangled Graph Attention Network.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Capturing Event Argument Interaction via A Bi-Directional Entity-Level Recurrent Decoder.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Improving Document Representations by Generating Pseudo Query Embeddings for Dense Retrieval.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Slot Transferability for Cross-domain Slot Filling.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Open Domain Dialogue Generation with Latent Images.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Empowering Conversational AI is a Trip to Mars: Progress and Future of Open Domain Human-Computer Dialogues.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Open Domain Dialogue Generation with Latent Images.
CoRR, 2020

Towards information-rich, logical text generation with knowledge-enhanced neural models.
CoRR, 2020

Improving Matching Models with Hierarchical Contextualized Representations for Multi-turn Response Selection.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Zero-Resource Knowledge-Grounded Dialogue Generation.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

What is that Building?: An End-to-end System for Building Recognition from Streetside Images.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

7.2 A 12nm Programmable Convolution-Efficient Neural-Processing-Unit Chip Achieving 825TOPS.
Proceedings of the 2020 IEEE International Solid- State Circuits Conference, 2020

Low-Resource Knowledge-Grounded Dialogue Generation.
Proceedings of the 8th International Conference on Learning Representations, 2020

Knowledge-Grounded Dialogue Generation with Pre-trained Language Models.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

StyleDGPT: Stylized Response Generation with Pre-trained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

2019
A Sequential Matching Framework for Multi-Turn Response Selection in Retrieval-Based Chatbots.
Comput. Linguistics, 2019

Deep Chit-Chat: Deep Learning for Chatbots.
Proceedings of the Companion of The 2019 World Wide Web Conference, 2019

Multi-Representation Fusion Network for Multi-Turn Response Selection in Retrieval-Based Chatbots.
Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, 2019

Predicting user routines with masked dilated convolutions.
Proceedings of the 13th ACM Conference on Recommender Systems, 2019

Evaluating and Enhancing the Robustness of Retrieval-Based Dialogue Systems with Adversarial Examples.
Proceedings of the Natural Language Processing and Chinese Computing, 2019

A Document-grounded Matching Network for Response Selection in Retrieval-based Chatbots.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Routines - A System for Inference, Analysis and Prediction of Users Daily Location Visits: Industrial Paper.
Proceedings of the 27th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, 2019

Scaling Address Parsing Sequence Models through Active Learning.
Proceedings of the 27th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, 2019

Read, Attend and Comment: A Deep Architecture for Automatic News Comment Generation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Low-Resource Response Generation with Template Prior.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Sampling Matters! An Empirical Study of Negative Sampling Strategies for Learning of Matching Models in Retrieval-based Dialogue Systems.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Neural Response Generation with Meta-words.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

One Time of Interaction May Not Be Enough: Go Deep with an Interaction-over-Interaction Network for Response Selection in Dialogues.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Learning a Matching Model with Co-teaching for Multi-turn Response Selection in Retrieval-based Dialogue Systems.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Response selection with topic clues for retrieval-based chatbots.
Neurocomputing, 2018

Improving Matching Models with Contextualized Word Representations for Multi-turn Response Selection in Retrieval-based Chatbots.
CoRR, 2018

Towards Explainable and Controllable Open Domain Dialogue Generation with Dialogue Acts.
CoRR, 2018

Get The Point of My Utterance! Learning Towards Effective Responses with Multi-Head Attention Mechanism.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Playing 20 Question Game with Policy-Based Reinforcement Learning.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Learning Matching Models with Weak Supervision for Response Selection in Retrieval-based Chatbots.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Hierarchical Recurrent Attention Network for Response Generation.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Neural Response Generation With Dynamic Vocabularies.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Knowledge Enhanced Hybrid Neural Network for Text Matching.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Named entity disambiguation for questions in community question answering.
Knowl. Based Syst., 2017

Neural Response Generation with Dynamic Vocabularies.
CoRR, 2017

Hierarchical Recurrent Attention Network for Response Generation.
CoRR, 2017

LiveMaps: Converting Map Images into Interactive Maps.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Beihang-MSRA at SemEval-2017 Task 3: A Ranking System with Neural Matching Features for Community Question Answering.
Proceedings of the 11th International Workshop on Semantic Evaluation, 2017

Beihang at the NTCIR-13 STC-2 Task.
Proceedings of the 13th NTCIR Conference, 2017

LiveMaps: Learning Geo-Intent from Images of Maps on a Large Scale.
Proceedings of the 25th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, 2017

An Unsupervised Approach for Low-Quality Answer Detection in Community Question-Answering.
Proceedings of the Database Systems for Advanced Applications, 2017

Sequential Matching Network: A New Architecture for Multi-turn Response Selection in Retrieval-Based Chatbots.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Topic Aware Neural Response Generation.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Topic Augmented Neural Response Generation with a Joint Attention Mechanism.
CoRR, 2016

Sequential Match Network: A New Architecture for Multi-turn Response Selection in Retrieval-based Chatbots.
CoRR, 2016

Knowledge Enhanced Hybrid Neural Network for Text Matching.
CoRR, 2016

Topic Augmented Neural Network for Short Text Conversation.
CoRR, 2016

Learning Distributed Representations of Data in Community Question Answering for Question Retrieval.
Proceedings of the Ninth ACM International Conference on Web Search and Data Mining, 2016

Detecting Context Dependent Messages in a Conversational Environment.
Proceedings of the COLING 2016, 2016

Improving Recommendation of Tail Tags for Questions in Community Question Answering.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
A new approach to geocoding: BingGC.
Proceedings of the 23rd SIGSPATIAL International Conference on Advances in Geographic Information Systems, 2015

Mining Query Subtopics from Questions in Community Question Answering.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Improving search relevance for short queries in community question answering.
Proceedings of the Seventh ACM International Conference on Web Search and Data Mining, 2014

Question Retrieval with High Quality Answers in Community Question Answering.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

2013
Learning bilinear model for matching queries and documents.
J. Mach. Learn. Res., 2013

Learning query and document similarities from click-through bipartite graph with metadata.
Proceedings of the Sixth ACM International Conference on Web Search and Data Mining, 2013

2012
A Kernel Approach to Multi-Task Learning with Task-Specific Kernels.
J. Comput. Sci. Technol., 2012

2011
Learning a Robust Relevance Model for Search Using Kernel Methods.
J. Mach. Learn. Res., 2011

A kernel approach to addressing term mismatch.
Proceedings of the 20th International Conference on World Wide Web, 2011

Multi-Task Learning in Square Integrable Space.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011


  Loading...