Furu Wei

Orcid: 0000-0002-7810-5852

According to our database1, Furu Wei authored at least 408 papers between 2005 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
DeepNet: Scaling Transformers to 1,000 Layers.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2024

Generic-to-Specific Distillation of Masked Autoencoders.
IEEE Trans. Circuits Syst. Video Technol., September, 2024

Fine-tuning pretrained transformer encoders for sequence-to-sequence learning.
Int. J. Mach. Learn. Cybern., May, 2024

VatLM: Visual-Audio-Text Pre-Training With Unified Masked Prediction for Speech Representation Learning.
IEEE Trans. Multim., 2024

SpeechLM: Enhanced Speech Pre-Training With Unpaired Textual Data.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

VioLA: Conditional Language Models for Speech Recognition, Synthesis, and Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

ARLON: Boosting Diffusion Transformers with Autoregressive Models for Long Video Generation.
CoRR, 2024

ShifCon: Enhancing Non-Dominant Language Capabilities with a Shift-based Contrastive Framework.
CoRR, 2024

Little Giants: Synthesizing High-Quality Embedding Data at Scale.
CoRR, 2024

1-bit AI Infra: Part 1.1, Fast and Lossless BitNet b1.58 Inference on CPUs.
CoRR, 2024

One Language, Many Gaps: Evaluating Dialect Fairness and Robustness of Large Language Models in Reasoning Tasks.
CoRR, 2024

Data Selection via Optimal Control for Language Models.
CoRR, 2024

Self-Boosting Large Language Models with Synthetic Preference Data.
CoRR, 2024

Differential Transformer.
CoRR, 2024

Scaling Optimal LR Across Token Horizon.
CoRR, 2024

Q-Sparse: All Large Language Models can be Fully Sparsely-Activated.
CoRR, 2024

Autoregressive Speech Synthesis without Vector Quantization.
CoRR, 2024

Enhancing Language Model Rationality with Bi-Directional Deliberation Reasoning.
CoRR, 2024

Direct Preference Knowledge Distillation for Large Language Models.
CoRR, 2024

Meta Reasoning for Large Language Models.
CoRR, 2024

VALL-E R: Robust and Efficient Zero-Shot Text-to-Speech Synthesis via Monotonic Alignment.
CoRR, 2024

VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text to Speech Synthesizers.
CoRR, 2024

xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token.
CoRR, 2024

MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning.
CoRR, 2024

You Only Cache Once: Decoder-Decoder Architectures for Language Models.
CoRR, 2024

Semi-Parametric Retrieval via Binary Token Index.
CoRR, 2024

Multimodal Large Language Model is a Human-Aligned Annotator for Text-to-Image Generation.
CoRR, 2024

Multi-Head Mixture-of-Experts.
CoRR, 2024

Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models.
CoRR, 2024

LLM as a Mastermind: A Survey of Strategic Reasoning with Large Language Models.
CoRR, 2024

WavLLM: Towards Robust and Adaptive Speech Large Language Model.
CoRR, 2024

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits.
CoRR, 2024

Towards Optimal Learning of Language Models.
CoRR, 2024

Se<sup>2</sup>: Sequential Example Selection for In-Context Learning.
CoRR, 2024

Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models.
CoRR, 2024

Generative Representational Instruction Tuning.
CoRR, 2024

Multilingual E5 Text Embeddings: A Technical Report.
CoRR, 2024

K-Level Reasoning with Large Language Models.
CoRR, 2024

Improving Domain Adaptation through Extended-Text Reading Comprehension.
CoRR, 2024

Boosting Large Language Model for Speech Synthesis: An Empirical Study.
CoRR, 2024

Fine-Tuning LLaMA for Multi-Stage Text Retrieval.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Unleashing the Emergent Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Not All Metrics Are Guilty: Improving NLG Evaluation by Diversifying References.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Low-code LLM: Graphical User Interface over Large Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: System Demonstrations, 2024

MathScale: Scaling Instruction Tuning for Mathematical Reasoning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Mixture of LoRA Experts.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Grounding Multimodal Large Language Models to the World.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Kosmos-G: Generating Images in Context with Multimodal Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

MiniLLM: Knowledge Distillation of Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Adapting Large Language Models via Reading Comprehension.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

In-context Autoencoder for Context Compression in a Large Language Model.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

LongEmbed: Extending Embedding Models for Long Context Retrieval.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Chain-of-Dictionary Prompting Elicits Translation in Large Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

WavLLM: Towards Robust and Adaptive Speech Large Language Model.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Instruction Pre-Training: Language Models are Supervised Multitask Learners.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering.
Proceedings of the Computer Vision - ECCV 2024, 2024

Language Models as Inductive Reasoners.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Learning to Retrieve In-Context Examples for Large Language Models.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Revamping Multilingual Agreement Bidirectionally via Switched Back-translation for Multilingual Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

Calibrating LLM-Based Evaluator.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Respond in my Language: Mitigating Language Inconsistency in Response Generation based on Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Improving Text Embeddings with Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

ResLoRA: Identity Residual Mapping in Low-Rank Adaption.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

HD-Eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Se²: Sequential Example Selection for In-Context Learning.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

SCALE: Synergized Collaboration of Asymmetric Language Translation Engines.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Text Diffusion with Reinforced Conditioning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Learning to Rank in Generative Retrieval.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Large Search Model: Redefining Search Stack in the Era of LLMs.
SIGIR Forum, December, 2023

Generative retrieval for conversational question answering.
Inf. Process. Manag., September, 2023

A Unified View of Masked Image Modeling.
Trans. Mach. Learn. Res., 2023

GTrans: Grouping and Fusing Transformer Layers for Neural Machine Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

When an Image is Worth 1, 024 x 1, 024 Words: A Case Study in Computational Pathology.
CoRR, 2023

Auto-ICL: In-Context Learning without Human Supervision.
CoRR, 2023

ALYMPICS: Language Agents Meet Game Theory.
CoRR, 2023

BitNet: Scaling 1-bit Transformers for Large Language Models.
CoRR, 2023

EIPE-text: Evaluation-Guided Iterative Plan Extraction for Long-Form Narrative Text Generation.
CoRR, 2023

Kosmos-2.5: A Multimodal Literate Model.
CoRR, 2023

Large Language Model for Science: A Study on P vs. NP.
CoRR, 2023

WavMark: Watermarking for Audio Generation.
CoRR, 2023

Retentive Network: A Successor to Transformer for Large Language Models.
CoRR, 2023

In-context Autoencoder for Context Compression in a Large Language Model.
CoRR, 2023

Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration.
CoRR, 2023

LongNet: Scaling Transformers to 1, 000, 000, 000 Tokens.
CoRR, 2023

Kosmos-2: Grounding Multimodal Large Language Models to the World.
CoRR, 2023

Knowledge Distillation of Large Language Models.
CoRR, 2023

VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation.
CoRR, 2023

Not All Metrics Are Guilty: Improving NLG Evaluation with LLM Paraphrasing.
CoRR, 2023

One-stop Training of Multiple Capacity Models.
CoRR, 2023

Chain-of-Dictionary Prompting Elicits Translation in Large Language Models.
CoRR, 2023

Inference with Reference: Lossless Acceleration of Large Language Models.
CoRR, 2023

UPRISE: Universal Prompt Retrieval for Improving Zero-Shot Evaluation.
CoRR, 2023

Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling.
CoRR, 2023

Semiparametric Language Models Are Scalable Continual Learners.
CoRR, 2023

Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers.
CoRR, 2023

Augmenting Language Models with Long-Term Memory.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Language Is Not All You Need: Aligning Perception with Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Optimizing Prompts for Text-to-Image Generation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

On the Pareto Front of Multilingual Neural Machine Translation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

TextDiffuser: Diffusion Models as Text Painters.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Extensible Prompts for Language Models on Zero-shot Language Style Customization.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Magneto: A Foundation Transformer.
Proceedings of the International Conference on Machine Learning, 2023

BEATs: Audio Pre-Training with Acoustic Tokenizers.
Proceedings of the International Conference on Machine Learning, 2023

Are More Layers Beneficial to Graph Transformers?
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Visually-Augmented Language Modeling.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Prototypical Calibration for Few-shot Learning of Language Models.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Corrupted Image Modeling for Self-Supervised Visual Pre-Training.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Joint Pre-Training with Speech and Bilingual Text for Direct Speech to Speech Translation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Query2doc: Query Expansion with Large Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Democratizing Reasoning Ability: Tailored Learning from Large Language Model.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

TRIP: Accelerating Document-level Multilingual Pre-training via Triangular Document-level Pre-training on Parallel Data Triplets.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Tuna: Instruction Tuning using Feedback from Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Not All Languages Are Created Equal in LLMs: Improving Multilingual Capability by Cross-Lingual-Thought Prompting.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Syllogistic Reasoning for Legal Judgment Analysis.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

UPRISE: Universal Prompt Retrieval for Improving Zero-Shot Evaluation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

HanoiT: Enhancing Context-aware Translation via Selective Context.
Proceedings of the Database Systems for Advanced Applications, 2023

Non-Contrastive Learning Meets Language-Image Pre-Training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Image as a Foreign Language: BEIT Pretraining for Vision and Vision-Language Tasks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

SimLM: Pre-training with Representation Bottleneck for Dense Passage Retrieval.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

A Length-Extrapolatable Transformer.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Beyond English-Centric Bitexts for Better Multilingual Language Representation Learning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Dual-Alignment Pre-training for Cross-lingual Sentence Embedding.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Pre-Training to Learn in Context.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

On the Off-Target Problem of Zero-Shot Multilingual Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Pre-training Language Model as a Multi-perspective Course Learner.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Multiview Identifiers Enhanced Generative Retrieval.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

MoEC: Mixture of Expert Clusters.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

TrOCR: Transformer-Based Optical Character Recognition with Pre-trained Models.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Transforming Wikipedia Into Augmented Data for Query-Focused Summarization.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing.
IEEE J. Sel. Top. Signal Process., 2022

Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers.
CoRR, 2022

Pay Attention to Your Tone: Introducing a New Dataset for Polite Language Rewrite.
CoRR, 2022

BEATs: Audio Pre-Training with Acoustic Tokenizers.
CoRR, 2022

TRIP: Triangular Document-level Pre-training for Multilingual Language Models.
CoRR, 2022

Structured Prompting: Scaling In-Context Learning to 1, 000 Examples.
CoRR, 2022

Momentum Calibration for Text Generation.
CoRR, 2022

Text Embeddings by Weakly-Supervised Contrastive Pre-training.
CoRR, 2022

Extensible Prompts for Language Models.
CoRR, 2022

TorchScale: Transformers at Scale.
CoRR, 2022

Latent Prompt Tuning for Text Summarization.
CoRR, 2022

LVP-M3: Language-aware Visual Prompt for Multilingual Multimodal Machine Translation.
CoRR, 2022

Foundation Transformers.
CoRR, 2022

Towards Multilingual Transitivity and Bidirectional Multilingual Agreement for Multilingual Document-level Machine Translation.
CoRR, 2022

Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks.
CoRR, 2022

BEiT v2: Masked Image Modeling with Vector-Quantized Visual Tokenizers.
CoRR, 2022

Learning Diverse Document Representations with Deep Query Interactions for Dense Retrieval.
CoRR, 2022

GTrans: Grouping and Fusing Transformer Layers for Neural Machine Translation.
CoRR, 2022

HLT-MT: High-resource Language-specific Training for Multilingual Neural Machine Translation.
CoRR, 2022

Language Models are General-Purpose Interfaces.
CoRR, 2022

The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task.
CoRR, 2022

VL-BEiT: Generative Vision-Language Pretraining.
CoRR, 2022

Task-Specific Expert Pruning for Sparse Mixture-of-Experts.
CoRR, 2022

Lossless Acceleration for Seq2seq Generation with Aggressive Decoding.
CoRR, 2022

Prototypical Calibration for Few-shot Learning of Language Models.
CoRR, 2022

On the Representation Collapse of Sparse Mixture of Experts.
CoRR, 2022

Lossless Speedup of Autoregressive Translation with Generalized Aggressive Decoding.
CoRR, 2022

DeepNet: Scaling Transformers to 1, 000 Layers.
CoRR, 2022

A Survey of Knowledge-Intensive NLP with Pre-Trained Language Models.
CoRR, 2022

EdgeFormer: A Parameter-Efficient Transformer for On-Device Seq2seq Generation.
CoRR, 2022

PromptBERT: Improving BERT Sentence Embeddings with Prompts.
CoRR, 2022

Phrase-level Adversarial Example Generation for Neural Machine Translation.
CoRR, 2022

SMDT: Selective Memory-Augmented Neural Document Translation.
CoRR, 2022

Kformer: Knowledge Injection in Transformer Feed-Forward Layers.
Proceedings of the Natural Language Processing and Chinese Computing, 2022

On the Representation Collapse of Sparse Mixture of Experts.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

VLMo: Unified Vision-Language Pre-Training with Mixture-of-Modality-Experts.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

DiT: Self-supervised Pre-training for Document Image Transformer.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Separating Long-Form Speech with Group-wise Permutation Invariant Training.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Supervision-Guided Codebooks for Masked Prediction in Speech Pre-training.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Speech Pre-training with Acoustic Piece.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition?
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Plug and Play Knowledge Distillation for kNN-LM with External Logits.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

UM4: Unified Multilingual Multiple Teacher-Student Model for Zero-Resource Neural Machine Translation.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

High-resource Language-specific Training for Multilingual Neural Machine Translation.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

A Unified Strategy for Multilingual Grammatical Error Correction with Pre-trained Cross-Lingual Language Model.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

BEiT: BERT Pre-Training of Image Transformers.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Unispeech-Sat: Universal Speech Representation Learning With Speaker Aware Pre-Training.
Proceedings of the IEEE International Conference on Acoustics, 2022

SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

CROP: Zero-shot Cross-lingual Named Entity Recognition with Multilingual Labeled Sequence Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Distilled Dual-Encoder Model for Vision-Language Understanding.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

PromptBERT: Improving BERT Sentence Embeddings with Prompts.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Zero-shot Cross-lingual Transfer of Prompt-based Tuning with a Unified Multilingual Prompt.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

EdgeFormer: A Parameter-Efficient Transformer for On-Device Seq2seq Generation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Snapshot-Guided Domain Adaptation for ELECTRA.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

XDoc: Unified Pre-training for Cross-Format Document Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Swin Transformer V2: Scaling Up Capacity and Resolution.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Attention Temperature Matters in Abstractive Summarization Distillation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

XFUND: A Benchmark Dataset for Multilingual Visually Rich Form Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Controllable Natural Language Generation with Contrastive Prefixes.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

MarkupLM: Pre-training of Text and Markup Language for Visually Rich Document Understanding.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Neural Label Search for Zero-Shot Multi-Lingual Extractive Summarization.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Knowledge Neurons in Pretrained Transformers.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

StableMoE: Stable Routing Strategy for Mixture of Experts.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

XLM-E: Cross-lingual Language Model Pre-training via ELECTRA.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Towards Making the Most of Cross-Lingual Transfer for Zero-Shot Neural Machine Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

THE-X: Privacy-Preserving Transformer Inference with Homomorphic Encryption.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

CLIP Models are Few-Shot Learners: Empirical Studies on VQA and Visual Entailment.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Sequence Level Contrastive Learning for Text Summarization.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Document AI: Benchmarks, Models and Applications.
CoRR, 2021

VLMo: Unified Vision-Language Pre-Training with Mixture-of-Modality-Experts.
CoRR, 2021

WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing.
CoRR, 2021

s2s-ft: Fine-Tuning Pretrained Transformer Encoders for Sequence-to-Sequence Learning.
CoRR, 2021

Improving Non-autoregressive Generation with Mixup Training.
CoRR, 2021

Towards Making the Most of Multilingual Pretraining for Zero-Shot Neural Machine Translation.
CoRR, 2021

SpeechT5: Unified-Modal Encoder-Decoder Pre-training for Spoken Language Processing.
CoRR, 2021

TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models.
CoRR, 2021

XLM-E: Cross-lingual Language Model Pre-training via ELECTRA.
CoRR, 2021

DeltaLM: Encoder-Decoder Pre-training for Language Generation and Translation by Augmenting Pretrained Multilingual Encoders.
CoRR, 2021

BEiT: BERT Pre-Training of Image Transformers.
CoRR, 2021

VT-SSum: A Benchmark Dataset for Video Transcript Segmentation and Summarization.
CoRR, 2021

LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding.
CoRR, 2021

Knowledge Neurons in Pretrained Transformers.
CoRR, 2021

mT6: Multilingual Pretrained Text-to-Text Transformer with Translation Pairs.
CoRR, 2021

Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting.
CoRR, 2021

Multilingual Machine Translation Systems from Microsoft for WMT21 Shared Task.
Proceedings of the Sixth Conference on Machine Translation, 2021

Learning to Select Relevant Knowledge for Neural Machine Translation.
Proceedings of the Natural Language Processing and Chinese Computing, 2021

Blow the Dog Whistle: A Chinese Dataset for Cant Understanding with Common Sense and World Knowledge.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

InfoXLM: An Information-Theoretic Framework for Cross-Lingual Language Model Pre-Training.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data.
Proceedings of the 38th International Conference on Machine Learning, 2021

Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Allocating Large Vocabulary Capacity for Cross-Lingual Language Model Pre-Training.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Beyond Preserved Accuracy: Evaluating Loyalty and Robustness of BERT Compression.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

LayoutReader: Pre-training of Text and Layout for Reading Order Detection.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

mT6: Multilingual Pretrained Text-to-Text Transformer with Translation Pairs.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Zero-Shot Cross-Lingual Transfer of Neural Machine Translation with Multilingual Pretrained Encoders.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Jointly Learning to Repair Code and Generate Commit Message.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Consistency Regularization for Cross-Lingual Fine-Tuning.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Memory-Efficient Differentiable Transformer Architecture Search.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Adapt-and-Distill: Developing Small, Fast and Effective Pretrained Language Models for Domains.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Multilingual Agreement for Multilingual Neural Machine Translation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

xMoCo: Cross Momentum Contrastive Learning for Open-Domain Question Answering.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

LayoutLMv2: Multi-modal Pre-training for Visually-rich Document Understanding.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

MiniLMv2: Multi-Head Self-Attention Relation Distillation for Compressing Pretrained Transformers.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Grammar-Based Patches Generation for Automated Program Repair.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Instantaneous Grammatical Error Correction with Shallow Aggressive Decoding.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

SemFace: Pre-training Encoder and Decoder with a Semantic Interface for Neural Machine Translation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Learning to Sample Replacements for ELECTRA Pre-Training.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Improving Pretrained Cross-Lingual Language Models via Self-Labeled Word Alignment.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Self-Attention Attribution: Interpreting Information Interactions Inside Transformer.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
A Joint Sentence Scoring and Selection Framework for Neural Extractive Document Summarization.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

XLM-T: Scaling up Multilingual Machine Translation with Pretrained Cross-lingual Transformer Encoders.
CoRR, 2020

At Which Level Should We Extract? An Empirical Study on Extractive Document Summarization.
CoRR, 2020

Learning to Summarize Passages: Mining Passage-Summary Pairs from Wikipedia Revision Histories.
CoRR, 2020

STEP: Sequence-to-Sequence Transformer Pre-training for Document Summarization.
CoRR, 2020

BERT Loses Patience: Fast and Robust Inference with Early Exit.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

MiniLM: Deep Self-Attention Distillation for Task-Agnostic Compression of Pre-Trained Transformers.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

TableBank: Table Benchmark for Image-based Table Detection and Recognition.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

LayoutLM: Pre-training of Text and Layout for Document Image Understanding.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

UnihanLM: Coarse-to-Fine Chinese-Japanese Language Model Pretraining with the Unihan Database.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

Generating Commonsense Explanation by Extracting Bridge Concepts from Reasoning Paths.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

Investigating Learning Dynamics of BERT Fine-Tuning.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

Can Monolingual Pretrained Models Help Cross-Lingual Classification?
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training.
Proceedings of the 37th International Conference on Machine Learning, 2020

Self-Adversarial Learning with Comparative Discrimination for Text Generation.
Proceedings of the 8th International Conference on Learning Representations, 2020

VL-BERT: Pre-training of Generic Visual-Linguistic Representations.
Proceedings of the 8th International Conference on Learning Representations, 2020

Pre-training for Abstractive Document Summarization by Reinstating Source Text.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Scheduled DropHead: A Regularization Method for Transformer Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Improving Grammatical Error Correction with Machine Translation Pairs.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Unsupervised Extractive Summarization by Pre-training Hierarchical Transformers.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

BERT-of-Theseus: Compressing BERT by Progressive Module Replacing.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Language Generation with Multi-Hop Reasoning on Commonsense Knowledge Graph.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Improving the Efficiency of Grammatical Error Correction with Erroneous Span Detection and Correction.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks.
Proceedings of the Computer Vision - ECCV 2020, 2020

Multimodal Matching Transformer for Live Commenting.
Proceedings of the ECAI 2020 - 24th European Conference on Artificial Intelligence, 29 August-8 September 2020, Santiago de Compostela, Spain, August 29 - September 8, 2020, 2020

At Which Level Should We Extract? An Empirical Analysis on Extractive Document Summarization.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

DocBank: A Benchmark Dataset for Document Layout Analysis.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Unsupervised Fine-tuning for Text Clustering.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Harvesting and Refining Question-Answer Pairs for Unsupervised QA.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Fact-Aware Sentence Split and Rephrase with Permutation Invariant Training.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Cross-Lingual Natural Language Generation via Pre-Training.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Sequence-to-sequence Pre-training with Data Augmentation for Sentence Rewriting.
CoRR, 2019

Formality Style Transfer with Hybrid Textual Annotations.
CoRR, 2019

Document-Based Question Answering Improves Query-Focused Multi-document Summarization.
Proceedings of the Natural Language Processing and Chinese Computing, 2019

Neural Melody Composition from Lyrics.
Proceedings of the Natural Language Processing and Chinese Computing, 2019

Unified Language Model Pre-training for Natural Language Understanding and Generation.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Video Dialog via Progressive Inference and Cross-Transformer.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Visualizing and Understanding the Effectiveness of BERT.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Learning to Ask Unanswerable Questions for Machine Reading Comprehension.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Retrieval-Enhanced Adversarial Training for Neural Response Generation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

BERT-based Lexical Substitution.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

HIBERT: Document Level Pre-training of Hierarchical Bidirectional Transformers for Document Summarization.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Automatic Grammatical Error Correction for Sequence-to-sequence Text Generation: An Empirical Study.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Inspecting Unification of Encoding and Matching with Transformer: A Case Study of Machine Reading Comprehension.
Proceedings of the 2nd Workshop on Machine Reading for Question Answering, 2019

LiveBot: Generating Live Video Comments Based on Visual and Textual Contexts.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Dictionary-Guided Editing Networks for Paraphrase Generation.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Read + Verify: Machine Reading Comprehension with Unanswerable Questions.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Response Generation by Context-Aware Prototype Editing.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Sentence Relations for Extractive Summarization with Deep Neural Networks.
ACM Trans. Inf. Syst., 2018

Context-Aware Answer Sentence Selection With Hierarchical Gated Recurrent Neural Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Text Morphing.
CoRR, 2018

Unsupervised Machine Commenting with Neural Variational Topic Model.
CoRR, 2018

Retrieval-Enhanced Adversarial Training for Neural Response Generation.
CoRR, 2018

Read + Verify: Machine Reading Comprehension with Unanswerable Questions.
CoRR, 2018

Reaching Human-level Performance in Automatic Grammatical Error Correction: An Empirical Study.
CoRR, 2018

Dictionary-Guided Editing Networks for Paraphrase Generation.
CoRR, 2018

Response Generation by Context-aware Prototype Editing.
CoRR, 2018

Improv Chat: Second Response Generation for Chatbot.
CoRR, 2018

I Know There Is No Answer: Modeling Answer Validation for Machine Reading Comprehension.
Proceedings of the Natural Language Processing and Chinese Computing, 2018

SeRI: A Dataset for Sub-event Relation Inference from an Encyclopedia.
Proceedings of the Natural Language Processing and Chinese Computing, 2018

EventWiki: A Knowledge Base of Major Events.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

XiaoIce Band: A Melody and Arrangement Generation Framework for Pop Music.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

Multiway Attention Networks for Modeling Sentence Pairs.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Reinforced Mnemonic Reader for Machine Reading Comprehension.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Attention-Fused Deep Matching Network for Natural Language Inference.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Neural Latent Extractive Document Summarization.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Attention-Guided Answer Distillation for Machine Reading Comprehension.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Fine-grained Coordinated Cross-lingual Text Stream Alignment for Endless Language Knowledge Acquisition.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Fluency Boost Learning and Inference for Neural Grammatical Error Correction.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Neural Document Summarization by Jointly Learning to Score and Select Sentences.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Retrieve, Rerank and Rewrite: Soft Template Based Neural Summarization.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Neural Open Information Extraction.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Hierarchical Attention Flow for Multiple-Choice Reading Comprehension.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Sequential Copying Networks.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

S-Net: From Answer Extraction to Answer Synthesis for Machine Reading Comprehension.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Faithful to the Original: Fact Aware Neural Abstractive Summarization.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
S-Net: From Answer Extraction to Answer Generation for Machine Reading Comprehension.
CoRR, 2017

Leveraging Contextual Sentence Relations for Extractive Summarization Using a Neural Attention Model.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Neural Question Generation from Text: A Preliminary Study.
Proceedings of the Natural Language Processing and Chinese Computing, 2017

Entity Linking for Queries by Searching Wikipedia Sentences.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Learning to Generate Product Reviews from Attributes.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Selective Encoding for Abstractive Sentence Summarization.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Gated Self-Matching Networks for Reading Comprehension and Question Answering.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

SuperAgent: A Customer Service Chatbot for E-commerce Websites.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Improving Multi-Document Summarization via Text Classification.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
An Uncertainty-Aware Approach for Exploratory Microblog Retrieval.
IEEE Trans. Vis. Comput. Graph., 2016

Sentiment Embeddings with Applications to Sentiment Analysis.
IEEE Trans. Knowl. Data Eng., 2016

Relation Classification Via Modeling Augmented Dependency Paths.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Adaptive Multi-Compositionality for Recursive Neural Network Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

AttSum: Joint Learning of Focusing and Summarization with Neural Attention.
CoRR, 2016

Unsupervised Word and Dependency Path Embeddings for Aspect Term Extraction.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Solving and Generating Chinese Character Riddles.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

A Redundancy-Aware Sentence Regression Framework for Extractive Summarization.
Proceedings of the COLING 2016, 2016

AttSum: Joint Learning of Focusing and Summarization with Neural Attention.
Proceedings of the COLING 2016, 2016

TGSum: Build Tweet Guided Multi-Document Summarization Dataset.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
A Joint Segmentation and Classification Framework for Sentence Level Sentiment Classification.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Multi-Document Summarization via Discriminative Summary Reranking.
CoRR, 2015

Cross-lingual Sentiment Lexicon Learning With Bilingual Word Graph Label Propagation.
Comput. Linguistics, 2015

A Statistical Parsing Framework for Sentiment Classification.
Comput. Linguistics, 2015

Splusplus: A Feature-Rich Two-stage Classifier for Sentiment Analysis of Tweets.
Proceedings of the 9th International Workshop on Semantic Evaluation, 2015

A Hybrid Neural Model for Type Classification of Entity Mentions.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Crowd-Selection Query Processing in Crowdsourcing Databases: A Task-Driven Approach.
Proceedings of the 18th International Conference on Extending Database Technology, 2015

Cold-Start Expert Finding in Community Question Answering via Graph Regularization.
Proceedings of the Database Systems for Advanced Applications, 2015

A Dependency-Based Neural Network for Relation Classification.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Question Answering over Freebase with Multi-Column Convolutional Neural Networks.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Learning Summary Prior Representation for Extractive Summarization.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Ranking with Recursive Neural Networks and Its Application to Multi-Document Summarization.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Coooolll: A Deep Learning System for Twitter Sentiment Classification.
Proceedings of the 8th International Workshop on Semantic Evaluation, 2014

Answer Extraction with Multiple Extraction Engines for Web-Based Question Answering.
Proceedings of the Natural Language Processing and Chinese Computing, 2014

A Joint Segmentation and Classification Framework for Sentiment Analysis.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Building Large-Scale Twitter-Specific Sentiment Lexicon : A Representation Learning Approach.
Proceedings of the COLING 2014, 2014

SocialTransfer: Transferring Social Knowledge for Cold-Start Cowdsourcing.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

Learning Sentiment-Specific Word Embedding for Twitter Sentiment Classification.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

Adaptive Recursive Neural Network for Target-dependent Twitter Sentiment Classification.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

Adaptive Multi-Compositionality for Recursive Neural Models with Applications to Sentiment Analysis.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
Constrained Text Coclustering with Supervised and Unsupervised Constraints.
IEEE Trans. Knowl. Data Eng., 2013

Named entity recognition for tweets.
ACM Trans. Intell. Syst. Technol., 2013

Exploring hypergraph-based semi-supervised ranking for query-oriented summarization.
Inf. Sci., 2013

Entity Linking for Tweets.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Co-Training Based Bilingual Sentiment Lexicon Learning.
Proceedings of the Late-Breaking Developments in the Field of Artificial Intelligence, 2013

The Automated Acquisition of Suggestions from Tweets.
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

2012
Entity-centric topic-oriented opinion summarization in twitter.
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012

Lost in Translations? Building Sentiment Lexicons using Context Based Machine Translation.
Proceedings of the COLING 2012, 2012

Graph-Based Multi-Tweet Summarization using Social Signals.
Proceedings of the COLING 2012, 2012

Twitter Topic Summarization by Ranking Tweets using Social Influence and Content Quality.
Proceedings of the COLING 2012, 2012

Graph-based collective classification for tweets.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Breaking news on twitter.
Proceedings of the CHI Conference on Human Factors in Computing Systems, 2012

Cross-Lingual Mixture Model for Sentiment Classification.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

Joint Inference of Named Entity Recognition and Normalization for Tweets.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

QuickView: NLP-based Tweet Search.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, 2012

Exacting Social Events for Tweets Using a Factor Graph.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

Collective Nominal Semantic Role Labeling for Tweets.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2011
Document-Aware Graph Models for Query-Oriented Multi-document Summarization.
Proceedings of the Multimedia Analysis, Processing and Communications, 2011

Semantic-Preserving Word Clouds by Seam Carving.
Comput. Graph. Forum, 2011

QuickView: advanced search of tweets.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Topic sentiment analysis in twitter: a graph-based hashtag sentiment classification approach.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

Recognizing Named Entities in Tweets.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

2010
OpinionSeer: Interactive Visualization of Hotel Customer Feedback.
IEEE Trans. Vis. Comput. Graph., 2010

A document-sensitive graph model for multi-document summarization.
Knowl. Inf. Syst., 2010

iRANK: A rank-learn-combine framework for unsupervised ensemble ranking.
J. Assoc. Inf. Sci. Technol., 2010

Context-Preserving, Dynamic Word Cloud Visualization.
IEEE Computer Graphics and Applications, 2010

TIARA: a visual exploratory text analytic system.
Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2010

Modeling Document Summarization as Multi-objective Optimization.
Proceedings of the Third International Symposium on Intelligent Information Technology and Security Informatics, 2010

Understanding text corpora with multiple facets.
Proceedings of the 5th IEEE Conference on Visual Analytics Science and Technology, 2010

VisWorks text and network visual analytics: VAST 2010 Mini Challenge 1 award: "Effective interactive visualization of document contents".
Proceedings of the 5th IEEE Conference on Visual Analytics Science and Technology, 2010

Towards Building an Integrated Information Platform for Eco-city.
Proceedings of the IEEE 7th International Conference on e-Business Engineering, 2010

Constrained Coclustering for Textual Documents.
Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010

2009
Applying two-level reinforcement ranking in query-oriented multidocument summarization.
J. Assoc. Inf. Sci. Technol., 2009

Query-Oriented Summarization Based on Neighborhood Graph Model.
Proceedings of the Computer Processing of Oriental Languages. Language Technology for the Knowledge-based Economy, 2009

Learning Similarity Functions in Graph-Based Document Summarization.
Proceedings of the Computer Processing of Oriental Languages. Language Technology for the Knowledge-based Economy, 2009

iRANK: an interactive ranking framework and its application in query-focused summarization.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

HyperSum: hypergraph based semi-supervised sentence ranking for query-oriented summarization.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

Co-Feedback Ranking for Query-Focused Summarization.
Proceedings of the ACL 2009, 2009

2008
PolyU at TAC 2008.
Proceedings of the First Text Analysis Conference, 2008

Query-sensitive mutual reinforcement chain and its application in query-oriented multi-document summarization.
Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2008

Exploiting the Role of Named Entities in Query-Oriented Document Summarization.
Proceedings of the PRICAI 2008: Trends in Artificial Intelligence, 2008

Exploiting the Role of Position Feature in Chinese Relation Extraction.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

A Query-Sensitive Graph-Based Sentence Ranking Algorithm for Query-Oriented Multi-document Summarization.
Proceedings of the International Symposium on Information Processing, 2008

A Cluster-Sensitive Graph Model for Query-Oriented Multi-document Summarization.
Proceedings of the Advances in Information Retrieval , 2008

PNR2: Ranking Sentences with Positive and Negative Reinforcement for Query-Oriented Update Summarization.
Proceedings of the COLING 2008, 2008

A Novel Feature-based Approach to Chinese Entity Relation Extraction.
Proceedings of the ACL 2008, 2008

2005
An Application of FIPA-ACL Based Agent Communication in Mobile-GPS System.
Proceedings of the 2005 International Conference on Wireless Networks, 2005


  Loading...