Lei Li

Orcid: 0000-0003-3095-9776

Affiliations:
  • Carnegie Mellon University, School of Computer Science, Language Technologies Institute, PA, USA
  • University of California Santa Barbara, CA, USA (former)
  • ByteDance AI Lab, Beijing, China (former)
  • University of California, Berkeley, Computer Science Division, CA, USA (former)
  • Carnegie Mellon University, Computer Science Department, Pittsburgh, PA, USA (PhD 2011)
  • Shanghai Jiao Tong University, Department of Computer Science and Engineering, APEX Data and Knowledge Management Lab, Shanghai, China (2004 - 2006)


According to our database1, Lei Li authored at least 245 papers between 2006 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Scaling LLM Inference with Optimized Sample Compute Allocation.
CoRR, 2024

Cooperative Strategic Planning Enhances Reasoning Capabilities in Large Language Models.
CoRR, 2024

CA*: Addressing Evaluation Pitfalls in Computation-Aware Latency for Simultaneous Speech Translation.
CoRR, 2024

Understanding the Role of LLMs in Multimodal Evaluation Benchmarks.
CoRR, 2024

Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling.
CoRR, 2024

Translation Canvas: An Explainable Interface to Pinpoint and Analyze Translation Systems.
CoRR, 2024

Efficiently Identifying Watermarked Segments in Mixed-Source Texts.
CoRR, 2024

TypedThinker: Typed Thinking Improves Large Language Model Reasoning.
CoRR, 2024

Global Human-guided Counterfactual Explanations for Molecular Properties via Reinforcement Learning.
CoRR, 2024

BPO: Supercharging Online Preference Learning by Adhering to the Proximity of Behavior LLM.
CoRR, 2024

Evaluating Durability: Benchmark Insights into Multimodal Watermarking.
CoRR, 2024

Entity6K: A Large Open-Domain Evaluation Dataset for Real-World Entity Recognition.
CoRR, 2024

Hire a Linguist!: Learning Endangered Languages with In-Context Linguistic Descriptions.
CoRR, 2024

Structure-Based Drug Design via 3D Molecular Generative Pre-training and Sampling.
CoRR, 2024

Perils of Self-Feedback: Self-Bias Amplifies in Large Language Models.
CoRR, 2024

Permute-and-Flip: An optimally robust and watermarkable decoder for LLMs.
CoRR, 2024

KS-Lottery: Finding Certified Lottery Tickets for Multilingual Language Models.
CoRR, 2024

Weak-to-Strong Jailbreaking on Large Language Models.
CoRR, 2024

Multilingual Machine Translation with Large Language Models: Empirical Results and Analysis.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

LLMRefine: Pinpointing and Refining Large Language Models via Fine-Grained Actionable Feedback.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Global Human-guided Counterfactual Explanations for Molecular Properties via Reinforcement Learning.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Generative Enzyme Design Guided by Functionally Important Sites and Small-Molecule Substrates.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

SurfPro: Functional Protein Design Based on Continuous Surface.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

DE-COP: Detecting Copyrighted Content in Language Models Training Data.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Provable Robust Watermarking for AI-Generated Text.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

BPO: Staying Close to the Behavior LLM Creates Better Online LLM Alignment.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Learning Personalized Alignment for Evaluating Open-ended Text Generation.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

LumberChunker: Long-Form Narrative Document Segmentation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Hire a Linguist!: Learning Endangered Languages in LLMs with In-Context Linguistic Descriptions.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Pride and Prejudice: LLM Amplifies Self-Bias in Self-Refinement.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Where It Really Matters: Few-Shot Environmental Conservation Media Monitoring for Low-Resource Languages.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Pinpoint, Not Criticize: Refining Large Language Models via Fine-Grained Actionable Feedback.
CoRR, 2023

How Multilingual is Multilingual LLM?
CoRR, 2023

Syntax Error-Free and Generalizable Tool Use for LLMs via Finite-State Decoding.
CoRR, 2023

Functional Geometry Guided Protein Sequence and Backbone Structure Co-Design.
CoRR, 2023

Learning Personalized Story Evaluation.
CoRR, 2023

Joint Design of Protein Sequence and Structure based on Motifs.
CoRR, 2023

Extrapolating Large Language Models to Non-English by Aligning Languages.
CoRR, 2023

Generative Autoencoders as Watermark Attackers: Analyses of Vulnerabilities and Threats.
CoRR, 2023

Prompt Optimization of Large Language Model for Interactive Tasks without Gradient and Demonstrations.
CoRR, 2023

Learn from Mistakes through Cooperative Interaction with Study Assistant.
CoRR, 2023

Statistical Knowledge Assessment for Generative Language Models.
CoRR, 2023

HIORE: Leveraging High-order Interactions for Unified Entity Relation Extraction.
CoRR, 2023

A Survey for In-context Learning.
CoRR, 2023

ALGO: Synthesizing Algorithmic Programs with Generated Oracle Verifiers.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Statistical Knowledge Assessment for Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Accelerating Antimicrobial Peptide Discovery with Latent Structure.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Protecting Language Generation Models via Invisible Watermarking.
Proceedings of the International Conference on Machine Learning, 2023

ReDi: Efficient Learning-Free Diffusion Inference via Trajectory Retrieval.
Proceedings of the International Conference on Machine Learning, 2023

Importance Weighted Expectation-Maximization for Protein Sequence Design.
Proceedings of the International Conference on Machine Learning, 2023

INSTRUCTSCORE: Towards Explainable Text Generation Evaluation with Automatic Feedback.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Extrapolating Multilingual Understanding Models as Multilingual Generators.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Learning from Mistakes via Cooperative Study Assistant for Large Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

AutoPlan: Automatic Planning of Interactive Decision-Making Tasks With Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Pre-trained Language Models Can be Fully Zero-Shot Learners.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Lego-MT: Learning Detachable Models for Massively Multilingual Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

SESCORE2: Learning Text Generation Evaluation via Synthesizing Realistic Mistakes.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

WACO: Word-Aligned Contrastive Learning for Speech Translation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Say What You Mean! Large Language Models Speak Too Positively about Negative Commonsense Knowledge.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Converge to the Truth: Factual Error Correction via Iterative Constrained Editing.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
ICM-3D: Instantiated Category Modeling for 3D Instance Segmentation.
IEEE Robotics Autom. Lett., 2022

SOLO: A Simple Framework for Instance Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Lego-MT: Towards Detachable Models in Massively Multilingual Machine Translation.
CoRR, 2022

Go-tuning: Improving Zero-shot Learning Abilities of Smaller Language Models.
CoRR, 2022

Accelerating Antimicrobial Peptide Discovery with Latent Sequence-Structure Model.
CoRR, 2022

SEScore2: Retrieval Augmented Pretraining for Text Generation Evaluation.
CoRR, 2022

PARAGEN : A Parallel Generation Toolkit.
CoRR, 2022

LightSeq2: Accelerated Training for Transformer-Based Models on GPUs.
Proceedings of the SC22: International Conference for High Performance Computing, 2022

Provably Confidential Language Modelling.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Cross-modal Contrastive Learning for Speech Translation.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

MTG: A Benchmark Suite for Multilingual Text Generation.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Uncovering the Heterogeneous Effects of Preference Diversity on User Activeness: A Dynamic Mixture Model.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

On the Impact of Noises in Crowd-Sourced Data for Speech Translation.
Proceedings of the 19th International Conference on Spoken Language Translation, 2022

Learning Design and Construction with Varying-Sized Materials via Prioritized Memory Resets.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

On the Learning of Non-Autoregressive Transformers.
Proceedings of the International Conference on Machine Learning, 2022

Enhancing Cross-lingual Transfer by Manifold Mixup.
Proceedings of the Tenth International Conference on Learning Representations, 2022

switch-GLAT: Multilingual Parallel Machine Translation Via Code-Switch Decoder.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Distillation-Resistant Watermarking for Model Protection in NLP.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Not All Errors are Equal: Learning Text Generation Metrics using Stratified Error Synthesis.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Calibrating Factual Knowledge in Pretrained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Compressing Sentence Representation for Semantic Retrieval via Homomorphic Projective Distillation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Rethinking Document-level Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Contextual Representation Learning beyond Masked Language Modeling.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Learning When to Translate for Streaming Speech.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

E-KAR: A Benchmark for Rationalizing Natural Language Analogical Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

latent-GLAT: Glancing at Latent Variables for Parallel Text Generation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Unsupervised Editing for Counterfactual Stories.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

LOREN: Logic-Regularized Reasoning for Interpretable Fact Verification.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Non-autoregressive Translation with Layer-Wise Prediction and Deep Supervision.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
A Survey on Green Deep Learning.
CoRR, 2021

CNewSum: A Large-scale Chinese News Summarization Dataset with Human-annotated Adequacy and Deducibility Level.
CoRR, 2021

LightSeq: Accelerated Training for Transformer-based Models on GPUs.
CoRR, 2021

UniST: Unified End-to-end Model for Streaming and Non-streaming Speech Translation.
CoRR, 2021

MTG: A Benchmarking Suite for Multilingual Text Generation.
CoRR, 2021

Serial or Parallel? Plug-able Adapter for multilingual machine translation.
CoRR, 2021

Auto Correcting in the Process of Translation - Multi-task Learning Improves Dialogue Machine Translation.
CoRR, 2021

Triangular Bidword Generation for Sponsored Search Auction.
Proceedings of the WSDM '21, 2021

The Volctrans GLAT System: Non-autoregressive Translation Meets WMT21.
Proceedings of the Sixth Conference on Machine Translation, 2021

Follow Your Path: A Progressive Method for Knowledge Distillation.
Proceedings of the Machine Learning and Knowledge Discovery in Databases. Research Track, 2021

CNewSum: A Large-Scale Summarization Dataset with Human-Annotated Adequacy and Deducibility Level.
Proceedings of the Natural Language Processing and Chinese Computing, 2021

Duplex Sequence-to-Sequence Learning for Reversible Machine Translation.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Autocorrect in the Process of Translation - Multi-task Learning Improves Dialogue Machine Translation.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Papers, 2021

LightSeq: A High Performance Inference Library for Transformers.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Papers, 2021

Cross-lingual Supervision Improves Unsupervised Neural Machine Translation.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Papers, 2021

Generative Imagination Elevates Machine Translation.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

The Volctrans Neural Speech Translation System for IWSLT 2021.
Proceedings of the 18th International Conference on Spoken Language Translation, 2021

Learning to Design and Construct Bridge without Blueprint.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Simultaneous Semantic and Collision Learning for 6-DoF Grasp Pose Estimation.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

End-to-End Speech Translation via Cross-Modal Progressive Training.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Adversarial Option-Aware Hierarchical Imitation Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

MARS: Markov Molecular Sampling for Multi-objective Drug Discovery.
Proceedings of the 9th International Conference on Learning Representations, 2021

Counter-Interference Adapter for Multilingual Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Gradient-Based Adversarial Factual Consistency Evaluation for Abstractive Summarization.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Secoco: Self-Correcting Encoding for Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Multilingual Translation via Grafting Pre-trained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Learning Logic Rules for Document-Level Relation Extraction.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Learning Kernel-Smoothed Machine Translation with Retrieved Examples.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

ENPAR: Enhancing Entity and Entity Pair Representations for Joint Entity Relation Extraction.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Dense Contrastive Learning for Self-Supervised Visual Pre-Training.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Sparse R-CNN: End-to-End Object Detection With Learnable Proposals.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Locate Then Segment: A Strong Pipeline for Referring Image Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Scale-Aware Automatic Augmentation for Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

NeurST: Neural Speech Translation Toolkit.
Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Vocabulary Learning via Optimal Transport for Neural Machine Translation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Document-level Event Extraction via Heterogeneous Graph-based Interaction Model with a Tracker.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Language Tags Matter for Zero-Shot Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

UniRE: A Unified Label Space for Entity Relation Extraction.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Contrastive Aligned Joint Learning for Multilingual Summarization.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Probabilistic Graph Reasoning for Natural Proof Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Glancing Transformer for Non-Autoregressive Neural Machine Translation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Contrastive Learning for Many-to-many Multilingual Neural Machine Translation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Learning Language Specific Sub-network for Multilingual Machine Translation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Learning Shared Semantic Space for Speech-to-Text Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Taxonomy Completion via Triplet Matching Network.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

TextGAIL: Generative Adversarial Imitation Learning for Text Generation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Finding Sparse Structures for Domain Specific Neural Machine Translation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

ACMo: Angle-Calibrated Moment Methods for Stochastic Optimization.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Listen, Understand and Translate: Triple Supervision Decouples End-to-end Speech-to-text Translation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Consecutive Decoding for Speech-to-text Translation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
FoveaBox: Beyound Anchor-Based Object Detection.
IEEE Trans. Image Process., 2020

Towards a new generation of artificial intelligence in China.
Nat. Mach. Intell., 2020

VOLT: Improving Vocabularization via Optimal Transport for Machine Translation.
CoRR, 2020

LOREN: Logic Enhanced Neural Reasoning for Fact Verification.
CoRR, 2020

Finding Sparse Structure for Domain Specific Neural Machine Translation.
CoRR, 2020

NeurST: Neural Speech Translation Toolkit.
CoRR, 2020

Reciprocal Supervised Learning Improves Neural Machine Translation.
CoRR, 2020

Language Generation via Combinatorial Constraint Satisfaction: A Tree Search Enhanced Monte-Carlo Approach.
CoRR, 2020

LightSeq: A High Performance Inference Library for Sequence Processing and Generation.
CoRR, 2020

Capturing Longer Context for Document-level Neural Machine Translation: A Multi-resolutional Approach.
CoRR, 2020

SDST: Successive Decoding for Speech-to-text Translation.
CoRR, 2020

TED: Triple Supervision Decouples End-to-end Speech-to-text Translation.
CoRR, 2020

Adaptive Gradient Methods Can Be Provably Faster than SGD after Finite Epochs.
CoRR, 2020

Active Sentence Learning by Adversarial Uncertainty Sampling in Discrete Space.
CoRR, 2020

Unsupervised Neural Machine Translation with Indirect Supervision.
CoRR, 2020

SOLOv2: Dynamic, Faster and Stronger.
CoRR, 2020

SPAN: A Stochastic Projected Approximate Newton Method.
CoRR, 2020

Volctrans Parallel Corpus Filtering System for WMT 2020.
Proceedings of the Fifth Conference on Machine Translation, 2020

The Volctrans Machine Translation System for WMT20.
Proceedings of the Fifth Conference on Machine Translation, 2020

QuAChIE: Question Answering based Chinese Information Extraction System.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

SOLOv2: Dynamic and Fast Instance Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Dispersed Exponential Family Mixture VAEs for Interpretable Text Generation.
Proceedings of the 37th International Conference on Machine Learning, 2020

Mirror-Generative Neural Machine Translation.
Proceedings of the 8th International Conference on Learning Representations, 2020

Variational Template Machine for Data-to-Text Generation.
Proceedings of the 8th International Conference on Learning Representations, 2020

Constraint Satisfaction Driven Natural Language Generation: A Tree Search Embedded MCMC Approach.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Double Graph Based Reasoning for Document-level Relation Extraction.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Active Sentence Learning by Adversarial Uncertainty Sampling in Discrete Space.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Pre-training Multilingual Neural Machine Translation by Leveraging Alignment Information.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

On the Sentence Embeddings from Pre-trained Language Models.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

SOLO: Segmenting Objects by Locations.
Proceedings of the Computer Vision - ECCV 2020, 2020

XREF: Entity Linking for Chinese News Comments with Supplementary Article Reference.
Proceedings of the Conference on Automated Knowledge Base Construction, 2020

Improving Maximum Likelihood Training for Text Generation with Density Ratio Estimation.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

Xiaomingbot: A Multilingual Robot News Reporter.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2020

Do you have the right scissors? Tailoring Pre-trained Language Models via Monte-Carlo Methods.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Towards Making the Most of BERT in Neural Machine Translation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Importance-Aware Learning for Neural Headline Editing.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Task-Aware Monocular Depth Estimation for 3D Object Detection.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

SPAN: A Stochastic Projected Approximate Newton Method.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Non-autoregressive Transformer by Position Learning.
CoRR, 2019

Cross-Lingual Vision-Language Navigation.
CoRR, 2019

Effective Domain Knowledge Transfer with Soft Fine-tuning.
CoRR, 2019

Fixing Gaussian Mixture VAEs for Interpretable Text Generation.
CoRR, 2019

UniVSE: Robust Visual Semantic Embeddings via Structured Semantic Representations.
CoRR, 2019

Kernelized Bayesian Softmax for Text Generation.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Uncovering the Co-driven Mechanism of Social and Content Links in User Churn Phenomena.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Rethinking Text Attribute Transfer: A Lexical Analysis.
Proceedings of the 12th International Conference on Natural Language Generation, 2019

Correct-and-Memorize: Learning to Translate from Interactive Revisions.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

GraspSnooker: Automatic Chinese Commentary Generation for Snooker Videos.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

VaTeX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

SVD: A Large-Scale Short Video Dataset for Near-Duplicate Video Retrieval.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Towards Linear Time Neural Machine Translation with Capsule Networks.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Discreteness in Neural Natural Language Processing.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Unified Visual-Semantic Embeddings: Bridging Vision and Language With Structured Meaning Representations.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

What You Look Matters?: Offline Evaluation of Advertising Creatives for Cold-start Problem.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Generating Fluent Adversarial Examples for Natural Languages.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Dynamically Fused Graph Network for Multi-hop Reasoning.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Generating Sentences from Disentangled Syntactic and Semantic Spaces.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

CGMH: Constrained Sentence Generation by Metropolis-Hastings Sampling.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Overview of the NLPCC 2018 Shared Task: Single Document Summarization.
Proceedings of the Natural Language Processing and Chinese Computing, 2018

BRITS: Bidirectional Recurrent Imputation for Time Series.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Reinforced Co-Training.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

On Tree-Based Neural Sentence Modeling.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Jersey Number Recognition With Semi-Supervised Spatial Transformer Network.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

2017
Nonlinear Dynamics of Information Diffusion in Social Networks.
ACM Trans. Web, 2017

DAPs: Mining using Change-Point Detection of Epileptic Activity Time Series Data.
J. Inf. Sci. Eng., 2017

SAM: Semantic Attribute Modulated Language Modeling.
CoRR, 2017

Overview of the NLPCC 2017 Shared Task: Single Document Summarization.
Proceedings of the Natural Language Processing and Chinese Computing, 2017

A Nearly-Black-Box Online Algorithm for Joint Parameter and State Estimation in Temporal Models.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Concept over time: the combination of probabilistic topic model with wikipedia knowledge.
Expert Syst. Appl., 2016

Towards Practical Bayesian Parameter and State Estimation.
CoRR, 2016

Swift: Compiled Inference for Probabilistic Programming Languages.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

CFO: Conditional Focused Neural Question Answering with Large-scale Knowledge Bases.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2014
Beyond Poisson: Modeling Inter-Arrival Time of Requests in a Datacenter.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2014

Multimodal Information Fusion for Robust Heart Beat Detection.
Proceedings of the Computing in Cardiology, CinC 2014, 2014

2013
F-Trail: Finding Patterns in Taxi Trajectories.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2013

Multilinear Dynamical Systems for Tensor Time Series.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Why people hate your app: making sense of user feedback in a mobile app store.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

The Extended Parameter Filter.
Proceedings of the 30th International Conference on Machine Learning, 2013

Hibernating Process: Modelling Mobile Calls at Multiple Scales.
Proceedings of the 2013 IEEE 13th International Conference on Data Mining, 2013

Writing and sketching in the air, recognizing and controlling on the fly.
Proceedings of the 2013 ACM SIGCHI Conference on Human Factors in Computing Systems, 2013

Dynamic Scaled Sampling for Deterministic Constraints.
Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics, 2013

2012
A Novel Violent Videos Classification Scheme Based on the Bag of Audio Words Features.
Int. J. Comput. Intell. Appl., 2012

Rise and fall patterns of information diffusion: model and implications.
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012

RolX: structural role extraction & mining in large graphs.
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012

2011
Fast Algorithms for Mining Co-evolving Time Series.
PhD thesis, 2011

WindMine: Fast and Effective Mining of Web-click Sequences.
Proceedings of the Eleventh SIAM International Conference on Data Mining, 2011

ThermoCast: a cyber-physical forecasting model for datacenters.
Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2011

It's who you know: graph mining using recursive structural features.
Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2011

Time Series Clustering: Complex is Simpler!
Proceedings of the 28th International Conference on Machine Learning, 2011

Mobile Phone Graph Evolution: Findings, Model and Interpretation.
Proceedings of the Data Mining Workshops (ICDMW), 2011

2010
Parsimonious Linear Fingerprinting for Time Series.
Proc. VLDB Endow., 2010

Efficient Parallel Learning of Hidden Markov Chain Models on SMPs.
IEICE Trans. Inf. Syst., 2010

BoLeRO: A Principled Technique for Including Bone Length Constraints in Motion Capture Occlusion Filling.
Proceedings of the 2010 Eurographics/ACM SIGGRAPH Symposium on Computer Animation, 2010

Metric forensics: a multi-level approach for mining volatile graphs.
Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2010

Fast algorithms for time series mining.
Proceedings of the Workshops Proceedings of the 26th International Conference on Data Engineering, 2010

2009
Tailoring click models to user goals.
Proceedings of the 2009 workshop on Web Search Click Data, 2009

DynaMMo: mining and summarization of coevolving sequences with missing values.
Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Paris, France, June 28, 2009

2008
C-DEM: a multi-modal query system for Drosophila Embryo databases.
Proc. VLDB Endow., 2008

Efficient Distribution Mining and Classification.
Proceedings of the SIAM International Conference on Data Mining, 2008

Cut-and-stitch: efficient parallel learning of linear dynamical systems on smps.
Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2008

Inferring privacy information via social relations.
Proceedings of the 24th International Conference on Data Engineering Workshops, 2008

Laziness is a Virtue: Motion Stitching Using Effort Minimization.
Proceedings of the 29th Annual Conference of the European Association for Computer Graphics, 2008

2006
Providing an Uncertainty Reasoning Service for Semantic Web Application.
Proceedings of the Frontiers of WWW Research and Development, 2006


  Loading...