Jian Jiao

Orcid: 0000-0003-4779-9588

Affiliations:
  • Microsoft, Redmond, WA, USA


According to our database1, Jian Jiao authored at least 43 papers between 2016 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
CROSS-JEM: Accurate and Efficient Cross-encoders for Short-text Ranking Tasks.
CoRR, 2024

On the Necessity of World Knowledge for Mitigating Missing Labels in Extreme Classification.
CoRR, 2024

Task Oriented In-Domain Data Augmentation.
CoRR, 2024

Scaling the Vocabulary of Non-autoregressive Models for Efficient Generative Retrieval.
CoRR, 2024

Rho-1: Not All Tokens Are What You Need.
CoRR, 2024

Graph Regularized Encoder Training for Extreme Classification.
CoRR, 2024

AnnoLLM: Making Large Language Models to Be Better Crowdsourced Annotators.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Track, 2024

Extreme Meta-Classification for Large-Scale Zero-Shot Retrieval.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Optimizing Novelty of Top-k Recommendations using Large Language Models and Reinforcement Learning.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

OAK: Enriching Document Representations using Auxiliary Knowledge for Extreme Classification.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
AutoHint: Automatic Prompt Optimization with Hint Generation.
CoRR, 2023

Pre-training Transformers for Knowledge Graph Completion.
CoRR, 2023

PROD: Progressive Distillation for Dense Retrieval.
Proceedings of the ACM Web Conference 2023, 2023

NGAME: Negative Mining-aware Mini-batching for Extreme Classification.
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Renee: End-To-End Training of Extreme Classification Models.
Proceedings of the Sixth Conference on Machine Learning and Systems, 2023

Deep Encoders with Auxiliary Parameters for Extreme Classification.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

CAPSTONE: Curriculum Sampling for Dense Retrieval with Document Expansion.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

DeepTagger: Knowledge Enhanced Named Entity Recognition for Web-Based Ads Queries.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Dual-Alignment Pre-training for Cross-lingual Sentence Embedding.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Curriculum Sampling for Dense Retrieval with Document Expansion.
CoRR, 2022

Efficient Long Sequence Modeling via State Space Augmented Transformer.
CoRR, 2022

LEAD: Liberal Feature-based Distillation for Dense Retrieval.
CoRR, 2022

A Self-Paced Mixed Distillation Method for Non-Autoregressive Generation.
CoRR, 2022

CULG: Commercial Universal Language Generation.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Track, 2022

Taming Sparsely Activated Transformer with Stochastic Experts.
Proceedings of the Tenth International Conference on Learning Representations, 2022

PromptBERT: Improving BERT Sentence Embeddings with Prompts.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Metric-guided Distillation: Distilling Knowledge from the Metric to Ranker and Retriever for Generative Commonsense Reasoning.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021
KFCNet: Knowledge Filtering and Contrastive Learning Network for Generative Commonsense Reasoning.
CoRR, 2021

GalaXC: Graph Neural Networks with Labelwise Attention for Extreme Classification.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Mask Attention Networks: Rethinking and Strengthen Transformer.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

BANG: Bridging Autoregressive and Non-autoregressive Generation with Large Scale Pretraining.
Proceedings of the 38th International Conference on Machine Learning, 2021

SiameseXML: Siamese Networks meet Extreme Classifiers with 100M Labels.
Proceedings of the 38th International Conference on Machine Learning, 2021

HittER: Hierarchical Transformers for Knowledge Graph Embeddings.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

KFCNet: Knowledge Filtering and Contrastive Learning for Generative Commonsense Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

GLGE: A New General Language Generation Evaluation Benchmark.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
TwinBERT: Distilling Knowledge to Twin-Structured BERT Models for Efficient Retrieval.
CoRR, 2020

ProphetNet-Ads: A Looking Ahead Strategy for Generative Retrieval Models in Sponsored Search Engine.
Proceedings of the Natural Language Processing and Chinese Computing, 2020

An Enhanced Knowledge Injection Model for Commonsense Generation.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

TwinBERT: Distilling Knowledge to Twin-Structured Compressed BERT Models for Large-Scale Retrieval.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

2018
Recurrent Binary Embedding for GPU-Enabled Exhaustive Retrieval from Billion-Scale Semantic Vectors.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

2016
Deep Crossing: Web-Scale Modeling without Manually Crafted Combinatorial Features.
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016


  Loading...