2024
Wisdom of Committee: Distilling from Foundation Model to Specialized Application Model.
CoRR, 2024
Co-optimize Content Generation and Consumption in a Large Scale Video Recommendation System.
,
,
,
,
,
,
,
,
,
,
Proceedings of the 18th ACM Conference on Recommender Systems, 2024
Learned Ranking Function: From Short-term Behavior Predictions to Long-term User Satisfaction.
Proceedings of the 18th ACM Conference on Recommender Systems, 2024
Bridging the Gap: Unpacking the Hidden Challenges in Knowledge Distillation for Online Ranking Systems.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 18th ACM Conference on Recommender Systems, 2024
LEVI: Generalizable Fine-tuning via Layer-wise Ensemble of Different Views.
,
,
,
,
,
,
,
,
,
,
Proceedings of the Forty-first International Conference on Machine Learning, 2024
2023
Talking Models: Distill Pre-trained Knowledge to Downstream Models via Interactive Communication.
CoRR, 2023
COMET: Learning Cardinality Constrained Mixture of Experts with Trees and Local Search.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023
Fast as CHITA: Neural Network Pruning with Combinatorial Optimization.
Proceedings of the International Conference on Machine Learning, 2023
Multitask Ranking System for Immersive Feed and No More Clicks: A Case Study of Short-Form Video Recommendation.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023
2022
Can Small Heads Help? Understanding and Improving Multi-Task Generalization.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022
Transformer Memory as a Differentiable Search Index.
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Improving Multi-Task Generalization via Regularizing Spurious Correlation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
HyperPrompt: Prompt-based Task-Conditioning of Transformers.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the International Conference on Machine Learning, 2022
2021
DSelect-k: Differentiable Selection in the Mixture of Experts with Applications to Multi-Task Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Efficiently Identifying Task Groupings for Multi-Task Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Synthesizer: Rethinking Self-Attention for Transformer Models.
Proceedings of the 38th International Conference on Machine Learning, 2021
HyperGrid Transformers: Towards A Single Model for Multiple Tasks.
Proceedings of the 9th International Conference on Learning Representations, 2021
Learning-to-Rank with Partitioned Preference: Fast Estimation for the Plackett-Luce Model.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021
2020
Measuring and Harnessing Transference in Multi-Task Learning.
CoRR, 2020
Small Towers Make Big Differences.
CoRR, 2020
HyperGrid: Efficient Multi-Task Transformers with Grid-wise Decomposable Hyper Projections.
CoRR, 2020
Understanding and Improving Knowledge Distillation.
CoRR, 2020
Off-policy Learning in Two-stage Recommender Systems.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020
Multitask Mixture of Sequential Experts for User Activity Streams.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020
2019
Recommending what video to watch next: a multitask ranking system.
Proceedings of the 13th ACM Conference on Recommender Systems, 2019
Sampling-bias-corrected neural modeling for large corpus item recommendations.
Proceedings of the 13th ACM Conference on Recommender Systems, 2019
Fairness in Recommendation Ranking through Pairwise Comparisons.
,
,
,
,
,
,
,
,
,
,
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019
SNR: Sub-Network Routing for Flexible Parameter Sharing in Multi-Task Learning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
Identify Shifts of Word Semantics through Bayesian Surprise.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018
Modeling Task Relationships in Multi-task Learning with Multi-gate Mixture-of-Experts.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018
2017
Data Decisions and Theoretical Implications when Adversarially Learning Fair Representations.
CoRR, 2017
2016
Detecting Social Media Icebergs by Their Tips: Rumors, Persuasion Campaigns, and Information Needs.
Proceedings of the Ninth ACM International Conference on Web Search and Data Mining, 2016
2015
Towards the prediction problems of bursting hashtags on Twitter.
J. Assoc. Inf. Sci. Technol., 2015
Enquiring Minds: Early Detection of Rumors in Social Media from Enquiry Posts.
Proceedings of the 24th International Conference on World Wide Web, 2015
Improving User Topic Interest Profiles by Behavior Factorization.
Proceedings of the 24th International Conference on World Wide Web, 2015
2014
On the Real-time Prediction Problems of Bursting Hashtags in Twitter.
CoRR, 2014
Real-Time Predicting Bursting Hashtags on Twitter.
Proceedings of the Web-Age Information Management - 15th International Conference, 2014
Predicting bursts and popularity of hashtags in real-time.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014
2013
Questions about questions: an empirical analysis of information needs on Twitter.
Proceedings of the 22nd International World Wide Web Conference, 2013
2012
A Framework for Similarity Search of Time Series Cliques with Natural Relations.
IEEE Trans. Knowl. Data Eng., 2012
Extracting representative motion flows for effective video retrieval.
Multim. Tools Appl., 2012
Recommending Flickr groups with social topic model.
Inf. Retr., 2012
2010
Multiple feature fusion for social media applications.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010
Efficient similarity matching of Time Series Cliques with natural relations.
Proceedings of the 26th International Conference on Data Engineering, 2010