Jingren Zhou

Orcid: 0000-0002-4220-2634

Affiliations:
  • Microsoft Research


According to our database1, Jingren Zhou authored at least 318 papers between 2002 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
A Survey of Distributed Graph Algorithms on Massive Graphs.
ACM Comput. Surv., February, 2025

2024
Towards Efficient Graph Processing in Geo-Distributed Data Centers.
IEEE Trans. Parallel Distributed Syst., November, 2024

Lero: applying learning-to-rank in query optimizer.
VLDB J., September, 2024

Lindorm-UWC: An Ultra-Wide-Column Database for Internet of Vehicles.
Proc. VLDB Endow., August, 2024

Towards Millions of Database Transmission Services in the Cloud.
Proc. VLDB Endow., August, 2024

Ingress: an automated incremental graph processing system.
VLDB J., May, 2024

PASS: Patch Automatic Skip Scheme for Efficient On-Device Video Perception.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Performance-Based Pricing of Federated Learning via Auction.
Proc. VLDB Endow., February, 2024

PilotScope: Steering Databases with Machine Learning Drivers.
Proc. VLDB Endow., January, 2024

Eraser: Eliminating Performance Regression on Learned Query Optimizer.
Proc. VLDB Endow., January, 2024

XGNN: Boosting Multi-GPU GNN Training via Global GNN Memory Store.
Proc. VLDB Endow., January, 2024

Text-to-SQL Empowered by Large Language Models: A Benchmark Evaluation.
Proc. VLDB Endow., January, 2024

Learned Query Optimizers.
Found. Trends Databases, 2024

Language Models can Self-Lengthen to Generate Long Texts.
CoRR, 2024

In-Context LoRA for Diffusion Transformers.
CoRR, 2024

Aligning Large Language Models via Self-Steering Optimization.
CoRR, 2024

Group Diffusion Transformers are Unsupervised Multitask Learners.
CoRR, 2024

AsymKV: Enabling 1-Bit Quantization of KV Cache with Layer-Wise Asymmetric Quantization Configurations.
CoRR, 2024

GenSim: A General Social Simulation Platform with Large Language Model based Agents.
CoRR, 2024

ACE: All-round Creator and Editor Following Instructions via Diffusion Transformer.
CoRR, 2024

Rotated Runtime Smooth: Training-Free Activation Smoother for accurate INT4 inference.
CoRR, 2024

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution.
CoRR, 2024

Qwen2.5-Coder Technical Report.
CoRR, 2024

Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement.
CoRR, 2024

Hierarchical Reinforcement Learning for Temporal Abstraction of Listwise Recommendation.
CoRR, 2024

mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding.
CoRR, 2024

Towards a Converged Relational-Graph Optimization Framework.
CoRR, 2024

On the Element-Wise Representation and Reasoning in Zero-Shot Image Recognition: A Systematic Survey.
CoRR, 2024

mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models.
CoRR, 2024

Very Large-Scale Multi-Agent Simulation in AgentScope.
CoRR, 2024

On the Design and Analysis of LLM-Based Algorithms.
CoRR, 2024

Data-Juicer Sandbox: A Comprehensive Suite for Multimodal Data-Model Co-development.
CoRR, 2024

Qwen2-Audio Technical Report.
CoRR, 2024

Qwen2 Technical Report.
CoRR, 2024

Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models.
CoRR, 2024

PRICE: A Pretrained Model for Cross-Database Cardinality Estimation.
CoRR, 2024

A Survey on Self-Evolution of Large Language Models.
CoRR, 2024

RoleInteract: Evaluating the Social Interaction of Role-Playing Agents.
CoRR, 2024

mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding.
CoRR, 2024

AgentScope: A Flexible yet Robust Multi-Agent Platform.
CoRR, 2024

AI Hospital: Interactive Evaluation and Collaboration of LLMs as Intern Doctors for Clinical Diagnosis.
CoRR, 2024

EE-Tuning: An Economical yet Scalable Solution for Tuning Early-Exit Large Language Models.
CoRR, 2024

A Graph-Native Query Optimization Framework.
CoRR, 2024

Fine-Grained Zero-Shot Learning: Advances, Challenges, and Prospects.
CoRR, 2024

WordArt Designer API: User-Driven Artistic Typography Synthesis with Large Language Models on ModelScope.
CoRR, 2024

Unicron: Economizing Self-Healing LLM Training at Scale.
CoRR, 2024

Learned Query Optimizer: What is New and What is Next.
Proceedings of the Companion of the 2024 International Conference on Management of Data, 2024

GraphScope Flex: LEGO-like Graph Computing Stack.
Proceedings of the Companion of the 2024 International Conference on Management of Data, 2024

Data-Juicer: A One-Stop Data Processing System for Large Language Models.
Proceedings of the Companion of the 2024 International Conference on Management of Data, 2024

Routing to the Expert: Efficient Reward-guided Ensemble of Large Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

UniDM: A Unified Framework for Data Manipulation with Large Language Models.
Proceedings of the Seventh Annual Conference on Machine Learning and Systems, 2024

FederatedScope-LLM: A Comprehensive Package for Fine-tuning Large Language Models in Federated Learning.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

EE-LLM: Large-Scale Training and Inference of Early-Exit Large Language Models with 3D Parallelism.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Lipschitz Singularities in Diffusion Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

#InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Bw<sup>e</sup>-tree: An Evolution of Bw-tree on Fast Storage.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Dream Video: Composing Your Dream Videos with Customized Subject and Motion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

SMARTFEAT: Efficient Feature Construction through Feature-Level Foundation Model Interactions.
Proceedings of the 14th Conference on Innovative Data Systems Research, 2024

AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-Alignment.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Tempura: a general cost-based optimizer framework for incremental data processing (Journal Version).
VLDB J., November, 2023

RAGraph: A Region-Aware Framework for Geo-Distributed Graph Processing.
Proc. VLDB Endow., November, 2023

Towards Data-Independent Knowledge Transfer in Model-Heterogeneous Federated Learning.
IEEE Trans. Computers, October, 2023

Edge-Cloud Polarization and Collaboration: A Comprehensive Survey for AI.
IEEE Trans. Knowl. Data Eng., July, 2023

GraphNAS++: Distributed Architecture Search for Graph Neural Networks.
IEEE Trans. Knowl. Data Eng., July, 2023

Application-driven graph partitioning.
VLDB J., January, 2023

Contrastive Attraction and Contrastive Repulsion for Representation Learning.
Trans. Mach. Learn. Res., 2023

CogKR: Cognitive Graph for Multi-Hop Knowledge Reasoning.
IEEE Trans. Knowl. Data Eng., 2023

Lero: A Learning-to-Rank Query Optimizer.
Proc. VLDB Endow., 2023

FederatedScope: A Flexible Federated Learning Platform for Heterogeneity.
Proc. VLDB Endow., 2023

ALECE: An Attention-based Learned Cardinality Estimator for SPJ Queries on Dynamic Workloads.
Proc. VLDB Endow., 2023

FS-Real: A Real-World Cross-Device Federated Learning Platform.
Proc. VLDB Endow., 2023

BASE: Bridging the Gap between Cost and Latency for Query Optimization.
Proc. VLDB Endow., 2023

Vineyard: Optimizing Data Sharing in Data-Intensive Analytics.
Proc. ACM Manag. Data, 2023

GraphScope Flex: LEGO-like Graph Computing Stack.
CoRR, 2023

Enhancing Data Lakes with GraphAr: Efficient Graph Data Management with a Specialized Storage Scheme.
CoRR, 2023

DreamVideo: Composing Your Dream Videos with Customized Subject and Motion.
CoRR, 2023

Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models.
CoRR, 2023

mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration.
CoRR, 2023

I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models.
CoRR, 2023

WordArt Designer: User-Driven Artistic Typography Synthesis using Large Language Models.
CoRR, 2023

OccuQuest: Mitigating Occupational Bias for Inclusive Large Language Models.
CoRR, 2023

ALECE: An Attention-based Learned Cardinality Estimator for SPJ Queries on Dynamic Workloads (Extended).
CoRR, 2023

Qwen Technical Report.
CoRR, 2023

ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models.
CoRR, 2023

TouchStone: Evaluating Vision-Language Models by Language Models.
CoRR, 2023

Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities.
CoRR, 2023

CValues: Measuring the Values of Chinese Large Language Models from Safety to Responsibility.
CoRR, 2023

Matching in the Wild: Learning Anatomical Embeddings for Multi-Modality Images.
CoRR, 2023

Eliminating Lipschitz Singularities in Diffusion Models.
CoRR, 2023

On Knowledge Editing in Federated Learning: Perspectives, Challenges, and Future Directions.
CoRR, 2023

Cones 2: Customizable Image Synthesis with Multiple Subjects.
CoRR, 2023

ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities.
CoRR, 2023

ChatPLUG: Open-Domain Generative Dialogue System with Internet-Augmented Instruction Tuning for Digital Human.
CoRR, 2023

VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation.
CoRR, 2023

Rethinking Efficient Tuning Methods from a Unified Perspective.
CoRR, 2023

Continual Segment: Towards a Single, Unified and Accessible Continual Segmentation Model of 143 Whole-body Organs in CT Scans.
CoRR, 2023

Towards a Single Unified Model for Effective Detection, Segmentation, and Diagnosis of Eight Major Cancers Using a Large Collection of CT Scans.
CoRR, 2023

Path-specific Causal Fair Prediction via Auxiliary Graph Structure Learning.
Proceedings of the ACM Web Conference 2023, 2023

Legion: Automatically Pushing the Envelope of Multi-GPU System for Billion-Scale GNN Training.
Proceedings of the 2023 USENIX Annual Technical Conference, 2023

Bridging the Gap between Relational OLTP and Graph-based OLAP.
Proceedings of the 2023 USENIX Annual Technical Conference, 2023

GLogS: Interactive Graph Pattern Matching Query At Large Scale.
Proceedings of the 2023 USENIX Annual Technical Conference, 2023

FaceComposer: A Unified Model for Versatile Facial Content Creation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

VideoComposer: Compositional Video Synthesis with Motion Controllability.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Customizable Image Synthesis with Multiple Subjects.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Res-Tuning: A Flexible and Efficient Tuning Paradigm via Unbinding Tuner from Backbone.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Deformable Medical Image Registration Under Distribution Shifts with Neural Instance Optimization.
Proceedings of the Machine Learning in Medical Imaging - 14th International Workshop, 2023

Parse and Recall: Towards Accurate Lung Nodule Malignancy Prediction Like Radiologists.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Cluster-Induced Mask Transformers for Effective Opportunistic Gastric Cancer Screening on Non-contrast CT Scans.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Liver Tumor Screening and Diagnosis in CT with Pixel-Lesion-Patient Network.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

M<sup>2</sup>Fusion: Bayesian-Based Multimodal Multi-level Fusion on Colorectal Cancer Microsatellite Instability Prediction.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023 Workshops, 2023

SAMConvex: Fast Discrete Optimization for CT Registration Using Self-supervised Anatomical Embedding and Correlation Pyramid.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Improved Prognostic Prediction of Pancreatic Cancer Using Multi-phase CT by Integrating Neural Distance and Texture-Aware Transformer.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

SLPT: Selective Labeling Meets Prompt Tuning on Label-Limited Lesion Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

FS-REAL: Towards Real-World Cross-Device Federated Learning.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Meta-information-Aware Dual-path Transformer for Differential Diagnosis of Multi-type Pancreatic Lesions in Multi-phase CT.
Proceedings of the Information Processing in Medical Imaging, 2023

MetaViT: Metabolism-Aware Vision Transformer for Differential Diagnosis of Parkinsonism with <sup>18</sup>F-FDG PET.
Proceedings of the Information Processing in Medical Imaging, 2023

MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for speech recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

RLEG: Vision-Language Representation Learning with Diffusion-based Embedding Generation.
Proceedings of the International Conference on Machine Learning, 2023

mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video.
Proceedings of the International Conference on Machine Learning, 2023

Cones: Concept Neurons in Diffusion Models for Customized Generation.
Proceedings of the International Conference on Machine Learning, 2023

Composer: Creative and Controllable Image Synthesis with Composable Conditions.
Proceedings of the International Conference on Machine Learning, 2023

Learned Index with Dynamic $\epsilon$.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Layph: Making Change Propagation Constraint in Incremental Graph Processing by Layering Graph.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

Efficient Multi-GPU Graph Processing with Remote Work Stealing.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

Flash: A Framework for Programming Distributed Graph Processing Algorithms.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

ViM: Vision Middleware for Unified Downstream Transferring.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

CancerUniT: Towards a Single Unified Model for Effective Detection, Segmentation, and Diagnosis of Eight Major Cancers Using a Large Collection of CT Scans.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Dimensionality-Varying Diffusion Process.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Devil is in the Queries: Advancing Mask Transformers for Real-world Medical Image Segmentation and Out-of-Distribution Localization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

RA-CLIP: Retrieval Augmented Contrastive Language-Image Pre-Training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

LipFormer: High-fidelity and Generalizable Talking Face Generation with A Pre-learned Facial Codebook.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Neural Dependencies Emerging from Learning Massive Categories.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

PASS: Patch Automatic Skip Scheme for Efficient Real-Time Video Perception on Edge Devices.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Maximum and top-k diversified biclique search at scale.
VLDB J., 2022

Banyan: A Scoped Dataflow Engine for Graph Query Service.
Proc. VLDB Endow., 2022

Fine-Grained Modeling and Optimization for Intelligent Resource Management in Big Data Processing.
Proc. VLDB Endow., 2022

OFASys: A Multi-Modal Multi-Task Learning System for Building Generalist Models.
CoRR, 2022

Chinese CLIP: Contrastive Vision-Language Pretraining in Chinese.
CoRR, 2022

Knowledge Distillation of Transformer-based Language Models Revisited.
CoRR, 2022

mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections.
CoRR, 2022

M6-Rec: Generative Pretrained Language Models are Open-Ended Recommender Systems.
CoRR, 2022

FederatedScope: A Comprehensive and Flexible Federated Learning Platform via Message Passing.
CoRR, 2022

Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework.
CoRR, 2022

Uncovering Causal Effects of Online Short Videos on Consumer Behaviors.
Proceedings of the WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21, 2022

One Size Does Not Fit All: A Bandit-Based Sampler Combination Framework with Theoretical Guarantees.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

AD-AUG: Adversarial Data Augmentation for Counterfactual Recommendation.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2022

In-N-Out Generative Learning for Dense Unsupervised Video Segmentation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Device-cloud Collaborative Recommendation via Meta Controller.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

FederatedScope-GNN: Towards a Unified, Comprehensive and Efficient Package for Federated Graph Learning.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Sampling-based Estimation of the Number of Distinct Values in Distributed Environment.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

A Practical Introduction to Federated Learning.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework.
Proceedings of the International Conference on Machine Learning, 2022

Principled Knowledge Extrapolation with GANs.
Proceedings of the International Conference on Machine Learning, 2022

Reliable Adversarial Distillation with Unreliable Teachers.
Proceedings of the Tenth International Conference on Learning Representations, 2022

iFlood: A Stable and Effective Regularizer.
Proceedings of the Tenth International Conference on Learning Representations, 2022

GNNLab: a factored system for sample-based GNN training over GPUs.
Proceedings of the EuroSys '22: Seventeenth European Conference on Computer Systems, Rennes, France, April 5, 2022

mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Learned Query Optimizer: At the Forefront of AI-Driven Databases.
Proceedings of the 25th International Conference on Extending Database Technology, 2022

A Unified Transferable Model for ML-Enhanced DBMS.
Proceedings of the 12th Conference on Innovative Data Systems Research, 2022

Strengthening Order Preserving Encryption with Differential Privacy.
Proceedings of the 2022 ACM SIGSAC Conference on Computer and Communications Security, 2022

2021
Efficient Hop-constrained s-t Simple Path Enumeration.
VLDB J., 2021

Accelerating Large-Scale Heterogeneous Interaction Graph Embedding Learning via Importance Sampling.
ACM Trans. Knowl. Discov. Data, 2021

FLAT: Fast, Lightweight and Accurate Method for Cardinality Estimation.
Proc. VLDB Endow., 2021

FlashP: An Analytical Pipeline for Real-time Forecasting of Time-Series Relational Data.
Proc. VLDB Endow., 2021

GraphScope: A One-Stop Large Graph Processing System.
Proc. VLDB Endow., 2021

Learning to be a Statistician: Learned Estimator for Number of Distinct Values.
Proc. VLDB Endow., 2021

VolcanoML: Speeding up End-to-End AutoML via Scalable Search Space Decomposition.
Proc. VLDB Endow., 2021

Federated Matrix Factorization with Privacy Guarantee.
Proc. VLDB Endow., 2021

Cardinality Estimation in DBMS: A Comprehensive Benchmark Evaluation.
Proc. VLDB Endow., 2021

Automating Incremental Graph Processing with Flexible Memoization.
Proc. VLDB Endow., 2021

GraphScope: A Unified Engine For Big Graph Processing.
Proc. VLDB Endow., 2021

Fangorn: Adaptive Execution Framework for Heterogeneous Workloads on Shared Clusters.
Proc. VLDB Endow., 2021

Baihe: SysML Framework for AI-driven Databases.
CoRR, 2021

Glue: Adaptively Merging Single Table Cardinality to Estimate Join Query Size.
CoRR, 2021

Edge-Cloud Polarization and Collaboration: A Comprehensive Survey.
CoRR, 2021

M6-10T: A Sharing-Delinking Paradigm for Efficient Multi-Trillion Parameter Pretraining.
CoRR, 2021

Exploring Sparse Expert Models and Beyond.
CoRR, 2021

Rethinking Lifelong Sequential Recommendation with Incremental Multi-Interest Attention.
CoRR, 2021

TCL: Transformer-based Dynamic Graph Modelling via Contrastive Learning.
CoRR, 2021

Contrastive Conditional Transport for Representation Learning.
CoRR, 2021

M6: A Chinese Multimodal Pretrainer.
CoRR, 2021

A Pluggable Learned Index Method via Sampling and Gap Insertion.
CoRR, 2021

Improving Search Engine Efficiency through Contextual Factor Selection.
AI Mag., 2021

Linear-Time Self Attention with Codeword Histogram for Efficient Recommendation.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Sparse-Interest Network for Sequential Recommendation.
Proceedings of the WSDM '21, 2021

Octo: INT8 Training with Loss-aware Compensation and Backward Quantization for Tiny On-device Learning.
Proceedings of the 2021 USENIX Annual Technical Conference, 2021

Incrementalizing Graph Algorithms.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

GAIA: A System for Interactive Analysis on Distributed Graphs Using a High-Level Language.
Proceedings of the 18th USENIX Symposium on Networked Systems Design and Implementation, 2021

Low-Rank Subspaces in GANs.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

UFC-BERT: Unifying Multi-Modal Controls for Conditional Image Synthesis.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Large-scale Multi-Modality Pretrained Models: Applications and Experiences.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

MicroRec: Efficient Recommendation Inference by Hardware and Data Structure Solutions.
Proceedings of the Fourth Conference on Machine Learning and Systems, 2021

Contrastive Learning for Debiased Candidate Generation in Large-Scale Recommender Systems.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Device-Cloud Collaborative Learning for Recommendation.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

FIVES: Feature Interaction Via Edge Search for Large-Scale Tabular Data.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

M6: Multi-Modality-to-Multi-Modality Multitask Mega-transformer for Unified Pretraining.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

FleetRec: Large-Scale Recommendation Inference on Hybrid GPU-FPGA Clusters.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Learning to Rehearse in Long Sequence Memorization.
Proceedings of the 38th International Conference on Machine Learning, 2021

Uncertainty Principles of Encoding GANs.
Proceedings of the 38th International Conference on Machine Learning, 2021

Efficient and Scalable Structure Learning for Bayesian Networks: Algorithms and Applications.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

ATNN: Adversarial Two-Tower Neural Network for New Item's Popularity Prediction in E-commerce.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

Large-scale Fake Click Detection for E-commerce Recommendation Systems.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

Towards an Integral System for Processing Big Graphs at Scale.
Proceedings of the 28th IEEE International Conference on High Performance Computing, 2021

Distributed Recommendation Inference on FPGA Clusters.
Proceedings of the 31st International Conference on Field-Programmable Logic and Applications, 2021

FlexGraph: a flexible and efficient distributed framework for GNN training.
Proceedings of the EuroSys '21: Sixteenth European Conference on Computer Systems, 2021

Sketch and Refine: Towards Faithful and Informative Table-to-Text Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Learning Relation Alignment for Calibrated Cross-modal Retrieval.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Enhancing E-commerce Recommender System Adaptability with Online Deep Controllable Learning-To-Rank.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Dynamic Memory based Attention Network for Sequential Recommendation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Efficient (α, β)-core computation in bipartite graphs.
VLDB J., 2020

Adaptive Asynchronous Parallelization of Graph Algorithms.
ACM Trans. Database Syst., 2020

Collecting and Analyzing Data Jointly from Multiple Services under Local Differential Privacy.
Proc. VLDB Endow., 2020

Tempura: A General Cost-Based Optimizer Framework for Incremental Data Processing.
Proc. VLDB Endow., 2020

Improving Utility and Security of the Shuffler-based Differential Privacy.
Proc. VLDB Endow., 2020

Maximum Biclique Search at Billion Scale.
Proc. VLDB Endow., 2020

Incrementalization of Graph Partitioning Algorithms.
Proc. VLDB Endow., 2020

Capturing Associations in Graphs.
Proc. VLDB Endow., 2020

FSPN: A New Class of Probabilistic Graphical Model.
CoRR, 2020

MicroRec: Accelerating Deep Recommendation Systems to Microseconds by Hardware and Data Structure Solutions.
CoRR, 2020

Tempura: A General Cost Based Optimizer Framework for Incremental Data Processing (Extended Version).
CoRR, 2020

Intertwining Order Preserving Encryption and Differential Privacy.
CoRR, 2020

Interactive Feature Generation via Learning Adjacency Tensor of Feature Graph.
CoRR, 2020

Contrastive Learning for Debiased Candidate Generation at Scale.
CoRR, 2020

Taming the Expressiveness and Programmability of Graph Analytical Queries.
CoRR, 2020

InterBERT: Vision-and-Language Interaction for Multi-modal Pretraining.
CoRR, 2020

Learning to Hash with Graph Neural Networks for Recommender Systems.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Grosbeak: A Data Warehouse Supporting Resource-Aware Incremental Computing.
Proceedings of the 2020 International Conference on Management of Data, 2020

Application Driven Graph Partitioning.
Proceedings of the 2020 International Conference on Management of Data, 2020

Extending Graph Patterns with Conditions.
Proceedings of the 2020 International Conference on Management of Data, 2020

Sentiment Analysis of Online Reviews with a Hierarchical Attention Network.
Proceedings of the 32nd International Conference on Software Engineering and Knowledge Engineering, 2020

Learning to Mutate with Hypergradient Guided Population.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Poet: Product-oriented Video Captioner for E-commerce.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Comprehensive Information Integration Modeling Framework for Video Titling.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Understanding Negative Sampling in Graph Representation Learning.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Linear and Range Counting under Metric-based Local Differential Privacy.
Proceedings of the IEEE International Symposium on Information Theory, 2020

AdaBERT: Task-Adaptive BERT Compression with Differentiable Neural Architecture Search.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Learning Efficient Parameter Server Synchronization Policies for Distributed SGD.
Proceedings of the 8th International Conference on Learning Representations, 2020

Inductive Granger Causal Modeling for Multivariate Time Series.
Proceedings of the 20th IEEE International Conference on Data Mining, 2020

Effective Sentiment Analysis for Multimodal Review Data on the Web.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2020

2019
Guest Editors' Introduction: Special Issue on Big Data Systems on Emerging Architectures.
IEEE Trans. Big Data, 2019

AliGraph: A Comprehensive Graph Neural Network Platform.
Proc. VLDB Endow., 2019

DPSAaS: Multi-Dimensional Data Sharing and Analytics as Services under Local Differential Privacy.
Proc. VLDB Endow., 2019

Hop-constrained s-t Simple Path Enumeration: Towards Bridging Theory and Practice.
Proc. VLDB Endow., 2019

Distributed Subgraph Matching on Timely Dataflow.
Proc. VLDB Endow., 2019

Yugong: Geo-Distributed Data and Job Placement at Scale.
Proc. VLDB Endow., 2019

Deducing Certain Fixes to Graphs.
Proc. VLDB Endow., 2019

Dynamic Scaling for Parallel Graph Computations.
Proc. VLDB Endow., 2019

Design of Algorithms under Policy-Aware Local Differential Privacy: Utility-Privacy Trade-offs.
CoRR, 2019

Practical and Robust Privacy Amplification with Multi-Party Differential Privacy.
CoRR, 2019

A Survey and Experimental Analysis of Distributed Subgraph Matching.
CoRR, 2019

Efficient (a,β)-core Computation: an Index-based Approach.
Proceedings of the World Wide Web Conference, 2019

Answering Multi-Dimensional Analytical Queries under Local Differential Privacy.
Proceedings of the 2019 International Conference on Management of Data, 2019

A Minimax Game for Instance based Selective Transfer Learning.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Is a Single Vector Enough?: Exploring Node Polysemy for Network Embedding.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Sequential Scenario-Specific Meta Learner for Online Recommendation.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Towards Knowledge-Based Personalized Product Description Generation in E-commerce.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Representation Learning for Attributed Multiplex Heterogeneous Network.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Towards self-managing cloud-scale computing platforms: Experiences and challenges.
Proceedings of the 35th IEEE International Conference on Data Engineering Workshops, 2019

Bayes EMbedding (BEM): Refining Representation by Integrating Knowledge Graphs and Behavior-specific Networks.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Cross-domain Attention Network with Wasserstein Regularizers for E-commerce Search.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

2018
Sort-Merge Join.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Nested Loop Join.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Join Order.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Index Join.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Hash Join.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Evaluation of Relational Operators.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Parallelizing Sequential Graph Computations.
ACM Trans. Database Syst., 2018

Real-time Constrained Cycle Detection in Large Dynamic Graphs.
Proc. VLDB Endow., 2018

PANDA: Facilitating Usable AI Development.
CoRR, 2018

Deep Graph Embedding for Ranking Optimization in E-commerce.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

2017
Big Data Analytics and Intelligence at Alibaba Cloud.
Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems, 2017

2016
StreamScope: Continuous Reliable Distributed Processing of Big Data Streams.
Proceedings of the 13th USENIX Symposium on Networked Systems Design and Implementation, 2016

2015
Spotting Code Optimizations in Data-Parallel Pipelines through PeriSCOPE.
IEEE Trans. Parallel Distributed Syst., 2015

JetScope: Reliable and Interactive Analytics at Cloud Scale.
Proc. VLDB Endow., 2015

2014
Apollo: Scalable and Coordinated Scheduling for Cloud-Scale Computing.
Proceedings of the 11th USENIX Symposium on Operating Systems Design and Implementation, 2014

Large-scale L-BFGS using MapReduce.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

2013
Continuous Cloud-Scale Query Optimization and Processing.
Proc. VLDB Endow., 2013

Recurring Job Optimization for Massively Distributed Query Processing.
IEEE Data Eng. Bull., 2013

2012
SCOPE: parallel databases meet MapReduce.
VLDB J., 2012

Advanced partitioning techniques for massively distributed computation.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012

Scope playback: self-validation in the cloud.
Proceedings of the Fifth International Workshop on Testing Database Systems, 2012

Recurring job optimization in scope.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012

Spotting Code Optimizations in Data-Parallel Pipelines through PeriSCOPE.
Proceedings of the 10th USENIX Symposium on Operating Systems Design and Implementation, 2012

Optimizing Data Shuffling in Data-Parallel Computation by Understanding User-Defined Functions.
Proceedings of the 9th USENIX Symposium on Networked Systems Design and Implementation, 2012

Reoptimizing Data Parallel Computing.
Proceedings of the 9th USENIX Symposium on Networked Systems Design and Implementation, 2012

Exploiting Common Subexpressions for Cloud Query Processing.
Proceedings of the IEEE 28th International Conference on Data Engineering (ICDE 2012), 2012

2010
Incorporating partitioning and parallel plans into the SCOPE optimizer.
Proceedings of the 26th International Conference on Data Engineering, 2010

2009
Sort-Merge Join.
Proceedings of the Encyclopedia of Database Systems, 2009

Nested Loop Join.
Proceedings of the Encyclopedia of Database Systems, 2009

Index Join.
Proceedings of the Encyclopedia of Database Systems, 2009

Hash Join.
Proceedings of the Encyclopedia of Database Systems, 2009

Evaluation of Relational Operators.
Proceedings of the Encyclopedia of Database Systems, 2009

Join Order.
Proceedings of the Encyclopedia of Database Systems, 2009

2008
SCOPE: easy and efficient parallel processing of massive data sets.
Proc. VLDB Endow., 2008

2007
View matching for outer-join views.
VLDB J., 2007

Lazy Maintenance of Materialized Views.
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

Efficient exploitation of similar subexpressions for query processing.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2007

Exploiting self-monitoring sample views for cardinality estimation.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2007

Cardinality estimation using sample views with quality assurance.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2007

Dynamic Materialized Views.
Proceedings of the 23rd International Conference on Data Engineering, 2007

Efficient Maintenance of Materialized Outer-Join Views.
Proceedings of the 23rd International Conference on Data Engineering, 2007

2005
Reminiscences on influential papers.
SIGMOD Rec., 2005

Architecture Sensitive Database Design: Examples from the Columbia Group.
IEEE Data Eng. Bull., 2005

Improving Database Performance on Simultaneous Multithreading Processors.
Proceedings of the 31st International Conference on Very Large Data Bases, Trondheim, Norway, August 30, 2005

Stacked indexed views in microsoft SQL server.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2005

2004
MTCache: Mid-Tier Database Caching for SQL Server.
IEEE Data Eng. Bull., 2004

Buffering Database Operations for Enhanced Instruction Cache Performance.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2004

MTCache: Transparent Mid-Tier Database Caching in SQL Server.
Proceedings of the 20th International Conference on Data Engineering, 2004

2003
Buffering Accesses to Memory-Resident Index Structures.
Proceedings of 29th International Conference on Very Large Data Bases, 2003

Transparent Mid-Tier Database Caching in SQL Server.
Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, 2003

A Multi-Resolution Block Storage Model for Database Design.
Proceedings of the 7th International Database Engineering and Applications Symposium (IDEAS 2003), 2003

2002
Implementing database operations using SIMD instructions.
Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, 2002


  Loading...