Sai Wu

Orcid: 0000-0002-1866-9197

According to our database1, Sai Wu authored at least 126 papers between 2004 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Predictive and Near-Optimal Sampling for View Materialization in Video Databases.
Proc. ACM Manag. Data, February, 2024

Pre-Trained Model Recommendation for Downstream Fine-tuning.
CoRR, 2024

FL-GUARD: A Holistic Framework for Run-Time Detection and Recovery of Negative Federated Learning.
CoRR, 2024

Towards Cross-Table Masked Pretraining for Web Data Mining.
Proceedings of the ACM on Web Conference 2024, 2024

Sampling-Resilient Multi-Object Tracking.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Rethinking Learned Cost Models: Why Start from Scratch?
Proc. ACM Manag. Data, December, 2023

Distributional constraint discovery for intelligent auditing.
Knowl. Inf. Syst., December, 2023

SmartLite: A DBMS-based Serving System for DNN Inference in Resource-constrained Environments.
Proc. VLDB Endow., November, 2023

DoveDB: A Declarative and Low-Latency Video Database.
Proc. VLDB Endow., 2023

A Demonstration of DLBD: Database Logic Bug Detection System.
Proc. VLDB Endow., 2023

Detecting Logic Bugs of Join Optimizations in DBMS.
Proc. ACM Manag. Data, 2023

A Survey on Mapping Semi-Structured Data and Graph Data to Relational Data.
ACM Comput. Surv., 2023

CT-BERT: Learning Better Tabular Representations Through Cross-Table Pre-training.
CoRR, 2023

Assessing Hidden Risks of LLMs: An Empirical Study on Robustness, Consistency, and Credibility.
CoRR, 2023

Controllable Textual Inversion for Personalized Text-to-Image Generation.
CoRR, 2023

GAIN: Enhancing Byzantine Robustness in Federated Learning with Gradient Decomposition.
CoRR, 2023

Textile pattern recommendations with convolutional neural networks and autoencoder.
Concurr. Comput. Pract. Exp., 2023

Quantum Computing for Databases: A Short Survey and Vision.
Proceedings of the Joint Proceedings of Workshops at the 49th International Conference on Very Large Data Bases (VLDB 2023), Vancouver, Canada, August 28, 2023

Towards Controlled Data Augmentations for Active Learning.
Proceedings of the International Conference on Machine Learning, 2023

Byzantine-Robust Learning on Heterogeneous Data via Gradient Splitting.
Proceedings of the International Conference on Machine Learning, 2023

Learning a Data-Driven Policy Network for Pre-Training Automated Feature Engineering.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

ModelGiF: Gradient Fields for Model Functional Distance.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Text Labels Classification Model based on BERT Algorithm.
Proceedings of the 4th International Conference on Big Data & Artificial Intelligence & Software Engineering, 2023

2022
Continuous Trajectory Similarity Search for Online Outlier Detection.
IEEE Trans. Knowl. Data Eng., 2022

A Stack-Centric Processing Model for Iterative Processing.
IEEE Trans. Big Data, 2022

Dynamic Index Construction with Deep Reinforcement Learning.
Data Sci. Eng., 2022

A Sampling-based Learning Framework for Big Databases.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

PreQR: Pre-training Representation for SQL Understanding.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

IceBerg: Deep Generative Modeling for Constraint Discovery and Anomaly Detection.
Proceedings of the IEEE Intl Conf on Parallel & Distributed Processing with Applications, 2022

Comparison Knowledge Translation for Generalizable Image Classification.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Continuous Trajectory Similarity Search for Online Outlier Detection (Extended Abstract).
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

A Comparative Study of in-Database Inference Approaches.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Towards Unifying the Label Space for Aspect- and Sentence-based Sentiment Analysis.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Model Doctor: A Simple Gradient Aggregation Strategy for Diagnosing and Treating CNN Classifiers.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
NEIST: A Neural-Enhanced Index for Spatio-Temporal Queries.
IEEE Trans. Knowl. Data Eng., 2021

AQUA+: Query Optimization for Hybrid Database-MapReduce System.
Knowl. Inf. Syst., 2021

BERT-JAM: Maximizing the utilization of BERT for neural machine translation.
Neurocomputing, 2021

Joining datasets via data augmentation in the label space for neural networks.
CoRR, 2021

A critical look at the current train/test split in machine learning.
CoRR, 2021

Trade-off analysis between delay and throughput of RAN slicing for smart grid.
Comput. Commun., 2021

Metric Learning via Penalized Optimization.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Joining datasets via data augmentation in the label space for neural networks.
Proceedings of the 38th International Conference on Machine Learning, 2021

A Learning to Tune Framework for LSH.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

An Interactive NL2SQL Approach with Reuse Strategy.
Proceedings of the Database Systems for Advanced Applications, 2021

Learning to Label with Active Learning and Reinforcement Learning.
Proceedings of the Database Systems for Advanced Applications, 2021

Effective Slot Filling via Weakly-Supervised Dual-Model Learning.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
A two-phase approach for unexpected pattern mining.
Expert Syst. Appl., 2020

LINDT: Tackling Negative Federated Learning with Local Adaptation.
CoRR, 2020

BERT-JAM: Boosting BERT-Enhanced Neural Machine Translation with Joint Attention.
CoRR, 2020

Multi-Context Attention for Entity Matching.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

AUDITOR: A System Designed for Automatic Discovery of Complex Integrity Constraints in Relational Databases.
Proceedings of the 2020 International Conference on Management of Data, 2020

Uplink Resource Allocation Based on Short Block-Length Regime in Heterogeneous Cellular Networks for Smart Grid.
Proceedings of the Innovative Mobile and Internet Services in Ubiquitous Computing, 2020

Self-Attention and Dynamic Convolution Hybrid Model for Neural Machine Translation.
Proceedings of the 2020 IEEE International Conference on Knowledge Graph, 2020

From Code to Natural Language: Type-Aware Sketch-Based Seq2Seq Learning.
Proceedings of the Database Systems for Advanced Applications, 2020

2019
Interpretable Partitioned Embedding for Intelligent Multi-item Fashion Outfit Composition.
ACM Trans. Multim. Comput. Commun. Appl., 2019

S3: A Scalable In-memory Skip-List Index for Key-Value Store.
Proc. VLDB Endow., 2019

Information fusion in visual question answering: A Survey.
Inf. Fusion, 2019

Progressive Neural Index Search for Database System.
CoRR, 2019

Semi-Supervised Few-Shot Learning for Dual Question-Answer Extraction.
CoRR, 2019

E2E Delay Optimization for Smart Grids Mission-critical Slices in Core Networks.
Proceedings of the 11th International Conference on Wireless Communications and Signal Processing, 2019

Integration Application of 5G and Smart Grid.
Proceedings of the 11th International Conference on Wireless Communications and Signal Processing, 2019

Service-Aware Resource Allocation Based on RAN Slicing for Smart Grid.
Proceedings of the Innovative Mobile and Internet Services in Ubiquitous Computing, 2019

DBSCAN-MS: Distributed Density-Based Clustering in Metric Spaces.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

RAN Slice Strategy Based on Deep Reinforcement Learning for Smart Grid.
Proceedings of the Computing, Communications and IoT Applications, ComComAp 2019, Shenzhen, 2019

NAD: Neural Network Aided Design for Textile Pattern Generation.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

2018
NoSQL Stores.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

MapReduce.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

A Graph-Theoretic Fusion Framework for Unsupervised Entity Resolution.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

A Base Station Location Planning Method Based on Artificial Bee Colony Algorithm.
Proceedings of the IEEE International Conference on Communication Systems, 2018

2017
Targeted Advertising in Public Transportation Systems with Quantitative Evaluation.
ACM Trans. Inf. Syst., 2017

FlashView: An Interactive Visual Explorer for Raw Data.
Proc. VLDB Endow., 2017

CSIR4G: An effective and efficient cross-scenario image retrieval model for glasses.
Inf. Sci., 2017

Tuning the granularity of parallelism for distributed graph processing.
Distributed Parallel Databases, 2017

A New Approach to Compute CNNs for Extremely Large Images.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

2016
epiC: an extensible and scalable system for processing Big Data.
VLDB J., 2016

KISS: Knowing Camera Prototype System for Recognizing and Annotating Places-of-Interest.
IEEE Trans. Knowl. Data Eng., 2016

LazyLSH: Approximate Nearest Neighbor Search for Multiple Distance Functions with a Single Index.
Proceedings of the 2016 International Conference on Management of Data, 2016

Adaptive Logging: Optimizing Logging and Recovery Costs in Distributed In-memory Databases.
Proceedings of the 2016 International Conference on Management of Data, 2016

iGlasses: A Novel Recommendation System for Best-fit Glasses.
Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 2016

Personal recommendation using deep recurrent neural networks in NetEase.
Proceedings of the 32nd IEEE International Conference on Data Engineering, 2016

2015
HM: A Column-Oriented MapReduce System on Hybrid Storage.
IEEE Trans. Knowl. Data Eng., 2015

Progressive online aggregation in a distributed stream system.
J. Syst. Softw., 2015

Adaptive Logging for Distributed In-memory Databases.
CoRR, 2015

The Evolvement of Big Data Systems: From the Perspective of an Information Security Application.
Big Data Res., 2015

Cross-Scenario Eyeglasses Retrieval via EGYPT Model.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

PABIRS: A data access middleware for distributed file systems.
Proceedings of the 31st IEEE International Conference on Data Engineering, 2015

Toward the New Item Problem: Context-Enhanced Event Recommendation in Event-Based Social Networks.
Proceedings of the Advances in Information Retrieval, 2015

2014
K-Anonymity for Crowdsourcing Database.
IEEE Trans. Knowl. Data Eng., 2014

BestPeer++: A Peer-to-Peer BasedLarge-Scale Data Processing Platform.
IEEE Trans. Knowl. Data Eng., 2014

Automatic Itinerary Planning for Traveling Services.
IEEE Trans. Knowl. Data Eng., 2014

yzBigData: Provisioning Customizable Solution for Big Data.
Proc. VLDB Endow., 2014

ScalaGiST: Scalable Generalized Search Trees for MapReduce Systems [Innovative Systems Paper].
Proc. VLDB Endow., 2014

epiC: an Extensible and Scalable System for Processing Big Data.
Proc. VLDB Endow., 2014

Real-time recommendation for microblogs.
Inf. Sci., 2014

kNN processing with co-space distance in SoLoMo systems.
Expert Syst. Appl., 2014

Distributed data management using MapReduce.
ACM Comput. Surv., 2014

The knowing camera 2: recognizing and annotating places-of-interest in smartphone photos.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

2013
Online Aggregation.
Proceedings of the Advanced Query Processing, Volume 1: Issues and Trends, 2013

Supporting Efficient Social Media Search in Cyber-Physical Web.
IEEE Data Eng. Bull., 2013

The knowing camera: recognizing places-of-interest in smartphone photos.
Proceedings of the 36th International ACM SIGIR conference on research and development in Information Retrieval, 2013

Crowd-answering system via microblogging.
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013

An efficient and compact indexing scheme for large-scale data store.
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013

2012
CDAS: A Crowdsourcing Data Analytics System.
Proc. VLDB Endow., 2012

E3: an Elastic Execution Engine for Scalable Data Processing.
J. Inf. Process., 2012

Cross Domain Search by Exploiting Wikipedia.
Proceedings of the IEEE 28th International Conference on Data Engineering (ICDE 2012), 2012

BestPeer++: A Peer-to-Peer Based Large-Scale Data Processing Platform.
Proceedings of the IEEE 28th International Conference on Data Engineering (ICDE 2012), 2012

Parallelizing stateful operators in a distributed stream processing system: how, should you and how much?
Proceedings of the Sixth ACM International Conference on Distributed Event-Based Systems, 2012

2011
Approximate Aggregations in Structured P2P Networks.
IEEE Trans. Knowl. Data Eng., 2011

A Framework for Supporting DBMS-like Indexes in the Cloud.
Proc. VLDB Endow., 2011

Llama: leveraging columnar storage for scalable join processing in the MapReduce framework.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011

TI: an efficient indexing mechanism for real-time search on tweets.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011

ES<sup>2</sup>: A cloud data storage system for supporting both OLTP and OLAP.
Proceedings of the 27th International Conference on Data Engineering, 2011

Query optimization for massively parallel data processing.
Proceedings of the ACM Symposium on Cloud Computing in conjunction with SOSP 2011, 2011

2010
Efficient B-tree Based Indexing for Cloud Data Processing.
Proc. VLDB Endow., 2010

The Performance of MapReduce: An In-depth Study.
Proc. VLDB Endow., 2010

Providing Scalable Database Services on the Cloud.
Proceedings of the Web Information Systems Engineering - WISE 2010, 2010

Continuous sampling for online aggregation over multiple queries.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010

Indexing multi-dimensional data in a cloud system.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010

2009
Distributed Online Aggregation.
Proc. VLDB Endow., 2009

An Indexing Framework for Efficient Retrieval on the Cloud.
IEEE Data Eng. Bull., 2009

Adaptive Multi-join Query Processing in PDBMS.
Proceedings of the 25th International Conference on Data Engineering, 2009

SiMPSON: Efficient Similarity Search in Metric Spaces over P2P Structured Overlay Networks.
Proceedings of the Euro-Par 2009 Parallel Processing, 2009

2008
Just-in-time query retrieval over partially indexed data on structured P2P overlays.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008

Skyline-join in distributed databases.
Proceedings of the 24th International Conference on Data Engineering Workshops, 2008

2004
PKUSpace: A Collaborative Platform for Scientific Researching.
Proceedings of the Advances in Web-Based Learning, 2004

WebGIS-RBDL - A Rare Book Digital Library Supporting Spatio-Temporary Retrieval.
Proceedings of the Digital Libraries: International Collaboration and Cross-Fertilization, 2004


  Loading...