Guoliang Li

Orcid: 0000-0002-1398-0621

Affiliations:
  • Tsinghua University, Department of Computer Science, TNList, Beijing, China


According to our database1, Guoliang Li authored at least 359 papers between 2005 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Automatic Index Tuning: A Survey.
IEEE Trans. Knowl. Data Eng., December, 2024

Cloud-Native Databases: A Survey.
IEEE Trans. Knowl. Data Eng., December, 2024

HTAP Databases: A Survey.
IEEE Trans. Knowl. Data Eng., November, 2024

Fast and Scalable Ridesharing Search.
IEEE Trans. Knowl. Data Eng., November, 2024

Stochastic gradient descent without full data shuffle: with applications to in-database machine learning and deep learning systems.
VLDB J., September, 2024

Survey of vector database management systems.
VLDB J., September, 2024

Chat2Data: An Interactive Data Analysis System with RAG, Vector Databases and LLMs.
Proc. VLDB Endow., August, 2024

Vector Databases: What's Really New and What's Next?
Proc. VLDB Endow., August, 2024

LLM for Data Management.
Proc. VLDB Endow., August, 2024

GaussDB: A Cloud-Native Multi-Primary Database with Compute-Memory-Storage Disaggregation.
Proc. VLDB Endow., August, 2024

HAIChart: Human and AI Paired Visualization System.
Proc. VLDB Endow., July, 2024

The Dawn of Natural Language to SQL: Are We Fully Ready? [Experiment, Analysis \u0026 Benchmark ].
Proc. VLDB Endow., July, 2024

Combining Small Language Models and Large Language Models for Zero-Shot NL2SQL.
Proc. VLDB Endow., July, 2024

G-Learned Index: Enabling Efficient Learned Index on GPU.
IEEE Trans. Parallel Distributed Syst., June, 2024

Breaking It Down: An In-depth Study of Index Advisors.
Proc. VLDB Endow., June, 2024

D-Bot: Database Diagnosis System using Large Language Models.
Proc. VLDB Endow., June, 2024

Contributions Estimation in Federated Learning: A Comprehensive Experimental Evaluation.
Proc. VLDB Endow., April, 2024

Unicorn: A Unified Multi-Tasking Matching Model.
SIGMOD Rec., March, 2024

Nuhuo: An Effective Estimation Model for Traffic Speed Histogram Imputation on A Road Network.
Proc. VLDB Endow., March, 2024

Leveraging Dynamic and Heterogeneous Workload Knowledge to Boost the Performance of Index Advisors.
Proc. VLDB Endow., March, 2024

DB-GPT: Large Language Model Meets Database.
Data Sci. Eng., March, 2024

Robustness of Updatable Learning-based Index Advisors against Poisoning Attack.
Proc. ACM Manag. Data, February, 2024

PACE: Poisoning Attacks on Learned Cardinality Estimation.
Proc. ACM Manag. Data, February, 2024

Controllable Tabular Data Synthesis Using Diffusion Models.
Proc. ACM Manag. Data, February, 2024

RITA: Group Attention is All You Need for Timeseries Analytics.
Proc. ACM Manag. Data, February, 2024

HyBench: A New Benchmark for HTAP Databases.
Proc. VLDB Endow., January, 2024

Tabular data synthesis with generative adversarial networks: design space and optimizations.
VLDB J., 2024

Cardinality estimation using normalizing flow.
VLDB J., 2024

Revolutionizing Database Q&A with Large Language Models: Comprehensive Benchmark and Evaluation.
CoRR, 2024

A Survey of NL2SQL with Large Language Models: Where are we, and where are we going?
CoRR, 2024

The Dawn of Natural Language to SQL: Are We Fully Ready?
CoRR, 2024

TablePuppet: A Generic Framework for Relational Federated Learning.
CoRR, 2024

LLM-Enhanced Data Management.
CoRR, 2024

Vector Database Management Techniques and Systems.
Proceedings of the Companion of the 2024 International Conference on Management of Data, 2024

TRAP: Tailored Robustness Assessment for Index Advisors via Adversarial Perturbation.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

Fast, Robust and Interpretable Participant Contribution Estimation for Federated Learning.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

Separation Is for Better Reunion: Data Lake Storage at Huawei.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

GaussML: An End-to-End In-Database Machine Learning System.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

A CXL- Powered Database System: Opportunities and Challenges.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

Cost-Effective In-Context Learning for Entity Resolution: A Design Space Exploration.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

2023
Automatic Database Knob Tuning: A Survey.
IEEE Trans. Knowl. Data Eng., December, 2023

HOFD: An Outdated Fact Detector for Knowledge Bases.
IEEE Trans. Knowl. Data Eng., October, 2023

From Large Language Models to Databases and Back: A Discussion on Research and Education.
SIGMOD Rec., September, 2023

Road-Aware Indexing for Trajectory Range Queries.
IEEE Trans. Knowl. Data Eng., August, 2023

$\mathtt {AutoView}$: An Autonomous Materialized View Management System With Encoder-Reducer.
IEEE Trans. Knowl. Data Eng., June, 2023

Data Management for Machine Learning: A Survey.
IEEE Trans. Knowl. Data Eng., May, 2023

A Learned Query Rewrite System.
Proc. VLDB Endow., 2023

FEBench: A Benchmark for Real-Time Relational Data Feature Extraction.
Proc. VLDB Endow., 2023

Learned Index: A Comprehensive Experimental Evaluation.
Proc. VLDB Endow., 2023

GeoGauss: Strongly Consistent and Light-Coordinated OLTP for Geo-Replicated SQL Database.
Proc. ACM Manag. Data, 2023

Grep: A Graph Learning Based Database Partitioning System.
Proc. ACM Manag. Data, 2023

Unicorn: A Unified Multi-tasking Model for Supporting Matching Tasks in Data Integration.
Proc. ACM Manag. Data, 2023

Learned Data-aware Image Representations of Line Charts for Similarity Search.
Proc. ACM Manag. Data, 2023

HAIPipe: Combining Human-generated and Machine-generated Pipelines for Data Preparation.
Proc. ACM Manag. Data, 2023

GoodCore: Data-effective and Data-efficient Machine Learning through Coreset Selection over Incomplete Data.
Proc. ACM Manag. Data, 2023

Towards Transaction as a Service.
CoRR, 2023

LLM As DBA.
CoRR, 2023

Interleaving Pre-Trained Language Models and Large Language Models for Zero-Shot NL2SQL Generation.
CoRR, 2023

Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs.
CoRR, 2023

Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Database Meets Artificial Intelligence: A Survey (Extended Abstract).
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

AutoCE: An Accurate and Efficient Model Advisor for Learned Cardinality Estimation.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

DBAugur: An Adversarial-based Trend Forecasting System for Diversified Workloads.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

A Topic-Aware Data Generation Framework for Math Word Problems.
Proceedings of the Database Systems for Advanced Applications, 2023

ACR-Tree: Constructing R-Trees Using Deep Reinforcement Learning.
Proceedings of the Database Systems for Advanced Applications, 2023

Demonstration of ViTA: Visualizing, Testing and Analyzing Index Advisors.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

2022
RNE: computing shortest paths using road network embedding.
VLDB J., 2022

Interactively discovering and ranking desired tuples by data exploration.
VLDB J., 2022

Adapative algorithms for crowd-aided categorization.
VLDB J., 2022

Natural Language to Visualization by Neural Machine Translation.
IEEE Trans. Vis. Comput. Graph., 2022

Database Meets Artificial Intelligence: A Survey.
IEEE Trans. Knowl. Data Eng., 2022

Representation Learning From Limited Educational Data With Crowdsourced Labels.
IEEE Trans. Knowl. Data Eng., 2022

Evaluating Public Anxiety for Topic-Based Communities in Social Networks.
IEEE Trans. Knowl. Data Eng., 2022

Steerable Self-Driving Data Visualization.
IEEE Trans. Knowl. Data Eng., 2022

GPSC: A Grid-Based Privacy-Reserving Framework for Online Spatial Crowdsourcing.
IEEE Trans. Knowl. Data Eng., 2022

Cost-based or Learning-based? A Hybrid Query Optimizer for Query Plan Selection.
Proc. VLDB Endow., 2022

Coresets over Multiple Tables for Feature-rich and Data-efficient Machine Learning.
Proc. VLDB Endow., 2022

A Deep Generative Model for Trajectory Modeling and Utilization.
Proc. VLDB Endow., 2022

DADER: Hands-Off Entity Resolution with Domain Adaptation.
Proc. VLDB Endow., 2022

Cloud Databases: New Techniques, Challenges, and Opportunities.
Proc. VLDB Endow., 2022

Selective Data Acquisition in the Wild for Model Charging.
Proc. VLDB Endow., 2022

Route Travel Time Estimation on A Road Network Revisited: Heterogeneity, Proximity, Periodicity and Dynamicity.
Proc. VLDB Endow., 2022

Preface.
J. Comput. Sci. Technol., 2022

AlphaQO: Robust Learned Query Optimizer.
Int. J. Softw. Informatics, 2022

Preface to Special Issue on New Technologies of Database Systems.
Int. J. Softw. Informatics, 2022

Stochastic Gradient Descent without Full Data Shuffle.
CoRR, 2022

Perspectives from the second Global Forum on Development of Computer Science.
Sci. China Inf. Sci., 2022

LearnedSQLGen: Constraint-aware SQL Generation using Reinforcement Learning.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

In-Database Machine Learning with CorgiPile: Stochastic Gradient Descent without Full Data Shuffle.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

Domain Adaptation for Deep Entity Resolution.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

Sevi: Speech-to-Visualization through Neural Machine Translation.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

Unsupervised Contextual Anomaly Detection for Database Systems.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

HTAP Databases: What is New and What is Next.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

Synthesizing Privacy Preserving Entity Resolution Datasets.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Adaptive Code Learning for Spark Configuration Tuning.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Machine Learning for Data Management: A System View.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Message from Data Science and Systems 2022 General Chairs.
Proceedings of the 24th IEEE Int Conf on High Performance Computing & Communications; 8th Int Conf on Data Science & Systems; 20th Int Conf on Smart City; 8th Int Conf on Dependability in Sensor, 2022

Learned Query Optimizer: At the Forefront of AI-Driven Databases.
Proceedings of the 25th International Conference on Extending Database Technology, 2022

2021
$\hbox {CDBTune}^{+}$: An efficient deep reinforcement learning-based automatic cloud database tuning system.
VLDB J., 2021

Mis-categorized entities detection.
VLDB J., 2021

CrowdChart: Crowdsourced Data Extraction From Visualization Charts.
IEEE Trans. Knowl. Data Eng., 2021

A Learned Query Rewrite System using Monte Carlo Tree Search.
Proc. VLDB Endow., 2021

FACE: A Normalizing Flow based Cardinality Estimator.
Proc. VLDB Endow., 2021

RPT: Relational Pre-trained Transformer Is Almost All You Need towards Democratizing Data Preparation.
Proc. VLDB Endow., 2021

Learned Cardinality Estimation: A Design Space Exploration and A Comparative Evaluation.
Proc. VLDB Endow., 2021

Adaptive Data Augmentation for Supervised Learning over Missing Data.
Proc. VLDB Endow., 2021

ATLANTIC: Making Database Differentially Private and Faster with Accuracy Guarantee.
Proc. VLDB Endow., 2021

openGauss: An Autonomous Database System.
Proc. VLDB Endow., 2021

Machine Learning for Databases.
Proc. VLDB Endow., 2021

Preface.
J. Comput. Sci. Technol., 2021

A Survey of Traffic Prediction: from Spatio-Temporal Data to Intelligent Transportation.
Data Sci. Eng., 2021

nvBench: A Large-Scale Synthesized Dataset for Cross-Domain Natural Language to Visualization Task.
CoRR, 2021

Learned Cardinality Estimation for Similarity Queries.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

Synthesizing Natural Language to Visualization (NL2VIS) Benchmarks from NL2SQL Benchmarks.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

AI Meets Database: AI4DB and DB4AI.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

Ranking Desired Tuples by Database Exploration.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

CrowdRL: An End-to-End Reinforcement Learning Framework for Data Labelling.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

A Learning-based Method for Computing Shortest Path Distances on Road Networks.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

An Autonomous Materialized View Management System with Deep Reinforcement Learning.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

An Effective Joint Prediction Model for Travel Demands and Traffic Flows.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

2020
A framework for image dark data assessment.
World Wide Web, 2020

A game-based framework for crowdsourced data labeling.
VLDB J., 2020

Making data visualization more efficient and effective: a survey.
VLDB J., 2020

A Crowdsourcing Framework for Collecting Tabular Data.
IEEE Trans. Knowl. Data Eng., 2020

Towards an Optimal Outdoor Advertising Placement: When a Budget Constraint Meets Moving Trajectories.
ACM Trans. Knowl. Discov. Data, 2020

PriRadar: A Privacy-Preserving Framework for Spatial Crowdsourcing.
IEEE Trans. Inf. Forensics Secur., 2020

Query Performance Prediction for Concurrent Queries using Graph Embedding.
Proc. VLDB Endow., 2020

Evaluating Ridesharing Algorithms using the Jargo Real-Time Stochastic Simulator.
Proc. VLDB Endow., 2020

DeepTrack: Monitoring and Exploring Spatio-Temporal Data - A Case of Tracking COVID-19 -.
Proc. VLDB Endow., 2020

VisClean: Interactive Cleaning for Progressive Visualization.
Proc. VLDB Endow., 2020

Efficient Algorithms for Crowd-Aided Categorization.
Proc. VLDB Endow., 2020

Relational Data Synthesis using Generative Adversarial Networks: A Design Space Exploration.
Proc. VLDB Endow., 2020

DeepEye: A Data Science System for Monitoring and Exploring COVID-19 Data.
IEEE Data Eng. Bull., 2020

Human-in-the-loop Techniques in Machine Learning.
IEEE Data Eng. Bull., 2020

Relational Pretrained Transformers towards Democratizing Data Preparation [Vision].
CoRR, 2020

Designing a double auction mechanism for the re-allocation of emission permits.
Ann. Oper. Res., 2020

An Active Learning Based Hybrid Neural Network for Joint Information Extraction.
Proceedings of the Web Information Systems Engineering - WISE 2020, 2020

Effective Travel Time Estimation: When Historical Trajectories over Road Networks Matter.
Proceedings of the 2020 International Conference on Management of Data, 2020

Interactively Discovering and Ranking Desired Tuples without Writing SQL Queries.
Proceedings of the 2020 International Conference on Management of Data, 2020

Towards Interpretable and Learnable Risk Analysis for Entity Resolution.
Proceedings of the 2020 International Conference on Management of Data, 2020

Human-in-the-loop Outlier Detection.
Proceedings of the 2020 International Conference on Management of Data, 2020

Automatic Detection of Solar Radio Spectrum Based on Codebook Model.
Proceedings of the Data Science, 2020

Automatic View Generation with Deep Learning and Reinforcement Learning.
Proceedings of the 36th IEEE International Conference on Data Engineering, 2020

Reinforcement Learning with Tree-LSTM for Join Order Selection.
Proceedings of the 36th IEEE International Conference on Data Engineering, 2020

An End-to-End Deep RL Framework for Task Arrangement in Crowdsourcing Platforms.
Proceedings of the 36th IEEE International Conference on Data Engineering, 2020

Interactive Cleaning for Progressive Visualization through Composite Questions.
Proceedings of the 36th IEEE International Conference on Data Engineering, 2020

Temporal Network Representation Learning via Historical Neighborhoods Aggregation.
Proceedings of the 36th IEEE International Conference on Data Engineering, 2020

Outdated Fact Detection in Knowledge Bases.
Proceedings of the 36th IEEE International Conference on Data Engineering, 2020

Crowdsourcing-based Data Extraction from Visualization Charts.
Proceedings of the 36th IEEE International Conference on Data Engineering, 2020


Neural Multi-task Learning for Teacher Question Detection in Online Classrooms.
Proceedings of the Artificial Intelligence in Education - 21st International Conference, 2020

2019
Bounded Approximate Query Processing.
IEEE Trans. Knowl. Data Eng., 2019

Querying Shortest Paths on Time Dependent Road Networks.
Proc. VLDB Endow., 2019

Balance-Aware Distributed String Similarity-Based Query Processing System.
Proc. VLDB Endow., 2019

An End-to-End Learning-based Cost Estimator.
Proc. VLDB Endow., 2019

Ridesharing: Simulator, Benchmark, and Evaluation.
Proc. VLDB Endow., 2019

QTune: A Query-Aware Database Tuning System with Deep Reinforcement Learning.
Proc. VLDB Endow., 2019

Location and Trajectory Identification from Microblogs.
J. Comput. Sci. Technol., 2019

XuanYuan: An AI-Native Database.
IEEE Data Eng. Bull., 2019

Letter from the Special Issue Editor.
IEEE Data Eng. Bull., 2019

Special Issue of DASFAA 2019.
Data Sci. Eng., 2019

Towards Interpretable and Learnable Risk Analysis for Entity Resolution.
CoRR, 2019

Technical Report: Optimizing Human Involvement for Entity Matching and Consolidation.
CoRR, 2019

An End-to-End Automatic Cloud Database Tuning System Using Deep Reinforcement Learning.
Proceedings of the 2019 International Conference on Management of Data, 2019

Interactive Graph Search.
Proceedings of the 2019 International Conference on Management of Data, 2019

Towards Democratizing Relational Data Visualization.
Proceedings of the 2019 International Conference on Management of Data, 2019

CrowdGame: A Game-Based Crowdsourcing System for Cost-Effective Data Labeling.
Proceedings of the 2019 International Conference on Management of Data, 2019

Ratel: Interactive Analytics for Large Scale Trajectories.
Proceedings of the 2019 International Conference on Management of Data, 2019

Distributed In-memory Trajectory Similarity Search and Join on Road Network.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

ServeDB: Secure, Verifiable, and Efficient Range Queries on Outsourced Database.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

Unsupervised String Transformation Learning for Entity Consolidation.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

Crowdsourcing Database Systems: Overview and Challenges.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

Data Management at Huawei: Recent Accomplishments and Future Challenges.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

VizNet: Towards A Large-Scale Visualization Learning and Benchmarking Repository.
Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems, 2019

2018
Distilling relations using knowledge bases.
VLDB J., 2018

A partial-order-based framework for cost-effective crowdsourced entity resolution.
VLDB J., 2018

Privacy-Preserving Top-k Spatial Keyword Queries in Untrusted Cloud Environments.
IEEE Trans. Serv. Comput., 2018

An Efficient Ride-Sharing Framework for Maximizing Shared Route.
IEEE Trans. Knowl. Data Eng., 2018

Cost-Effective Data Annotation using Game-Based Crowdsourcing.
Proc. VLDB Endow., 2018

TOAIN: A Throughput Optimizing Adaptive Index for Answering Dynamic kNN Queries on Road Networks.
Proc. VLDB Endow., 2018

CDB: A Crowd-Powered Database System.
Proc. VLDB Endow., 2018

Context-aware result inference in crowdsourcing.
Inf. Sci., 2018

Human-in-the-loop Rule Learning for Data Integration.
IEEE Data Eng. Bull., 2018

Approximate Query Processing: What is New and Where to Go? - A Survey on Approximate Query Processing.
Data Sci. Eng., 2018

Crowd-Powered Data Mining.
CoRR, 2018

PANDA: Facilitating Usable AI Development.
CoRR, 2018

DeepEye: An automatic big data visualization framework.
Big Data Min. Anal., 2018

DITA: A Distributed In-Memory Trajectory Analytics System.
Proceedings of the 2018 International Conference on Management of Data, 2018

DITA: Distributed In-Memory Trajectory Analytics.
Proceedings of the 2018 International Conference on Management of Data, 2018

DeepEye: Creating Good Data Visualizations by Keyword Search.
Proceedings of the 2018 International Conference on Management of Data, 2018

A Rating-Ranking Method for Crowdsourced Top-k Computation.
Proceedings of the 2018 International Conference on Management of Data, 2018

Overlap Set Similarity Joins with Theoretical Guarantees.
Proceedings of the 2018 International Conference on Management of Data, 2018

Trajectory-driven Influential Billboard Placement.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

CrowdOTA: An Online Task Assignment System in Crowdsourcing.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

An Efficient Ride-Sharing Framework for Maximizing Shared Routes.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

T-Crowd: Effective Crowdsourcing for Tabular Data.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

DeepEye: Towards Automatic Data Visualization.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

Cleaning Your Wrong Google Scholar Entries.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

Discovering Mis-Categorized Entities.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

OCTOPUS: An Online Topic-Aware Influence Analysis System for Social Networks.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

Incentive-Based Entity Collection Using Crowdsourcing.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

DeepEye: Visualizing Your Data by Keyword Search.
Proceedings of the 21st International Conference on Extending Database Technology, 2018

Web Table Understanding by Collective Inference.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

Map Matching Algorithms: An Experimental Evaluation.
Proceedings of the Web and Big Data - Second International Joint Conference, 2018

2017
A unified framework for string similarity search with edit-distance constraint.
VLDB J., 2017

Signature-Based Trajectory Similarity Join.
IEEE Trans. Knowl. Data Eng., 2017

A Novel Cost-Based Model for Data Repairing.
IEEE Trans. Knowl. Data Eng., 2017

Truth Inference in Crowdsourcing: Is the Problem Solved?
Proc. VLDB Endow., 2017

Dima: A Distributed In-Memory Similarity-Based Query Processing System.
Proc. VLDB Endow., 2017

Human-in-the-loop Data Integration.
Proc. VLDB Endow., 2017

Thinking and Modeling for Big Data from the Perspective of the I Ching.
Int. J. Inf. Technol. Decis. Mak., 2017

Towards a Unified Graph Model for Supporting Data Management and Usable Machine Learning.
IEEE Data Eng. Bull., 2017

Crowdsourced Entity Alignment: A Decision Theory Based Approach.
Proceedings of the Web Information Systems Engineering - WISE 2017, 2017

Crowdsourced Data Management: Overview and Challenges.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

CDB: Optimizing Queries with Crowd-Based Selections and Joins.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

V-Tree: Efficient kNN Search on Moving Objects with Road-Network Constraints.
Proceedings of the 33rd IEEE International Conference on Data Engineering, 2017

Cleaning Relations Using Knowledge Bases.
Proceedings of the 33rd IEEE International Conference on Data Engineering, 2017

Hike: A Hybrid Human-Machine Method for Entity Alignment in Large-Scale Knowledge Bases.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

Sybil Defense in Crowdsourcing Platforms.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

Crowdsourced Selection on Multi-Attribute Data.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

2016
K-Join: Knowledge-Aware Similarity Join.
IEEE Trans. Knowl. Data Eng., 2016

Crowdsourced Data Management: A Survey.
IEEE Trans. Knowl. Data Eng., 2016

Top-k Spatio-Textual Similarity Join.
IEEE Trans. Knowl. Data Eng., 2016

DOCS: Domain-Aware Crowdsourcing System.
Proc. VLDB Endow., 2016

Crowdsourced Top-k Algorithms: An Experimental Evaluation.
Proc. VLDB Endow., 2016

META: An Efficient Matching-Based Method for Error-Tolerant Autocompletion.
Proc. VLDB Endow., 2016

String similarity search and join: a survey.
Frontiers Comput. Sci., 2016

Preface.
Big Data Res., 2016

ELAN: An Efficient Location-Aware Analytics System.
Big Data Res., 2016

Semantic-Aware Trajectory Compression with Urban Road Network.
Proceedings of the Web-Age Information Management - 17th International Conference, 2016

Interactive and Deterministic Data Cleaning.
Proceedings of the 2016 International Conference on Management of Data, 2016

Cost-Effective Crowdsourced Entity Resolution: A Partial-Order Approach.
Proceedings of the 2016 International Conference on Management of Data, 2016

Crowdsourced POI labelling: Location-aware result inference and Task Assignment.
Proceedings of the 32nd IEEE International Conference on Data Engineering, 2016

Crowdsourcing-based real-time urban traffic speed estimation: From trends to speeds.
Proceedings of the 32nd IEEE International Conference on Data Engineering, 2016

NXgraph: An efficient graph processing system on a single machine.
Proceedings of the 32nd IEEE International Conference on Data Engineering, 2016

PBA: Partition and Blocking Based Alignment for Large Knowledge Bases.
Proceedings of the Database Systems for Advanced Applications, 2016

CrowdAidRepair: A Crowd-Aided Interactive Data Repairing Method.
Proceedings of the Database Systems for Advanced Applications, 2016

Effective Result Inference for Context-Sensitive Tasks in Crowdsourcing.
Proceedings of the Database Systems for Advanced Applications, 2016

2015
A unified framework for approximate dictionary-based entity extraction.
VLDB J., 2015

A general framework to resolve the MisMatch problem in XML keyword search.
VLDB J., 2015

G-Tree: An Efficient and Scalable Index for Spatial Search on Road Networks.
IEEE Trans. Knowl. Data Eng., 2015

Efficient Filtering Algorithms for Location-Aware Publish/Subscribe.
IEEE Trans. Knowl. Data Eng., 2015

An Efficient Partition Based Method for Exact Set Similarity Joins.
Proc. VLDB Endow., 2015

Online Topic-Aware Influence Maximization.
Proc. VLDB Endow., 2015

RFP: A Remote Fetching Paradigm for RDMA-Accelerated Systems.
CoRR, 2015

QASCA: A Quality-Aware Task Assignment System for Crowdsourcing Applications.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

Efficient Similarity Join and Search on Multi-Attribute Data.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

Location-Aware Pub/Sub System: When Continuous Moving Queries Meet Dynamic Event Streams.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

iCrowd: An Adaptive Crowdsourcing Framework.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

Two birds with one stone: An efficient hierarchical framework for top-k and threshold-based string similarity search.
Proceedings of the 31st IEEE International Conference on Data Engineering, 2015

Real time personalized search on social networks.
Proceedings of the 31st IEEE International Conference on Data Engineering, 2015

A location-aware publish/subscribe framework for parameterized spatio-textual subscriptions.
Proceedings of the 31st IEEE International Conference on Data Engineering, 2015

Elaps: An efficient location-aware pub/sub system.
Proceedings of the 31st IEEE International Conference on Data Engineering, 2015

A self-aware data compression system on FPGA in Hadoop.
Proceedings of the 2015 International Conference on Field Programmable Technology, 2015

Privacy-Preserving Top-k Spatial Keyword Queries over Outsourced Database.
Proceedings of the Database Systems for Advanced Applications, 2015

A Cost-based Method for Location-Aware Publish/Subscribe Services.
Proceedings of the 24th ACM International Conference on Information and Knowledge Management, 2015

2014
Extending string similarity join to tolerant fuzzy token matching.
ACM Trans. Database Syst., 2014

Adaptive Database Schema Design for Multi-Tenant Data Management.
IEEE Trans. Knowl. Data Eng., 2014

A Prefix-Filter based Method for Spatio-Textual Similarity Join.
IEEE Trans. Knowl. Data Eng., 2014

\(\mathsf{B}^{p}\) - \(\mathsf{Tree}\) : A Predictive \(\mathsf{B}^{+}\) - \(\mathsf{Tree}\) for Reducing Writes on Phase Change Memory.
IEEE Trans. Knowl. Data Eng., 2014

R3: A Real-Time Route Recommendation System.
Proc. VLDB Endow., 2014

Efficient Top-K SimRank-based Similarity Join.
Proc. VLDB Endow., 2014

String Similarity Joins: An Experimental Evaluation.
Proc. VLDB Endow., 2014

A survey of address translation technologies for flash memories.
ACM Comput. Surv., 2014

The Expected Optimal Labeling Order Problem for Crowdsourced Joins and Entity Resolution.
CoRR, 2014

Efficient top-K SimRank-based similarity join.
Proceedings of the International Conference on Management of Data, 2014

Efficient location-aware influence maximization.
Proceedings of the International Conference on Management of Data, 2014

A pivotal prefix based filtering algorithm for string similarity search.
Proceedings of the International Conference on Management of Data, 2014

Breaking out of the MisMatch trap.
Proceedings of the IEEE 30th International Conference on Data Engineering, Chicago, 2014

Effective location identification from microblogs.
Proceedings of the IEEE 30th International Conference on Data Engineering, Chicago, 2014

MassJoin: A mapreduce-based method for scalable string similarity joins.
Proceedings of the IEEE 30th International Conference on Data Engineering, Chicago, 2014

Incremental Quality Inference in Crowdsourcing.
Proceedings of the Database Systems for Advanced Applications, 2014

Group-Based Personalized Location Recommendation on Social Networks.
Proceedings of the Web Technologies and Applications - 16th Asia-Pacific Web Conference, 2014

2013
A partition-based method for string similarity joins with edit-distance constraints.
ACM Trans. Database Syst., 2013

Supporting Search-As-You-Type Using SQL in Databases.
IEEE Trans. Knowl. Data Eng., 2013

A User-Friendly Patent Search Paradigm.
IEEE Trans. Knowl. Data Eng., 2013

Scalable Column Concept Determination for Web Tables Using Large Knowledge Bases.
Proc. VLDB Endow., 2013

A Human-Machine Method for Web Table Understanding.
Proceedings of the Web-Age Information Management - 14th International Conference, 2013

Leveraging transitive relations for crowdsourced joins.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

TsingNUS: a location-based service system towards live city.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

Removing the mismatch headache in XML keyword search.
Proceedings of the 36th International ACM SIGIR conference on research and development in Information Retrieval, 2013

Location-aware publish/subscribe.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

Top-k string similarity search with edit-distance constraints.
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013

Efficient parallel partition-based algorithms for similarity search and join with edit distance constraints.
Proceedings of the Joint 2013 EDBT/ICDT Conferences, 2013

Efficient XML Keyword Search: From Graph Model to Tree Model.
Proceedings of the Database and Expert Systems Applications, 2013

G-tree: an efficient index for KNN search on road networks.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

Exploring XML data is as easy as using maps.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

Fast Similar Subgraph Search with Maximum Common Connected Subgraph Constraints.
Proceedings of the IEEE International Congress on Big Data, 2013

2012
Trie-join: a trie-based method for efficient string similarity joins.
VLDB J., 2012

Efficient Fuzzy Type-Ahead Search in XML Data.
IEEE Trans. Knowl. Data Eng., 2012

SEAL: Spatio-Textual Similarity Search.
Proc. VLDB Endow., 2012

Can we beat the prefix filtering?: an adaptive framework for similarity join and search.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012

Supporting efficient top-k queries in type-ahead search.
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

DESKS: Direction-Aware Spatial Keyword Search.
Proceedings of the IEEE 28th International Conference on Data Engineering (ICDE 2012), 2012

An Efficient Trie-based Method for Approximate Entity Extraction with Edit-Distance Constraints.
Proceedings of the IEEE 28th International Conference on Data Engineering (ICDE 2012), 2012

Location-aware instant search.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Adapt: adaptive database schema design for multi-tenant applications.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

A positional access method for relational databases.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Star-Join: spatio-textual similarity join.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Keyword-based k-nearest neighbor search in spatial databases.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Efficient safe-region construction for moving top-K spatial keyword queries.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

2011
Efficient fuzzy full-text type-ahead search.
VLDB J., 2011

Providing built-in keyword search capabilities in RDBMS.
VLDB J., 2011

KEMB: A Keyword-Based XML Message Broker.
IEEE Trans. Knowl. Data Eng., 2011

Finding Top-k Answers in Keyword Search over Relational Databases Using Tuple Units.
IEEE Trans. Knowl. Data Eng., 2011

Entity Matching: How Similar Is Similar.
Proc. VLDB Endow., 2011

PASS-JOIN: A Partition-based Method for Similarity Joins.
Proc. VLDB Endow., 2011

An effective 3-in-1 keyword search method over heterogeneous data sources.
Inf. Syst., 2011

LazyFTL: a page-level flash translation layer optimized for NAND flash memory.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011

Faerie: efficient filtering algorithms for approximate dictionary-based entity extraction.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011

Efficient Algorithms for Top-k Keyword Queries on Spatial Databases.
Proceedings of the 12th IEEE International Conference on Mobile Data Management, 2011

Fast-join: An efficient method for fuzzy token matching based string similarity join.
Proceedings of the 27th International Conference on Data Engineering, 2011

Interactive SQL query suggestion: Making databases user-friendly.
Proceedings of the 27th International Conference on Data Engineering, 2011

An Effective Approach for Searching Closest Sentence Translations from the Web.
Proceedings of the Database Systems for Advanced Applications, 2011

DBease: Making Databases User-Friendly and Easily Accessible.
Proceedings of the Fifth Biennial Conference on Innovative Data Systems Research, 2011

2010
Seaform: Search-As-You-Type in Forms.
Proc. VLDB Endow., 2010

Trie-Join: Efficient Trie-based String Similarity Joins with Edit-Distance Constraints.
Proc. VLDB Endow., 2010

An Effective Semantic Cache for Exploiting XPath Query/View Answerability.
J. Comput. Sci. Technol., 2010

Finding and ranking compact connected trees for effective keyword proximity search in XML documents.
Inf. Syst., 2010

Search-As-You-Type: Opportunities and Challenges.
IEEE Data Eng. Bull., 2010

Interactive and fuzzy search: a dynamic way to explore MEDLINE.
Bioinform., 2010

Efficient fuzzy type-ahead search in TASTIER.
Proceedings of the 26th International Conference on Data Engineering, 2010

An Efficient Parallel PathStack Algorithm for Processing XML Twig Queries on Multi-core Systems.
Proceedings of the Database Systems for Advanced Applications, 2010

Extending dictionary-based entity extraction to tolerate errors.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

Suggesting Topic-Based Query Terms as You Type.
Proceedings of the Advances in Web Technologies and Applications, 2010

2009
SAIL: Structure-aware indexing for effective and progressive top-k keyword search over XML documents.
Inf. Sci., 2009

Incremental sequence-based frequent query pattern mining from XML queries.
Data Min. Knowl. Discov., 2009

Interactive search in XML data.
Proceedings of the 18th International Conference on World Wide Web, 2009

Efficient interactive fuzzy keyword search.
Proceedings of the 18th International Conference on World Wide Web, 2009

Efficient type-ahead search on relational data: a TASTIER approach.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2009

Automatic URL completion and prediction using fuzzy type-ahead search.
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009

Progressive Keyword Search in Relational Databases.
Proceedings of the 25th International Conference on Data Engineering, 2009

Supporting Database Applications as a Service.
Proceedings of the 25th International Conference on Data Engineering, 2009

Effective Fuzzy Keyword Search over Uncertain Data.
Proceedings of the Database Systems for Advanced Applications, 2009

Structure-aware indexing for keyword search in databases.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

2008
An effective and versatile keyword search engine on heterogenous data sources.
Proc. VLDB Endow., 2008

A Semantic Cache Framework for Secure XML Queries.
J. Comput. Sci. Technol., 2008

Race: finding and ranking compact connected trees for keyword proximity search over xml documents.
Proceedings of the 17th International Conference on World Wide Web, 2008

Sailer: an effective search engine for unified retrieval of heterogeneous xml and web documents.
Proceedings of the 17th International Conference on World Wide Web, 2008

Efficient vectorial operators for processing xml twig queries.
Proceedings of the 17th International Conference on World Wide Web, 2008

LCA-Based Keyword Search for Effectively Retrieving "Information Unit" from Web Pages.
Proceedings of the Ninth International Conference on Web-Age Information Management, 2008

Effective Indices for Efficient Approximate String Search and Similarity Join.
Proceedings of the Ninth International Conference on Web-Age Information Management, 2008

Parallel Structural Join Algorithm on Shared-Memory Multi-Core Systems.
Proceedings of the Ninth International Conference on Web-Age Information Management, 2008

Efficient Similarity Search for Tree-Structured Data.
Proceedings of the Scientific and Statistical Database Management, 2008

EASE: an effective 3-in-1 keyword search method for unstructured, semi-structured and structured data.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008

Retune: Retrieving and Materializing Tuple Units for Effective Keyword Search over Relational Databases.
Proceedings of the Conceptual Modeling, 2008

Progressive Ranking for Efficient Keyword Search over Relational Databases.
Proceedings of the Sharing Data, 2008

2007
Exploit sequencing views in semantic cache to accelerate xpath query evaluation.
Proceedings of the 16th International Conference on World Wide Web, 2007

Efficient Keyword Search over Data-Centric XML Documents.
Proceedings of the Advances in Data and Web Management, 2007

Effective keyword-based selection of relational databases.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2007

A framework of semantic cache for secure XML query answering: an interesting joint and novel perspective.
Proceedings of the 2nf International Conference on Scalable Information Systems, 2007

Schema Mapping in P2P Networks Based on Classification and Probing.
Proceedings of the Advances in Databases: Concepts, 2007

Efficient Holistic Twig Joins in Leaf-to-Root Combining with Root-to-Leaf Way.
Proceedings of the Advances in Databases: Concepts, 2007

Effective keyword search for valuable lcas over xml documents.
Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, 2007

One table stores all: Enabling painless free-and-easy data publishing and sharing.
Proceedings of the Third Biennial Conference on Innovative Data Systems Research, 2007

2006
SCEND: An Efficient Semantic Cache to Adequately Explore Answerability of Views.
Proceedings of the Web Information Systems, 2006

Incremental Mining of Frequent Query Patterns from XML Queries for Caching.
Proceedings of the 6th IEEE International Conference on Data Mining (ICDM 2006), 2006

2005
DMT: A Flexible and Versatile Selectivity Estimation Approach for Graph Query.
Proceedings of the Advances in Web-Age Information Management, 2005

BBTC: A New Update-Supporting Coding Scheme for XML Documents.
Proceedings of the Advances in Web-Age Information Management, 2005


  Loading...