2025
Shift guided active learning.
Mach. Learn., January, 2025
AIGT: AI Generative Table Based on Prompt.
Proceedings of the 31st International Conference on Computational Linguistics, 2025
2024
When Quantum Computing Meets Database: A Hybrid Sampling Framework for Approximate Query Processing.
IEEE Trans. Knowl. Data Eng., December, 2024
FL-GUARD: A Holistic Framework for Run-Time Detection and Recovery of Negative Federated Learning.
Data Sci. Eng., June, 2024
Predictive and Near-Optimal Sampling for View Materialization in Video Databases.
Proc. ACM Manag. Data, February, 2024
Mining Platoon Patterns from Traffic Videos.
CoRR, 2024
Revisiting CNNs for Trajectory Similarity Learning.
CoRR, 2024
Quantum Computing for Databases: Overview and Challenges.
CoRR, 2024
A Comprehensive Survey of Dynamic Graph Neural Networks: Models, Frameworks, Benchmarks, Experiments and Challenges.
CoRR, 2024
Pre-Trained Model Recommendation for Downstream Fine-tuning.
CoRR, 2024
Towards Cross-Table Masked Pretraining for Web Data Mining.
Proceedings of the ACM on Web Conference 2024, 2024
Locating What You Need: Towards Adapting Diffusion Models to OOD Concepts In-the-Wild.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Enhancing LLM Reasoning via Vision-Augmented Prompting.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
In Situ Neural Relational Schema Matcher.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024
Sampling-Resilient Multi-Object Tracking.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
Rethinking Learned Cost Models: Why Start from Scratch?
Proc. ACM Manag. Data, December, 2023
Distributional constraint discovery for intelligent auditing.
Knowl. Inf. Syst., December, 2023
SmartLite: A DBMS-based Serving System for DNN Inference in Resource-constrained Environments.
Proc. VLDB Endow., November, 2023
DoveDB: A Declarative and Low-Latency Video Database.
Proc. VLDB Endow., 2023
A Demonstration of DLBD: Database Logic Bug Detection System.
Proc. VLDB Endow., 2023
Detecting Logic Bugs of Join Optimizations in DBMS.
Proc. ACM Manag. Data, 2023
A Survey on Mapping Semi-Structured Data and Graph Data to Relational Data.
ACM Comput. Surv., 2023
CT-BERT: Learning Better Tabular Representations Through Cross-Table Pre-training.
CoRR, 2023
Assessing Hidden Risks of LLMs: An Empirical Study on Robustness, Consistency, and Credibility.
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
Controllable Textual Inversion for Personalized Text-to-Image Generation.
CoRR, 2023
GAIN: Enhancing Byzantine Robustness in Federated Learning with Gradient Decomposition.
CoRR, 2023
Textile pattern recommendations with convolutional neural networks and autoencoder.
Concurr. Comput. Pract. Exp., 2023
Quantum Computing for Databases: A Short Survey and Vision.
Proceedings of the Joint Proceedings of Workshops at the 49th International Conference on Very Large Data Bases (VLDB 2023), Vancouver, Canada, August 28, 2023
Towards Controlled Data Augmentations for Active Learning.
Proceedings of the International Conference on Machine Learning, 2023
Byzantine-Robust Learning on Heterogeneous Data via Gradient Splitting.
Proceedings of the International Conference on Machine Learning, 2023
Learning a Data-Driven Policy Network for Pre-Training Automated Feature Engineering.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
ModelGiF: Gradient Fields for Model Functional Distance.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Text Labels Classification Model based on BERT Algorithm.
Proceedings of the 4th International Conference on Big Data & Artificial Intelligence & Software Engineering, 2023
2022
Continuous Trajectory Similarity Search for Online Outlier Detection.
IEEE Trans. Knowl. Data Eng., 2022
A Stack-Centric Processing Model for Iterative Processing.
IEEE Trans. Big Data, 2022
Dynamic Index Construction with Deep Reinforcement Learning.
Data Sci. Eng., 2022
A Sampling-based Learning Framework for Big Databases.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022
PreQR: Pre-training Representation for SQL Understanding.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022
IceBerg: Deep Generative Modeling for Constraint Discovery and Anomaly Detection.
Proceedings of the IEEE Intl Conf on Parallel & Distributed Processing with Applications, 2022
Comparison Knowledge Translation for Generalizable Image Classification.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022
Continuous Trajectory Similarity Search for Online Outlier Detection (Extended Abstract).
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022
A Comparative Study of in-Database Inference Approaches.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022
Towards Unifying the Label Space for Aspect- and Sentence-based Sentiment Analysis.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022
Model Doctor: A Simple Gradient Aggregation Strategy for Diagnosing and Treating CNN Classifiers.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
NEIST: A Neural-Enhanced Index for Spatio-Temporal Queries.
IEEE Trans. Knowl. Data Eng., 2021
AQUA+: Query Optimization for Hybrid Database-MapReduce System.
Knowl. Inf. Syst., 2021
BERT-JAM: Maximizing the utilization of BERT for neural machine translation.
Neurocomputing, 2021
Joining datasets via data augmentation in the label space for neural networks.
CoRR, 2021
A critical look at the current train/test split in machine learning.
CoRR, 2021
Trade-off analysis between delay and throughput of RAN slicing for smart grid.
Comput. Commun., 2021
Metric Learning via Penalized Optimization.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021
Joining datasets via data augmentation in the label space for neural networks.
Proceedings of the 38th International Conference on Machine Learning, 2021
A Learning to Tune Framework for LSH.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021
An Interactive NL2SQL Approach with Reuse Strategy.
Proceedings of the Database Systems for Advanced Applications, 2021
Learning to Label with Active Learning and Reinforcement Learning.
Proceedings of the Database Systems for Advanced Applications, 2021
Effective Slot Filling via Weakly-Supervised Dual-Model Learning.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
A two-phase approach for unexpected pattern mining.
Expert Syst. Appl., 2020
LINDT: Tackling Negative Federated Learning with Local Adaptation.
CoRR, 2020
BERT-JAM: Boosting BERT-Enhanced Neural Machine Translation with Joint Attention.
CoRR, 2020
Multi-Context Attention for Entity Matching.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020
AUDITOR: A System Designed for Automatic Discovery of Complex Integrity Constraints in Relational Databases.
Proceedings of the 2020 International Conference on Management of Data, 2020
Uplink Resource Allocation Based on Short Block-Length Regime in Heterogeneous Cellular Networks for Smart Grid.
Proceedings of the Innovative Mobile and Internet Services in Ubiquitous Computing, 2020
Self-Attention and Dynamic Convolution Hybrid Model for Neural Machine Translation.
Proceedings of the 2020 IEEE International Conference on Knowledge Graph, 2020
From Code to Natural Language: Type-Aware Sketch-Based Seq2Seq Learning.
Proceedings of the Database Systems for Advanced Applications, 2020
2019
Interpretable Partitioned Embedding for Intelligent Multi-item Fashion Outfit Composition.
ACM Trans. Multim. Comput. Commun. Appl., 2019
S3: A Scalable In-memory Skip-List Index for Key-Value Store.
Proc. VLDB Endow., 2019
Information fusion in visual question answering: A Survey.
Inf. Fusion, 2019
Progressive Neural Index Search for Database System.
CoRR, 2019
Semi-Supervised Few-Shot Learning for Dual Question-Answer Extraction.
CoRR, 2019
E2E Delay Optimization for Smart Grids Mission-critical Slices in Core Networks.
Proceedings of the 11th International Conference on Wireless Communications and Signal Processing, 2019
Integration Application of 5G and Smart Grid.
Proceedings of the 11th International Conference on Wireless Communications and Signal Processing, 2019
Service-Aware Resource Allocation Based on RAN Slicing for Smart Grid.
Proceedings of the Innovative Mobile and Internet Services in Ubiquitous Computing, 2019
DBSCAN-MS: Distributed Density-Based Clustering in Metric Spaces.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019
RAN Slice Strategy Based on Deep Reinforcement Learning for Smart Grid.
Proceedings of the Computing, Communications and IoT Applications, ComComAp 2019, Shenzhen, 2019
NAD: Neural Network Aided Design for Textile Pattern Generation.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019
2018
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018
A Graph-Theoretic Fusion Framework for Unsupervised Entity Resolution.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018
A Base Station Location Planning Method Based on Artificial Bee Colony Algorithm.
Proceedings of the IEEE International Conference on Communication Systems, 2018
2017
Targeted Advertising in Public Transportation Systems with Quantitative Evaluation.
ACM Trans. Inf. Syst., 2017
FlashView: An Interactive Visual Explorer for Raw Data.
Proc. VLDB Endow., 2017
CSIR4G: An effective and efficient cross-scenario image retrieval model for glasses.
Inf. Sci., 2017
Tuning the granularity of parallelism for distributed graph processing.
Distributed Parallel Databases, 2017
A New Approach to Compute CNNs for Extremely Large Images.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017
2016
epiC: an extensible and scalable system for processing Big Data.
VLDB J., 2016
KISS: Knowing Camera Prototype System for Recognizing and Annotating Places-of-Interest.
IEEE Trans. Knowl. Data Eng., 2016
LazyLSH: Approximate Nearest Neighbor Search for Multiple Distance Functions with a Single Index.
Proceedings of the 2016 International Conference on Management of Data, 2016
Adaptive Logging: Optimizing Logging and Recovery Costs in Distributed In-memory Databases.
Proceedings of the 2016 International Conference on Management of Data, 2016
iGlasses: A Novel Recommendation System for Best-fit Glasses.
Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 2016
Personal recommendation using deep recurrent neural networks in NetEase.
Proceedings of the 32nd IEEE International Conference on Data Engineering, 2016
2015
HM: A Column-Oriented MapReduce System on Hybrid Storage.
IEEE Trans. Knowl. Data Eng., 2015
Progressive online aggregation in a distributed stream system.
J. Syst. Softw., 2015
Adaptive Logging for Distributed In-memory Databases.
CoRR, 2015
The Evolvement of Big Data Systems: From the Perspective of an Information Security Application.
Big Data Res., 2015
Cross-Scenario Eyeglasses Retrieval via EGYPT Model.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015
PABIRS: A data access middleware for distributed file systems.
Proceedings of the 31st IEEE International Conference on Data Engineering, 2015
Toward the New Item Problem: Context-Enhanced Event Recommendation in Event-Based Social Networks.
Proceedings of the Advances in Information Retrieval, 2015
2014
K-Anonymity for Crowdsourcing Database.
IEEE Trans. Knowl. Data Eng., 2014
BestPeer++: A Peer-to-Peer BasedLarge-Scale Data Processing Platform.
IEEE Trans. Knowl. Data Eng., 2014
Automatic Itinerary Planning for Traveling Services.
IEEE Trans. Knowl. Data Eng., 2014
yzBigData: Provisioning Customizable Solution for Big Data.
Proc. VLDB Endow., 2014
ScalaGiST: Scalable Generalized Search Trees for MapReduce Systems [Innovative Systems Paper].
Proc. VLDB Endow., 2014
epiC: an Extensible and Scalable System for Processing Big Data.
Proc. VLDB Endow., 2014
Real-time recommendation for microblogs.
Inf. Sci., 2014
kNN processing with co-space distance in SoLoMo systems.
Expert Syst. Appl., 2014
Distributed data management using MapReduce.
ACM Comput. Surv., 2014
The knowing camera 2: recognizing and annotating places-of-interest in smartphone photos.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014
2013
Proceedings of the Advanced Query Processing, Volume 1: Issues and Trends, 2013
Supporting Efficient Social Media Search in Cyber-Physical Web.
IEEE Data Eng. Bull., 2013
The knowing camera: recognizing places-of-interest in smartphone photos.
Proceedings of the 36th International ACM SIGIR conference on research and development in Information Retrieval, 2013
Crowd-answering system via microblogging.
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013
An efficient and compact indexing scheme for large-scale data store.
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013
2012
CDAS: A Crowdsourcing Data Analytics System.
Proc. VLDB Endow., 2012
E3: an Elastic Execution Engine for Scalable Data Processing.
J. Inf. Process., 2012
E<sup>3</sup>: an Elastic Execution Engine for Scalable Data Processing.
Inf. Media Technol., 2012
Cross Domain Search by Exploiting Wikipedia.
Proceedings of the IEEE 28th International Conference on Data Engineering (ICDE 2012), 2012
BestPeer++: A Peer-to-Peer Based Large-Scale Data Processing Platform.
Proceedings of the IEEE 28th International Conference on Data Engineering (ICDE 2012), 2012
Parallelizing stateful operators in a distributed stream processing system: how, should you and how much?
Proceedings of the Sixth ACM International Conference on Distributed Event-Based Systems, 2012
2011
Approximate Aggregations in Structured P2P Networks.
IEEE Trans. Knowl. Data Eng., 2011
A Framework for Supporting DBMS-like Indexes in the Cloud.
Proc. VLDB Endow., 2011
Llama: leveraging columnar storage for scalable join processing in the MapReduce framework.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011
TI: an efficient indexing mechanism for real-time search on tweets.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011
ES<sup>2</sup>: A cloud data storage system for supporting both OLTP and OLAP.
Proceedings of the 27th International Conference on Data Engineering, 2011
Query optimization for massively parallel data processing.
Proceedings of the ACM Symposium on Cloud Computing in conjunction with SOSP 2011, 2011
2010
Efficient B-tree Based Indexing for Cloud Data Processing.
Proc. VLDB Endow., 2010
The Performance of MapReduce: An In-depth Study.
Proc. VLDB Endow., 2010
Providing Scalable Database Services on the Cloud.
Proceedings of the Web Information Systems Engineering - WISE 2010, 2010
Continuous sampling for online aggregation over multiple queries.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010
Indexing multi-dimensional data in a cloud system.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010
2009
Distributed Online Aggregation.
Proc. VLDB Endow., 2009
An Indexing Framework for Efficient Retrieval on the Cloud.
IEEE Data Eng. Bull., 2009
Adaptive Multi-join Query Processing in PDBMS.
Proceedings of the 25th International Conference on Data Engineering, 2009
SiMPSON: Efficient Similarity Search in Metric Spaces over P2P Structured Overlay Networks.
Proceedings of the Euro-Par 2009 Parallel Processing, 2009
2008
Just-in-time query retrieval over partially indexed data on structured P2P overlays.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008
Skyline-join in distributed databases.
Proceedings of the 24th International Conference on Data Engineering Workshops, 2008
2004
PKUSpace: A Collaborative Platform for Scientific Researching.
Proceedings of the Advances in Web-Based Learning, 2004
WebGIS-RBDL - A Rare Book Digital Library Supporting Spatio-Temporary Retrieval.
Proceedings of the Digital Libraries: International Collaboration and Cross-Fertilization, 2004