Hongzhi Wang

Orcid: 0000-0002-7521-2871

Affiliations:
  • Harbin Institute of Technology, China


According to our database1, Hongzhi Wang authored at least 374 papers between 2002 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Approximation algorithms for finding maximum containing circle and sphere.
Theor. Comput. Sci., 2025

ACDRL: A constraint-guided dynamic reinforcement learning for index selection.
Inf. Sci., 2025

2024
AutoSR: Automatic Sequential Recommendation System Design.
IEEE Trans. Knowl. Data Eng., November, 2024

FedST: secure federated shapelet transformation for time series classification.
VLDB J., September, 2024

Fair Federated Learning with Multi-Objective Hyperparameter Optimization.
ACM Trans. Knowl. Discov. Data, September, 2024

TimeCSL: Unsupervised Contrastive Learning of General Shapelets for Explorable Time Series Analysis.
Proc. VLDB Endow., August, 2024

Clean4TSDB: A Data Cleaning Tool for Time Series Databases.
Proc. VLDB Endow., August, 2024

DAFDiscover: Robust Mining Algorithm for Dynamic Approximate Functional Dependencies on Dirty Data.
Proc. VLDB Endow., July, 2024

CUBE: Causal Intervention-Based Counterfactual Explanation for Prediction Models.
IEEE Trans. Knowl. Data Eng., June, 2024

DIDS: Double Indices and Double Summarizations for Fast Similarity Search.
Proc. VLDB Endow., May, 2024

One Seed, Two Birds: A Unified Learned Structure for Exact and Approximate Counting.
Proc. ACM Manag. Data, February, 2024

Analytical framework for end-to-end channel capacity in molecular communication system.
Multim. Tools Appl., January, 2024

GFedKG: GNN-based federated embedding model for knowledge graph completion.
Knowl. Based Syst., 2024

TodyNet: Temporal dynamic graph neural network for multivariate time series classification.
Inf. Sci., 2024

Multimodal Data Modeling Technology and Its App-19lication for Cloud-edge-device Collaboration.
Int. J. Softw. Informatics, 2024

Updateable Data-Driven Cardinality Estimator with Bounded Q-error.
CoRR, 2024

KnobCF: Uncertainty-aware Knob Tuning.
CoRR, 2024

RTFormer: Re-parameter TSBN Spiking Transformer.
CoRR, 2024

IntraMix: Intra-Class Mixup Generation for Accurate Labels and Neighbors.
CoRR, 2024

Automatic time series forecasting model design based on pruning.
Appl. Soft Comput., 2024

UniTS: A Universal Time Series Analysis Framework Powered by Self-Supervised Representation Learning.
Proceedings of the Companion of the 2024 International Conference on Management of Data, 2024

RHKH: Relational Hypergraph Neural Network for Link Prediction on N-ary Knowledge Hypergraph.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

An Unsupervised Learning Framework Combined with Heuristics for the Maximum Minimal Cut Problem.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Duet: Efficient and Scalable Hybrid Neural Relation Understanding.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

QCFE: An Efficient Feature Engineering for Query Cost Estimation.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

AutoMC: Automated Model Compression Based on Domain Knowledge and Progressive Search.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

Efficient Semi-External SCC Computation (Extended Abstract).
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

Explainable Database Management System Configuration Tuning through Counterfactuals.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

CUBE: Causal Intervention-based Counterfactual Explanation for Prediction Models (Extended Abstract).
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

ShrinkHPO: Towards Explainable Parallel Hyperparameter Optimization.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

Efficient Relaxed Functional Dependency Discovery with Minimal Set Cover.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

Time Series Data Cleaning Under Expressive Constraints on Both Rows and Columns.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

TSDDISCOVER: Discovering Data Dependency for Time Series Data.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

Towards Real-Time Data Ingestion for Industrial Internet of Things.
Proceedings of the Database Systems for Advanced Applications, 2024

iMonitor: A Real-Time Monitoring Platform for Industrial Internet of Things.
Proceedings of the Database Systems for Advanced Applications, 2024

Cnos-Connector: Enabling Seamless Connection with CnosDB to Facilitate Large-Scale Time-Series Data Management and Analytics.
Proceedings of the Database Systems for Advanced Applications, 2024

RocolSys: An Automatic Row-Column Data Storage System for HTAP.
Proceedings of the Web and Big Data - 8th International Joint Conference, 2024

Access Structure Selection for Knowledge Graphs Based on Machine Learning.
Proceedings of the ACM Turing Award Celebration Conference 2024, 2024

DCLP: Neural Architecture Predictor with Curriculum Contrastive Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Dirty Data Processing for Machine Learning, 2
Springer, ISBN: 978-981-99-7656-0, 2024

2023
Discovering time series motifs of all lengths using dynamic time warping.
World Wide Web (WWW), November, 2023

Data quality model for assessing public COVID-19 big datasets.
J. Supercomput., November, 2023

A Shapelet-based Framework for Unsupervised Multivariate Time Series Representation Learning.
Proc. VLDB Endow., November, 2023

Automatic single table storage structure selection for hybrid workload.
Knowl. Inf. Syst., November, 2023

Time series compression based on reinforcement learning.
Inf. Sci., November, 2023

Search for deep graph neural networks.
Inf. Sci., November, 2023

Automated Graph Neural Network Search Under Federated Learning Framework.
IEEE Trans. Knowl. Data Eng., October, 2023

SAT: sampling acceleration tree for adaptive database repartition.
World Wide Web (WWW), September, 2023

TENSILE: A Tensor Granularity Dynamic GPU Memory Scheduling Method Toward Multiple Dynamic Workloads System.
IEEE Trans. Knowl. Data Eng., August, 2023

Identifying effective trajectory predictions under the guidance of trajectory anomaly detection model.
Pattern Recognit., August, 2023

Reachability Queries With Label and Substructure Constraints on Knowledge Graphs.
IEEE Trans. Knowl. Data Eng., June, 2023

Auto-STGCN: Autonomous Spatial-Temporal Graph Convolutional Network Search.
ACM Trans. Knowl. Discov. Data, June, 2023

HyGGE: Hyperbolic graph attention network for reasoning over knowledge graphs.
Inf. Sci., June, 2023

Efficient Semi-External SCC Computation.
IEEE Trans. Knowl. Data Eng., April, 2023

End-to-end multivariate time series classification via hybrid deep learning architectures.
Pers. Ubiquitous Comput., April, 2023

AAE: An active auto-estimator for improving graph storage.
Inf. Sci., April, 2023

EFFECT: Explainable framework for meta-learning in automatic classification algorithm selection.
Inf. Sci., April, 2023

PreKar: A learned performance predictor for knowledge graph stores.
World Wide Web (WWW), January, 2023

A Dual-Store Structure for Knowledge Graphs.
IEEE Trans. Knowl. Data Eng., 2023

Dynamic Relation Repairing for Knowledge Enhancement.
IEEE Trans. Knowl. Data Eng., 2023

SoftStep relaxation for mining optimal convolution kernel.
Knowl. Based Syst., 2023

TransFusion Model Fusion Mechanism Based on Transformer for Traffic Flow Prediction.
J. Database Manag., 2023

Unsupervised Multi-modal Feature Alignment for Time Series Representation Learning.
CoRR, 2023

IWEK: An Interpretable What-If Estimator for Database Knobs.
CoRR, 2023

Contrastive Shapelet Learning for Unsupervised Multivariate Time Series Representation Learning.
CoRR, 2023

One stone, two birds: A lightweight multidimensional learned index with cardinality support.
CoRR, 2023

TodyNet: Temporal Dynamic Graph Neural Network for Multivariate Time Series Classification.
CoRR, 2023

UniTS: A Universal Time Series Analysis Framework with Self-supervised Representation Learning.
CoRR, 2023

TS-Cabinet: Hierarchical Storage for Cloud-Edge-End Time-series Database.
CoRR, 2023

FedST: Federated Shapelet Transformation for Interpretable Time Series Classification.
CoRR, 2023

Prediction of Time Series Data with Low Latitude Features.
Proceedings of the Data Science, 2023

Complex Time Series Analysis Based on Conditional Random Fields.
Proceedings of the Data Science, 2023

Dimension Reduction Based on Sampling.
Proceedings of the Data Science, 2023

Reachability Queries with Label and Substructure Constraints on Knowledge Graphs (Extended abstract).
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

TSC-AutoML: Meta-learning for Automatic Time Series Classification Algorithm Selection.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

Approximate Query Processing Based on Approximate Materialized View.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2023

Schema Integration on Massive Data Sources.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2023

CnosDB: A Flexible Distributed Time-Series Database for Large-Scale Data.
Proceedings of the Database Systems for Advanced Applications, 2023

Auto-TSA: An Automatic Time Series Analysis System Based on Meta-learning.
Proceedings of the Database Systems for Advanced Applications. DASFAA 2023 International Workshops, 2023

PFKMaster: A Knowledge-Driven Flow Control System for Large-Scale Power Grid.
Proceedings of the Database Systems for Advanced Applications, 2023

Cleanits-MEDetect: Multiple Errors Detection for Time Series Data in Cleanits.
Proceedings of the Database Systems for Advanced Applications, 2023

SNN-AAD: Active Anomaly Detection Method for Multivariate Time Series with Sparse Neural Network.
Proceedings of the Database Systems for Advanced Applications, 2023

A Novel Approximation Algorithm for Max-Covering Circle Problem.
Proceedings of the Combinatorial Optimization and Applications, 2023

RDBlab: An Artificial Simulation System for RDBMSs.
Proceedings of the Web and Big Data - 7th International Joint Conference, 2023

ANSWER: Automatic Index Selector for Knowledge Graphs.
Proceedings of the Web and Big Data - 7th International Joint Conference, 2023

2022
Knowledge Discovery from Multi-Sourced Data
Springer Briefs in Computer Science, Springer, ISBN: 978-981-19-1878-0, 2022

GSBRL : Efficient RDF graph storage based on reinforcement learning.
World Wide Web, 2022

Constrained Truth Discovery.
IEEE Trans. Knowl. Data Eng., 2022

Leveraging Currency for Repairing Inconsistent and Incomplete Data.
IEEE Trans. Knowl. Data Eng., 2022

FedTSC: A Secure Federated Learning System for Interpretable Time Series Classification.
Proc. VLDB Endow., 2022

A meta learning approach for open information extraction.
Neural Comput. Appl., 2022

JointMatcher: Numerically-aware entity matching using pre-trained language models with attention concentration.
Knowl. Based Syst., 2022

Partial multi-label learning via specific label disambiguation.
Knowl. Based Syst., 2022

Mining Simple Path Traversal Patterns in Knowledge Graph.
J. Web Eng., 2022

ConsciousControlFlow(CCF): Conscious Artificial Intelligence Based on Needs.
J. Artif. Intell. Conscious., 2022

Evaluating community quality based on ground-truth.
Inf. Sci., 2022

A linear algorithm for semi-external cutnode computation.
Inf. Sci., 2022

Efficient semi-external depth-first search.
Inf. Sci., 2022

Auto-CASH: A meta-learning embedding approach for autonomous classification algorithm selection.
Inf. Sci., 2022

IoT data cleaning techniques: A survey.
Intell. Converged Networks, 2022

METransE: Manifold-like mechanism enhanced embedding for reasoning over knowledge graphs.
Expert Syst. Appl., 2022

MISS: finding optimal sample sizes for approximate analytics.
Distributed Parallel Databases, 2022

Differentiable Self-Adaptive Learning Rate.
CoRR, 2022

AAE: An Active Auto-Estimator for Improving Graph Storage.
CoRR, 2022

FIND: Explainable Framework for Meta-learning.
CoRR, 2022

AutoTS: Automatic Time Series Forecasting Model Design Based on Two-Stage Pruning.
CoRR, 2022

AutoMC: Automated Model Compression based on Domain Knowledge and Progressive search strategy.
CoRR, 2022

TPAD: Identifying Effective Trajectory Predictions Under the Guidance of Trajectory Anomaly Detection Model.
CoRR, 2022

Predict industrial equipment failure with time windows and transfer learning.
Appl. Intell., 2022

EEML: Ensemble Embedded Meta-Learning.
Proceedings of the Web Information Systems Engineering - WISE 2022, 2022

Automatic Scheduling Technology of Computing Power Network Driven by Knowledge Graph.
Proceedings of the International Conference on Service Science, 2022

Parallel Skyline Query Processing of Massive Incomplete Activity-Trajectories Data.
Proceedings of the Model and Data Engineering: 11th International Conference, 2022

A Generative Adversarial Active Learning Method for Effective Outlier Detection.
Proceedings of the 34th IEEE International Conference on Tools with Artificial Intelligence, 2022

A Dual-Store Structure for Knowledge Graphs (Extended Abstract).
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

CO-AutoML: An Optimizable Automated Machine Learning System.
Proceedings of the Database Systems for Advanced Applications, 2022

Time Series Data Quality Enhancing Based on Pattern Alignment.
Proceedings of the Database Systems for Advanced Applications. DASFAA 2022 International Workshops, 2022

ATPFL: Automatic Trajectory Prediction Model Design under Federated Learning Framework.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Multi-SQL: An Automatic Multi-model Data Management System.
Proceedings of the Web and Big Data - 6th International Joint Conference, 2022

2021
GPU-based efficient join algorithms on Hadoop.
J. Supercomput., 2021

IDCOS: optimization strategy for parallel complex expression computation on big data.
J. Supercomput., 2021

SDLER: stacked dedupe learning for entity resolution in big data era.
J. Supercomput., 2021

Bidirectional LSTM-RNN-based hybrid deep learning frameworks for univariate time series classification.
J. Supercomput., 2021

Assassin: an Automatic claSSificAtion system baSed on algorithm SelectIoN.
Proc. VLDB Endow., 2021

Attention-Based Deep Gated Fully Convolutional End-to-End Architectures for Time Series Classification.
Neural Process. Lett., 2021

Improving Open Information Extraction with Distant Supervision Learning.
Neural Process. Lett., 2021

ExperienceThinking: Constrained hyperparameter optimization based on knowledge and pruning.
Knowl. Based Syst., 2021

TAGAT: Type-Aware Graph Attention neTworks for reasoning over knowledge graphs.
Knowl. Based Syst., 2021

Semi-supervised multi-label feature selection with adaptive structure learning and manifold learning.
Knowl. Based Syst., 2021

Industrial time series determinative anomaly detection based on constraint hypergraph.
Knowl. Based Syst., 2021

Selectivity estimation with density-model-based multidimensional histogram.
Knowl. Inf. Syst., 2021

Impacts of Dirty Data on Classification and Clustering Models: An Experimental Evaluation.
J. Comput. Sci. Technol., 2021

LAQP: Learning-based approximate query processing.
Inf. Sci., 2021

Deep truth discovery for pattern-based fact extraction.
Inf. Sci., 2021

Index selection for NoSQL database with deep reinforcement learning.
Inf. Sci., 2021

Attributed community search based on effective scoring function and elastic greedy method.
Inf. Sci., 2021

Efficient class-specific shapelets learning for interpretable time series classification.
Inf. Sci., 2021

Methods for detecting and correcting contextual data quality problems.
Intell. Data Anal., 2021

Graph matching based reasoner: A symbolic approach to question answering.
Eng. Appl. Artif. Intell., 2021

Transformer based network for Open Information Extraction.
Eng. Appl. Artif. Intell., 2021

Experience-Enhanced Learning: One Size Still does not Fit All in Automatic Database.
CoRR, 2021

FL-AGCNS: Federated Learning Framework for Automatic Graph Convolutional Network Search.
CoRR, 2021

Approximate Query Processing for Group-By Queries based on Conditional Generative Models.
CoRR, 2021

Exploring Data and Knowledge combined Anomaly Explanation of Multivariate Industrial Data.
CoRR, 2021

EMM-CLODS: An Effective Microcluster and Minimal Pruning CLustering-Based Technique for Detecting Outliers in Data Streams.
Complex., 2021

Effective density-based clustering algorithms for incomplete data.
Big Data Min. Anal., 2021

Modulo 9 model-based learning for missing data imputation.
Appl. Soft Comput., 2021

Cost-Based Lightweight Storage Automatic Decision for In-Database Machine Learning.
Proceedings of the Web Information Systems Engineering - WISE 2021, 2021

Intelligent Storage System of Machine Learning Model Based on Task Similarity.
Proceedings of the Data Science, 2021

Data-Driven Prediction of Foodborne Disease Pathogens.
Proceedings of the Data Science, 2021

Constrained Truth Discovery (Extended Abstract).
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

Concurrency Control Based on Transaction Clustering.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

Leveraging Currency for Repairing Inconsistent and Incomplete Data (Extended Abstract).
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

Dirty-Data Impacts on Regression Models: An Experimental Evaluation.
Proceedings of the Database Systems for Advanced Applications, 2021

The Regional Clusting Effect of the Blockchain Industry Base on Unsupervised Learning Methods.
Proceedings of the Blockchain and Trustworthy Systems - Third International Conference, 2021

A Study on Bitcoin Price Volatility Based on the SVAR Model and Impulse Response Analysis.
Proceedings of the Blockchain and Trustworthy Systems - Third International Conference, 2021

ALMSS: Automatic Learned Index Model Selection System.
Proceedings of the Web and Big Data - 5th International Joint Conference, 2021

2020
Biocyber Interface-Based Privacy for Internet of Bio-nano Things.
Wirel. Pers. Commun., 2020

On mixing reservoir targeted drug delivery Modeling-based Internet of Bio-NanoThings.
Wirel. Networks, 2020

Automatic weighted matching rectifying rule discovery for data repairing.
VLDB J., 2020

Efficient Entity Resolution on Heterogeneous Records.
IEEE Trans. Knowl. Data Eng., 2020

A new ensemble feature selection approach based on genetic algorithm.
Soft Comput., 2020

Effective Bayesian-network-based missing value imputation enhanced by crowdsourcing.
Knowl. Based Syst., 2020

SUM-optimal histograms for approximate query processing.
Knowl. Inf. Syst., 2020

TAILOR: time-aware facility location recommendation based on massive trajectories.
Knowl. Inf. Syst., 2020

A Parametric and Non-Parametric Approach for High-Accurate Outlier Detection.
J. Inf. Sci. Eng., 2020

Learned sketches for frequency estimation.
Inf. Sci., 2020

Multi-source data repairing powered by integrity constraints and source reliability.
Inf. Sci., 2020

Error Detection in a Large-Scale Lexical Taxonomy.
Inf., 2020

FRIEND: Feature selection on inconsistent data.
Neurocomputing, 2020

Diversification on big data in query processing.
Frontiers Comput. Sci., 2020

A survey of uncertain data management.
Frontiers Comput. Sci., 2020

Misplaced Subsequences Repairing with Application to Multivariate Industrial Time Series Data.
CoRR, 2020

Multi-SQL: An extensible multi-model data query language.
CoRR, 2020

Efficient RDF Graph Storage based on Reinforcement Learning.
CoRR, 2020

Auto-STGCN: Autonomous Spatial-Temporal Graph Convolutional Network Search Based on Reinforcement Learning and Existing Research Results.
CoRR, 2020

EM-RBR: a reinforced framework for knowledge graph completion from reasoning perspective.
CoRR, 2020

Automatic Storage Structure Selection for hybrid Workload.
CoRR, 2020

Auto-CASH: Autonomous Classification Algorithm Selection with Deep Q-Network.
CoRR, 2020

Multi-Objective Neural Architecture Search Based on Diverse Structures and Adaptive Recommendation.
CoRR, 2020

Index Selection for NoSQL Database with Deep Reinforcement Learning.
CoRR, 2020

ConsciousControlFlow(CCF): A Demonstration for conscious Artificial Intelligence.
CoRR, 2020

Automatic Hyper-Parameter Optimization Based on Mapping Discovery from Data to Hyper-Parameters.
CoRR, 2020

Mining conditional functional dependency rules on big data.
Big Data Min. Anal., 2020

Incomplete data classification with view-based decision tree.
Appl. Soft Comput., 2020

Power of Attention in MOOC Dropout Prediction.
IEEE Access, 2020

Data Cleaning About Student Information Based on Massive Open Online Course System.
Proceedings of the Data Science, 2020

General Model for Index Recommendation Based on Convolutional Neural Network.
Proceedings of the Data Science, 2020

Auto-Model: Utilizing Research Papers and HPO Techniques to Deal with the CASH problem.
Proceedings of the 36th IEEE International Conference on Data Engineering, 2020

Efficient Entity Resolution on Heterogeneous Records (Extended abstract).
Proceedings of the 36th IEEE International Conference on Data Engineering, 2020

GMDA: An Automatic Data Analysis System for Industrial Production.
Proceedings of the Database Systems for Advanced Applications, 2020

STRATEGY: A Flexible Job-Shop Scheduling System for Large-Scale Complex Products.
Proceedings of the Database Systems for Advanced Applications, 2020

April: An Automatic Graph Data Management System Based on Reinforcement Learning.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

Automatic Document Data Storage System Based on Machine Learning.
Proceedings of the Web and Big Data - 4th International Joint Conference, 2020

An Effective Constraint-Based Anomaly Detection Approach on Multivariate Time Series.
Proceedings of the Web and Big Data - 4th International Joint Conference, 2020

A MOOC Courses Recommendation System Based on Learning Behaviours.
Proceedings of the ACM TUR-C'20: ACM Turing Celebration Conference, 2020

Conscious Knowledge Based Question Answering.
Proceedings of the ACM TUR-C'20: ACM Turing Celebration Conference, 2020

2019
Effective and Efficient Community Search Over Large Directed Graphs.
IEEE Trans. Knowl. Data Eng., 2019

Cleanits: A Data Cleaning System for Industrial Time Series.
Proc. VLDB Endow., 2019

PatternFinder: Pattern discovery for truth discovery.
Knowl. Based Syst., 2019

Hierarchical spatial-aware Siamese network for thermal infrared object tracking.
Knowl. Based Syst., 2019

AutoRepair: an automatic repairing approach over multi-source data.
Knowl. Inf. Syst., 2019

社交网络中同一用户的识别 (Identification of Same User in Social Networks).
计算机科学, 2019

LKAQ: Large-scale knowledge graph approximate query algorithm.
Inf. Sci., 2019

Data source selection for information integration in big data era.
Inf. Sci., 2019

Distracter-aware tracking via correlation filter.
Neurocomputing, 2019

Efficient OLAP algorithms on GPU-accelerated Hadoop clusters.
Distributed Parallel Databases, 2019

ExperienceThinking: Hyperparameter Optimization with Budget Constraints.
CoRR, 2019

A General Data Renewal Model for Prediction Algorithms in Industrial Data Analytics.
CoRR, 2019

Autoregressive-Model-Based Methods for Online Time Series Prediction with Missing Values: an Experimental Evaluation.
CoRR, 2019

LSTM-based Flow Prediction.
CoRR, 2019

Regular Expression Matching on billion-nodes Graphs.
CoRR, 2019

One-Pass Inconsistency Detection Algorithms for Big Data.
IEEE Access, 2019

Progress in Outlier Detection Techniques: A Survey.
IEEE Access, 2019

An Effective Minimal Probing Approach With Micro-Cluster for Distance-Based Outlier Detection in Data Streams.
IEEE Access, 2019

ECOQUG: An Effective Ensemble Community Scoring Function.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

Rule-Based Entity Resolution on Database with Hidden Temporal Information (Extended Abstract).
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

An Iterative Scheme for Leverage-Based Approximate Aggregation.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

Effective and Efficient Community Search Over Large Directed Graphs (Extended Abstract).
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

IMOptimizer: An Online Interactive Parameter Optimization System Based on Big Data.
Proceedings of the Database Systems for Advanced Applications, 2019

Panel: The Computing in Data Science.
Proceedings of the ACM Conference on Global Computing Education, 2019

The analysis and early warning of student loss in MOOC course.
Proceedings of the ACM Turing Celebration Conference - China, 2019

Predicting learning status in MOOCs using LSTM.
Proceedings of the ACM Turing Celebration Conference - China, 2019

An efficient distributed storage strategy for blockchain.
Proceedings of the ACM Turing Celebration Conference - China, 2019

2018
Rule-Based Entity Resolution on Database with Hidden Temporal Information.
IEEE Trans. Knowl. Data Eng., 2018

Data management on new processors: A survey.
Parallel Comput., 2018

Iteration acceleration for distributed learning systems.
Parallel Comput., 2018

FROG: Inference from knowledge base for missing value imputation.
Knowl. Based Syst., 2018

一种基于MapReduce的不确定图上的相似性连接方法 (Method of Similarity Join on Uncertain Graphs Using MapReduce).
计算机科学, 2018

O2iJoin: An Efficient Index-Based Algorithm for Overlap Interval Join.
J. Comput. Sci. Technol., 2018

Parallel algorithms for flexible pattern matching on big graphs.
Inf. Sci., 2018

A gray-box performance model for Apache Spark.
Future Gener. Comput. Syst., 2018

Efficient histogram-based range query estimation for dirty data.
Frontiers Comput. Sci., 2018

An effective weighted rule-based method for entity resolution.
Distributed Parallel Databases, 2018

Error Detection in a Large-Scale Lexical Taxonomy.
CoRR, 2018

On the Fairness of Quality-based Data Markets.
CoRR, 2018

Mining CFD Rules on Big Data.
CoRR, 2018

Schema Integration on Massive Data Sources.
CoRR, 2018

Improve3C: Data Cleaning on Consistency and Completeness with Currency.
CoRR, 2018

MISS: Finding Optimal Sample Sizes for Approximate Analytics.
CoRR, 2018

Impacts of Dirty Data: and Experimental Evaluation.
CoRR, 2018

A generic data analytics system for manufacturing production.
Big Data Min. Anal., 2018

Efficient Computation of Skyline Queries on Incomplete Dynamic Data.
IEEE Access, 2018

Statistical Learning-Based Prediction of Execution Time of Data-Intensive Program Under Hadoop2.0.
Proceedings of the Data Science, 2018

SFSC: Segment Feature Sampling Classifier for Time Series Classification.
Proceedings of the Data Science, 2018

An Anomaly Detection Method Based on Learning of "Scores Sequence".
Proceedings of the Data Science, 2018

A Method to Identify Spark Important Parameters Based on Machine Learning.
Proceedings of the Data Science, 2018

MobiDis: Relationship Discovery of Mobile Users from Spatial-Temporal Trajectories.
Proceedings of the Advances in Conceptual Modeling, 2018

Time Series Cleaning Under Variance Constraints.
Proceedings of the Database Systems for Advanced Applications, 2018

2017
A survey of query result diversification.
Knowl. Inf. Syst., 2017

FrepJoin: an efficient partition-based algorithm for edit similarity join.
Frontiers Inf. Technol. Electron. Eng., 2017

COSSET+: Crowdsourced Missing Value Imputation Optimized by Knowledge Base.
J. Comput. Sci. Technol., 2017

EntityManager: Managing Dirty Data Based on Entity Resolution.
J. Comput. Sci. Technol., 2017

Parallel trajectory search based on distributed index.
Inf. Sci., 2017

Hierarchical Siamese Network for Thermal Infrared Object Tracking.
CoRR, 2017

Life-long learning based on dynamic combination model.
Appl. Soft Comput., 2017

Baymax: A Mental-Analyzing Mobile App Based on Big Data.
Proceedings of the Data Science, 2017

Disease Prediction Based on Transfer Learning in Individual Healthcare.
Proceedings of the Data Science, 2017

Efficient Web-Based Data Imputation with Graph Model.
Proceedings of the Database Systems for Advanced Applications, 2017

Similarity Search Combining Query Relaxation and Diversification.
Proceedings of the Database Systems for Advanced Applications, 2017

Capture Missing Values with Inference on Knowledge Base.
Proceedings of the Database Systems for Advanced Applications, 2017

CleanCloud: Cleaning Big Data on Cloud.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

Any-Time Methods for Time-Series Prediction with Missing Observations.
Proceedings of the 2017 IEEE International Congress on Big Data, 2017

The design of course architecture for big data.
Proceedings of the ACM Turing 50th Celebration Conference, 2017

Determining the currency of dynamic data.
Proceedings of the ACM Turing 50th Celebration Conference, 2017

Distributed skyline trajectory query processing.
Proceedings of the ACM Turing 50th Celebration Conference, 2017

2016
Repairing Data through Regular Expressions.
Proc. VLDB Endow., 2016

Extend tree edit distance for effective object identification.
Knowl. Inf. Syst., 2016

Efficient entity resolution based on subgraph cohesion.
Knowl. Inf. Syst., 2016

An efficient pruning strategy for approximate string matching over suffix tree.
Knowl. Inf. Syst., 2016

Efficient quality-driven source selection from massive data sources.
J. Syst. Softw., 2016

图结构模糊XML文档上的模式匹配算法 (Pattern Matching Algorithms for Graph Structured Fuzzy XML Documents).
计算机科学, 2016

Efficient accuracy evaluation for multi-modal sensed data.
J. Comb. Optim., 2016

Skyline for geo-textual data.
GeoInformatica, 2016

Parallelizing the extraction of fresh information from online social networks.
Future Gener. Comput. Syst., 2016

Efficient graph similarity join for information integration on graphs.
Frontiers Comput. Sci., 2016

Similarity Search Combining Query Relaxation and Diversification.
CoRR, 2016

SHMDRS: A Smartphone-Based Human Motion Detection and Response System.
Proceedings of the Wireless Algorithms, Systems, and Applications, 2016

Set-based Similarity Search for Time Series.
Proceedings of the 2016 International Conference on Management of Data, 2016

MapReduce for Big Data Analysis: Benefits, Limitations and Extensions.
Proceedings of the Social Computing, 2016

Efficient File Accessing Techniques on Hadoop Distributed File Systems.
Proceedings of the Social Computing, 2016

HierarSearch: Enhancing Performance of Search Engines by Mining Semantic Relationships Among Results.
Proceedings of the Social Computing, 2016

Optimization Analysis of Hadoop.
Proceedings of the Social Computing, 2016

VMPSP: Efficient Skyline Computation Using VMP-Based Space Partitioning.
Proceedings of the Database Systems for Advanced Applications, 2016

ISSA: Efficient Skyline Computation for Incomplete Data.
Proceedings of the Database Systems for Advanced Applications, 2016

Join Query Processing in Data Quality Management.
Proceedings of the Database Systems for Advanced Applications, 2016

Crowdsourcing-Enhanced Missing Values Imputation Based on Bayesian Network.
Proceedings of the Database Systems for Advanced Applications, 2016

Efficient Influence Maximization in Weighted Independent Cascade Model.
Proceedings of the Database Systems for Advanced Applications, 2016

Similarity Search on Massive Data Based on FPGA.
Proceedings of the Database Systems for Advanced Applications, 2016

A Distributed Load Balance Algorithm of MapReduce for Data Quality Detection.
Proceedings of the Database Systems for Advanced Applications, 2016

Incomplete Data Classification Based on Multiple Views.
Proceedings of the Web Technologies and Applications - 18th Asia-Pacific Web Conference, 2016

A Chronic Disease Analysis System Based on Dirty Data Mining.
Proceedings of the Web Technologies and Applications - 18th Asia-Pacific Web Conference, 2016

An Alarming and Prediction System for Infections Disease Based on Combined Models.
Proceedings of the Web Technologies and Applications - 18th Asia-Pacific Web Conference, 2016

Fuzzy Keywords Query.
Proceedings of the Web Technologies and Applications - 18th Asia-Pacific Web Conference, 2016

2015
Efficient subgraph join based on connectivity similarity.
World Wide Web, 2015

Indexable online time series segmentation with error bound guarantee.
World Wide Web, 2015

Cleanix: a Parallel Big Data Cleaning System.
SIGMOD Rec., 2015

Efficient Influence Maximization in Weighted Independent Cascade Model.
CoRR, 2015

Image Clustering Based on the Human Intelligence.
Proceedings of the 10th International Conference on Intelligent Systems and Knowledge Engineering, 2015

Graph Similarity Join with K-Hop Tree Indexing.
Proceedings of the Intelligent Computation in Big Data Era, 2015

Maximal Influence Spread for Social Network Based on MapReduce.
Proceedings of the Intelligent Computation in Big Data Era, 2015

Paradise Pointer : A Sightseeing Scenes Images Search Engine Based on Big Data Processing.
Proceedings of the Intelligent Computation in Big Data Era, 2015

Tree-Based Metric Learning for Distance Computation in Data Mining.
Proceedings of the Web Technologies and Applications - 17th Asia-PacificWeb Conference, 2015

A Fair Data Market System with Data Quality Evaluation and Repairing Recommendation.
Proceedings of the Web Technologies and Applications - 17th Asia-PacificWeb Conference, 2015

2014
Approximate joins for XML at label level.
Inf. Sci., 2014

Brief survey of crowdsourcing for data mining.
Expert Syst. Appl., 2014

Accuracy Evaluation for Sensed Data.
Proceedings of the Wireless Algorithms, Systems, and Applications, 2014

Capture Missing Values Based on Crowdsourcing.
Proceedings of the Wireless Algorithms, Systems, and Applications, 2014

Sectional and Conditional Functional Dependencies.
Proceedings of the Wireless Algorithms, Systems, and Applications, 2014

Truth Discovery Based on Crowdsourcing.
Proceedings of the Web-Age Information Management - 15th International Conference, 2014

Neighbor-base similarity matching for graphs.
Proceedings of the Workshops Proceedings of the 30th International Conference on Data Engineering Workshops, 2014

Harbinger: An Analyzing and Predicting System for Online Social Network Users' Behavior.
Proceedings of the Database Systems for Advanced Applications, 2014

Cleanix: A Big Data Cleaning Parfait.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

CrowdCleaner: A Data Cleaning System Based on Crowdsourcing.
Proceedings of the Web Technologies and Applications - 16th Asia-Pacific Web Conference, 2014

TruthOrRumor: Truth Judgment from Web.
Proceedings of the Web Technologies and Applications - 16th Asia-Pacific Web Conference, 2014

Efficient Subgraph Matching Using GPUs.
Proceedings of the Databases Theory and Applications, 2014

2013
A survey on tree edit distance lower bound estimation techniques for similarity join on XML data.
SIGMOD Rec., 2013

Efficient community detection with additive constrains on large networks.
Knowl. Based Syst., 2013

Graph-based reference table construction to facilitate entity matching.
J. Syst. Softw., 2013

An Efficient Entity Resolution Method for Large Relations.
Int. J. Cooperative Inf. Syst., 2013

Parallelization in Extracting Fresh Information from Online Social Network.
CoRR, 2013

A Data Cleaning Framework Based on User Feedback.
Proceedings of the Web-Age Information Management - 14th International Conference, 2013

Imputation for Categorical Attributes with Probabilistic Reasoning.
Proceedings of the Web-Age Information Management - 14th International Conference, 2013

Incremental Truth Discovery for Information from Multiple Data Sources.
Proceedings of the Web-Age Information Management, 2013

CUVIM: Extracting Fresh Information from Social Network.
Proceedings of the Web-Age Information Management - 14th International Conference, 2013

Entity Resolution on Uncertain Relations.
Proceedings of the Web-Age Information Management - 14th International Conference, 2013

ProductSeeker: entity-based product retrieval for e-commerce.
Proceedings of the 36th International ACM SIGIR conference on research and development in Information Retrieval, 2013

EntityManager: An Entity-Based Dirty Data Management System.
Proceedings of the Database Systems for Advanced Applications, 2013

HITCleaner: A Light-Weight Online Data Cleaning System.
Proceedings of the Database Systems for Advanced Applications, 2013

2012
Efficient Subgraph Matching on Billion Node Graphs.
Proc. VLDB Endow., 2012

Range Query Estimation for Dirty Data Management System.
Proceedings of the Web-Age Information Management - 13th International Conference, 2012

Graph-Structured Data Compression Based on Frequent Subgraph Contraction.
Proceedings of the Web-Age Information Management, 2012

Fgram-Tree: An Index Structure Based on Feature Grams for String Approximate Search.
Proceedings of the Web-Age Information Management - 13th International Conference, 2012

Path-Based XML Stream Compression with XPath Query Support.
Proceedings of the Web-Age Information Management, 2012

Dynamic Graph Shortest Path Algorithm.
Proceedings of the Web-Age Information Management - 13th International Conference, 2012

2011
GXQuery: Extending XQuery for Querying Graph-structured XML Data.
J. Comput. Inf. Technol., 2011

Similarity Join on XML Based on k-Generation Set Distance.
Proceedings of the Web-Age Information Management, 2011

Context-based entity description rule for entity resolution.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

Schema Mapping with Quality Assurance for Data Integration.
Proceedings of the Web Technologies and Applications - 13th Asia-Pacific Web Conference, 2011

Dirty Data Management in Cloud Database.
Proceedings of the Grid and Cloud Database Management, 2011

Labelling-Scheme-based Subgraph Query Processing on Graph Data.
Proceedings of the Graph Data Management: Techniques and Applications., 2011

2010
Graph Homomorphism Revisited for Graph Matching.
Proc. VLDB Endow., 2010

Approximate Joins for XML Using <i>g</i>-String.
Proceedings of the Database and XML Technologies, 2010

Compression Algorithms for Structural Query Results on XML Data.
Proceedings of the Web-Age Information Management, 2010

Efficient Duplicate Record Detection Based on Similarity Estimation.
Proceedings of the Web-Age Information Management, 11th International Conference, 2010

<i>pq</i>-Hash: An Efficient Method for Approximate XML Joins.
Proceedings of the Web-Age Information Management, 2010

EIF: A Framework of Effective Entity Identification.
Proceedings of the Web-Age Information Management, 11th International Conference, 2010

DCUBE: CUBE on Dirty Databases.
Proceedings of the Web-Age Information Management, 11th International Conference, 2010

2009
Query Optimization for Complex Path Queries on XML Data.
Proceedings of the Database Systems for Advanced Applications, 2009

Efficient Algorithms for Skyline Top-K Keyword Queries on XML Streams.
Proceedings of the Database Systems for Advanced Applications, 2009

Global Similarity and Local Variance in Human Gene Coexpression Networks.
Proceedings of the Artificial Intelligence and Computational Intelligence, 2009

2008
Coding-based Join Algorithms for Structural Queries on Graph-Structured XML Document.
World Wide Web, 2008

Clustered Chain Path Index for XML Document: Efficiently Processing Branch Queries.
World Wide Web, 2008

Hash-base subgraph query processing method for graph-structured XML documents.
Proc. VLDB Endow., 2008

Data sources selection for XML data sources.
Int. J. Intell. Inf. Database Syst., 2008

Efficient join algorithms for distributed information integration based on XML.
Int. J. Bus. Process. Integr. Manag., 2008

Constructing disease-specific gene networks using pair-wise relevance metric: Application to colon cancer identifies interleukin 8, desmin and enolase 1 as the central elements.
BMC Syst. Biol., 2008

C-kNN Query Processing in Object Tracking Sensor Networks.
Proceedings of the Wireless Algorithms, 2008

Efficient Top-k Keyword Search on XML Streams.
Proceedings of the 9th International Conference for Young Computer Scientists, 2008

2007
SAM: An Efficient Algorithm for F&B-Index Construction.
Proceedings of the Advances in Data and Web Management, 2007

2006
Caching Frequent XML Query Patterns.
Proceedings of the Advanced Web and Network Technologies, and Applications, 2006

Clustered Absolute Path Index for XML Document: On Efficient Processing of Twig Queries.
Proceedings of the Advanced Web and Network Technologies, and Applications, 2006

2005
Subgraph Join: Efficient Processing Subgraph Queries on Graph-Structured XML Document.
Proceedings of the Advances in Web-Age Information Management, 2005

Efficient Processing of XML Path Queries Using the Disk-based F&B Index.
Proceedings of the 31st International Conference on Very Large Data Bases, Trondheim, Norway, August 30, 2005

Efficient Join Algorithms for Integrating XML Data in Grid Environment.
Proceedings of the Grid and Cooperative Computing - GCC 2005, 4th International Conference, Beijing, China, November 30, 2005

Practical Indexing XML Document for Twig Query.
Proceedings of the Advances in Computer Science, 2005

Labeling Scheme and Structural Joins for Graph-Structured XML Data.
Proceedings of the Web Technologies Research and Development - APWeb 2005, 7th Asia-Pacific Web Conference, Shanghai, China, March 29, 2005

OLAP for XML Data.
Proceedings of the Fifth International Conference on Computer and Information Technology (CIT 2005), 2005

2004
Optimized Query Translation Strategy for XML Stored in Relational Database.
Proceedings of the Advances in Web-Age Information Management: 5th International Conference, 2004

The Compression of Massive Offline Relations.
Proceedings of the Advances in Web-Age Information Management: 5th International Conference, 2004

XCpaqs: Compression of XML Document with XPath Query Support.
Proceedings of the International Conference on Information Technology: Coding and Computing (ITCC'04), 2004

2003
Xaggregation: Flexible Aggregation of XML Data.
Proceedings of the Advances in Web-Age Information Management, 2003

Web Information Integration Based on Compressed XML.
Proceedings of the Databases in Networked Information Systems, 2003

An Effective Wrapper Architecture to Heterogeneous Data Source.
Proceedings of the 17th International Conference on Advanced Information Networking and Applications (AINA'03), 2003

2002
INEXP: Information Exchange Protocol for Interoperability.
Proceedings of the Digital Libraries: People, 2002


  Loading...