Yaoshu Wang

Orcid: 0000-0002-5760-5145

According to our database1, Yaoshu Wang authored at least 25 papers between 2013 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Graph Association Analyses for Early Drug Discovery.
Proc. VLDB Endow., August, 2024

Rock: Cleaning Data with both ML and Logic Rules.
Proc. VLDB Endow., August, 2024

Enriching Relations with Additional Attributes for ER.
Proc. VLDB Endow., July, 2024

Rock: Cleaning Data by Embedding ML in Logic Rules.
Proceedings of the Companion of the 2024 International Conference on Management of Data, 2024

Efficient Mixture of Experts based on Large Language Models for Low-Resource Data Preprocessing.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

BClean: A Bayesian Data Cleaning System.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

2023
Splitting Tuples of Mismatched Entities.
Proc. ACM Manag. Data, December, 2023

Learning and Deducing Temporal Orders.
Proc. VLDB Endow., 2023

Discovering Top-k Rules using Subjective and Objective Criteria.
Proc. ACM Manag. Data, 2023

Adaptive Label Cleaning for Error Detection on Tabular Data.
Proceedings of the Web and Big Data - 7th International Joint Conference, 2023

TabMentor: Detect Errors on Tabular Data with Noisy Labels.
Proceedings of the Advanced Data Mining and Applications - 19th International Conference, 2023

Domain-Adapted Dependency Parsing for Cross-Domain Named Entity Recognition.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Parallel Rule Discovery from Large Datasets by Sampling.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

2021
Generalizing the Pigeonhole Principle for Similarity Search in Hamming Space.
IEEE Trans. Knowl. Data Eng., 2021

Consistent and Flexible Selectivity Estimation for High-Dimensional Data.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

High-Dimensional Similarity Query Processing for Data Science.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Improving the Efficiency and Effectiveness for BERT-based Entity Resolution.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Consistent and Flexible Selectivity Estimation for High-dimensional Data.
CoRR, 2020

Monotonic Cardinality Estimation of Similarity Selection: A Deep Learning Approach.
Proceedings of the 2020 International Conference on Management of Data, 2020

2018
Approximate Query Processing with Multiple Similarity Metrics.
PhD thesis, 2018

GPH: Similarity Search in Hamming Space.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

2017
Efficient Approximate Entity Matching Using Jaro-Winkler Distance.
Proceedings of the Web Information Systems Engineering - WISE 2017, 2017

2016
Negative Factor: Improving Regular-Expression Matching in Strings.
ACM Trans. Database Syst., 2016

2015
Local Filtering: Improving the Performance of Approximate Queries on String Collections.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

2013
Improving regular-expression matching on strings using negative factors.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013


  Loading...