Xu Chu

Orcid: 0000-0002-0520-7196

According to our database1, Xu Chu authored at least 93 papers between 2013 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
LoRA Dropout as a Sparsity Regularizer for Overfitting Control.
CoRR, 2024

Parameter Efficient Quasi-Orthogonal Fine-Tuning via Givens Rotation.
CoRR, 2024

Exploring the Potential of Large Language Models in Graph Generation.
CoRR, 2024

Learnable Prompt as Pseudo-Imputation: Reassessing the Necessity of Traditional EHR Data Imputation in Downstream Clinical Prediction.
CoRR, 2024

Infinite-Horizon Graph Filters: Leveraging Power Series to Enhance Sparse Information Aggregation.
CoRR, 2024

Imputation with Inter-Series Information from Prototypes for Irregular Sampled Time Series.
CoRR, 2024

2023
Transcriptional correlates of frequency-dependent brain functional activity associated with symptom severity in degenerative cervical myelopathy.
NeuroImage, December, 2023

Patient Health Representation Learning via Correlational Sparse Prior of Medical Features.
IEEE Trans. Knowl. Data Eng., November, 2023

Adaptive federated few-shot feature learning with prototype rectification.
Eng. Appl. Artif. Intell., November, 2023

A multimodal dual-fusion entity extraction model for large and complex devices.
Comput. Commun., October, 2023

Spatial-Attention and Demographic-Augmented Generative Adversarial Imputation Network for Population Health Data Reconstruction.
IEEE Trans. Big Data, August, 2023

Experiences and Lessons Learned from the SIGMOD Entity Resolution Programming Contests.
SIGMOD Rec., June, 2023

An Adaptive Fusion Risk-Zone Detection Network and its Application.
Int. J. Pattern Recognit. Artif. Intell., May, 2023

CTCD-Net: A Cross-Layer Transmission Network for Tiny Road Crack Detection.
Remote. Sens., April, 2023

Towards Sustainable Compressive Population Health: A GAN-based Year-By-Year Imputation Method.
ACM Trans. Comput. Heal., January, 2023

iFlipper: Label Flipping for Individual Fairness.
Proc. ACM Manag. Data, 2023

Ground Truth Inference for Weakly Supervised Entity Matching.
Proc. ACM Manag. Data, 2023

DiffPrep: Differentiable Data Preprocessing Pipeline Search for Learning over Tabular Data.
Proc. ACM Manag. Data, 2023

Think and Retrieval: A Hypothesis Knowledge Graph Enhanced Medical Large Language Models.
CoRR, 2023

Graph Interpolation via Fast Fused-Gromovization.
CoRR, 2023

A new class of differential quasivariational inequalities with an application to a quasistatic viscoelastic frictional contact problem.
Commun. Nonlinear Sci. Numer. Simul., 2023

SeqCare: Sequential Training with External Medical Knowledge Graph for Diagnosis Prediction in Healthcare Data.
Proceedings of the ACM Web Conference 2023, 2023

Fused Gromov-Wasserstein Graph Mixup for Graph-level Classifications.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Discovering Process-Based Drivers for Case-Level Outcome Explanation.
Proceedings of the Process Mining Workshops, 2023

Wasserstein Barycenter Matching for Graph Size Generalization of Message Passing Neural Networks.
Proceedings of the International Conference on Machine Learning, 2023

Learning Hyper Label Model for Programmatic Weak Supervision.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Improving Generalization of Meta-Learning with Inverted Regularization at Inner-Level.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Enhancing Neural Topic Model with Multi-Level Supervisions from Seed Words.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Single-Phase to Ground Fault Line Identification for Medium Voltage Islanded Microgrids With Neutral Ineffectively Grounded Modes.
IEEE Trans. Smart Grid, 2022

Electrical Compensation for Magnetization Distortion of Magnetic Fluxgate Current Sensor.
IEEE Trans. Instrum. Meas., 2022

Defect Detection for a Vertical Shaft Surface Based on Multimodal Sensors.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2022

A Cluster-then-label Approach for Few-shot Learning with Application to Automatic Image Data Labeling.
ACM J. Data Inf. Qual., 2022

M<sup>3</sup>Care: Learning with Missing Modalities in Multimodal Healthcare Data.
CoRR, 2022

Learned Label Aggregation for Weak Supervision.
CoRR, 2022

MedFACT: Modeling Medical Feature Correlations in Patient Health Representation Learning via Feature Clustering.
CoRR, 2022

Cost-sensitive matrixized classification learning with information entropy.
Appl. Soft Comput., 2022

BiTMulV: Bidirectional-Decoding Based Transformer with Multi-view Visual Representation.
Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022

M3Care: Learning with Missing Modalities in Multimodal Healthcare Data.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Domain Generalization through the Lens of Angular Invariance.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

DNA: Domain Generalization with Diversified Neural Averaging.
Proceedings of the International Conference on Machine Learning, 2022

Enhancing Robust Text Classification via Category Description.
Proceedings of the IEEE International Conference on Data Mining, 2022

A Model-Agnostic Approach for Learning with Noisy Labels of Arbitrary Distributions.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Research on Multi-parameter Tracking Effect Evaluation system Based on Photoelectric theodolite.
Proceedings of the 5th International Conference on Data Science and Information Technology, 2022

2021
Waveform Difference Feature-Based Protection Scheme for Islanded Microgrids.
IEEE Trans. Smart Grid, 2021

A Joint Optimization Framework of the Embedding Model and Classifier for Meta-Learning.
Sci. Program., 2021

Demonstration of Panda: A Weakly Supervised Entity Matching System.
Proc. VLDB Endow., 2021

Learning to be a Statistician: Learned Estimator for Number of Distinct Values.
Proc. VLDB Endow., 2021

Joint Accuracy and Resource Allocation for Green Federated Learning Networks.
Proceedings of the Smart Computing and Communication - 6th International Conference, 2021

FairRover: explorative model building for fair and responsible machine learning.
Proceedings of the Fifth Workshop on Data Management for End-To-End Machine Learning, 2021

OmniFair: A Declarative System for Model-Agnostic Group Fairness in Machine Learning.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

Auto-FuzzyJoin: Auto-Program Fuzzy Similarity Joins Without Labeled Examples.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

CleanML: A Study for Evaluating the Impact of Data Cleaning on ML Classification Tasks.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

Establishment of a unified data model based on OPC UA and analysis of the efficiency of communication protocol.
Proceedings of the EITCE 2021: 5th International Conference on Electronic Information Technology and Computer Engineering, Xiamen, China, October 22, 2021

Application of Rough Sets Conditional Information Entropy in Climatic Evaluation of Flue-Cured Tobacco Growing Areas.
Proceedings of the EBIMCS 2021: 4th International Conference on E-Business, Information Management and Computer Science, Hong Kong, SAR, China, December 29, 2021

2020
Nearest Neighbor Classifiers over Incomplete Information: From Certain Answers to Certain Predictions.
Proc. VLDB Endow., 2020

ZeroER: Entity Resolution using Zero Labeled Examples.
Proceedings of the 2020 International Conference on Management of Data, 2020

GOGGLES: Automatic Image Labeling with Affinity Coding.
Proceedings of the 2020 International Conference on Management of Data, 2020

Distance Metric Learning with Joint Representation Diversification.
Proceedings of the 37th International Conference on Machine Learning, 2020

Neighbor Profile: Bagging Nearest Neighbors for Unsupervised Time Series Mining.
Proceedings of the 36th IEEE International Conference on Data Engineering, 2020

Study on Classification of Flue-cured Tobacco Planting area Based on Different Clustering Analysis Methods.
Proceedings of the EBIMCS 2020: 3rd International Conference on E-Business, 2020

Set-Sequence-Graph: A Multi-View Approach Towards Exploiting Reviews for Recommendation.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

2019
Data Cleaning.
Proceedings of the Encyclopedia of Big Data Technologies., 2019

Using the Area Ratio to Differentiate Between Benign and Malignant Breast Lesions: Ultrasound Strain Elastography versus Contrast Enhanced Ultrasound.
J. Medical Imaging Health Informatics, 2019

AutoER: Automated Entity Resolution using Generative Modelling.
CoRR, 2019

CleanML: A Benchmark for Joint Data Cleaning and Machine Learning [Experiments and Analysis].
CoRR, 2019

GOGGLES: Automatic Training Data Generation with Affinity Coding.
CoRR, 2019

PIClean: A Probabilistic and Interactive Data Cleaning System.
Proceedings of the 2019 International Conference on Management of Data, 2019

MLRDA: A Multi-Task Semi-Supervised Learning Framework for Drug-Drug Interaction Prediction.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

STAR: Spatio-Temporal Taxonomy-Aware Tag Recommendation for Citizen Complaints.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Data Cleaning
ACM Books 28, ACM, ISBN: 978-1-4503-7152-0, 2019

2018
Transform-Data-by-Example (TDE): An Extensible Search Engine for Data Transformations.
Proc. VLDB Endow., 2018

Multi-Label Robust Factorization Autoencoder and its Application in Predicting Drug-Drug Interactions.
CoRR, 2018

Transform-Data-by-Example (TDE): Extensible Data Transformation in Excel.
Proceedings of the 2018 International Conference on Management of Data, 2018

CAPED: Context-Aware Powerlet-Based Energy Disaggregation.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2018

Characteristic Subspace Learning for Time Series Classification.
Proceedings of the IEEE International Conference on Data Mining, 2018

Mining Rules from Real-Valued Time Series: A Relative Information-Gain-Based Approach.
Proceedings of the 2018 IEEE 42nd Annual Computer Software and Applications Conference, 2018

Pseudonym Inference in Cooperative Vehicular Traffic Scenarios.
Proceedings of the 2018 IEEE Conference on Communications and Network Security, 2018

2017
Scalable and Holistic Qualitative Data Cleaning.
PhD thesis, 2017

HoloClean: Holistic Data Repairs with Probabilistic Inference.
Proc. VLDB Endow., 2017

Motif-based Rule Discovery for Predicting Real-valued Time Series.
CoRR, 2017

2016
Distributed Data Deduplication.
Proc. VLDB Endow., 2016

Qualitative Data Cleaning.
Proc. VLDB Endow., 2016

Detecting Data Errors: Where are we and what needs to be done?
Proc. VLDB Endow., 2016

CLAMS: Bringing Quality to Data Lakes.
Proceedings of the 2016 International Conference on Management of Data, 2016

Data Cleaning: Overview and Emerging Challenges.
Proceedings of the 2016 International Conference on Management of Data, 2016

2015
SEMA-JOIN: Joining Semantically-Related Tables Using Big Table Corpora.
Proc. VLDB Endow., 2015

KATARA: Reliable Data Cleaning with Knowledge Bases and Crowdsourcing.
Proc. VLDB Endow., 2015

Trends in Cleaning Relational Data: Consistency and Deduplication.
Found. Trends Databases, 2015

KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

TEGRA: Table Extraction by Global Record Alignment.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

2014
RuleMiner: Data quality rules discovery.
Proceedings of the IEEE 30th International Conference on Data Engineering, Chicago, 2014

2013
Discovering Denial Constraints.
Proc. VLDB Endow., 2013

Holistic data cleaning: Putting violations into context.
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013


  Loading...