Da Yan

Orcid: 0000-0002-4653-0408

Affiliations:
  • Indiana University Bloomington, IN, USA
  • University of Alabama at Birmingham, AL, USA (former)
  • Hong Kong University of Science and Technology, Hong Kong, SAR, China (Ph.D., 2014)


According to our database1, Da Yan authored at least 126 papers between 2011 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Efficient Enumeration of Large Maximal k-Plexes.
Proceedings of the Proceedings 28th International Conference on Extending Database Technology, 2025

2024
Fraction-Score: A Generalized Support Measure for Weighted and Maximal Co-Location Pattern Mining.
IEEE Trans. Knowl. Data Eng., April, 2024

Guest Editorial Selected Papers From BIOKDD 2022.
IEEE ACM Trans. Comput. Biol. Bioinform., 2024

Systems for Scalable Graph Analytics and Machine Learning: Trends and Methods.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

23<sup>rd</sup> International Workshop on Data Mining in Bioinformatics (BIOKDD 2024).
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

EvaNet: Elevation-Guided Flood Extent Mapping on Earth Imagery.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

G<sup>2</sup>-AIMD: A Memory-Efficient Subgraph-Centric Framework for Efficient Subgraph Finding on GPUs.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

Faster Depth-First Subgraph Matching on GPUs.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

FSM-Explorer: An Interactive Tool for Frequent Subgraph Pattern Mining From a Big Graph.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

Advancing Certified Robustness of Explanation via Gradient Quantization.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

DirDense: A Tool for Mining Dense Subgraphs from a Big Directed Graph.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

Systems for Scalable Graph Analytics and Machine Learning: Trends and Methods.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

Spatial-Logic-Aware Weakly Supervised Learning for Flood Mapping on Earth Imagery.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
T-FSM: A Task-Based System for Massively Parallel Frequent Subgraph Pattern Mining from a Big Graph.
Proc. ACM Manag. Data, 2023

Structure prediction and materials design with generative neural networks.
Nat. Comput. Sci., 2023

A Summary of ICDE 2022 Research Session Panels.
IEEE Data Eng. Bull., 2023

Effective Network Analytics: Network Visualization and Graph Data Management.
IEEE Computer Graphics and Applications, 2023

A Hidden Markov Forest Model for Terrain-Aware Flood Inundation Mapping from Earth Imagery.
Proceedings of the 2023 SIAM International Conference on Data Mining, 2023

22nd International Workshop on Data Mining in Bioinformatics (BIOKDD 2023).
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Fast Federated Machine Unlearning with Nonlinear Functional Theory.
Proceedings of the International Conference on Machine Learning, 2023

Dimension-independent Certified Neural Network Watermarks via Mollifier Smoothing.
Proceedings of the International Conference on Machine Learning, 2023

Rethinking Graph Lottery Tickets: Graph Sparsity Matters.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Reinforcement Learning Enhanced Weighted Sampling for Accurate Subgraph Counting on Fully Dynamic Graph Streams.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

Accelerating k-Core Decomposition by a GPU.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

2022
PrefixFPM: a parallel framework for general-purpose mining of frequent and closed patterns.
VLDB J., 2022

G-thinker: a general distributed framework for finding qualified subgraphs in a big graph with load balancing.
VLDB J., 2022

Parallel mining of large maximal quasi-cliques.
VLDB J., 2022

Mining Order-preserving Submatrices under Data Uncertainty: A Possible-world Approach and Efficient Approximation Methods.
ACM Trans. Database Syst., 2022

Unsupervised Adversarial Network Alignment with Reinforcement Learning.
ACM Trans. Knowl. Discov. Data, 2022

Earth Imagery Segmentation on Terrain Surface with Limited Training Labels: A Semi-supervised Approach based on Physics-Guided Graph Co-Training.
ACM Trans. Intell. Syst. Technol., 2022

Guest Editorial for Selected Papers From BIOKDD 2020.
IEEE ACM Trans. Comput. Biol. Bioinform., 2022

Guest Editorial for Selected Papers From BIOKDD 2021.
IEEE ACM Trans. Comput. Biol. Bioinform., 2022

Bringing Graph Databases and Network Visualization Together (Dagstuhl Seminar 22031).
Dagstuhl Reports, 2022

Efficient Algorithms for Maximal k-Biplex Enumeration.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

Quantifying and Reducing Registration Uncertainty of Spatial Vector Labels on Earth Imagery.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

21th International Workshop on Data Mining in Bioinformatics (BIOKDD 2022).
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Center-Based iPSC Colony Counting with Multi-Task Learning.
Proceedings of the IEEE International Conference on Data Mining, 2022

MentalNet: Heterogeneous Graph Representation for Early Depression Detection.
Proceedings of the IEEE International Conference on Data Mining, 2022

Federated Fingerprint Learning with Heterogeneous Architectures.
Proceedings of the IEEE International Conference on Data Mining, 2022

Distributed Task-Based Training of Tree Models.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Time-sensitive POI Recommendation by Tensor Completion with Side Information.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Maximal Directed Quasi -Clique Mining.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Realistic urban traffic simulation with ride-hailing services: a revisit to network kernel density estimation (systems paper).
Proceedings of the 30th International Conference on Advances in Geographic Information Systems, 2022

An elevation-guided annotation tool for flood extent mapping on earth imagery (demo paper).
Proceedings of the 30th International Conference on Advances in Geographic Information Systems, 2022

2021
Guest Editorial for Selected Papers From BIOKDD 2019.
IEEE ACM Trans. Comput. Biol. Bioinform., 2021

Scalable De Novo Genome Assembly Using a Pregel-Like Graph-Parallel System.
IEEE ACM Trans. Comput. Biol. Bioinform., 2021

Editorial: AI-Enabled Data Science for COVID-19.
Frontiers Big Data, 2021

Flood Inundation Mapping with Limited Observations Based on Physics-Aware Topography Constraint.
Frontiers Big Data, 2021

The future is big graphs: a community view on graph processing systems.
Commun. ACM, 2021

Deep Neural Network for 3D Surface Segmentation based on Contour Tree Hierarchy.
Proceedings of the 2021 SIAM International Conference on Data Mining, 2021

TrajNet: A Trajectory-Based Deep Learning Model for Traffic Prediction.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

20th International Workshop on Data Mining in Bioinformatics (BIOKDD 2021).
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Weakly Supervised Spatial Deep Learning based on Imperfect Vector Labels with Registration Errors.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Forecasting Stock Prices Using Stock Correlation Graph: A Graph Convolutional Network Approach.
Proceedings of the International Joint Conference on Neural Networks, 2021

Predicting Autism Spectrum Disorder from Brain Imaging Data by Graph Convolutional Network.
Proceedings of the International Joint Conference on Neural Networks, 2021

Expressive 1-Lipschitz Neural Networks for Robust Multiple Graph Learning against Adversarial Attacks.
Proceedings of the 38th International Conference on Machine Learning, 2021

Trajectory WaveNet: A Trajectory-Based Model for Traffic Forecasting.
Proceedings of the IEEE International Conference on Data Mining, 2021

EDGE: Entity-Diffusion Gaussian Ensemble for Interpretable Tweet Geolocation Prediction.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

Mapping Road Safety Barriers Across Street View Image Sequences: A Hybrid Object Detection and Recurrent Model.
Proceedings of the GeoAI@SIGSPATIAL 2021: Proceedings of the 4th ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery, 2021

Adversarial Attack against Cross-lingual Knowledge Graph Alignment.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

MentalSpot: Effective Early Screening for Depression Based on Social Contagion.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Drone-Based Tower Survey by Multi-Task Learning.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

Realistic Transport Simulation for Studying the Impacts of Shared Micromobility Services.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

Traffic Study of Shared Micromobility Services by Transportation Simulation.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

Node-Polysemy Aware Recommendation by Matrix Completion with Side Information.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

Improving Brain Dysfunction Prediction by GAN: A Functional-Connectivity Generator Approach.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

2020
Guest Editorial for Selected Papers from BIOKDD 2018 and DMBIH 2018.
IEEE ACM Trans. Comput. Biol. Bioinform., 2020

Scalable Mining of Maximal Quasi-Cliques: An Algorithm-System Codesign Approach.
Proc. VLDB Endow., 2020

Spatial Classification With Limited Observations Based On Physics-Aware Structural Constraint.
CoRR, 2020

Scalable Mining of Maximal Quasi-Cliques: An Algorithm-System Codesign Approach.
CoRR, 2020

Proceedings of the 2019 MidSouth Computational Biology and Bioinformatics Society (MCBIOS) Conference.
BMC Bioinform., 2020

Parallel Mining of Frequent Subtree Patterns.
Proceedings of the Software Foundations for Data Interoperability and Large Scale Graph Data Analytics, 2020

Lane Extraction and Quality Evaluation: A Hough Transform Based Approach.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

PrefixFPM: A Parallel Framework for General-Purpose Frequent Pattern Mining.
Proceedings of the 36th IEEE International Conference on Data Engineering, 2020

G-thinker: A Distributed Framework for Mining Subgraphs in a Big Graph.
Proceedings of the 36th IEEE International Conference on Data Engineering, 2020

Predicting Economic Growth by Region Embedding: A Multigraph Convolutional Network Approach.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

Price Forecast with High-Frequency Finance Data: An Autoregressive Recurrent Neural Network Model with Technical Indicators.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

Unsupervised Multiple Network Alignment with Multinominal GAN and Variational Inference.
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020

BELT: A Pipeline for Stock Price Prediction Using News.
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020

Accurate Tensor Decomposition with Simultaneous Rank Approximation for Surveillance Videos.
Proceedings of the 54th Asilomar Conference on Signals, Systems, and Computers, 2020

Spatial Classification with Limited Observations Based on Physics-Aware Structural Constraint.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Parallel Graph Processing.
Proceedings of the Encyclopedia of Big Data Technologies., 2019

T-thinker: a task-centric distributed framework for compute-intensive divide-and-conquer algorithms.
Proceedings of the 24th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2019

Lightweight Fault Tolerance in Pregel-Like Systems.
Proceedings of the 48th International Conference on Parallel Processing, 2019

Mining Order-Preserving Submatrices Under Data Uncertainty: A Possible-World Approach.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

Fraction-Score: A New Support Measure for Co-location Pattern Mining.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

Parallel Clique-Like Subgraph Counting and Listing.
Proceedings of the Conceptual Modeling - 38th International Conference, 2019

Realistic Transport Simulation with Open Data.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019

Realistic Transport Simulation: Tackling the Small Data Challenge with Open Data.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019

EasyRain: A User-Friendly Platform for Comparing Precipitation Nowcasting Models.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019

2018
GraphD: Distributed Vertex-Centric Graph Processing Beyond the Memory Limit.
IEEE Trans. Parallel Distributed Syst., 2018

Fuzzy Double Trace Norm Minimization for Recommendation Systems.
IEEE Trans. Fuzzy Syst., 2018

Challenges Towards Deploying Data Intensive Scientific Applications on Extreme Heterogeneity Supercomputers.
CoRR, 2018

Scalable De Novo Genome Assembly Using Pregel.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

G-Miner: an efficient task-oriented graph mining system.
Proceedings of the Thirteenth EuroSys Conference, 2018

2017
Systems for Big Graph Analytics
Springer Briefs in Computer Science, Springer, ISBN: 978-3-319-58216-0, 2017

Big Graph Analytics Platforms.
Found. Trends Databases, 2017

G-thinker: Big Graph Mining Made Easier and Faster.
CoRR, 2017

Efficient Processing of Growing Temporal Graphs.
Proceedings of the Database Systems for Advanced Applications, 2017

Architectural implications on the performance and cost of graph analytics systems.
Proceedings of the 2017 Symposium on Cloud Computing, SoCC 2017, Santa Clara, CA, USA, 2017

2016
A General-Purpose Query-Centric Framework for Querying Big Graphs.
Proc. VLDB Endow., 2016

Efficient Processing of Very Large Graphs in a Small Cluster.
CoRR, 2016

Lightweight Fault Tolerance in Large-Scale Distributed Graph Processing.
CoRR, 2016

Quegel: A General-Purpose Query-Centric Framework for Querying Big Graphs.
CoRR, 2016

Quegel: A General-Purpose System for Querying Big Graphs.
Proceedings of the 2016 International Conference on Management of Data, 2016

Big Graph Analytics Systems.
Proceedings of the 2016 International Conference on Management of Data, 2016

Diversified Temporal Subgraph Pattern Mining.
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

SGraph: A Distributed Streaming System for Processing Big Graphs.
Proceedings of the Big Data Computing and Communications - Second International Conference, 2016

2015
Probabilistic Convex Hull Queries over Uncertain Data.
IEEE Trans. Knowl. Data Eng., 2015

Efficient processing of optimal meeting point queries in Euclidean space and road networks.
Knowl. Inf. Syst., 2015

Efficient location-based search of trajectories with location importance.
Knowl. Inf. Syst., 2015

Effective Techniques for Message Reduction and Load Balancing in Distributed Graph Computation.
Proceedings of the 24th International Conference on World Wide Web, 2015

Core decomposition in large temporal graphs.
Proceedings of the 2015 IEEE International Conference on Big Data (IEEE BigData 2015), Santa Clara, CA, USA, October 29, 2015

2014
Mining Probabilistically Frequent Sequential Patterns in Large Uncertain Databases.
IEEE Trans. Knowl. Data Eng., 2014

Pregel Algorithms for Graph Connectivity Problems with Performance Guarantees.
Proc. VLDB Endow., 2014

Blogel: A Block-Centric Framework for Distributed Computation on Real-World Graphs.
Proc. VLDB Endow., 2014

Large-Scale Distributed Graph Computing Systems: An Experimental Evaluation.
Proc. VLDB Endow., 2014

2013
A transfer learning based framework of crowd-selection on twitter.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

Finding distance-preserving subgraphs in large road networks.
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013

2012
A probabilistic convex hull query tool.
Proceedings of the 15th International Conference on Extending Database Technology, 2012

Mining probabilistically frequent sequential patterns in uncertain databases.
Proceedings of the 15th International Conference on Extending Database Technology, 2012

Monochromatic and bichromatic reverse nearest neighbor queries on land surfaces.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Leveraging read rates of passive RFID tags for real-time indoor location tracking.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

2011
Efficient Algorithms for Finding Optimal Meeting Point on Road Networks.
Proc. VLDB Endow., 2011

Robust Ranking of Uncertain Data.
Proceedings of the Database Systems for Advanced Applications, 2011

Efficient methods for finding influential locations with adaptive grids.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011


  Loading...