Yanlei Diao

Orcid: 0009-0000-7793-9671

Affiliations:
  • University of Massachusetts Amherst, USA


According to our database1, Yanlei Diao authored at least 78 papers between 2000 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Efficient and robust active learning methods for interactive database exploration.
VLDB J., July, 2024

A Spark Optimizer for Adaptive, Fine-Grained Parameter Tuning.
Proc. VLDB Endow., July, 2024

Efficient Version Space Algorithms for Human-in-the-loop Model Development.
ACM Trans. Knowl. Discov. Data, April, 2024

Forecasting Algorithms for Intelligent Resource Scaling: An Experimental Analysis.
Proceedings of the 2024 ACM Symposium on Cloud Computing, 2024

2022
Fine-Grained Modeling and Optimization for Intelligent Resource Management in Big Data Processing.
Proc. VLDB Endow., 2022

2021
A Demonstration of the Exathlon Benchmarking Platform for Explainable Anomaly Detection.
Proc. VLDB Endow., 2021

Exathlon: A Benchmark for Explainable Anomaly Detection over Time Series.
Proc. VLDB Endow., 2021

Neural-based Modeling for Performance Tuning of Spark Data Analytics.
CoRR, 2021

Efficient Exploration of Interesting Aggregates in RDF Graphs.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

Spark-based Cloud Data Analytics using Multi-Objective Optimization.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

Explainable anomaly detection on high-dimensional time series data.
Proceedings of the 15th ACM International Conference on Distributed and Event-based Systems, 2021

2020
AnomalyBench: An Open Benchmark for Explainable Anomaly Detection.
CoRR, 2020

Boosting Cloud Data Analytics using Multi-Objective Optimization.
CoRR, 2020

2019
UDAO: A Next-Generation Unified Data Analytics Optimizer.
Proc. VLDB Endow., 2019

Spade: A Modular Framework for Analytical Exploration of RDF Graphs.
Proc. VLDB Endow., 2019

A Factorized Version Space Algorithm for "Human-In-the-Loop" Data Exploration.
Proceedings of the 2019 IEEE International Conference on Data Mining, 2019

2018
XML Publish/Subscribe.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Publish/Subscribe Over Streams.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Optimization for Active Learning-based Interactive Database Exploration.
Proc. VLDB Endow., 2018

EXAD: A System for Explainable Anomaly Detection on Big Data Traces.
Proceedings of the 2018 IEEE International Conference on Data Mining Workshops, 2018

Anomaly Detection and Explanation Discovery on Event Streams.
Proceedings of the International Workshop on Real-Time Business Intelligence and Analytics, 2018

2017
Massively Parallel Processing of Whole Genome Sequence Data: An In-Depth Performance Study.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

Dagger: Digging for Interesting Aggregates in RDF Graphs.
Proceedings of the ISWC 2017 Posters & Demonstrations and Industry Tracks co-located with 16th International Semantic Web Conference (ISWC 2017), Vienna, Austria, October 23rd - to, 2017

EXstream: Explaining Anomalies in Event Stream Monitoring.
Proceedings of the 20th International Conference on Extending Database Technology, 2017

2016
AIDE: An Active Learning-Based Approach for Interactive Data Exploration.
IEEE Trans. Knowl. Data Eng., 2016

Interactive Data Exploration via Machine Learning Models.
IEEE Data Eng. Bull., 2016

High-Performance XML Message Brokering.
Proceedings of the Data Stream Management - Processing High-Speed Data Streams, 2016

2015
Supporting Scalable Analytics with Latency Constraints.
Proc. VLDB Endow., 2015

AIDE: An Automatic User Navigation System for Interactive Data Exploration.
Proc. VLDB Endow., 2015

AIDE: An Automated Sample-based Approach for Interactive Data Exploration.
CoRR, 2015

Supporting Data Uncertainty in Array Databases.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

Explore-By-Example: A New Database Service for Interactive Data Exploration.
Proceedings of the Second International Workshop on Exploratory Search in Databases and the Web, 2015

Building Highly-Optimized, Low-Latency Pipelines for Genomic Data Analysis.
Proceedings of the Seventh Biennial Conference on Innovative Data Systems Research, 2015

2014
On complexity and optimization of expensive queries in complex event processing.
Proceedings of the International Conference on Management of Data, 2014

Explore-by-example: an automatic query steering framework for interactive data exploration.
Proceedings of the International Conference on Management of Data, 2014

Interactive data exploration based on user relevance feedback.
Proceedings of the Workshops Proceedings of the 30th International Conference on Data Engineering Workshops, 2014

2013
Supporting User-Defined Functions on Uncertain Data.
Proc. VLDB Endow., 2013

Front Matter.
Proc. VLDB Endow., 2013

SEDGE: Symbolic example data generation for dataflow programs.
Proceedings of the 2013 28th IEEE/ACM International Conference on Automated Software Engineering, 2013

A Science Fiction Talk.
Proceedings of the Sixth Biennial Conference on Innovative Data Systems Research, 2013

Query Steering for Interactive Data Exploration.
Proceedings of the Sixth Biennial Conference on Innovative Data Systems Research, 2013

2012
CLARO: modeling and processing uncertain data streams.
VLDB J., 2012

SCALLA: A Platform for Scalable One-Pass Analytics Using MapReduce.
ACM Trans. Database Syst., 2012

SPIRE: Efficient Data Inference and Compression over RFID Streams.
IEEE Trans. Knowl. Data Eng., 2012

Massive Genomic Data Processing and Deep Analysis.
Proc. VLDB Endow., 2012

2011
Optimizing Probabilistic Query Processing on Continuous Uncertain Data.
Proc. VLDB Endow., 2011

Distributed inference and query processing for RFID tracking and monitoring.
Proc. VLDB Endow., 2011

Quality-biased ranking of web documents.
Proceedings of the Forth International Conference on Web Search and Web Data Mining, 2011

A platform for scalable one-pass analytics using MapReduce.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011

Towards Scalable One-Pass Analytics Using MapReduce.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

2010
Fast Packet Pattern-Matching Algorithms.
Proceedings of the Algorithms for Next Generation Networks, 2010

Recognizing Patterns in Streams with Imprecise Timestamps.
Proc. VLDB Endow., 2010

Conditioning and Aggregating Uncertain Data Streams: Going Beyond Expectations.
Proc. VLDB Endow., 2010

PODS: a new model and processing algorithms for uncertain data streams.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010

Exploiting the Interplay between Memory and Flash Storage in Embedded Sensor Devices.
Proceedings of the 16th IEEE International Conference on Embedded and Real-Time Computing Systems and Applications, 2010

2009
XML Publish/Subscribe.
Proceedings of the Encyclopedia of Database Systems, 2009

Publish/Subscribe over Streams.
Proceedings of the Encyclopedia of Database Systems, 2009

Lazy-Adaptive Tree: An Optimized Index Structure for Flash Devices.
Proc. VLDB Endow., 2009

Probabilistic Inference over RFID Streams in Mobile Environments.
Proceedings of the 25th International Conference on Data Engineering, 2009

Refining Keyword Queries for XML Retrieval by Combining Content and Structure.
Proceedings of the Advances in Information Retrieval, 2009

Capturing Data Uncertainty in High-Volume Stream Processing.
Proceedings of the Fourth Biennial Conference on Innovative Data Systems Research, 2009

2008
Efficient pattern matching over event streams.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008

On Supporting Kleene Closure over Event Streams.
Proceedings of the 24th International Conference on Data Engineering, 2008

Efficient Data Interpretation and Compression over RFID Streams.
Proceedings of the 24th International Conference on Data Engineering, 2008

2007
SASE: Complex Event Processing over Streams (Demo).
Proceedings of the Third Biennial Conference on Innovative Data Systems Research, 2007

Rethinking Data Management for Storage-centric Sensor Networks.
Proceedings of the Third Biennial Conference on Innovative Data Systems Research, 2007

2006
SASE: Complex Event Processing over Streams
CoRR, 2006

High-performance complex event processing over streams.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2006

Fast and memory-efficient regular expression matching for deep packet inspection.
Proceedings of the 2006 ACM/IEEE Symposium on Architecture for Networking and Communications Systems, 2006

2004
Implementing Memoization in a Streaming XQuery Processor.
Proceedings of the Database and XML Technologies, 2004

Towards an Internet-Scale XML Dissemination Service.
Proceedings of the (e)Proceedings of the Thirtieth International Conference on Very Large Data Bases, VLDB 2004, Toronto, Canada, August 31, 2004

2003
Path sharing and predicate evaluation for high-performance XML filtering.
ACM Trans. Database Syst., 2003

High-Performance XML Filtering: An Overview of YFilter.
IEEE Data Eng. Bull., 2003

Query Processing for High-Volume XML Message Brokering.
Proceedings of 29th International Conference on Very Large Data Bases, 2003

2002
YFilter: Efficient and Scalable Filtering of XML Documents.
Proceedings of the 18th International Conference on Data Engineering, San Jose, CA, USA, February 26, 2002

2000
Toward Learning Based Web Query Processing.
Proceedings of the VLDB 2000, 2000

Fact: A Learning Based Web Query Processing System.
Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, 2000

A Comparative Study of Classification Based Personal E-mail Filtering.
Proceedings of the Knowledge Discovery and Data Mining, 2000


  Loading...