Sebastian Michel

Orcid: 0000-0002-2238-0185

Affiliations:
  • University of Kaiserslautern, Germany


According to our database1, Sebastian Michel authored at least 119 papers between 2004 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Partition, Don't Sort! Compression Boosters for Cloud Data Ingestion Pipelines.
Proc. VLDB Endow., July, 2024

Grid-AR: A Grid-based Booster for Learned Cardinality Estimation and Range Joins.
CoRR, 2024

Learning over Sets for Databases.
Proceedings of the Proceedings 27th International Conference on Extending Database Technology, 2024

2023
To UDFs and Beyond: Demonstration of a Fully Decomposed Data Processor for General Data Wrangling Tasks.
Proc. VLDB Endow., 2023

Schema-based Column Reordering for Dremel-encoded Data.
Proceedings of the International Workshop on Big Data in Emergent Distributed Environments, 2023

Tuple Bubbles: Learned Tuple Representations for Tunable Approximate Query Processing.
Proceedings of the Sixth International Workshop on Exploiting Artificial Intelligence Techniques for Data Management, 2023

2022
Approximate Query Processing via Tuple Bubbles.
CoRR, 2022

Compressing (Multidimensional) Learned Bloom Filters.
CoRR, 2022

BETZE: Benchmarking Data Exploration Tools with (Almost) Zero Effort.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

LMKG: Learned Models for Cardinality Estimation in Knowledge Graphs.
Proceedings of the 25th International Conference on Extending Database Technology, 2022

Ameliorating data compression and query performance through cracked Parquet.
Proceedings of the BiDEDE '22: Proceedings of The International Workshop on Big Data in Emergent Distributed Environments, 2022

Summarizing Edge-Device Data via Core Items.
Proceedings of the Advances in Databases and Information Systems, 2022

2021
Utilizing Delta Trees for Efficient, Iterative Exploration and Transformation of Semi-Structured Contents.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

Optimizing Multiple Multi-Way Stream Joins.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

Natural Joins for Schema-free Data.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

2020
JODA: A Vertically Scalable, Lightweight JSON Processor for Big Data Transformations.
Proceedings of the 36th IEEE International Conference on Data Engineering, 2020

Scaling Out Schema-free Stream Joins.
Proceedings of the 36th IEEE International Conference on Data Engineering, 2020

Partially Materializable Delta Trees for Efficient Data Wrangling of Semi-Structured Contents.
Proceedings of the 23rd International Conference on Extending Database Technology, 2020

Distributed Similarity Joins over Top-K Rankings.
Proceedings of the 23rd International Conference on Extending Database Technology, 2020

The ML-Index: A Multidimensional, Learned Index for Point, Range, and Nearest-Neighbor Queries.
Proceedings of the 23rd International Conference on Extending Database Technology, 2020

2019
Concept and Computation of Ranking-based Dominance.
Inf. Syst., 2019

CLASH: A High-Level Abstraction for Optimized, Multi-Way Stream Joins over Apache Storm.
Proceedings of the 2019 International Conference on Management of Data, 2019

Similarity Search and Data Exploration over Entity Rankings (Ähnlichkeitssuche und Datenexploration über Entitäts-Ranglisten).
Proceedings of the 31st GI-Workshop Grundlagen von Datenbanken, Saarburg, 2019

Scaling Out Multi-Way Stream Joins using Optimized, Iterative Probing.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019

2018
Editorial.
Datenbank-Spektrum, 2018

Processing Class-Constraint K-NN Queries with MISP.
Proceedings of the 21st International Workshop on the Web and Databases, 2018

Learning interesting attributes for automated data categorization.
Proceedings of the 30th International Conference on Scientific and Statistical Database Management, 2018

Exploring Pros and Cons of Ranked Entities with COMPETE.
Proceedings of the 5th International Workshop on Exploratory Search in Databases and the Web, 2018

Class-constraint similarity queries.
Proceedings of the 33rd Annual ACM Symposium on Applied Computing, 2018

2017
Learning Interesting Categorical Attributes for Refined Data Exploration.
CoRR, 2017

LSH-Based Probabilistic Pruning of Inverted Indices for Sets and Ranked Lists.
Proceedings of the 20th International Workshop on the Web and Databases, 2017

Scaling Out Continuous Multi-Way Theta-Joins.
Proceedings of the 4th ACM SIGMOD Workshop on Algorithms and Systems for MapReduce and Beyond, 2017

Playing Information LEGO at Large Scale.
Proceedings of the 8th Biennial Conference on Innovative Data Systems Research, 2017

Reverse Engineering Top-k Join Queries.
Proceedings of the Datenbanksysteme für Business, 2017

2016
Exploring Databases via Reverse Engineering Ranking Queries with PALEO.
Proc. VLDB Endow., 2016

Mining Entity Rankings.
Datenbank-Spektrum, 2016

Die Arbeitsgruppen für Datenbanken und Informationssysteme an der TU Kaiserslautern.
Datenbank-Spektrum, 2016

Efficient Similarity Search across Top-k Lists under the Kendall's Tau Distance.
Proceedings of the 28th International Conference on Scientific and Statistical Database Management, 2016

Computing similar entity rankings via reverse engineering of top-k database queries.
Proceedings of the 32nd IEEE International Conference on Data Engineering Workshops, 2016

Playing LEGO with JSON: Probabilistic joins over attribute-value fragments.
Proceedings of the 32nd IEEE International Conference on Data Engineering Workshops, 2016

Reverse Engineering Top-k Database Queries with PALEO.
Proceedings of the 19th International Conference on Extending Database Technology, 2016

A Data Mining Approach to Choosing Categorical Attributes for Ranked Lists.
Proceedings of the 19th International Conference on Extending Database Technology, 2016

Quantifying Likelihood of Change through Update Propagation across Top-k Rankings.
Proceedings of the 19th International Conference on Extending Database Technology, 2016

2015
ligDB - Online Query Processing Without (almost) any Storage.
Proceedings of the 18th International Conference on Extending Database Technology, 2015

The Sweet Spot between Inverted Indices and Metric-Space Indexing for Top-K-List Similarity Search.
Proceedings of the 18th International Conference on Extending Database Technology, 2015

2014
An LSH Index for Computing Kendall's Tau over Top-k Lists.
CoRR, 2014

Tracking set correlations at large scale.
Proceedings of the International Conference on Management of Data, 2014

2013
Benchmarking Soundtrack Recommendation Systems with SRBench.
CoRR, 2013

A thin monitoring layer for top-k aggregation queries over a database.
Proceedings of the 7th International Workshop on Ranking in Databases (co-located with VLDB 2013), 2013

Automated educated guessing.
Proceedings of the Workshops Proceedings of the 29th IEEE International Conference on Data Engineering, 2013

Scalable, continuous tracking of tag co-occurrences between short sets using (almost) disjoint tag partitions.
Proceedings of the 3rd ACM SIGMOD Workshop on Databases and Social Networks, 2013

Interesting event detection through hall of fame rankings.
Proceedings of the 3rd ACM SIGMOD Workshop on Databases and Social Networks, 2013

SRbench-a benchmark for soundtrack recommendation systems.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

The essence of knowledge (bases) through entity rankings.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

2012
Efficient monitoring of personalized hot news over Web 2.0 streams.
Comput. Sci. Res. Dev., 2012

Sequoia - An Approach to Declarative Information Retrieval.
Datenbank-Spektrum, 2012

Building and Maintaining Halls of Fame over a Database
CoRR, 2012

Enhancing Locality Sensitive Hashing with Peek Probing and Nearest Neighbor Links.
Proceedings of the 15th International Workshop on the Web and Databases 2012, 2012

Top-k interesting phrase mining in ad-hoc collections using sequence pattern indexing.
Proceedings of the 15th International Conference on Extending Database Technology, 2012

See what's enBlogue: real-time emergent topic identification in social media.
Proceedings of the 15th International Conference on Extending Database Technology, 2012

Being picky: processing top-k queries with set-defined selections.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

2011
EnBlogue: emergent topic detection in web 2.0 streams.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011

Picasso - to sing, you must close your eyes and draw.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

PICASSO: automated soundtrack suggestion for multi-modal data.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

Tracking Hot-k Items over Web 2.0 Streams.
Proceedings of the Datenbanksysteme für Business, 2011

Efficient Interest Group Discovery in Social Networks using an Integrated Structure/Quality Index.
Proceedings of the Datenbanksysteme für Business, 2011

2010
RankReduce - Processing K-Nearest Neighbor Queries on Top of MapReduce.
Proceedings of the 8th Workshop on Large-Scale Distributed Systems for Information Retrieval, 2010

Continuous query evaluation over distributed sensor networks.
Proceedings of the 26th International Conference on Data Engineering, 2010

The gist of everything new: personalized top-k processing over web 2.0 streams.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

2009
Workshop on large-scale distributed systems for information retrieval.
SIGIR Forum, 2009

Distributed top-<i>k</i> aggregation queries at large.
Distributed Parallel Databases, 2009

Rethinking email message and people search.
Proceedings of the 18th International Conference on World Wide Web, 2009

Knowing When to Slide - Efficient Scheduling for Sliding Window Processing.
Proceedings of the MDM 2009, 2009


Neighborhood-Based Tag Prediction.
Proceedings of the Semantic Web: Research and Applications, 2009

Distributed similarity search in high dimensions using locality sensitive hashing.
Proceedings of the EDBT 2009, 2009

Processing publish/subscribe queries over distributed data streams.
Proceedings of the Third ACM International Conference on Distributed Event-Based Systems, 2009

Evaluating top-k queries over incomplete data streams.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

Top-k Aggegation Queries in Large-Scale Distributed Systems.
Proceedings of the Datenbanksysteme in Business, 2009

2008
The Juxtaposed approximate PageRank method for robust PageRank approximation in a peer-to-peer web search network.
VLDB J., 2008

Making SENSE: socially enhanced search and exploration.
Proc. VLDB Endow., 2008

Efficiently Handling Dynamics in Distributed Link Based Authority Analysis.
Proceedings of the Web Information Systems Engineering, 2008

Optimizing Distributed Top-k Queries.
Proceedings of the Web Information Systems Engineering, 2008

From Web 1.0 to Web 2.0 and back -: how did your grandma use to tag?
Proceedings of the 10th ACM International Workshop on Web Information and Data Management (WIDM 2008), 2008

LSH At Large - Distributed KNN Search in High Dimensions.
Proceedings of the 11th International Workshop on the Web and Databases, 2008

Efficient top-k querying over social-tagging networks.
Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2008

Social recommendations at work.
Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2008

To tag or not to tag -: harvesting adjacent metadata in large-scale tagging systems.
Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2008

Exploiting social relations for query expansion and result ranking.
Proceedings of the 24th International Conference on Data Engineering Workshops, 2008

Sensor Metadata Management and Its Application in Collaborative Environmental Research.
Proceedings of the Fourth International Conference on e-Science, 2008

Smooth Interpolating Histograms with Error Guarantees.
Proceedings of the Sharing Data, 2008

2007
Top-k aggregation queries in large-scale distributed systems.
PhD thesis, 2007

p2pDating: Real life inspired semantic overlay networks for Web search.
Inf. Process. Manag., 2007

Algebraic query optimization for distributed top-k queries.
Inform. Forsch. Entwickl., 2007

Peer-to-Peer Information Search: Semantic, Social, or Spiritual?
IEEE Data Eng. Bull., 2007

Search for the Best but Expect the Worst - Distributed Top-k Queries over Decreasing Aggregated Scores.
Proceedings of the Tenth International Workshop on the Web and Databases, 2007

P2P Authority Analysis for Social Communities.
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

Peer-to-Peer Information Search.
Proceedings of the XXII Simpósio Brasileiro de Banco de Dados, 2007

P2P Web Search: Make It Light, Make It Fly (Demo).
Proceedings of the Third Biennial Conference on Innovative Data Systems Research, 2007

2006
Global Document Frequency Estimation in Peer-to-Peer Web Search.
Proceedings of the Ninth International Workshop on the Web and Databases, 2006

Efficient and Decentralized PageRank Approximation in a Peer-to-Peer Web Search Network.
Proceedings of the 32nd International Conference on Very Large Data Bases, 2006

P2P Content Search: Give the Web Back to the People.
Proceedings of the 5th International workshop on Peer-To-Peer Systems, 2006

P2P Directories for Distributed Web Search: From Each According to His Ability, to Each According to His Needs.
Proceedings of the 22nd International Conference on Data Engineering Workshops, 2006

A Reproducible Benchmark for P2P Retrieval.
Proceedings of the First International Workshop on Performance and Evaluation of Data Management Systems, 2006

IQN Routing: Integrating Quality and Novelty in P2P Querying and Ranking.
Proceedings of the Advances in Database Technology, 2006

A Comparative Study of Pub/Sub Methods in Structured P2P Networks.
Proceedings of the Databases, 2006

Size doesn't always matter: exploiting pageRank for query routing in distributed IR.
Proceedings of the international workshop on Information retrieval in peer-to-peer networks, 2006

Discovering and exploiting keyword and attribute-value co-occurrences to improve P2P routing indices.
Proceedings of the 2006 ACM CIKM International Conference on Information and Knowledge Management, 2006

2005
Das MINERVA-Projekt: Datenbankselektion für Peer-to-Peer-Websuche.
Inform. Forsch. Entwickl., 2005

KLEE: A Framework for Distributed Top-k Query Algorithms.
Proceedings of the 31st International Conference on Very Large Data Bases, Trondheim, Norway, August 30, 2005

MINERVA: Collaborative P2P Search.
Proceedings of the 31st International Conference on Very Large Data Bases, Trondheim, Norway, August 30, 2005

Improving collection selection with overlap awareness in P2P search engines.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

On the Usage of Global Document Occurrences in Peer-to-Peer Information Systems.
Proceedings of the On the Move to Meaningful Internet Systems 2005: CoopIS, 2005

MINERVA<sub>infinity</sub>: A Scalable Efficient Peer-to-Peer Search Engine.
Proceedings of the Middleware 2005, ACM/IFIP/USENIX, 6th International Middleware Conference, Grenoble, France, November 28, 2005

Database Selection and Result Merging in P2P Web Search.
Proceedings of the Databases, 2005

The MINERVA Project: Database Selection in the Context of P2P Search.
Proceedings of the Datenbanksysteme in Business, 2005

2004
Bookmark-driven Query Routing in Peer-to-Peer Web Search.
Proceedings of the SIGIR Workshop on Peer-to-Peer Information Retrieval, 2004

The MINERVAMinerva is the Roman goddess of science, wisdom, and learning. Project: Towards Collaborative Search in Digital Libraries Using Peer-to-Peer Technology.
Proceedings of the Peer-to-Peer, 2004

Towards Collaborative Search in Digital Libraries Using Peer-to-Peer Technology.
Proceedings of the Digital Library Architectures: Peer-to-Peer, 2004


  Loading...