Shimin Chen

Orcid: 0009-0000-1043-6236

Affiliations:
  • Chinese Academy of Sciences, Institute of Computing Technology, State Key Laboratory of Computer Architecture, Beijing, China
  • University of Chinese Academy of Sciences (UCAS), Beijing, China
  • Intel Labs, Pittsburgh, PA, USA (former)
  • Carnegie Mellon University, School of Computer Science, Pittsburgh, PA, USA (former, PhD 2005)


According to our database1, Shimin Chen authored at least 77 papers between 2001 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
LITS: An Optimized Learned Index for Strings.
Proc. VLDB Endow., July, 2024

Cabin: A Compressed Adaptive Binned Scan Index.
Proc. ACM Manag. Data, February, 2024

Technical Report for ActivityNet Challenge 2022 - Temporal Action Localization.
CoRR, 2024

Technical Report for Soccernet 2023 - Dense Video Captioning.
CoRR, 2024

Technical Report for SoccerNet Challenge 2022 - Replay Grounding Task.
CoRR, 2024

LITS: An Optimized Learned Index for Strings (An Extended Version).
CoRR, 2024

Fewer Tokens and Fewer Videos: Extending Video Understanding Abilities in Large Vision-Language Models.
CoRR, 2024

CPU and GPU Hash Joins on Skewed Data.
Proceedings of the 40th International Conference on Data Engineering, ICDE 2024, 2024

A Segmentation-Based Data Contention Resolution Method For Optical Named Data Networking.
Proceedings of the International Conference on Computing, Networking and Communications, 2024

2023
MOST: Model-Based Compression with Outlier Storage for Time Series Data.
Proc. ACM Manag. Data, December, 2023

Zen+: a robust NUMA-aware OLTP engine optimized for non-volatile main memory.
VLDB J., January, 2023

Pea Hash: A Performant Extendible Adaptive Hashing Index.
Proc. ACM Manag. Data, 2023

2022
Index Checkpoints for Instant Recovery in In-Memory Database Systems.
Proc. VLDB Endow., 2022

Density-optimized Intersection-free Mapping and Matrix Multiplication for Join-Project Operations.
Proc. VLDB Endow., 2022

Introduction to the special issue on self‑managing and hardware‑optimized database systems 2020.
Distributed Parallel Databases, 2022

Density-optimized Intersection-free Mapping and Matrix Multiplication for Join-Project Operations (extended version).
CoRR, 2022

SEAL: A Large-scale Video Dataset of Multi-grained Spatio-temporally Action Localization.
CoRR, 2022

Faster-TAD: Towards Temporal Action Detection with Proposal Generation and Classification in a Unified Network.
CoRR, 2022


MV-TAL: Mulit-view Temporal Action Localization in Naturalistic Driving.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2021
Updatable Learned Index with Precise Positions.
Proc. VLDB Endow., 2021

Zen: a High-Throughput Log-Free OLTP Engine for Non-Volatile Main Memory.
Proc. VLDB Endow., 2021

TransGPerf: Exploiting Transfer Learning for Modeling Distributed Graph Computation Performance.
J. Comput. Sci. Technol., 2021

How Does Updatable Learned Index Perform on Non-Volatile Main Memory?
Proceedings of the 37th IEEE International Conference on Data Engineering Workshops, 2021

ROART: Range-query Optimized Persistent ART.
Proceedings of the 19th USENIX Conference on File and Storage Technologies, 2021

Progressive Join Algorithms Considering User Preference.
Proceedings of the 11th Conference on Innovative Data Systems Research, 2021

2020
LB+-Trees: Optimizing Persistent Index Performance on 3DXPoint Memory.
Proc. VLDB Endow., 2020

Initial experience with 3D XPoint main memory.
Distributed Parallel Databases, 2020

Introduction to the special issue on Self-managing and Hardware-Optimized Database Systems 2019.
Distributed Parallel Databases, 2020

2019
Storing and Querying Large-Scale Spatio-Temporal Graphs with High-Throughput Edge Insertions.
CoRR, 2019

Auxo: a temporal graph management system.
Big Data Min. Anal., 2019

Efficient Partitioning and Query Processing of Spatio-Temporal Graphs with Trillion Edges.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

2018
Efficient Graph Computation for Node2Vec.
CoRR, 2018

2017
STEED: An Analytical Database System for TrEE-structured Data.
Proc. VLDB Endow., 2017

Exploiting Common Patterns for Tree-Structured Data.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

2016
Efficient Distributed Density Peaks for Clustering Large Data Sets in MapReduce.
IEEE Trans. Knowl. Data Eng., 2016

i2MapReduce: Incremental mapreduce for mining evolving big data.
Proceedings of the 32nd IEEE International Conference on Data Engineering, 2016

Exploiting SIMD for complex numerical predicates.
Proceedings of the 32nd IEEE International Conference on Data Engineering Workshops, 2016

2015
Online Updates on Data Warehouses via Judicious Use of Solid-State Storage.
ACM Trans. Database Syst., 2015

Exploitation and Exploration Balanced Hierarchical Summary for Landmark Images.
IEEE Trans. Multim., 2015

i<sup>2</sup> MapReduce: Incremental MapReduce for Mining Evolving Big Data.
IEEE Trans. Knowl. Data Eng., 2015

Report on the International Workshop on Big Data Management on Emerging Hardware (HardBD 2015).
SIGMOD Rec., 2015

Persistent B+-Trees in Non-Volatile Main Memory.
Proc. VLDB Endow., 2015

Optimizing CPU cache performance for Pregel-like graph computation.
Proceedings of the 31st IEEE International Conference on Data Engineering Workshops, 2015

2014
Efficient and Flexible Index Access in MapReduce.
Proceedings of the 17th International Conference on Extending Database Technology, 2014

2013
Wiki3C: exploiting wikipedia for context-aware concept categorization.
Proceedings of the Sixth ACM International Conference on Web Search and Data Mining, 2013

iPLUG: Personalized List Recommendation in Twitter.
Proceedings of the Web Information Systems Engineering - WISE 2013, 2013

i<sup>2</sup>MapReduce: incremental iterative MapReduce.
Proceedings of the 2nd International Workshop on Cloud Intelligence (colocated with VLDB 2013), 2013

LogKV: Exploiting Key-Value Stores for Log Processing.
Proceedings of the Sixth Biennial Conference on Innovative Data Systems Research, 2013

2012
Chrysalis analysis: incorporating synchronization arcs in dataflow-analysis-based parallel monitoring.
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2012

2011
Log-based architectures: using multicore to help software behave correctly.
ACM SIGOPS Oper. Syst. Rev., 2011

MaSM: efficient online updates in data warehouses.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011

Location-Aware MapReduce in Virtual Cloud.
Proceedings of the International Conference on Parallel Processing, 2011

QMD: exploiting flash for energy efficient disk arrays.
Proceedings of the Seventh International Workshop on Data Management on New Hardware, 2011

Collaborative integrity verification in hybrid clouds.
Proceedings of the 7th International Conference on Collaborative Computing: Networking, 2011

Rethinking Database Algorithms for Phase Change Memory.
Proceedings of the Fifth Biennial Conference on Innovative Data Systems Research, 2011

Poster: temporal attribute-based encryption in clouds.
Proceedings of the 18th ACM Conference on Computer and Communications Security, 2011

2010
TPC-E vs. TPC-C: characterizing the new TPC-E benchmark via an I/O comparison study.
SIGMOD Rec., 2010

Flash in a DBMS: Where and How?
IEEE Data Eng. Bull., 2010

PR-join: a non-blocking join achieving higher early result rate with statistical guarantees.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010

Decoupled lifeguards: enabling path optimizations for dynamic correctness checking tools.
Proceedings of the 2010 ACM SIGPLAN Conference on Programming Language Design and Implementation, 2010

ParaLog: enabling and accelerating online parallel monitoring of multithreaded applications.
Proceedings of the 15th International Conference on Architectural Support for Programming Languages and Operating Systems, 2010

Butterfly analysis: adapting dataflow analysis to dynamic parallel monitoring.
Proceedings of the 15th International Conference on Architectural Support for Programming Languages and Operating Systems, 2010

2009
Flexible Hardware Acceleration for Instruction-Grain Lifeguards.
IEEE Micro, 2009

FlashLogging: exploiting flash devices for synchronous logging performance.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2009

2008
Parallelizing dynamic information flow tracking.
Proceedings of the SPAA 2008: Proceedings of the 20th Annual ACM Symposium on Parallelism in Algorithms and Architectures, 2008

Provably good multicore cache performance for divide-and-conquer algorithms.
Proceedings of the Nineteenth Annual ACM-SIAM Symposium on Discrete Algorithms, 2008

Flexible Hardware Acceleration for Instruction-Grain Program Monitoring.
Proceedings of the 35th International Symposium on Computer Architecture (ISCA 2008), 2008

2007
Improving hash join performance through prefetching.
ACM Trans. Database Syst., 2007

Scheduling threads for constructive cache sharing on CMPs.
Proceedings of the SPAA 2007: Proceedings of the 19th Annual ACM Symposium on Parallelism in Algorithms and Architectures, 2007

2006
Parallel depth first vs. work stealing schedulers on CMP architectures.
Proceedings of the SPAA 2006: Proceedings of the 18th Annual ACM Symposium on Parallelism in Algorithms and Architectures, Cambridge, Massachusetts, USA, July 30, 2006

Log-based architectures for general-purpose monitoring of deployed code.
Proceedings of the 1st Workshop on Architectural and System Support for Improving Software Dependability, 2006

2005
Inspector Joins.
Proceedings of the 31st International Conference on Very Large Data Bases, Trondheim, Norway, August 30, 2005

Database-Centric Programming for Wide-Area Sensor Systems.
Proceedings of the Distributed Computing in Sensor Systems, 2005

2004
Improving Logging and Recovery Performance in Phoenix/App.
Proceedings of the 20th International Conference on Data Engineering, 2004

2002
Fractal prefetching B±Trees: optimizing both cache and disk performance.
Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, 2002

2001
Improving Index Performance through Prefetching.
Proceedings of the 2001 ACM SIGMOD international conference on Management of data, 2001


  Loading...