Kai Li

Orcid: 0000-0002-2095-7024

Affiliations:
  • Princeton University, NJ, USA


According to our database1, Kai Li authored at least 156 papers between 1986 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Evaluating Copyright Takedown Methods for Language Models.
CoRR, 2024

SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors.
CoRR, 2024

Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving.
Proceedings of the ACM SIGOPS 30th Symposium on Operating Systems Principles, 2024

Catastrophic Jailbreak of Open-source LLMs via Exploiting Generation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Retrospective: Avoiding the Disk Bottleneck in the Data Domain Deduplication File System.
Proceedings of the 12th International Conference on Fun with Algorithms, 2024

2023
A Dataset Auditing Method for Collaboratively Trained Machine Learning Models.
IEEE Trans. Medical Imaging, 2023

Privacy Implications of Retrieval-Based Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
RT-Cloud: A cloud-based software framework to simplify and standardize real-time fMRI.
NeuroImage, 2022

Catalytic activity <i>in vitro</i> of the human protein kinase ASK1 mutants: Experimental and molecular simulation study.
Comput. Biol. Chem., 2022

Recovering Private Text in Federated Learning of Language Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
Evaluating Gradient Inversion Attacks and Defenses in Federated Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

EMA: Auditing Data Removal from Trained Models.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021

2020
Sparse multi-output Gaussian processes for online medical time series prediction.
BMC Medical Informatics Decis. Mak., 2020

MixCon: Adjusting the Separability of Data Representations for Harder Data Recovery.
CoRR, 2020

Privacy-preserving Learning via Deep Net Pruning.
CoRR, 2020

InstaHide: Instance-hiding Schemes for Private Distributed Learning.
Proceedings of the 37th International Conference on Machine Learning, 2020

TextHide: Tackling Data Privacy for Language Understanding Tasks.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Patient-Specific Effects of Medication Using Latent Force Models with Gaussian Processes.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

2019
The anatomy of efficient FFT and winograd convolutions on modern CPUs.
Proceedings of the ACM International Conference on Supercomputing, 2019

Compressed Sensing MRI Reconstruction on Intel HARPv2.
Proceedings of the 27th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2019

PZnet: Efficient 3D ConvNet Inference on Manycore CPUs.
Proceedings of the Advances in Computer Vision, 2019

2018
FFT Convolutions are Faster than Winograd on Modern CPUs, Here is Why.
CoRR, 2018

Scheduling Computation Graphs of Deep Learning Models on Manycore CPUs.
CoRR, 2018

Optimizing N-dimensional, winograd-based convolution for manycore CPUs.
Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2018

2017
Intelligent Probing for Locality Sensitive Hashing: Multi-Probe LSH and Beyond.
Proc. VLDB Endow., 2017

2016
PARSEC3.0: A Multicore Benchmark Suite with Network Stacks and SPLASH-2X.
SIGARCH Comput. Archit. News, 2016

Erasing Belady's Limitations: In Search of Flash Cache Offline Optimality.
Proceedings of the 2016 USENIX Annual Technical Conference, 2016

Disruptive Research and Innovation.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

Real-time full correlation matrix analysis of fMRI data.
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

2015
Full correlation matrix analysis of fMRI data on Intel® Xeon Phi™ coprocessors.
Proceedings of the International Conference for High Performance Computing, 2015

RIPQ: Advanced Photo Caching on Flash for Facebook.
Proceedings of the 13th USENIX Conference on File and Storage Technologies, 2015

2014
Author retrospective for search and replication in unstructured peer-to-peer networks.
Proceedings of the ACM International Conference on Supercomputing 25th Anniversary Volume, 2014

2012
High-confidence near-duplicate image detection.
Proceedings of the International Conference on Multimedia Retrieval, 2012

2011
Management of Multilevel, Multiclient Cache Hierarchies with Application Hints.
ACM Trans. Comput. Syst., 2011

Efficient k-nearest neighbor graph construction for generic similarity measures.
Proceedings of the 20th International Conference on World Wide Web, 2011

Tradeoffs in Scalable Data Routing for Deduplication Clusters.
Proceedings of the 9th USENIX Conference on File and Storage Technologies, 2011

2010
DFS: A file system for virtualized flash storage.
ACM Trans. Storage, 2010

Characteristics of Workloads Using the Pipeline Programming Model.
Proceedings of the Computer Architecture, 2010

Fidelity and scaling of the PARSEC benchmark inputs.
Proceedings of the 2010 IEEE International Symposium on Workload Characterization, 2010

What Does Classifying More Than 10, 000 Image Categories Tell Us?
Proceedings of the Computer Vision - ECCV 2010, 2010

Scaling of the PARSEC benchmark inputs.
Proceedings of the 19th International Conference on Parallel Architectures and Compilation Techniques, 2010

2009
Directing Experimental Biology: A Case Study in Mitochondrial Biogenesis.
PLoS Comput. Biol., 2009

ImageNet: A large-scale hierarchical image database.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Tech-note: Device-free interaction spaces.
Proceedings of the IEEE Symposium on 3D User Interfaces, 2009

2008
Asymmetric distance estimation with sketches for similarity search in high-dimensional spaces.
Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2008

Efficiently matching sets of features with random histograms.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

PARSEC vs. SPLASH-2: A quantitative comparison of two multithreaded benchmark suites on Chip-Multiprocessors.
Proceedings of the 4th International Symposium on Workload Characterization (IISWC 2008), 2008

MC2: Multiple Clients on a Multilevel Cache.
Proceedings of the 28th IEEE International Conference on Distributed Computing Systems (ICDCS 2008), 2008

Avoiding the Disk Bottleneck in the Data Domain Deduplication File System.
Proceedings of the 6th USENIX Conference on File and Storage Technologies, 2008

Towards Scalable Dataset Construction: An Active Learning Approach.
Proceedings of the Computer Vision, 2008

Modeling LSH for performance tuning.
Proceedings of the 17th ACM Conference on Information and Knowledge Management, 2008

The PARSEC benchmark suite: characterization and architectural implications.
Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques, 2008

2007
Exploring the functional landscape of gene expression: directed search of large microarray compendia.
Bioinform., 2007

Multi-Probe LSH: Efficient Indexing for High-Dimensional Similarity Search .
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

Virtually Shared Displays and User Input Devices.
Proceedings of the 2007 USENIX Annual Technical Conference, 2007

Sizing sketches: a rank-based analysis for similarity search.
Proceedings of the 2007 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, 2007

Viewing the Larger Context of Genomic Data through Horizontal Integration.
Proceedings of the 11th International Conference on Information Visualisation, 2007

Scalable, Dynamic Analysis and Visualization for Genomic Datasets.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Filtering Image Spam with Near-Duplicate Detection.
Proceedings of the CEAS 2007, 2007

ISA Support for Fingerprinting and Erasure Codes.
Proceedings of the IEEE International Conference on Application-Specific Systems, 2007

2006
Efficient filtering with sketches in the ferret toolkit.
Proceedings of the 8th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2006

Systems Support for Remote Visualization of Genomics Applications over Wide Area Networks.
Proceedings of the Distributed, 2006

Ferret: a toolkit for content-based similarity search of feature-rich data.
Proceedings of the 2006 EuroSys Conference, Leuven, Belgium, April 18-21, 2006, 2006

2005
Memory Performance Optimizations For Real-Time Software HDTV Decoding.
J. VLSI Signal Process., 2005

VI-Attached Database Storage.
IEEE Trans. Parallel Distributed Syst., 2005

Bridging the digital divide: storage media + postal network = generic high-bandwidth communication.
ACM Trans. Storage, 2005

Tools and Applications for Large-Scale Display Walls.
IEEE Computer Graphics and Applications, 2005

Visualization methods for statistical analysis of microarray clusters.
BMC Bioinform., 2005

Dynamic Scalable Visualization for Collaborative Scientific Applications.
Proceedings of the 19th International Parallel and Distributed Processing Symposium (IPDPS 2005), 2005

2004
Second-Level Buffer Cache Management.
IEEE Trans. Parallel Distributed Syst., 2004

Image similarity search with compact data structures.
Proceedings of the 2004 ACM CIKM International Conference on Information and Knowledge Management, 2004

Fast Paths in Concurrent Programs.
Proceedings of the 13th International Conference on Parallel Architectures and Compilation Techniques (PACT 2004), 29 September, 2004

2003
Remote View-dependent Isosurface Visualization.
Proceedings of the 3rd Eurographics / IEEE TCVG International Workshop on Volume Graphics, 2003

Eviction-based Cache Placement for Storage Caches.
Proceedings of the General Track: 2003 USENIX Annual Technical Conference, 2003

Color Gamut Matching for Tiled DisplayWalls.
Proceedings of the 7th International Workshop on Immersive Projection Technology, 2003

2002
Improving progressive view-dependent isosurface propagation.
Comput. Graph., 2002

Scalable Alignment of Large-Format Multi-Projector Displays Using Camera Homography Trees.
Proceedings of the 13th IEEE Visualization Conference, 2002

Using Model Checking to Debug Device Firmware.
Proceedings of the 5th Symposium on Operating System Design and Implementation (OSDI 2002), 2002

Dynamic memory management for programmable devices.
Proceedings of The Workshop on Memory Systems Performance (MSP 2002), 2002

Experiences with VI Communication for Database Storage.
Proceedings of the 29th International Symposium on Computer Architecture (ISCA 2002), 2002

A Parallel Ultra-High Resolution MPEG-2 Video Decoder for PC Cluster Based Tiled Display Systems.
Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002

Search and replication in unstructured peer-to-peer networks.
Proceedings of the 16th international conference on Supercomputing, 2002

2001
Data distribution strategies for high-resolution displays.
Comput. Graph., 2001

Progressive View-Dependent Isosurface Propagation.
Proceedings of the 3rd Joint Eurographics - IEEE TCVG Symposium on Visualization, 2001

The Multi-Queue Replacement Algorithm for Second Level Buffer Caches.
Proceedings of the General Track: 2001 USENIX Annual Technical Conference, 2001

Parallel rendering with k-way replication.
Proceedings of the IEEE 2001 Symposium on Parallel and Large-Data Visualization and Graphics, 2001

ESP: A Language for Programmable Devices.
Proceedings of the 2001 ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI), 2001

Software Environments For Cluster-Based Display Systems.
Proceedings of the First IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2001), 2001

2000
Building and Using A Scalable Display Wall System.
IEEE Computer Graphics and Applications, 2000

Guest Editors' Introduction: Large-Format Displays.
IEEE Computer Graphics and Applications, 2000

Automatic alignment of high-resolution multi-projector display using an un-calibrated camera.
Proceedings of the 11th IEEE Visualization Conference, 2000

Next-generation visualization displays: the research challenges of building tiled displays (panel session).
Proceedings of the 11th IEEE Visualization Conference, 2000

Trading Capacity for Performance in a Disk Array.
Proceedings of the 4th Symposium on Operating System Design and Implementation (OSDI 2000), 2000

Hybrid Sort-First and Sort-Last Parallel Rendering with a Cluster of PCs.
Proceedings of the 2000 ACM SIGGRAPH/EUROGRAPHICS Workshop on Graphics Hardware, 2000

1999
Memory Exclusion: Optimizing the Performance of Checkpointing Systems.
Softw. Pract. Exp., 1999

Thread Scheduling for Out-of-core Applications with Memory Server on Multicomputers.
Proceedings of the Sixth Workshop on I/O in Parallel and Distributed Systems, 1999

Fast cluster failover using virtual memory-mapped communication.
Proceedings of the 13th international conference on Supercomputing, 1999

Shared virtual memory with automatic update support.
Proceedings of the 13th international conference on Supercomputing, 1999

OS Support for General-Purpose Routers.
Proceedings of The Seventh Workshop on Hot Topics in Operating Systems, 1999

Load Balancing for Multi-Projector Rendering Systems.
Proceedings of the 1999 ACM SIGGRAPH/EUROGRAPHICS Workshop on Graphics Hardware, 1999

1998
Diskless Checkpointing.
IEEE Trans. Parallel Distributed Syst., 1998

Scope Consistency: A Bridge between Release Consistency and Entry Consistency.
Theory Comput. Syst., 1998

Myrinet communication.
IEEE Micro, 1998

Performance Measurements for Multithreaded Programs.
Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems, 1998

Retrospective: Virtual Memory Mapped Network Interface for the SHRIMP Multicomputer.
Proceedings of the 25 Years of the International Symposia on Computer Architecture (Selected Papers)., 1998

Design Choices in the SHRIMP System: An Empirical Study.
Proceedings of the 25th Annual International Symposium on Computer Architecture, 1998

Performance Issues of a Distributed Frame Buffer on a Multicomputer.
Proceedings of the 1998 ACM SIGGRAPH/EUROGRAPHICS Workshop on Graphics Hardware, Lisbon, Portugal, August 31, 1998

UTLB: A Mechanism for Address Translation on Network Interfaces.
Proceedings of the ASPLOS-VIII Proceedings of the 8th International Conference on Architectural Support for Programming Languages and Operating Systems, 1998

1997
CLIP: A Checkpointing Tool for Message Passing Parallel Programs.
Proceedings of the ACM/IEEE Conference on Supercomputing, 1997

Relaxed Consistency and Coherence Granularity in DSM Systems: A Performance Evaluation.
Proceedings of the Sixth ACM SIGPLAN Symposium on Principles & Practice of Parallel Programming (PPOPP), 1997

Design and Implementation of Virtual Memory-Mapped Communication on Myrinet.
Proceedings of the 11th International Parallel Processing Symposium (IPPS '97), 1997

1996
Implementation and Performance of Integrated Application-Controlled File Caching, Prefetching, and Disk Scheduling.
ACM Trans. Comput. Syst., 1996

Applications, Storage Hierarchy, and Integration.
ACM Comput. Surv., 1996

Integrating Parallel Prefetching and Caching.
Proceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systems, 1996

Performance Evaluation of Two Home-Based Lazy Release Consistency Protocols for Shared Virtual Memory Systems.
Proceedings of the Second USENIX Symposium on Operating Systems Design and Implementation (OSDI), 1996

A Trace-Driven Comparison of Algorithms for Parallel Prefetching and Caching.
Proceedings of the Second USENIX Symposium on Operating Systems Design and Implementation (OSDI), 1996

Understanding Application Performance on Shared Virtual Memory Systems.
Proceedings of the 23rd Annual International Symposium on Computer Architecture, 1996

Early Experience with Message-Passing on the SHRIMP Multicomputer.
Proceedings of the 23rd Annual International Symposium on Computer Architecture, 1996

Software Support for Virtual Memory-Mapped Communication.
Proceedings of IPPS '96, 1996

Design and Implementation of NX Message Passing Using Shrimp Virtual Memory Mapped Communication.
Proceedings of the 1996 International Conference on Parallel Processing, 1996

Improving Release-Consistent Shared Virtual Memory Using Automatic Update.
Proceedings of the Second International Symposium on High-Performance Computer Architecture, 1996

Protected, User-Level DMA for the SHRIMP Network Interface.
Proceedings of the Second International Symposium on High-Performance Computer Architecture, 1996

Thread Scheduling for Cache Locality.
Proceedings of the ASPLOS-VII Proceedings, 1996

1995
Virtual-Memory-Mapped Network Interfaces.
IEEE Micro, 1995

Multiprocessor Cache Coherence Based on Virtual Memory Support.
J. Parallel Distributed Comput., 1995

Libckpt: Transparent Checkpointing under UNIX.
Proceedings of the USENIX 1995 Technical Conference on UNIX and Advanced Computing Systems, 1995

A Study of Integrated Prefetching and Caching Strategies.
Proceedings of the 1995 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems, 1995

Synchronization for a multi-port frame buffer on a mesh-connected multicomputer.
Proceedings of the IEEE Symposium on Parallel Rendering, 1995

Evaluating Multi-Port Frame Buffer Designs for a Mesh-Connected Multicomputer.
Proceedings of the 22nd Annual International Symposium on Computer Architecture, 1995

1994
Low-Latency, Concurrent Checkpointing for Parallel Programs.
IEEE Trans. Parallel Distributed Syst., 1994

ickp: a consistent checkpointer for multicomputers.
IEEE Parallel Distributed Technol. Syst. Appl., 1994

Application-Controlled File Caching Policies.
Proceedings of the USENIX Summer 1994 Technical Conference, 1994

Network Interface Support for User-Level Buffer Management.
Proceedings of the Parallel Computer Routing and Communication, 1994

Storage Alternatives for Mobile Computers.
Proceedings of the First USENIX Symposium on Operating Systems Design and Implementation (OSDI), 1994

Implementation and Performance of Application-Controlled File Caching.
Proceedings of the First USENIX Symposium on Operating Systems Design and Implementation (OSDI), 1994

Storage Alternatives for Mobile Computers.
Proceedings of the Mobile Computing [Mobidata Workshop on Mobile and Wireless Information Systems, Rutgers University, NJ, USA, October 31, 1994

Virtual Memory Mapped Network Interface for the SHRIMP Multicomputer.
Proceedings of the 21st Annual International Symposium on Computer Architecture. Chicago, 1994

An Evaluation of Multiprocessor Cache Coherence Based on Virtual Memory Support.
Proceedings of the 8th International Symposium on Parallel Processing, 1994

Two virtual memory mapped network interface designs.
Proceedings of the Hot Interconnects II, 1994

Faster Checkpointing with <i>N</i>+1 Parity.
Proceedings of the Digest of Papers: FTCS/24, 1994

1993
Cache Coherence for Shared Memory Multiprocessors Based on Virtual Memory Support.
Proceedings of the Seventh International Parallel Processing Symposium, 1993

Operating System Implications of Solid-State Mobile Computers.
Proceedings of the Proceedings Fourth Workshop on Workstation Operating Systems, 1993

1992
Heterogeneous Distributed Shared Memory.
IEEE Trans. Parallel Distributed Syst., 1992

Software Support for Speculative Loads.
Proceedings of the ASPLOS-V Proceedings, 1992

1991
An efficient checkpointing method for multicomputers with wormhole routing.
Int. J. Parallel Program., 1991

Checkpointing Multicomputer Applications.
Proceedings of the Tenth Symposium on Reliable Distributed Systems, 1991

Empirical Studies of Competitive Spinning for a Shared-Memory Multiprocessor.
Proceedings of the Thirteenth ACM Symposium on Operating System Principles, 1991

Evaluation of Memory System Extensions.
Proceedings of the 18th Annual International Symposium on Computer Architecture. Toronto, 1991

Virtual Memory Primitives for User Programs.
Proceedings of the ASPLOS-IV Proceedings, 1991

1990
Real-Time, Concurrent Checkpoint for Parallel Programs.
Proceedings of the Second ACM SIGPLAN Symposium on Princiles & Practice of Parallel Programming (PPOPP), 1990

1989
Memory Coherence in Shared Virtual Memory Systems.
ACM Trans. Comput. Syst., 1989

A Hypercube Shared Virtual Memory System.
Proceedings of the International Conference on Parallel Processing, 1989

1988
Real-Time Concurrent Collection on Stock Multiprocessors.
Proceedings of the ACM SIGPLAN'88 Conference on Programming Language Design and Implementation (PLDI), 1988

IVY: A Shared Virtual Memory System for Parallel Computing.
Proceedings of the International Conference on Parallel Processing, 1988

Multiprocessor Main Memory Transaction Processing.
Proceedings of the International Symposium on Databases in Parallel and Distributed Systems, 1988

1986
A New List Compaction Method.
Softw. Pract. Exp., 1986


  Loading...