Mor Harchol-Balter

Orcid: 0000-0003-1721-6759

Affiliations:
  • Carnegie Mellon University, Pittsburgh, USA


According to our database1, Mor Harchol-Balter authored at least 135 papers between 1994 and 2024.

Collaborative distances:

Awards

ACM Fellow

ACM Fellow 2017, "For contributions to performance modeling and analysis of distributed computing systems".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
How to Rent GPUs on a Budget.
CoRR, 2024

Analysis of Markovian Arrivals and Service with Applications to Intermittent Overload.
CoRR, 2024

Can Increasing the Hit Ratio Hurt Cache Throughput?
CoRR, 2024

Asymptotically Optimal Scheduling of Multiple Parallelizable Job Classes.
CoRR, 2024

2023
The RESET and MARC techniques, with application to multiserver-job analysis.
Perform. Evaluation, November, 2023

New Stability Results for Multiserver-job Models via Product-form Saturated Systems.
SIGMETRICS Perform. Evaluation Rev., September, 2023

Invited Paper: ServerFilling: A better approach to packing multiserver jobs.
Proceedings of the 5th workshop on Advanced tools, 2023

2022
The most common queueing theory questions asked by computer systems practitioners.
SIGMETRICS Perform. Evaluation Rev., 2022

The multiserver job queueing model.
Queueing Syst. Theory Appl., 2022

WCFS: a new framework for analyzing multiserver systems.
Queueing Syst. Theory Appl., 2022

The case for phase-aware scheduling of parallelizable jobs.
Perform. Evaluation, 2022

2021
Correction to: Multi-server queueing systems with multiple priority classes.
Queueing Syst. Theory Appl., 2021

Open problems in queueing theory inspired by datacenter computing.
Queueing Syst. Theory Appl., 2021

The M/M/k with Deterministic Setup Times.
Proc. ACM Meas. Anal. Comput. Syst., 2021

Zero Queueing for Multi-Server Jobs.
Proc. ACM Meas. Anal. Comput. Syst., 2021

Nudge: Stochastically Improving upon FCFS.
Proc. ACM Meas. Anal. Comput. Syst., 2021

Optimal Scheduling in the Multiserver-job Model under Heavy Traffic.
Proc. ACM Meas. Anal. Comput. Syst., 2021

Optimal multiserver scheduling with unknown job sizes in heavy traffic.
Perform. Evaluation, 2021

To clean or not to clean: Malware removal strategies for servers under load.
Eur. J. Oper. Res., 2021

How to Schedule Near-Optimally under Real-World Constraints.
CoRR, 2021

The Finite-Skip Method for Multiserver Analysis.
CoRR, 2021

The Gittins Policy in the M/G/1 Queue.
Proceedings of the 19th International Symposium on Modeling and Optimization in Mobile, 2021

Load balancing guardrails: keeping your heavy traffic on the road to low response times (invited paper).
Proceedings of the STOC '21: 53rd Annual ACM SIGACT Symposium on Theory of Computing, 2021

Computing the Death Rate of COVID-19.
Proceedings of the Computer Science Protecting Human Society Against Epidemics, 2021

2020
Simple Near-Optimal Scheduling for the M/G/1.
Proc. ACM Meas. Anal. Comput. Syst., 2020

The Gittins Policy is Nearly Optimal in the M/G/k under Extremely General Conditions.
Proc. ACM Meas. Anal. Comput. Syst., 2020

heSRPT: Parallel scheduling to minimize mean slowdown.
Perform. Evaluation, 2020

Stability for Two-class Multiserver-job Systems.
CoRR, 2020

Optimal Resource Allocation for Elastic and Inelastic Jobs.
Proceedings of the SPAA '20: 32nd ACM Symposium on Parallelism in Algorithms and Architectures, 2020

The CacheLib Caching Engine: Design and Experiences at Scale.
Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, 2020

Borg: the next generation.
Proceedings of the EuroSys '20: Fifteenth EuroSys Conference 2020, 2020

2019
heSRPT: Optimal Scheduling of Parallel Jobs with Known Sizes.
SIGMETRICS Perform. Evaluation Rev., 2019

Delay asymptotics and bounds for multitask parallel jobs.
Queueing Syst. Theory Appl., 2019

Load Balancing Guardrails: Keeping Your Heavy Traffic on the Road to Low Response Times.
Proc. ACM Meas. Anal. Comput. Syst., 2019

heSRPT: Optimal Parallel Scheduling of Jobs With Known Sizes.
CoRR, 2019

2018
Delay Asymptotics and Bounds for Multi-Task Parallel Jobs.
SIGMETRICS Perform. Evaluation Rev., 2018

SOAP: One Clean Analysis of All Age-Based Scheduling Policies.
Proc. ACM Meas. Anal. Comput. Syst., 2018

Practical Bounds on Optimal Caching with Variable Object Sizes.
Proc. ACM Meas. Anal. Comput. Syst., 2018

SRPT for multiserver systems.
Perform. Evaluation, 2018

Optimal Scheduling and Exact Response Time Analysis for Multistage Jobs.
CoRR, 2018

Towards Optimality in Parallel Job Scheduling.
Proceedings of the Abstracts of the 2018 ACM International Conference on Measurement and Modeling of Computer Systems, 2018

RobinHood: Tail Latency Aware Caching - Dynamic Reallocation from Cache-Rich to Cache-Poor.
Proceedings of the 13th USENIX Symposium on Operating Systems Design and Implementation, 2018

SOAP Bubbles: Robust Scheduling Under Adversarial Noise.
Proceedings of the 56th Annual Allerton Conference on Communication, 2018

2017
A Better Model for Job Redundancy: Decoupling Server Slowdown and Job Size.
IEEE/ACM Trans. Netw., 2017

Optimally Scheduling Jobs with Multiple Tasks.
SIGMETRICS Perform. Evaluation Rev., 2017

Towards Optimality in Parallel Scheduling.
Proc. ACM Meas. Anal. Comput. Syst., 2017

Scheduling for efficiency and fairness in systems with redundancy.
Perform. Evaluation, 2017

Redundancy-d: The Power of d Choices for Redundancy.
Oper. Res., 2017

Asymptotic response time analysis for multi-task parallel jobs.
CoRR, 2017

AdaptSize: Orchestrating the Hot Object Memory Cache in a Content Delivery Network.
Proceedings of the 14th USENIX Symposium on Networked Systems Design and Implementation, 2017

WorkloadCompactor: reducing datacenter cost while providing tail latency SLO guarantees.
Proceedings of the 2017 Symposium on Cloud Computing, SoCC 2017, Santa Clara, CA, USA, 2017

2016
Understanding Response Time in the Redundancy-d System.
SIGMETRICS Perform. Evaluation Rev., 2016

Queueing with redundant requests: exact analysis.
Queueing Syst. Theory Appl., 2016

A Better Model for Task Assignment in Server Farms: How Replication can Help.
Proceedings of the 2016 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Science, 2016

The Power of d Choices for Redundancy.
Proceedings of the 2016 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Science, 2016

A Better Model for Job Redundancy: Decoupling Server Slowdown and Job Size.
Proceedings of the 24th IEEE International Symposium on Modeling, 2016

TetriSched: global rescheduling with adaptive plan-ahead in dynamic heterogeneous clusters.
Proceedings of the Eleventh European Conference on Computer Systems, 2016

SNC-Meister: Admitting More Tenants with Tail Latency SLOs.
Proceedings of the Seventh ACM Symposium on Cloud Computing, 2016

2015
Reducing Latency via Redundant Requests: Exact Analysis.
Proceedings of the 2015 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, 2015

Optimal scheduling for jobs with progressive deadlines.
Proceedings of the 2015 IEEE Conference on Computer Communications, 2015

2014
Exact analysis of the M/M/k/setup class of Markov chains via recursive renewal reward.
Queueing Syst. Theory Appl., 2014

Value driven load balancing.
Perform. Evaluation, 2014

PriorityMeister: Tail Latency QoS for Shared Networked Storage.
Proceedings of the ACM Symposium on Cloud Computing, 2014

2013
<i>M</i>/<i>G</i>/<i>k</i>M/G/k with staggered setup.
Oper. Res. Lett., 2013

2012
AutoScale: Dynamic, Robust Capacity Management for Multi-Tier Data Centers.
ACM Trans. Comput. Syst., 2012

Many Flows Asymptotics for SMART Scheduling Policies.
IEEE Trans. Autom. Control., 2012

Pricing and queueing.
SIGMETRICS Perform. Evaluation Rev., 2012

SOFTScale: Stealing Opportunistically for Transient Scaling.
Proceedings of the Middleware 2012, 2012

Saving Cash by Using Less Cache.
Proceedings of the 4th USENIX Workshop on Hot Topics in Cloud Computing, 2012

Are sleep states effective in data centers?
Proceedings of the 2012 International Green Computing Conference, 2012

2011
Thread Cluster Memory Scheduling.
IEEE Micro, 2011

The case for sleep states in servers.
Proceedings of the 4th Workshop on Power-Aware Computing and Systems, 2011

How data center size impacts the effectiveness of dynamic power management.
Proceedings of the 49th Annual Allerton Conference on Communication, 2011

2010
Decomposition results for an m/m/k with staggered setup.
SIGMETRICS Perform. Evaluation Rev., 2010

On the inapproximability of <i>M</i>/<i>G</i>/<i>K</i>: why two moments of job size distribution are not enough.
Queueing Syst. Theory Appl., 2010

Analysis of scheduling policies under correlated job sizes.
Perform. Evaluation, 2010

Server farms with setup costs.
Perform. Evaluation, 2010

Optimality analysis of energy-performance trade-off for server farm management.
Perform. Evaluation, 2010

Thread Cluster Memory Scheduling: Exploiting Differences in Memory Access Behavior.
Proceedings of the 43rd Annual IEEE/ACM International Symposium on Microarchitecture, 2010

ATLAS: A scalable and high-performance scheduling algorithm for multiple memory controllers.
Proceedings of the 16th International Conference on High-Performance Computer Architecture (HPCA-16 2010), 2010

2009
Surprising results on task assignment in server farms with high-variability workloads.
Proceedings of the Eleventh International Joint Conference on Measurement and Modeling of Computer Systems, 2009

Self-adaptive admission control policies for resource-sharing systems.
Proceedings of the Eleventh International Joint Conference on Measurement and Modeling of Computer Systems, 2009

Optimal power allocation in server farms.
Proceedings of the Eleventh International Joint Conference on Measurement and Modeling of Computer Systems, 2009

Why segregating short jobs from long jobs under high variability is not always a win.
Proceedings of the 47th Annual Allerton Conference on Communication, 2009

2008
An Exact and Efficient Algorithm for the Constrained Dynamic Operator Staffing Problem for Call Centers.
Manag. Sci., 2008

Scheduling for Server Farms: Approaches and Open Problems.
Proceedings of the Performance Evaluation: Metrics, 2008

2007
Foreword.
SIGMETRICS Perform. Evaluation Rev., 2007

Insensitivity for PS server farms with JSQ routing.
SIGMETRICS Perform. Evaluation Rev., 2007

The effect of higher moments of job size distribution on the performance of an <i>M/G/s</i> queueing system.
SIGMETRICS Perform. Evaluation Rev., 2007

Analysis of join-the-shortest-queue routing for web server farms.
Perform. Evaluation, 2007

2006
Web servers under overload: How scheduling can help.
ACM Trans. Internet Techn., 2006

Fluid and diffusion limits for transient sojourn times of processor sharing queues with time varying rates.
Queueing Syst. Theory Appl., 2006

How many servers are best in a dual-priority M/PH/k system?
Perform. Evaluation, 2006

Closed form solutions for mapping general distributions to quasi-minimal PH distributions.
Perform. Evaluation, 2006

Tail asymptotics for policies favoring short jobs in a many-flows regime.
Proceedings of the Joint International Conference on Measurement and Modeling of Computer Systems, 2006

Fundamental characteristics of queues with fluctuating load.
Proceedings of the Joint International Conference on Measurement and Modeling of Computer Systems, 2006

Open Versus Closed: A Cautionary Tale.
Proceedings of the 3rd Symposium on Networked Systems Design and Implementation (NSDI 2006), 2006

How to Determine a Good Multi-Programming Level for External Scheduling.
Proceedings of the 22nd International Conference on Data Engineering, 2006

Achieving Class-Based QoS for Transactional Workloads.
Proceedings of the 22nd International Conference on Data Engineering, 2006

2005
Robustness of threshold policies in beneficiary-donor model.
SIGMETRICS Perform. Evaluation Rev., 2005

Multi-Server Queueing Systems with Multiple Priority Classes.
Queueing Syst. Theory Appl., 2005

Analysis of cycle stealing with switching times and thresholds.
Perform. Evaluation, 2005

Nearly insensitive bounds on SMART scheduling.
Proceedings of the International Conference on Measurements and Modeling of Computer Systems, 2005

Classifying scheduling policies with respect to higher moments of conditional response time.
Proceedings of the International Conference on Measurements and Modeling of Computer Systems, 2005

Improving Preemptive Prioritization via Statistical Characterization of OLTP Locking.
Proceedings of the 21st International Conference on Data Engineering, 2005

2004
Formalizing SMART scheduling.
SIGMETRICS Perform. Evaluation Rev., 2004

A recursive analysis technique for multi-dimensionally infinite Markov chains.
SIGMETRICS Perform. Evaluation Rev., 2004

A note on comparing response times in the M/GI/1/FB and M/GI/1/PS queues.
Oper. Res. Lett., 2004

Evaluation of Task Assignment Policies for Supercomputing Servers: The Case for Load Unbalancing and Fairness.
Clust. Comput., 2004

Priority Mechanisms for OLTP and Transactional Web Applications.
Proceedings of the 20th International Conference on Data Engineering, 2004

2003
Size-based scheduling to improve web performance.
ACM Trans. Comput. Syst., 2003

Cycle stealing under immediate dispatch task assignment.
Proceedings of the SPAA 2003: Proceedings of the Fifteenth Annual ACM Symposium on Parallelism in Algorithms and Architectures, 2003

Classifying scheduling policies with respect to unfairness in an M/GI/1.
Proceedings of the International Conference on Measurements and Modeling of Computer Systems, 2003

Analysis of cycle stealing with switching cost.
Proceedings of the International Conference on Measurements and Modeling of Computer Systems, 2003

Analysis of Task Assignment with Cycle Stealing under Central Queue.
Proceedings of the 23rd International Conference on Distributed Computing Systems (ICDCS 2003), 2003

A Closed-Form Solution for Mapping General Distributions to Minimal PH Distributions.
Proceedings of the Computer Performance Evaluations, 2003

Necessary and Sufficient Conditions for Representing General Distributions by Coxians.
Proceedings of the Computer Performance Evaluations, 2003

2002
Understanding the slowdown of large jobs in an M/GI/1 system.
SIGMETRICS Perform. Evaluation Rev., 2002

Asymptotic convergence of scheduling policies with respect to slowdown.
Perform. Evaluation, 2002

Task assignment with unknown duration.
J. ACM, 2002

2001
Job placement with unknown duration and no preemption.
SIGMETRICS Perform. Evaluation Rev., 2001

Analysis of M/G/1/SRPT under transient overload.
SIGMETRICS Perform. Evaluation Rev., 2001

Analysis of SRPT scheduling: investigating unfairness.
Proceedings of the Joint International Conference on Measurements and Modeling of Computer Systems, 2001

SRPT Scheduling for Web Servers.
Proceedings of the Job Scheduling Strategies for Parallel Processing, 2001

2000
General Dynamic Routing with Per-Packet Delay Guarantees of <i>O</i>(Distance + 1/Session Rate).
SIAM J. Comput., 2000

1999
On Choosing a Task Assignment Policy for a Distributed Server System.
J. Parallel Distributed Comput., 1999

Connection Scheduling in Web Servers.
Proceedings of the 2nd USENIX Symposium on Internet Technologies and Systems, 1999

Resource Discovery in Distributed Networks.
Proceedings of the Eighteenth Annual ACM Symposium on Principles of Distributed Computing, 1999

1998
Task Assignment in a Distributed System: Improving Performance by Unbalancing Load (Extended Abstract).
Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems, 1998

1997
Exploiting Process Lifetime Distributions for Dynamic Load Balancing.
ACM Trans. Comput. Syst., 1997

General Dynamic Routing with Per-Packet Delay Guarantees of O(distance + 1 / session rate).
Proceedings of the 38th Annual Symposium on Foundations of Computer Science, 1997

1995
Bounding delays in packet-routing networks.
Proceedings of the Twenty-Seventh Annual ACM Symposium on Theory of Computing, 1995

1994
Tight Bounds on Expected Time to Add Correctly and Add Mostly Correctly.
Inf. Process. Lett., 1994

Queueing Analysis of Oblivious Packet-Routing Networks.
Proceedings of the Fifth Annual ACM-SIAM Symposium on Discrete Algorithms. 23-25 January 1994, 1994

Selection in the Presence of Noise: The Design of Playoff Systems.
Proceedings of the Fifth Annual ACM-SIAM Symposium on Discrete Algorithms. 23-25 January 1994, 1994


  Loading...