Mosharaf Chowdhury

Orcid: 0000-0003-0884-6740

Affiliations:
  • University of Michigan, USA


According to our database1, Mosharaf Chowdhury authored at least 101 papers between 2007 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Pyxis: Scheduling Mixed Tasks in Disaggregated Datacenters.
IEEE Trans. Parallel Distributed Syst., September, 2024

Fed-ensemble: Ensemble Models in Federated Learning for Improved Generalization and Uncertainty Quantification.
IEEE Trans Autom. Sci. Eng., July, 2024

Efficient Large Language Models: A Survey.
Trans. Mach. Learn. Res., 2024

Andes: Defining and Enhancing Quality-of-Experience in LLM-Based Text Streaming Services.
CoRR, 2024

Toward Cross-Layer Energy Optimizations in Machine Learning Systems.
CoRR, 2024

Reducing Energy Bloat in Large Model Training.
Proceedings of the ACM SIGOPS 30th Symposium on Operating Systems Principles, 2024

Managing Memory Tiers with CXL in Virtualized Environments.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

Vulcan: Automatic Query Planning for Live ML Analytics.
Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024

FedTrans: Efficient Federated Learning via Multi-Model Transformation.
Proceedings of the Seventh Annual Conference on Machine Learning and Systems, 2024

2023
Memory Disaggregation: Advances and Open Challenges.
ACM SIGOPS Oper. Syst. Rev., 2023

Memtrade: Marketplace for Disaggregated Memory Clouds.
Proc. ACM Meas. Anal. Comput. Syst., 2023

Venn: Resource Management Across Federated Learning Jobs.
CoRR, 2023

Perseus: Removing Energy Bloat from Large Model Training.
CoRR, 2023

Chasing Low-Carbon Electricity for Practical and Sustainable DNN Training.
CoRR, 2023

FLINT: A Platform for Federated Learning Integration.
CoRR, 2023

Oobleck: Resilient Distributed Training of Large Models Using Pipeline Templates.
Proceedings of the 29th Symposium on Operating Systems Principles, 2023

AdaEmbed: Adaptive Embedding for Large-Scale Recommendation Models.
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023

Zeus: Understanding and Optimizing GPU Energy Consumption of DNN Training.
Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, 2023

ModelKeeper: Accelerating DNN Training via Automated Training Warmup.
Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, 2023

FLINT: A Platform for Federated Learning Integration.
Proceedings of the Sixth Conference on Machine Learning and Systems, 2023

Simplifying Cloud Management with Cloudless Computing.
Proceedings of the 22nd ACM Workshop on Hot Topics in Networks, 2023

Egeria: Efficient DNN Training with Knowledge-Guided Layer Freezing.
Proceedings of the Eighteenth European Conference on Computer Systems, 2023

Flamingo: A User-Centric System for Fast and Energy-Efficient DNN Training on Smartphones.
Proceedings of the 4th International Workshop on Distributed Machine Learning, 2023

Auxo: Efficient Federated Learning via Scalable Client Clustering.
Proceedings of the 2023 ACM Symposium on Cloud Computing, SoCC 2023, 2023

TPP: Transparent Page Placement for CXL-Enabled Tiered-Memory.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

2022
CDI-E: An Elastic Cloud Service for Data Engineering.
Proc. VLDB Endow., 2022

Packing Privacy Budget Efficiently.
CoRR, 2022

Auxo: Heterogeneity-Mitigating Federated Learning via Scalable Client Clustering.
CoRR, 2022

Orloj: Predictably Serving Unpredictable DNNs.
CoRR, 2022

Swan: A Neural Engine for Efficient DNN Training on Smartphone SoCs.
CoRR, 2022

Elastic Model Aggregation with Parameter Service.
CoRR, 2022

Efficient DNN Training with Knowledge-Guided Layer Freezing.
CoRR, 2022

Treehouse: A Case For Carbon-Aware Datacenter Software.
CoRR, 2022

Aequitas: admission control for performance-critical RPCs in datacenters.
Proceedings of the SIGCOMM '22: ACM SIGCOMM 2022 Conference, Amsterdam, The Netherlands, August 22, 2022

Justitia: Software Multi-Tenancy in Hardware Kernel-Bypass Networks.
Proceedings of the 19th USENIX Symposium on Networked Systems Design and Implementation, 2022

FedScale: Benchmarking Model and System Performance of Federated Learning at Scale.
Proceedings of the International Conference on Machine Learning, 2022

Hydra : Resilient and Highly Available Remote Memory.
Proceedings of the 20th USENIX Conference on File and Storage Technologies, 2022

2021
The Internet of Federated Things (IoFT): A Vision for the Future and In-depth Survey of Data-driven Approaches for Federated Learning.
CoRR, 2021

Memtrade: A Disaggregated-Memory Marketplace for Public Clouds.
CoRR, 2021

Fed-ensemble: Improving Generalization through Model Ensembling in Federated Learning.
CoRR, 2021

FedScale: Benchmarking Model and System Performance of Federated Learning.
CoRR, 2021

The Internet of Federated Things (IoFT).
IEEE Access, 2021

FedScale: Benchmarking Model and System Performance of Federated Learning.
Proceedings of the ResilientFL '21: Proceedings of the First Workshop on Systems Challenges in Reliable and Secure Federated Learning, 2021

Programmable packet scheduling with a single queue.
Proceedings of the ACM SIGCOMM 2021 Conference, Virtual Event, USA, August 23-27, 2021., 2021

Oort: Efficient Federated Learning via Guided Participant Selection.
Proceedings of the 15th USENIX Symposium on Operating Systems Design and Implementation, 2021

Ship Compute or Ship Data? Why Not Both?
Proceedings of the 18th USENIX Symposium on Networked Systems Design and Implementation, 2021

Fluid: Resource-aware Hyperparameter Tuning Engine.
Proceedings of the Fourth Conference on Machine Learning and Systems, 2021

2020
A Systematic Review of Machine Learning Techniques in Hematopoietic Stem Cell Transplantation (HSCT).
Sensors, 2020

Oort: Informed Participant Selection for Scalable Federated Learning.
CoRR, 2020

Effectively Prefetching Remote Memory with Leap.
Proceedings of the 2020 USENIX Annual Technical Conference, 2020

NetLock: Fast, Centralized Lock Management Using Programmable Switches.
Proceedings of the SIGCOMM '20: Proceedings of the 2020 Annual conference of the ACM Special Interest Group on Data Communication on the applications, 2020

Near-Optimal Latency Versus Cost Tradeoffs in Geo-Distributed Storage.
Proceedings of the 17th USENIX Symposium on Networked Systems Design and Implementation, 2020

Sol: Fast Distributed Computation Over Slow Networks.
Proceedings of the 17th USENIX Symposium on Networked Systems Design and Implementation, 2020

Fine-Grained GPU Sharing Primitives for Deep Learning Applications.
Proceedings of the Third Conference on Machine Learning and Systems, 2020

AlloX: compute allocation in hybrid clusters.
Proceedings of the EuroSys '20: Fifteenth EuroSys Conference 2020, 2020

2019
Mitigating the Performance-Efficiency Tradeoff in Resilient Memory Disaggregation.
CoRR, 2019

RDMA Performance Isolation With Justitia.
CoRR, 2019

Terra: Scalable Cross-Layer GDA Optimizations.
CoRR, 2019

Salus: Fine-Grained GPU Sharing Primitives for Deep Learning Applications.
CoRR, 2019

Near Optimal Coflow Scheduling in Networks.
Proceedings of the 31st ACM on Symposium on Parallelism in Algorithms and Architectures, 2019

Tiresias: A GPU Cluster Manager for Distributed Deep Learning.
Proceedings of the 16th USENIX Symposium on Networked Systems Design and Implementation, 2019

2018
Fair Allocation of Heterogeneous and InterchangeableResources.
SIGMETRICS Perform. Evaluation Rev., 2018

AlloX: Allocation across Computing Resources for Hybrid CPU/GPU clusters.
SIGMETRICS Perform. Evaluation Rev., 2018

BoPF: Mitigating the Burstiness-Fairness Tradeoff in Multi-Resource Clusters.
SIGMETRICS Perform. Evaluation Rev., 2018

Distributed Lock Management with RDMA: Decentralization without Starvation.
Proceedings of the 2018 International Conference on Management of Data, 2018

Dynamic Query Re-Planning using QOOP.
Proceedings of the 13th USENIX Symposium on Operating Systems Design and Implementation, 2018

Mitigating the Latency-Accuracy Trade-off in Mobile Data Analytics Systems.
Proceedings of the 24th Annual International Conference on Mobile Computing and Networking, 2018

To Relay or Not to Relay for Inter-Cloud Transfers?
Proceedings of the 10th USENIX Workshop on Hot Topics in Cloud Computing, 2018

Monarch: Gaining Command on Geo-Distributed Graph Analytics.
Proceedings of the 10th USENIX Workshop on Hot Topics in Cloud Computing, 2018

Bridging the GAP: towards approximate graph analytics.
Proceedings of the 1st ACM SIGMOD Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA), 2018

Pas de deux: Shape the Circuits, and Shape the Apps too!
Proceedings of the 2nd Asia-Pacific Workshop on Networking, 2018

2017
Decentralized Memory Disaggregation Over Low-Latency Networks.
login Usenix Mag., 2017

Resilient Datacenter Load Balancing in the Wild.
Proceedings of the Conference of the ACM Special Interest Group on Data Communication, 2017

Performance Isolation Anomalies in RDMA.
Proceedings of the Workshop on Kernel-Bypass Networks, 2017

Efficient Memory Disaggregation with Infiniswap.
Proceedings of the 14th USENIX Symposium on Networked Systems Design and Implementation, 2017

No!: Not Another Deep Learning Framework.
Proceedings of the 16th Workshop on Hot Topics in Operating Systems, 2017

2016
Fast and Accurate Performance Analysis of LTE Radio Access Networks.
CoRR, 2016

CODA: Toward Automatically Identifying and Scheduling Coflows in the Dark.
Proceedings of the ACM SIGCOMM 2016 Conference, Florianopolis, Brazil, August 22-26, 2016, 2016

EC-Cache: Load-Balanced, Low-Latency Cluster Caching with Online Erasure Coding.
Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation, 2016

Altruistic Scheduling in Multi-Resource Clusters.
Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation, 2016

HUG: Multi-Resource Fairness for Correlated and Elastic Demands.
Proceedings of the 13th USENIX Symposium on Networked Systems Design and Implementation, 2016

2015
Coflow: A Networking Abstraction for Distributed Data-Parallel Applications.
PhD thesis, 2015

Efficient Coflow Scheduling Without Prior Knowledge.
Proceedings of the 2015 ACM Conference on Special Interest Group on Data Communication, 2015

2014
Efficient coflow scheduling with Varys.
Proceedings of the ACM SIGCOMM 2014 Conference, 2014

2013
PolyViNE: policy-based virtual network embedding across multiple domains.
J. Internet Serv. Appl., 2013

Leveraging endpoint flexibility in data-intensive clusters.
Proceedings of the ACM SIGCOMM 2013 Conference, 2013

2012
Fast and Interactive Analytics over Hadoop Data with Spark.
login Usenix Mag., 2012

ViNEYard: Virtual Network Embedding Algorithms With Coordinated Node and Link Mapping.
IEEE/ACM Trans. Netw., 2012

FairCloud: sharing the network in cloud computing.
Proceedings of the ACM SIGCOMM 2012 Conference, 2012

Surviving failures in bandwidth-constrained datacenters.
Proceedings of the ACM SIGCOMM 2012 Conference, 2012

Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing.
Proceedings of the 9th USENIX Symposium on Networked Systems Design and Implementation, 2012

Coflow: a networking abstraction for cluster applications.
Proceedings of the 11th ACM Workshop on Hot Topics in Networks, 2012

A Case for Performance-Centric Network Allocation.
Proceedings of the 4th USENIX Workshop on Hot Topics in Cloud Computing, 2012

2011
Managing data transfers in computer clusters with orchestra.
Proceedings of the ACM SIGCOMM 2011 Conference on Applications, 2011

2010
A survey of network virtualization.
Comput. Networks, 2010

Topology-Awareness and Reoptimization Mechanism for Virtual Network Embedding.
Proceedings of the NETWORKING 2010, 2010

Spark: Cluster Computing with Working Sets.
Proceedings of the 2nd USENIX Workshop on Hot Topics in Cloud Computing, 2010

2009
Network virtualization: state of the art and research challenges.
IEEE Commun. Mag., 2009

Virtual Network Embedding with Coordinated Node and Link Mapping.
Proceedings of the INFOCOM 2009. 28th IEEE International Conference on Computer Communications, 2009

iMark: An identity management framework for network virtualization environment.
Proceedings of the Integrated Network Management, 2009

2007
DiskTrie: An Efficient Data Structure using Flash Memory for Mobile Devices.
Proceedings of the Workshop on Algorithms and Computation 2007, 2007


  Loading...