Carole-Jean Wu
Orcid: 0000-0002-9032-7239Affiliations:
- Facebook AI Research
- Arizona State University, AZ, USA
According to our database1,
Carole-Jean Wu
authored at least 116 papers
between 2011 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
CoRR, 2024
CoRR, 2024
HeteroSwitch: Characterizing and Taming System-Induced Data Heterogeneity in Federated Learning.
Proceedings of the Seventh Annual Conference on Machine Learning and Systems, 2024
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2024
MAD-Max Beyond Single-Node: Enabling Large Machine Learning Model Acceleration on Distributed Systems.
Proceedings of the 51st ACM/IEEE Annual International Symposium on Computer Architecture, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Eighth Workshop on Data Management for End-to-End Machine Learning, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
Architectural CO<sub>2</sub> Footprint Tool: Designing Sustainable Computer Systems With an Architectural Carbon Modeling Tool.
IEEE Micro, 2023
Federated Ensemble Learning: Increasing the Capacity of Label Private Recommendation Systems.
IEEE Data Eng. Bull., 2023
Decoding Data Quality via Synthetic Corruptions: Embedding-guided Pruning of Code Data.
CoRR, 2023
Design Space Exploration and Optimization for Carbon-Efficient Extended Reality Systems.
CoRR, 2023
Towards MoE Deployment: Mitigating Inefficiencies in Mixture-of-Expert (MoE) Inference.
CoRR, 2023
CoRR, 2023
Proceedings of the 2023 USENIX Annual Technical Conference, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
RecD: Deduplication for End-to-End Deep Learning Recommendation Model Training Infrastructure.
Proceedings of the Sixth Conference on Machine Learning and Systems, 2023
Proceedings of the 2nd Workshop on Sustainable Computer Systems, 2023
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023
2022
EdgeWise: Energy-efficient CNN Computation on Edge Devices under Stochastic Communication Delays.
ACM Trans. Embed. Comput. Syst., September, 2022
IEEE Trans. Computers, 2022
FEL: High Capacity Learning for Recommendation and Ranking via Federated Ensemble Learning.
CoRR, 2022
Proceedings of the WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21, 2022
Towards Fair Federated Recommendation Learning: Characterizing the Inter-Dependence of System and Data Heterogeneity.
Proceedings of the RecSys '22: Sixteenth ACM Conference on Recommender Systems, Seattle, WA, USA, September 18, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Fifth Conference on Machine Learning and Systems, 2022
Proceedings of the Fifth Conference on Machine Learning and Systems, 2022
Understanding data storage and ingestion for large-scale deep recommendation model training: industrial product.
Proceedings of the ISCA '22: The 49th Annual International Symposium on Computer Architecture, New York, New York, USA, June 18, 2022
ACT: designing sustainable computer systems with an architectural carbon modeling tool.
Proceedings of the ISCA '22: The 49th Annual International Symposium on Computer Architecture, New York, New York, USA, June 18, 2022
Proceedings of the IEEE International Symposium on Workload Characterization, 2022
FedGPO: Heterogeneity-Aware Global Parameter optimization for Efficient Federated Learning.
Proceedings of the IEEE International Symposium on Workload Characterization, 2022
Hercules: Heterogeneity-Aware Inference Serving for At-Scale Personalized Recommendation.
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2022
A joint management middleware to improve training performance of deep recommendation systems with SSDs.
Proceedings of the DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10, 2022
RecShard: statistical feature-based memory optimization for industry-scale neural recommendation.
Proceedings of the ASPLOS '22: 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Lausanne, Switzerland, 28 February 2022, 2022
2021
ACM Trans. Archit. Code Optim., 2021
Dynamic Temperature Management of Near-Sensor Processing for Energy-Efficient High-Fidelity Imaging.
Sensors, 2021
IEEE Micro, 2021
IACR Cryptol. ePrint Arch., 2021
Understanding and Co-designing the Data Ingestion Pipeline for Industry-Scale RecSys Training.
CoRR, 2021
Proceedings of the IEEE International Conference on Smart Computing, 2021
Proceedings of the Fourth Conference on Machine Learning and Systems, 2021
Understanding and Improving Failure Tolerant Training for Deep Learning Recommendation with Partial Recovery.
Proceedings of the Fourth Conference on Machine Learning and Systems, 2021
Proceedings of the MICRO '21: 54th Annual IEEE/ACM International Symposium on Microarchitecture, 2021
RecPipe: Co-designing Models and Hardware to Jointly Optimize Recommendation Quality and Performance.
Proceedings of the MICRO '21: 54th Annual IEEE/ACM International Symposium on Microarchitecture, 2021
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2021
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2021
Proceedings of the ASPLOS '21: 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2021
2020
ACM Trans. Archit. Code Optim., 2020
IEEE Micro, 2020
CPR: Understanding and Improving Failure Tolerant Training for Deep Learning Recommendation with Partial Recovery.
CoRR, 2020
AutoScale: Optimizing Energy Efficiency of End-to-End Edge Inference under Stochastic Variance.
CoRR, 2020
Proceedings of the Third Conference on Machine Learning and Systems, 2020
AutoScale: Energy Efficiency Optimization for Stochastic Edge Inference Using Reinforcement Learning.
Proceedings of the 53rd Annual IEEE/ACM International Symposium on Microarchitecture, 2020
Proceedings of the 47th ACM/IEEE Annual International Symposium on Computer Architecture, 2020
Proceedings of the 47th ACM/IEEE Annual International Symposium on Computer Architecture, 2020
DeepRecSys: A System for Optimizing End-To-End At-Scale Neural Recommendation Inference.
Proceedings of the 47th ACM/IEEE Annual International Symposium on Computer Architecture, 2020
Proceedings of the IEEE International Symposium on Workload Characterization, 2020
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2020
Proceedings of the GECCO '20: Genetic and Evolutionary Computation Conference, 2020
Proceedings of the 2020 Design, Automation & Test in Europe Conference & Exhibition, 2020
2019
Optimizing User Satisfaction of Mobile Workloads Subject to Various Sources of Uncertainties.
IEEE Trans. Mob. Comput., 2019
Configurable-ECC: Architecting a Flexible ECC Scheme to Support Different Sized Accesses in High Bandwidth Memory Systems.
IEEE Trans. Computers, 2019
CoRR, 2019
CoRR, 2019
Proceedings of the 6th International Workshop on Genetic Improvement, 2019
Proceedings of the 25th IEEE International Symposium on High Performance Computer Architecture, 2019
Proceedings of the 25th IEEE International Symposium on High Performance Computer Architecture, 2019
2018
DORA: Optimizing Smartphone Energy Efficiency and Web Browser Performance under Interference.
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2018
LATTE-CC: Latency Tolerance Aware Adaptive Cache Compression Management for Energy Efficient GPUs.
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2018
2017
Proceedings of the 44th Annual International Symposium on Computer Architecture, 2017
Understanding the thermal challenges of high-performance mobile devices with a detailed platform temperature model.
Proceedings of the 2017 IEEE International Symposium on Workload Characterization, 2017
Performance characterization, prediction, and optimization for heterogeneous systems with multi-level memory interference.
Proceedings of the 2017 IEEE International Symposium on Workload Characterization, 2017
2016
Using Low Cost Erasure and Error Correction Schemes to Improve Reliability of Commodity DRAM Systems.
IEEE Trans. Computers, 2016
RATT-ECC: Rate Adaptive Two-Tiered Error Correction Codes for Reliable 3D Die-Stacked Memory.
ACM Trans. Archit. Code Optim., 2016
Proceedings of the 2016 IEEE International Symposium on Workload Characterization, 2016
Proceedings of the 34th IEEE International Conference on Computer Design, 2016
Improving smartphone user experience by balancing performance and energy with probabilistic QoS guarantee.
Proceedings of the 2016 IEEE International Symposium on High Performance Computer Architecture, 2016
2015
E-ECC: Low Power Erasure and Error Correction Schemes for Increasing Reliability of Commodity DRAM Systems.
Proceedings of the 2015 International Symposium on Memory Systems, 2015
Proceedings of the 2015 IEEE International Symposium on Performance Analysis of Systems and Software, 2015
CAWA: coordinated warp scheduling and cache prioritization for critical warp acceleration of GPGPU workloads.
Proceedings of the 42nd Annual International Symposium on Computer Architecture, 2015
Characterization and Throttling-Based Mitigation of Memory Interference for Heterogeneous Smartphones.
Proceedings of the 2015 IEEE International Symposium on Workload Characterization, 2015
2014
ACM Trans. Embed. Comput. Syst., 2014
IEEE Comput. Archit. Lett., 2014
Proceedings of the 2014 IEEE International Symposium on Performance Analysis of Systems and Software, 2014
Quantifying the energy cost of data movement for emerging smart phone workloads on mobile platforms.
Proceedings of the 2014 IEEE International Symposium on Workload Characterization, 2014
ReMAP: Reuse and memory access cost aware eviction policy for last level cache management.
Proceedings of the 32nd IEEE International Conference on Computer Design, 2014
Proceedings of the 51st Annual Design Automation Conference 2014, 2014
Proceedings of the International Conference on Parallel Architectures and Compilation, 2014
2013
Performance, energy characterizations and architectural implications of an emerging mobile platform benchmark suite - MobileBench.
Proceedings of the IEEE International Symposium on Workload Characterization, 2013
2011
Adaptive timekeeping replacement: Fine-grained capacity management for shared CMP caches.
ACM Trans. Archit. Code Optim., 2011
Proceedings of the 44rd Annual IEEE/ACM International Symposium on Microarchitecture, 2011
Proceedings of the 44rd Annual IEEE/ACM International Symposium on Microarchitecture, 2011
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2011