Djordje Jevdjic
Orcid: 0000-0002-3341-9364
According to our database1,
Djordje Jevdjic
authored at least 26 papers
between 2012 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
AttentionStore: Cost-effective Attention Reuse across Multi-turn Conversations in Large Language Model Serving.
CoRR, 2024
Proceedings of the 2024 USENIX Annual Technical Conference, 2024
Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention.
Proceedings of the 2024 USENIX Annual Technical Conference, 2024
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2024
2023
Proceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture, 2023
2022
Proceedings of the International IEEE Symposium on Performance Analysis of Systems and Software, 2022
Proceedings of the International IEEE Symposium on Performance Analysis of Systems and Software, 2022
Proceedings of the ISCA '22: The 49th Annual International Symposium on Computer Architecture, New York, New York, USA, June 18, 2022
2021
ACM Comput. Surv., 2021
2020
CoRR, 2020
2018
2017
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems, 2017
Proceedings of the 26th International Conference on Parallel Architectures and Compilation Techniques, 2017
2015
2014
Proceedings of the 47th Annual IEEE/ACM International Symposium on Microarchitecture, 2014
2013
Die-stacked DRAM caches for servers: hit ratio, latency, or bandwidth? have it all with footprint cache.
Proceedings of the 40th Annual International Symposium on Computer Architecture, 2013
From A to E: analyzing TPC's OLTP benchmarks: the obsolete, the ubiquitous, the unexplored.
Proceedings of the Joint 2013 EDBT/ICDT Conferences, 2013
2012
Quantifying the Mismatch between Emerging Scale-Out Applications and Modern Processors.
ACM Trans. Comput. Syst., 2012
Proceedings of the 39th International Symposium on Computer Architecture (ISCA 2012), 2012
Proceedings of the 30th International IEEE Conference on Computer Design, 2012
Proceedings of the 17th International Conference on Architectural Support for Programming Languages and Operating Systems, 2012