Publications

Papers

  1. J. Stojkovic, C. Alverti, A. Andrade, N. Iliakopoulou, T. Xu, H. Franke, J. Torrellas. (March 2025). "Concord: Rethinking Distributed Coherence for Software Caches in Serverless Environments". To Appear in Proceedings of the 31st IEEE International Symposium on High-Performance Computer Architecture (HPCA). Paper: [PDF]
  2. J. Stojkovic, C. Zhang, Í. Goiri, J. Torrellas, E. Choukse. (March 2025). "DynamoLLM: Designing LLM Inference Clusters for Performance and Energy Efficiency". To Appear in Proceedings of the 31st IEEE International Symposium on High-Performance Computer Architecture (HPCA). Paper: [PDF]
  3. J. Stojkovic, E. Choukse, E. Saurez, Í. Goiri, J. Torrellas. (November 2024). "Mosaic: Harnessing the Micro-architectural Resources of Servers in Serverless Environments". In Proceedings of the 57th International Symposium on Microarchitecture (MICRO). Paper: [PDF]. Presentation: [PDF]
  4. J. Stojkovic, N. Iliakopoulou, T. Xu, H. Franke, J. Torrellas. (June 2024). "EcoFaaS: Rethinking the Design of Serverless Environments for Energy Efficiency ". In Proceedings of the 51st International Symposium on Computer Architecture (ISCA). Paper: [PDF]. Presentation: [PDF]
  5. J. Stojkovic, P. Misra, Í. Goiri, S. Whitlock, E. Choukse, M. Das, C. Bansal, J. Lee, Z. Sun, H. Qiu, R. Zimmermann, S. Samal, B. Warrier, A. Raniwala, R. Bianchini. (June 2024). "SmartOClock: Workload- and Risk-Aware Overclocking in the Cloud". Proceedings of the 51st International Symposium on Computer Architecture (ISCA). Paper: [PDF]. Presentation: [PDF]
  6. J. Stojkovic, T. Xu, H. Franke, J. Torrellas. (June 2023). "MXFaaS: Resource Sharing in Serverless Environments for Parallelism and Efficiency". In Proceedings of the 50th International Symposium on Computer Architecture (ISCA). Paper: [PDF]. Presentation: [PDF] Artifact: [GitHub]
  7. J. Stojkovic, C. Liu, M. Shahbaz, J. Torrellas. (June 2023). "µManycore: A Cloud Native CPU for Tail at Scale". In Proceedings of the 50th International Symposium on Computer Architecture (ISCA). Paper: [PDF]. Presentation: [PDF] [Selected as an Honorable Mention in IEEE Micro Top Picks from Computer Architecture Conferences]
  8. J. Stojkovic, T. Xu, H. Franke, J. Torrellas. (February 2023). "SpecFaaS: Accelerating Serverless Applications with Speculative Function Execution". In Proceedings of the 29th IEEE International Symposium on High-Performance Computer Architecture (HPCA). Paper: [PDF]. Presentation: [PDF]
  9. J. Stojkovic, N. Mantri, D. Skarlatos, T. Xu, J. Torrellas. (February 2023). "Memory Efficient Hashed Page Tables". In Proceedings of the 29th IEEE International Symposium on High-Performance Computer Architecture (HPCA). Paper: [PDF]. Presentation: [PDF]
  10. J. Stojkovic, D. Skarlatos, A. Kokolis, T. Xu, J. Torrellas. (March 2022). "Parallel Virtualized Memory Translation with Nested Elastic Cuckoo Page Tables". In Proceedings of the 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS). Paper: [PDF]. Presentation: [PDF]
  11. G. Lan, Z. Liu, Y. Zhang, T. Scargill, J. Stojkovic, C. Joe-Wong, M. Gorlatova. (February 2022). "Edge-assisted Collaborative Image Recognition for Mobile Augmented Reality". ACM Transactions on Sensor Networks. Paper: [PDF]
  12. Z. Liu, G. Lan, J. Stojkovic, Y. Zhang, C. Joe-Wong, M. Gorlatova. (April 2020). "CollabAR: Edge-assisted Collaborative Image Recognition for Mobile Augmented Reality". In Proceedings of the International Conference on Information Processing on Sensor Networks (IPSN). Paper: [PDF] [Best Research Artifact Award]
  13. J. Stojkovic, M. Misic, J. Protic. (November 2019). "Collaboration Network Analysis of Scientific Production at UB-SEE". In 27th Telecommunications Forum (TELFOR). Paper: [PDF]

Preprints

  1. J. Stojkovic, C. Zhang, Í. Goiri, J. Torrellas, E. Choukse. (August 2024). "DynamoLLM: Designing LLM Inference Clusters for Performance and Energy Efficiency". CoRR, vol. abs/2408.00741. Paper: [PDF]
  2. L. Huang, A. Parayil, J. Zhang, X. Qin, C. Bansal, J. Stojkovic, P. Zardoshti, P. Misra, E. Cortez, R. Ghelman, Í. Goiri, S. Rajmohan, J. Kleewein, R. Fonseca, T. Zhu, R. Bianchini (April 2024). "Workload Intelligence: Punching Holes Through the Cloud Abstraction". CoRR, vol. abs/2404.19143. Paper: [PDF]

Workshops, Posters, Demo

  1. J. Stojkovic, E. Choukse, C. Zhang, Í. Goiri, J. Torrellas (April 2024). "Towards Greener LLMs: Bringing Energy-Efficiency to the Forefront of LLM Inference". 9th Workshop on Energy Efficient Machine Learning and Cognitive Computing (EMC2 '24) in conjunction with ASPLOS'24. Paper: [PDF]
  2. J. Stojkovic, T. Xu, H. Franke, J. Torrellas (April 2024). "UniCache: The Next 700 Caches for Serverless Computing". 5th International Workshop on Cloud Intelligence / AIOps (AIOps'24, in conjunction with ASPLOS'24). Paper: [PDF].
  3. N. Stojkovic, J. Stojkovic (April 2024). "OasisRPC: Hiding the Overheads of RPCs in Microservice Environments". 6th Young Architect Workshop (YArch'24, in conjuction with ASPLOS'24).
  4. J. Stojkovic, C. Liu, M. Shahbaz, J. Torrellas. (March 2023). "Hardware Support for Efficient and Secure Resource Harvesting in the Cloud". 5th Young Architect Workshop (YArch'23, in conjuction with ASPLOS'23).
  5. J. Stojkovic, T. Xu, H. Franke, J. Torrellas (October 2022). "Super Scalar Clouds". 7th Workshop on the Future of Computing Architectures (FOCA'22).
  6. J. Stojkovic, J. Torrellas. (March 2022). "Nested Elastic Cuckoo Page Tables". NSF Arch-1 Workshop.
  7. J. Stojkovic, G. Lan, M. Gorlatova. (July 2019). "Edge Computing Platform for Collaborative Augmented Reality". Duke Summer REU Symposium.
  8. J. Stojkovic, Z. Liu, G. Lan, C. Joe-Wong, M. Gorlatova. (November 2019). "Demo: Edge-assisted Collaborative Image Recognition for Augmented Reality". In ACM Conference on Embedded Networked Sensor Systems (SenSys). Paper: [PDF]