Srinivas Kola. (2024). Monitoring and scaling GPU workloads in production with Nvidia DCGM and Prometheus. ISCSITR - INTERNATIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING (ISCSITR-IJSRAIML) ISSN (Online): 3067-753X, 5(2), 8-28. https://doi.org/10.63397/ISCSITR-IJSRAIML_05_02_002