Cloud-Native AI: Challenges and Innovations in Deploying Large-Scale Machine Learning Models

Authors

  • Rahul Amte Cloud AI Infrastructure Engineer, Nivid Infotech Inc, Austin, TX, USA. Author

Keywords:

Cloud, AI, Machine Learning, Deployment

Abstract

Large scale machine learning models are deployed with revolutionary speed in cloud native AI for the reasons of scalability, flexibility and cost efficiency. However, most of these challenges face a latency problem, resource management issue, and most of all security risk. This paper considers challenges in producing cloud native AI, such as containerization, microservices, and optimized inference pipelines and then it studies innovations in these areas. By qualitative and quantitative analysis, we examine best practices of deploying AI models effectively in cloud environment. The results indicate ways of cost effective and high-performance AI implementation, with a focus on automation, edge computation, and serverless hands in increasing the deployment of AI on a sustainable basis.

References

Lu, Y., Bian, S., Chen, L., He, Y., Hui, Y., Lentz, M., ... & Zhuo, D. (2024). Computing in the era of large generative models: From cloud-native to ai-native. arXiv preprint arXiv:2401.12230. https://doi.org/10.48550/arXiv.2401.12230

Gupta, A., & Chaturvedi, Y. Cloud-Native ML: Architecting AI Solutions for Cloud-First Infrastructures. 10.62441/nano-ntp.v20i7.4004

Sikha, V. K. Cloud-Native Application Development for AI-Conducive Architectures. https://ijritcc.org/index.php/ijritcc/article/view/11325

BRUCE WILLIAMS, J. K., & SAMAD, A. (2024). SCALABLE CLOUD-NATIVE SOLUTIONS FOR AI WORKLOADS WITH .NET AND KUBERNETES. https://www.researchgate.net/profile/Bruce-William-2/publication/387089591_SCALABLE_CLOUD-NATIVE_SOLUTIONS_FOR_AI_WORKLOADS_WITH_NET_AND_KUBERNETES/links/676032fe996d2552c3ec776a/SCALABLE-CLOUD-NATIVE-SOLUTIONS-FOR-AI-WORKLOADS-WITH-NET-AND-KUBERNETES.pdf

Oladoja, T. (2024). Artificial Intelligence-Driven Innovations in VLSI, DevOps Security, and Cloud-Native Platforms: Addressing Challenges in Modern Technology Development. https://www.researchgate.net/profile/Oladoja-Timilehin/publication/386991785_Artificial_Intelligence-Driven_Innovations_in_VLSI_DevOps_Security_and_Cloud-Native_Platforms_Addressing_Challenges_in_Modern_Technology_Development/links/675bd1fdda24c8537c69f6c6/Artificial-Intelligence-Driven-Innovations-in-VLSI-DevOps-Security-and-Cloud-Native-Platforms-Addressing-Challenges-in-Modern-Technology-Development.pdf

Oyeniran, O. C., Modupe, O. T., Otitoola, A. A., Abiona, O. O., Adewusi, A. O., & Oladapo, O. J. (2024). A comprehensive review of leveraging cloud-native technologies for scalability and resilience in software development. International Journal of Science and Research Archive, 11(2), 330-337. https://doi.org/10.30574/ijsra.2024.11.2.0432

Nguyen, H. (2021). Machine Learning Model Management using Cloud-Native Technologies for IoT. https://urn.fi/URN:NBN:fi:aalto-202101311775

Anbalagan, K. (2024). AI in Cloud Computing: Enhancing Services And Performance. International Journal of Computer Engineering And Technology (IJCET), 15(4), 622-635. https://lib-index.com/index.php/IJCET/article/view/IJCET_15_04_055

KODAKANDLA, N. (2021). Serverless Architectures: A Comparative Study of Performance, Scalability, and Cost in Cloud-native Applications. Iconic Research And Engineering Journals, 5(2), 136-150. https://www.researchgate.net/profile/Naveen-Kodakandla/publication/386876894_Serverless_Architectures_A_Comparative_Study_of_Performance_Scalability_and_Cost_in_Cloud-native_Applications/links/675a697472215358fe2b77bd/Serverless-Architectures-A-Comparative-Study-of-Performance-Scalability-and-Cost-in-Cloud-native-Applications.pdf

Thatikonda, K. C. AUTOMATING REGULATORY COMPLIANCE IN CLOUD-NATIVE ARCHITECTURES: A DEEP LEARNING PERSPECTIVE. https://www.researchgate.net/profile/Kalyan-Chakravarthy-Thatikonda/publication/389550950_AUTOMATING_REGULATORY_COMPLIANCE_IN_CLOUD-NATIVE_ARCHITECTURES_A_DEEP_LEARNING_PERSPECTIVE/links/67c734dd8311ce680c7cada8/Automating-Regulatory-Compliance-in-Cloud-Native-Architectures-A-Deep-Learning-Perspective.pdf

Downloads

Published

2025-03-21

How to Cite

Rahul Amte. (2025). Cloud-Native AI: Challenges and Innovations in Deploying Large-Scale Machine Learning Models. ISCSITR - INTERNATIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING (ISCSITR-IJSRAIML) ISSN (Online): 3067-753X, 6(2), 9-18. https://iscsitr.in/index.php/ISCSITR-IJSRAIML/article/view/ISCSITR-IJSRAIML_06_02_002