International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064


Downloads: 7

United States | Computer Engineering | Volume 13 Issue 3, March 2024 | Pages: 630 - 633


Advancing AI: Enhancing Large Language Model Performance through GPU Optimization Techniques

Sriram Sagi

Abstract: This study delves into optimizing GPU utilization for supporting Large Language Models LLMs within Generative AI frameworks. Focusing on dynamic resource allocation, kernel optimization, and memory management, our investigation reveals significant improvements in LLM efficiency and performance. By integrating NVIDIAs advanced AI technologies, we propose a scalable, cost - effective approach for deploying AI applications at the enterprise level. The findings underscore the pivotal role of GPU optimization in enhancing AI accessibility and fostering innovation across diverse sectors.

Keywords: Large Language Models (LLMs), GPU Optimization, Generative AI, Artificial Intelligence Deployment

How to Cite?: Sriram Sagi, "Advancing AI: Enhancing Large Language Model Performance through GPU Optimization Techniques", Volume 13 Issue 3, March 2024, International Journal of Science and Research (IJSR), Pages: 630-633, https://www.ijsr.net/getabstract.php?paperid=SR24309100709, DOI: https://dx.doi.org/10.21275/SR24309100709


Download Article PDF


Rate This Article!

Received Comments

No approved comments available.


Top