Scaling AI Workloads with Kubernetes: Sharing GPU Resources Across Multiple Containers - Jack Ong

  Переглядів 3,481

The Linux Foundation

The Linux Foundation

7 місяців тому

Scaling AI Workloads with Kubernetes: Sharing GPU Resources Across Multiple Containers - Jack Min Ong, Jina AI
With the rise of AI and machine learning applications, GPU resources have become a critical bottleneck in scaling infrastructure to efficiently serve AI workloads. Kubernetes, an open-source container orchestration platform, provides a solution to this problem through the NVIDIA device plugin which allows multiple containers to share access to GPU devices. In this talk, we will explore how Kubernetes can be used to efficiently scale AI workloads by sharing GPU resources across multiple containers. We will discuss the challenges of GPU resource management, explore various techniques for optimizing GPU usage and set resource limits to ensure fair and efficient allocation of GPU resources among containers. By the end of this talk, attendees will have a solid understanding of how Kubernetes can be used to share GPU resources across multiple containers, allowing them to make the most of their GPU investments and achieve faster, more accurate results in their AI applications.

КОМЕНТАРІ
Mastering GPU Management in Kubernetes Using the Operator Pattern- Shiva Krishna Merla & Kevin Klues
47:53
Не пей газировку у мамы в машине
00:28
Даша Боровик
Переглядів 1,6 млн
Excited Dog Zooms In and Out of Sliding Door!
00:18
The Pet Collective
Переглядів 16 млн
Enabling Cost-Efficient LLM Serving with Ray Serve
30:28
Anyscale
Переглядів 3,3 тис.
Machine Learning on Kubernetes | Salman Iqbal
25:45
Kubernetes Community Days UK
Переглядів 2,4 тис.
Running Generative AI & LLM on a Kubernetes Cluster | Cloud Institute
30:32
Cloud Institute
Переглядів 4,2 тис.
Everything you Need to Know about using GPUs with Kubernetes - Rohit Agarwal, Google
31:33
CNCF [Cloud Native Computing Foundation]
Переглядів 8 тис.
KubeRay: A Ray cluster management solution on Kubernetes
25:00
Anyscale
Переглядів 2,3 тис.
GPUs: Explained
7:29
IBM Technology
Переглядів 290 тис.
How Fully Sharded Data Parallel (FSDP) works?
32:31
Ahmed Taha
Переглядів 7 тис.
Kubernetes Explained in 15 Minutes | Hands On (2024 Edition)
15:18
Travis Media
Переглядів 45 тис.
Keynote: Accelerating AI Workloads with GPUs in Kubernetes - Kevin Klues & Sanjay Chatterjee
15:28
CNCF [Cloud Native Computing Foundation]
Переглядів 2,4 тис.
Купите ЭТОТ БЮДЖЕТНИК вместо флагманов от Samsung, Xiaomi и Apple!
13:03
Thebox - о технике и гаджетах
Переглядів 22 тис.
iPhone 17 Slim - НЕ ОНОВЛЮЙ iPhone в 2024 | Новини Тижня
31:12
Канал Лучкова
Переглядів 35 тис.
Apple Event - May 7
38:22
Apple
Переглядів 6 млн
Why spend $10.000 on a flashlight when these are $200🗿
0:12
NIGHTOPERATOR
Переглядів 17 млн
Игровой ноутбук за 100тр в МВИДЕО
0:58
KOLBIN REVIEW
Переглядів 711 тис.