Scaling AI Workloads with Kubernetes: Sharing GPU Resources Across Multiple Containers - Jack Ong

  Переглядів 3,494

The Linux Foundation

The Linux Foundation

7 місяців тому

Scaling AI Workloads with Kubernetes: Sharing GPU Resources Across Multiple Containers - Jack Min Ong, Jina AI
With the rise of AI and machine learning applications, GPU resources have become a critical bottleneck in scaling infrastructure to efficiently serve AI workloads. Kubernetes, an open-source container orchestration platform, provides a solution to this problem through the NVIDIA device plugin which allows multiple containers to share access to GPU devices. In this talk, we will explore how Kubernetes can be used to efficiently scale AI workloads by sharing GPU resources across multiple containers. We will discuss the challenges of GPU resource management, explore various techniques for optimizing GPU usage and set resource limits to ensure fair and efficient allocation of GPU resources among containers. By the end of this talk, attendees will have a solid understanding of how Kubernetes can be used to share GPU resources across multiple containers, allowing them to make the most of their GPU investments and achieve faster, more accurate results in their AI applications.

КОМЕНТАРІ
Mastering GPU Management in Kubernetes Using the Operator Pattern- Shiva Krishna Merla & Kevin Klues
47:53
Machine Learning on Kubernetes | Salman Iqbal
25:45
Kubernetes Community Days UK
Переглядів 2,4 тис.
LIVE - Парад Победы в Москве. 9 Мая 2024
2:27:56
AKIpress news
Переглядів 2,2 млн
How I Would Start Gamedev (if I had to start over)
9:02
Sasquatch B Studios
Переглядів 4 тис.
New GPT-4o VS GPT-4 - Ultimate Test (Prompts Included)
13:52
Skill Leap AI
Переглядів 23 тис.
Enabling Cost-Efficient LLM Serving with Ray Serve
30:28
Anyscale
Переглядів 3,3 тис.
Everything you Need to Know about using GPUs with Kubernetes - Rohit Agarwal, Google
31:33
CNCF [Cloud Native Computing Foundation]
Переглядів 8 тис.
Running Generative AI & LLM on a Kubernetes Cluster | Cloud Institute
30:32
Cloud Institute
Переглядів 4,2 тис.
How To Auto-Scale Kubernetes Clusters With Karpenter
26:58
DevOps Toolkit
Переглядів 22 тис.
Keynote: Accelerating AI Workloads with GPUs in Kubernetes - Kevin Klues & Sanjay Chatterjee
15:28
CNCF [Cloud Native Computing Foundation]
Переглядів 2,4 тис.
Improving GPU Utilization using Kubernetes - Maulin Patel & Pradeep Venkatachalam, Google
37:53
CNCF [Cloud Native Computing Foundation]
Переглядів 2,9 тис.
KubeRay: A Ray cluster management solution on Kubernetes
25:00
Anyscale
Переглядів 2,4 тис.
How much charging is in your phone right now? 📱➡️ 🔋VS 🪫
0:11
Why spend $10.000 on a flashlight when these are $200🗿
0:12
NIGHTOPERATOR
Переглядів 17 млн
поворотний механізм для антени
0:17
Lazeruk
Переглядів 14 тис.
How Neuralink Works 🧠
0:28
Zack D. Films
Переглядів 26 млн