kubernetes

Tag

Cards List
#kubernetes

Here's why data center company IREN bought cloud-native power Mirantis

Reddit r/ArtificialInteligence · 20h ago Cached

IREN acquires Mirantis for $625 million to integrate its cloud-native Kubernetes and AI infrastructure software into IREN's data centers, aiming to offer a full AI cloud platform.

0 favorites 0 likes
#kubernetes

@smratitiwa86867: This is wild. Ex-Google engineers just dropped a full map of their internal tools… and the exact open-source versions y…

X AI KOLs Timeline · yesterday

Ex-Google engineers published a map of Google's internal tools and their open-source equivalents, providing a cheat code for building scalable infrastructure.

0 favorites 0 likes
#kubernetes

Kubernetes v1.36: Haru

Lobsters Hottest · 2026-04-23 Cached

Kubernetes v1.36 “Haru” ships 70 enhancements—18 stable, 25 beta, 25 alpha—plus deprecations and removals.

0 favorites 0 likes
#kubernetes

Is there a clean way to define AI workloads that can run across different GPU providers without provider-specific configuration[D]

Reddit r/MachineLearning · 2026-04-23

Developer explores how to abstract GPU workloads so they can run across multiple GPU providers without provider-specific configuration, leaning toward separating workload definition from infrastructure binding.

0 favorites 0 likes
#kubernetes

@K8sFM: ByteDance open sources Gödel, their high-performance Kubernetes scheduler, to give back to the open source community Wa…

X AI KOLs Timeline · 2026-04-20

ByteDance has open-sourced Gödel, a high-performance Kubernetes scheduler, contributing it to the open-source community.

0 favorites 0 likes
#kubernetes

Advancing Open Source AI, NVIDIA Donates Dynamic Resource Allocation Driver for GPUs to Kubernetes Community

NVIDIA Blog · 2026-03-24 Cached

NVIDIA is donating its Dynamic Resource Allocation (DRA) Driver for GPUs to the Cloud Native Computing Foundation (CNCF) and Kubernetes community, moving it from vendor-governed to community-owned. The donation aims to simplify GPU resource management in Kubernetes for AI workloads and includes GPU support for Kata Containers through collaboration with CNCF's Confidential Containers community.

0 favorites 0 likes
#kubernetes

Scaling Kubernetes to 7,500 nodes

OpenAI Blog · 2021-01-25 Cached

OpenAI shares detailed lessons learned from scaling a single Kubernetes cluster to 7,500 nodes to support large machine learning workloads, covering networking, scheduling, and infrastructure challenges. The post builds on their earlier experience scaling to 2,500 nodes and aims to help the broader Kubernetes community.

0 favorites 0 likes
#kubernetes

Scaling Kubernetes to 2,500 nodes

OpenAI Blog · 2018-01-18 Cached

OpenAI shares infrastructure lessons from scaling Kubernetes to 2,500 nodes, detailing optimizations for container image pulls including kubelet configuration changes, Docker overlay2 migration, and preloading strategies to resolve Pending pod issues.

0 favorites 0 likes
#kubernetes

Infrastructure for deep learning

OpenAI Blog · 2016-08-29 Cached

OpenAI shares their deep learning infrastructure approach and open-sources kubernetes-ec2-autoscaler, a batch-optimized scaling manager for Kubernetes, emphasizing how infrastructure quality multiplies research progress.

0 favorites 0 likes
← Back to home

Submit Feedback