menu
techminis

A naukri.com initiative

google-web-stories
Home

>

Devops News

>

Google Kub...
source image

The New Stack

1M

read

293

img
dot

Image Credit: The New Stack

Google Kubernetes Engine Customized for Faster AI Work

  • Google Cloud enhanced Google Kubernetes Engine (GKE) for streamlined AI workloads at GoogleNext conference.
  • Hosted GKE-based supercomputing service with AI workload focus unveiled by Google Cloud.
  • Customers showcasing 900% growth in the use of AI-oriented GPUs and TPUs in the past year.
  • 15 top GKE customers are now utilizing the service for AI and ML workloads.
  • AI expected to generate over $200 billion in annual infrastructure cloud services by 2028.
  • GKE enhancements include support for Kubernetes standard Gateway API Inference Extension.
  • Introduction of GKE supercomputing service, Cluster Director, for large AI modeling jobs.
  • Gemini AI-based chat client, Gemini Cloud Assist Investigations, for debugging on GKE admin dashboard.
  • Public preview of GKE Inference Gateway for intelligent routing and load balancing for AI inference workloads.
  • Extension offers benefits like end-to-end observability, workload isolation, and optimized routing for multiple models.

Read Full Article

like

17 Likes

For uninterrupted reading, download the app