Skip to content

awslabs/ai-on-eks

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

AI on EKS

(Pronounced: "AI on EKS")

๐Ÿ’ก Optimized Solutions for AI and ML on EKS

โš ๏ธ This repository is under active development as we support the new infrastructure format. Please raise any issues you may encounter

Build, Scale, and Optimize AI/ML Platforms on Amazon EKS ๐Ÿš€

Welcome to AI on EKS, your gateway to scaling AI and ML workloads on Amazon EKS. Unlock the potential of AI with a rich collection of Terraform Blueprints featuring best practices for deploying robust solutions with advanced logging and observability.

Explore practical patterns for running AI/ML workloads on EKS, leveraging the power of the Ray ecosystem for distributed computing. Utilize advanced serving solutions like NVIDIA Triton Server, vLLM for efficient and scalable model inference, and TensorRT-LLM for optimizing deep learning models.

Take advantage of high-performance NVIDIA GPUs for intensive computational tasks and leverage AWSโ€™s specialized hardware, including AWS Trainium for efficient model training and AWS Inferentia for cost-effective model inference at scale.

Note: AIoEKS is in active development. For upcoming features and enhancements, check out the issues section.

๐Ÿƒโ€โ™€๏ธGetting Started

In this repository, you'll find a variety of deployment blueprints for creating AI/ML platforms with Amazon EKS clusters. These examples are just a small selection of the available blueprints - visit the AIoEKS website for the complete list of options.

๐Ÿง  AI

๐Ÿš€ JARK-Stack on EKS ๐Ÿ‘ˆ This blueprint deploys JARK stack for AI workloads with NVIDIA GPUs.

๐Ÿš€ Generative AI on EKS ๐Ÿ‘ˆ Collection of Generative AI Training and Inference LLM deployment patterns

๐Ÿ“š Documentation

For instructions on how to deploy AI on EKS patterns and run sample tests, visit the AIoEKS website.

๐Ÿ† Motivation

Kubernetes is a widely adopted system for orchestrating containerized software at scale. As more users migrate their AI and machine learning workloads to Kubernetes, they often face the complexity of managing the Kubernetes ecosystem and selecting the right tools and configurations for their specific needs.

At AWS, we understand the challenges users encounter when deploying and scaling AI/ML workloads on Kubernetes. To simplify the process and enable users to quickly conduct proof-of-concepts and build production-ready clusters, we have developed AI on EKS (AIoEKS). AIoEKS offers opinionated open-source blueprints that provide end-to-end logging and observability, making it easier for users to deploy and manage Ray, vLLM, Kubeflow, MLFlow, Jupyter and other AI/ML workloads. With AIoEKS, users can confidently leverage the power of Kubernetes for their AI and machine learning needs without getting overwhelmed by its complexity.

๐Ÿค Support & Feedback

AIoEKS is maintained by AWS Solution Architects and is not an AWS service. Support is provided on a best effort basis by the AI on EKS community. If you have feedback, feature ideas, or wish to report bugs, please use the Issues section of this GitHub.

๐Ÿ” Security

See CONTRIBUTING for more information.

๐Ÿ’ผ License

This library is licensed under the Apache 2.0 License.

๐Ÿ™Œ Community

We welcome all individuals who are enthusiastic about AI on Kubernetes to become a part of this open source community. Your contributions and participation are invaluable to the success of this project.

Built with โค๏ธ at AWS.

About

AI on EKS - Tested AI/ML for Amazon Elastic Kubernetes Service

Resources

License

Code of conduct

Contributing

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 47