How to run and deploy LLMs using Red Hat OpenShift AI on a Red Hat OpenShift Service on AWS cluster

Learning path • 7 resources • 1 hr and 4 mins • Published on August 16, 2024

Learn how to install the Red Hat® OpenShift® AI (RHOAI) operator and Jupyter notebook, create an Amazon S3 bucket, and run the LLM model on a Red Hat OpenShift Service on AWS (ROSA) cluster.

Disclaimer: this content is authored by Red Hat experts, but has not yet been tested on every supported configuration.

Learn how to install the Red Hat® OpenShift® AI (RHOAI) operator and Jupyter notebook, create an Amazon S3 bucket, and run the LLM model on a Red Hat OpenShift Service on AWS (ROSA) cluster.

Disclaimer: this content is authored by Red Hat experts, but has not yet been tested on every supported configuration.

Prerequisites for deploying LLMs using Red Hat OpenShift AI on a Red Hat OpenShift Service on AWS cluster

2 mins

Before beginning, you’ll need a Red Hat® OpenShift® Service on AWS (ROSA) cluster. If you don’t have one already, visit our learning path Getting Started with Red Hat OpenShift Service on AWS (ROSA) to view instructions for deploying a cluster using the console interface, which you can access with the button below.

Go to console

What will you learn?

How to deploy a ROSA cluster
Accessing the cluster console

What do you need before starting?

Red Hat account

Steps for meeting the prerequisites

Deploy your ROSA cluster.
1. In this learning path, we’ll use a single-AZ ROSA 4.15.10 cluster with m5.4xlarge node with auto-scaling enabled up to 10 nodes. The cluster has 64 vCPUs with ~278Gi memory.
2. Note that ROSA and RHOAI also support GPU, however, for the sake of simplicity, we’ll be using only CPU for compute in this guide.
3. Please be sure that you have admin access to the cluster.
Access the cluster console.

You are now ready to install Red Hat OpenShift AI (RHOAI) and Jupyter notebook in the next resource.

Get more support

Troubleshoot with Red Hat support

Previous resource

Overview: How to run and deploy LLMs using Red Hat OpenShift AI on a Red Hat OpenShift Service on AWS cluster

Next resource

Installing RHOAI and Jupyter notebook

This learning path is for operations teams or system administrators.

Developers might want to check out how to create a natural language processing (NLP) application using Red Hat OpenShift AI on developers.redhat.com.

Get started on developers.redhat.com

How to run and deploy LLMs using Red Hat OpenShift AI on a Red Hat OpenShift Service on AWS cluster

Resources in this path

Prerequisites for deploying LLMs using Red Hat OpenShift AI on a Red Hat OpenShift Service on AWS cluster

What will you learn?

What do you need before starting?

Steps for meeting the prerequisites

Get more support

Platforms

Tools

Try, buy, sell

Communicate

About Red Hat

Red Hat legal and privacy links

Red Hat legal and privacy links

How to run and deploy LLMs using Red Hat OpenShift AI on a Red Hat OpenShift Service on AWS cluster

View the resources in this path

Resources in this path

Prerequisites for deploying LLMs using Red Hat OpenShift AI on a Red Hat OpenShift Service on AWS cluster

What will you learn?

What do you need before starting?

Steps for meeting the prerequisites

Get more support

Platforms

Tools

Try, buy, sell

Communicate

About Red Hat

Red Hat legal and privacy links

Red Hat legal and privacy links