LLM Deployment - Workshop

Main Details

Author Victor Calderon - LinkedIn
Date and Time Feb 28th, 2025 @ 12pm PT
Event Link Partiful Invite
Github Repo 2025-02-GenAI-workshop-LLM-Deployment
Documentation https://vcalderon2009.github.io/2025-02-GenAI-workshop-LLM-Deployment/
Slides Google Slides

Description

The following workshop will cover examples and guides for how to deploy Large-Language Models (LLMs) into a production environment.

We’ll cover the following:

  • How to set up an endpoint in HuggingFace.
  • How to use AWS infrastructure and Terraform to fully deploy a LLM into production.
  • Setting up GitHub’s CICD to deploy such applications.

Some of the tools that we’ll use in this workshop include, but not limited to:

  • Docker
  • FastAPI
  • Terraform
  • AWS infra (e.g. ECS, ECR, and more)

All of the source code can be found in the main repository in their corresponding branches.