Members-Only
Recent Talks & Demos are for members only
You must be an AI Tinkerers active member to view these talks and demos.
Covalent: Deploying LLMs in Python
Learn how to fine‑tune an LLM, then deploy it as an API service using Covalent to orchestrate cloud resources directly from a Jupyter notebook.
What does it take to fine-tune and deploy a customized LLM on state-of-the-art cloud hardware? In this talk, we explore a fully Pythonic solution to this problem, using just a few extra lines on top of ordinary code. No cloud expertise is required to follow along. We’ll start with a simple example and scale things gradually to arrive at a powerful, high-compute workflow that creates a “model inference” service with a custom API—all within the confines of a Jupyter Notebook!
Covalent orchestrates Pythonic ML/HPC/Quantum workflows across heterogeneous compute environments.