Sign in to view content

Sign in to view this lesson and continue learning.

End-to-End AI Applications Day1 Lecture

Module
36 mins
MLOpsLLMsCI/CD

Description

In this lecture, Zach talks about end to end AI apps, focusing on MLOps, evaluations, and reliability. He explains where agents break, like user behavior drift, code and tool changes via MCP, model upgrades, RAG data changes, and even tool or web downtime, plus latency and context window issues. He breaks down unit tests, integration tests, and end to end evaluation, and four evaluation styles, cosine distance, ROUGE, exact match, custom scoring, and LLM as a judge.