Loading…
Thursday June 4, 2026 10:30 - 11:10 EEST
As organizations rush to adopt Large Language Models (LLMs), many discover that building reliable, trustworthy applications is far from straightforward. Unlike traditional software, LLM outputs are non-deterministic, context-dependent, and vulnerable to issues like bias, hallucinations, and prompt injection. Ensuring quality requires more than testing—it demands a holistic approach that blends architecture, safety, observability, and continuous feedback. This talk explores practical strategies for embedding quality into LLM-powered systems from the ground up. We’ll cover methods for prompt design, evaluation frameworks, guardrails, and hybrid architectures that improve accuracy and safety. Attendees will leave with a clearer understanding of how to balance innovation with reliability and how to design AI applications that are not only powerful but also consistent, secure, and user-focused.


Key takeaways:

  1. Testing LLMs requires new methods, not just old QA practices.Combine automation + human oversight for best results.
  2. Build feedback and safety into the system from the start.
  3. Quality is a continuous journey, not a release milestone.

Speakers
avatar for Craig Risi

Craig Risi

Head of Engineering, Old Mutual
Craig is a software enthusiast with over 20 years of experience across development, testing, and leadership, yet still claims to learn something new every day. Equal parts tech nerd and people person, he’s passionate about designing systems that prioritize quality in a fast-evolving... Read More →
Thursday June 4, 2026 10:30 - 11:10 EEST
BlackBox Kultuurikatel

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link