Conference Program

Please note:
On this page you will only see the English-language presentations of the conference. You can find all conference sessions, including the German speaking ones, here.

The times given in the conference program correspond to Central European Time (CET).

ENTFÄLLT: Lessons from the Trenches: What It Actually Takes to Properly Test GenAI Applications

Generative AI is a powerful asset if you know how to tame it. As this technology rapidly transforms the software landscape, one of the key challenges lies in effectively testing and validating GenAI applications. Traditional testing methodologies fall short in addressing the unique complexities posed by these systems, especially in enterprise environments.

Drawing from real-world experiences and hard-earned insights, we'll explore how to adapt established software engineering principles to the world of GenAI, and where entirely new approaches are necessary. Key topics include:

  • Strategies for testing non-deterministic outputs
  • Which metrics to choose and how to cope with conflicting metrics
  • Unit-Tests vs End-To-End-Tests in the World of GenAI
  • How to evaluate the evaluators: Aligning automatic evaluation with human evaluation
  • Techniques for evaluating model hallucinations and biases
  • Approaches to performance and scalability testing for GenAI systems
  • Methods for ensuring data privacy and security in AI-driven applications
  • Practical tools and frameworks for continuous testing in GenAI pipelines

Attendees will gain actionable knowledge on implementing robust testing practices for GenAI applications, bridging the gap between traditional QA and the demands of modern AI systems. This session is essential for developers, QA engineers, and technical leaders navigating the complexities of deploying GenAI in production environments.

Target Audience: AI engineers, software engineers, architects, and stakeholders working with GenAI applications
Prerequisites: Attendees should have a foundational understanding of AI
Level: Advanced

Steve Haupt, an agile software developer at andrena objects, views software development as a quality-driven craft. Fascinated by AI, he explores its implications for software craftsmanship, working on AI projects and developing best practices. Steve focuses on applying Clean Code and XP principles to AI development. He regularly speaks on AI and co-created an AI training course, aiming to bridge traditional software development with modern AI technologies for sustainable solutions.

More content from this speaker? Have a look at sigs.de: https://www.sigs.de/experten/steve-haupt/

Steve Haupt
18:30 - 20:00
Vortrag: Ndo 3

Vortrag Teilen