Eric D. Schabell: KubeCon EU 2023 - Beyond O11y at Cloud Native Scale and Tracing Adventures

Wednesday, November 30, 2022

KubeCon EU 2023 - Beyond O11y at Cloud Native Scale and Tracing Adventures

Next year in Amsterdam, the KubeCon and CloudNativeCon Europe 2023 conference kicks off from 17-21 April 2023.

There will be co-located events on Monday and Tuesday, which are one day events like Prometheus Day and Open Observability Day, then a three day event with the focus on open source and Cloud Native Computing Foundation (CNCF) projects. 

I've attended these events in both Europe and North America over the years but have yet to try speaking at one. That changes next year as I've jumped on their call for papers with several colleagues, one old and one new!

Below you'll find my submissions to the CNCF KubeCon and CloudNativeCon Eu event.

The first talk I've submitted with an old friend from my days at Red Hat, Marius Bogoevici, Chief Solutions Architect. 

Going beyond observability for running complex applications at cloud native scale

For many organizations, reliance on business-critical applications that need to be running 24/7, at scale, and with great resiliency has become a way of life. Among examples from many industries, the payments use case from financial services is one of the most illustrative: customers and merchants increasingly rely on real-time payments systems to carry out their day-to-day interactions.

This talk is based on real researched customer solutions focusing on open standards and open source tooling in the real-time payments domain as a foundation to discuss the problems of quickly identifying and remediating faults in complex business-critical applications. In such scenarios, it is not enough to simply collect, visualize, and aggregate data, but also to conquer the data overload and separate signal from noise.

Finally, the attendee will be presented with the observability pitfalls of running real-time payments at cloud native scale. We’ll share the challenges with observability data when using open source tooling like Prometheus, PromQL, OpenTelemetry and their open standards when your solution reaches true cloud native scale. Attendees take home actionable insights based on a real life large scale cloud native deployment and observability solutions.

Key takeaways

Attendees to this session will learn about the challenges to be faced when implementing solutions at cloud native scale, based on a real-time payments use case supported by multiple researched customer solutions. They will leave with actionable insights for their cloud native observability and solution architectures.

The second talk is with a new friend and colleague from Chronosphere, Paige Cruz, Senior Developer Advocate.

Tracing Adventures from Pull Request to Production in a Cloud Native World

Many things can go awry on the journey from pull request (PR) open to merge to production deployment. Issues can arise from the application code, layers of YAML configuration, underlying infrastructure or pipeline logic itself. How can distributed tracing and trace-derived metrics bring developers and operators together for troubleshooting paradise? We'll unpack a deploy gone bad from both vantage points, gaining an empathy for the engineer who needs to deploy their changes and an operations engineer who is responsible for keeping the CI/CD system up and running. With signals from openTelemetry we will demonstrate how increasing the observability of your deploy system can facilitate better collaboration and quicker troubleshooting.

Key takeaways

Attendees to this session will gain insights and actionable tips on how tracing has uses beyond production apps and is helpful to follow the path from PR to production. Also how developers and SRE have different understandings and mental models of that path which can make troubleshooting difficult and long-running. Finally, they'll see how tracing the deploy pipelines can help developers determine issues they can address and how they can help operators determine infrastructure/build system issues to address where it counts most.

Now it's wait and watch as the selection committee bows over all the great submissions they've received and we await their judgment!