303

Self-Healing Data Platforms – 3A’s: Automation, AI, and Alignment

Session Level 300
🕒 2024-10-03 13:00  •  📍 Room 1 - Keynote / Breakout  •  📄 Agenda: 13

In complex data platforms powered by AWS Glue, AppFlow, Airflow, and Step Functions, operational issues are inevitable. This talk shares how we built a self-healing system that automatically detects issues, tags them with context, uses Bedrock-based LLMs to suggest resolutions, and keeps stakeholders informed via GitLab and Slack. The result: a 40% drop in incident volume and significantly faster resolution times.

Key takeaways: • Using automation + AI for smarter incident workflows • Practical use of Bedrock in a data engineering context • Bridging operations and business with structured communication

Przemysław Mikulski
Przemysław Mikulski
Co-Founder, Data and Cloud Architect, Kodlot ApS
Speaker Profile →