ELEMENTARY CLOUDOne of the challenges data teams face is tracking and understand and collaborate on the status of data issues.
Tests fail daily, pipelines are executed frequently, alerts are sent to different channels.
There is a need for a centralized place to track:
What data issues are open? Which issues were already resolved?
Who is on it, and what’s the latest status?
Are multiple failures part of the same issue?
What actions and events happened since the incident started?
Did such issue happen before? Who resolved it and how?
In Elementary, these are solved with Incidents.A comprehensive view of all incidents can be found in the Incidents page.
Every failure or warning in Elementary will automatically open a new incident or be added as an event to an ongoing incident.
Based on grouping rules, different failures are grouped to the same incident.An incident has a status, assignee and severity.
These can be set in the Incidents page, or from an alert in integrations that support alert actions.
Each incident starts at the first failure, and ends when the status is changed manually or automatically to Resolved.
An incident is automatically resolved when the failing tests, monitors and / or models are successful again.