This repository has been archived by the owner on Jun 6, 2024. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 549
Inform the user when jobs status change #5337
Labels
Comments
the notification is also useful when the job succeeds. maybe the feature could be rephrased as: notifying the user when a job completes. |
Work ItemsPart 1: Job / User configuration
Part 2: monitor & trigger corresponding alerts P0
Part 3: alerts handling P0 #5492
Doc
|
This was referenced May 12, 2021
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
Motivation
Some jobs may fail unexpectedly.
If the users can be informed when the jobs fail, the users will be able to handle the issue in time.
This will save the users from checking their job status all the time.
Similar for other status changes.
Background:
Design
Workflow:
Part 1: Job / User configuration
extras
->jobStatusChangeNotification
email-user
action is available inalert-handler
Part 2: monitor & trigger corresponding alerts
design with DB
framework
table in DB:these columns are used to save job config & alerts state
framework-status-notification-poller
inalert-manager
, whichframework
tableframework
table after successfully sending alerts toalert-manager
Part 3: alerts handling
src/alert-manager/deploy/alert-manager-configmap.yaml
: add a newreceiver
and a newroute
alert-handler
: add an email templateinform-user-job-status-change
Archive
Problems of watching k8s
Framework
object: not stable, may miss certain status changeProposal 1
framework-status-notification-poller
inalert-manager
, whichProposal 2
Job Exporter:
Benefits: useful for
averageWaitingTime
,failingRate
, & other statisticsPrometheus:
The text was updated successfully, but these errors were encountered: