Metrics/GenevaActions for Clustersync #3785

rhamitarora · 2024-08-21T16:02:15Z

Which issue this PR addresses:

ARO-9545 and ARO-8659 both JIRA's have common code

What this PR does / why we need it:

Create new clustersync metrics under monitor package. Both syncSets and selectorSyncSets should be merged into the same Geneva metric.
Create a Geneva Action to show the clustersync resource of a cluster.

Test plan for issue:

Unit test cases added.
Need to create respective metrics dashboard in Geneva.

Is there any documentation that needs to be updated for this PR?

Will create TSGs for respective metrics.

How do you know this will function as expected in production?

Monitor from Geneva Dashboard.

LiniSusan

Changes looks good to me

pkg/frontend/admin_hive_syncset_resources.go

bitoku · 2024-10-22T10:47:24Z

/azp run

azure-pipelines · 2024-10-22T10:47:39Z

Azure Pipelines successfully started running 2 pipeline(s).

pkg/frontend/admin_hive_syncset_resources.go

github-actions · 2024-10-24T16:38:11Z

Please rebase pull request.

merging 8659 and 9545 Metrics for SyncSet and SelectorSyncSets

tsatam

Mostly looks good, just a few questions about the metric we emit to make sure the metric works for us downstream (dashboarding/alerting).

tsatam · 2024-11-20T20:04:12Z

pkg/monitor/cluster/clustersync.go

+		if clusterSync.Status.SyncSets != nil {
+			for _, s := range clusterSync.Status.SyncSets {
+				mon.emitGauge("hive.clustersync", 1, map[string]string{
+					"metric": "SyncSets",


nit: having a dimension on the metric named "metric" might be a little confusing - should we rename this to something else? for example syncType?

tsatam · 2024-11-20T20:06:36Z

pkg/monitor/cluster/clustersync.go

+	if clusterSync != nil {
+		if clusterSync.Status.SyncSets != nil {
+			for _, s := range clusterSync.Status.SyncSets {
+				mon.emitGauge("hive.clustersync", 1, map[string]string{


question: Do we want to change what "value" we emit here based on the success/failure state of the syncset? For example, return 1 for Successful syncsets and 0 for failed syncsets?

This might make it easier for us to, for example, build downstream dashboards or alerts based off of this metric.

tsatam · 2024-11-20T20:09:08Z

test/e2e/monitor.go

@@ -23,7 +23,9 @@ var _ = Describe("Monitor", func() {
 		wg.Add(1)
 		mon, err := cluster.NewMonitor(log, clients.RestConfig, &api.OpenShiftCluster{
 			ID: resourceIDFromEnv(),
-		}, &noop.Noop{}, nil, true, &wg)
+		}, &api.OpenShiftClusterDocument{


I think it might be worth adding E2E tests for both the monitor and Geneva Actions functionality. We have various contexts in which E2E runs against an RP with Hive enabled (production/release E2E, PR E2E after we move to the containerized implementation).

If it is too difficult to add the E2E tests in this PR, this can become follow-up work, however.

tsatam · 2024-11-20T20:11:34Z

pkg/monitor/cluster/cluster.go

 }

-func NewMonitor(log *logrus.Entry, restConfig *rest.Config, oc *api.OpenShiftCluster, m metrics.Emitter, hiveRestConfig *rest.Config, hourlyRun bool, wg *sync.WaitGroup) (*Monitor, error) {
+func NewMonitor(log *logrus.Entry, restConfig *rest.Config, oc *api.OpenShiftCluster, doc *api.OpenShiftClusterDocument, m metrics.Emitter, hiveRestConfig *rest.Config, hourlyRun bool, wg *sync.WaitGroup, hiveClusterManager hive.ClusterManager) (*Monitor, error) {


It's a little strange to me that we have both oc and doc here (as oc should be a subproperty of doc), and hiveRestConfig and hiveClusterManager.

I think there's an opportunity for us to deduplicate some of these dependencies, but that can be a follow-up refactor.

rhamitarora force-pushed the rhamitarora/ARO-9545-syncset-metrics branch 5 times, most recently from 3cdbf8c to 8d1a6e9 Compare August 27, 2024 11:30

rhamitarora force-pushed the rhamitarora/ARO-9545-syncset-metrics branch 6 times, most recently from b5ac73b to 99fa8df Compare September 11, 2024 07:40

rhamitarora force-pushed the rhamitarora/ARO-9545-syncset-metrics branch 18 times, most recently from 08835f8 to 41a6e7c Compare September 17, 2024 12:43

rhamitarora marked this pull request as ready for review September 17, 2024 14:16

rhamitarora force-pushed the rhamitarora/ARO-9545-syncset-metrics branch 4 times, most recently from 8d33911 to afbaf3b Compare October 22, 2024 05:12

LiniSusan reviewed Oct 22, 2024

View reviewed changes

rhamitarora force-pushed the rhamitarora/ARO-9545-syncset-metrics branch from afbaf3b to 548e097 Compare October 22, 2024 09:24

rhamitarora requested a review from bitoku October 22, 2024 09:33

bitoku previously approved these changes Oct 22, 2024

View reviewed changes

pkg/frontend/admin_hive_syncset_resources.go Outdated Show resolved Hide resolved

pkg/frontend/admin_hive_syncset_resources.go Show resolved Hide resolved

rhamitarora dismissed bitoku’s stale review via 88d5029 October 23, 2024 09:27

rhamitarora force-pushed the rhamitarora/ARO-9545-syncset-metrics branch 2 times, most recently from 88d5029 to 99c51f8 Compare October 23, 2024 09:29

rhamitarora requested a review from bitoku October 23, 2024 10:13

bitoku reviewed Oct 23, 2024

View reviewed changes

pkg/frontend/admin_hive_syncset_resources.go Outdated Show resolved Hide resolved

rhamitarora force-pushed the rhamitarora/ARO-9545-syncset-metrics branch from 99c51f8 to a04c6bb Compare October 24, 2024 09:13

rhamitarora requested a review from bitoku October 24, 2024 09:14

bitoku previously approved these changes Oct 24, 2024

View reviewed changes

github-actions bot added the needs-rebase branch needs a rebase label Oct 24, 2024

rhamitarora dismissed bitoku’s stale review via 99c3f75 October 24, 2024 17:22

rhamitarora force-pushed the rhamitarora/ARO-9545-syncset-metrics branch from a04c6bb to 99c3f75 Compare October 24, 2024 17:22

github-actions bot removed the needs-rebase branch needs a rebase label Oct 24, 2024

rhamitarora requested a review from bitoku October 24, 2024 17:22

rhamitarora force-pushed the rhamitarora/ARO-9545-syncset-metrics branch 2 times, most recently from 60aee73 to e884d16 Compare October 25, 2024 03:45

Metrics for SyncSet and SelectorSyncSets

4522c6e

merging 8659 and 9545 Metrics for SyncSet and SelectorSyncSets

rhamitarora force-pushed the rhamitarora/ARO-9545-syncset-metrics branch from e884d16 to 4522c6e Compare October 28, 2024 06:28

bitoku approved these changes Nov 7, 2024

View reviewed changes

tsatam requested changes Nov 20, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Metrics/GenevaActions for Clustersync #3785

Metrics/GenevaActions for Clustersync #3785

rhamitarora commented Aug 21, 2024 •

edited

Loading

LiniSusan left a comment

bitoku commented Oct 22, 2024

azure-pipelines bot commented Oct 22, 2024

github-actions bot commented Oct 24, 2024

tsatam left a comment

tsatam Nov 20, 2024

tsatam Nov 20, 2024

tsatam Nov 20, 2024

tsatam Nov 20, 2024

tsatam Nov 20, 2024

Metrics/GenevaActions for Clustersync #3785

Are you sure you want to change the base?

Metrics/GenevaActions for Clustersync #3785

Conversation

rhamitarora commented Aug 21, 2024 • edited Loading

Which issue this PR addresses:

What this PR does / why we need it:

Test plan for issue:

Is there any documentation that needs to be updated for this PR?

How do you know this will function as expected in production?

LiniSusan left a comment

Choose a reason for hiding this comment

bitoku commented Oct 22, 2024

azure-pipelines bot commented Oct 22, 2024

github-actions bot commented Oct 24, 2024

tsatam left a comment

Choose a reason for hiding this comment

tsatam Nov 20, 2024

Choose a reason for hiding this comment

tsatam Nov 20, 2024

Choose a reason for hiding this comment

tsatam Nov 20, 2024

Choose a reason for hiding this comment

tsatam Nov 20, 2024

Choose a reason for hiding this comment

tsatam Nov 20, 2024

Choose a reason for hiding this comment

rhamitarora commented Aug 21, 2024 •

edited

Loading