Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Observability Platform using Grafana with Loki, instead of Grafana with Prometheus #8689

Open
SimonPPledger opened this issue Dec 5, 2024 · 0 comments

Comments

@SimonPPledger
Copy link
Contributor

User Story

As a Modernisation Platform Engineer or a Modernisation Platform user
I need to use Observability Platform/Tools
So that I can verify the health/cost/security/usage/capacity of the Modernisation Platform or the Application (or Application Infrastructure)

Value / Purpose

This is to investigate using using Grafana with Loki, instead of the currentl (AP) solution of using Grafana with Prometheus - ie is this a better solution?
We can compare it to other Grafana based solutions and understand advantages and disadvantages of them all.

The platform should be assessed and compared to other solutions based on the following criteria:

ease of use (this could be, but is not limited to: documentation, user friendliness/experience from both end user and engineering user point of view, configuration)
management overhead
price
scalability
expandability
limitations

Context / Background

The Modernisation Platform has a need to observe the platform and application teams have a need to observe their applications or application infrastructure.

Currently Modernisation Platform uses a number of monitoring and alerting tools/platforms: Observability Platform (by AP), slack alerting, AWS Cloud Watch -> SNS -> PagerDuty -> slack notifications, Security Hub data.
Application teams may or may not be using observability tools to meet their observability needs.

There is a need to have a centralised place where Modernisation Platform and Modernisation Platform users can observe their workloads. This PoC is one of a number of solutions we are going to explore. The solution should also take it into account that this may be the future Observability Platform for the whole MoJ.

Useful Contacts

Aaron

Additional Information

Useful documents:

State of observability in Modernisation Platform
Observability Tools research
AP Observability Platform code
AP Observability Platform tenant module

Definition of Done

#8679

#8680

#8681

#8682

#8683

@SimonPPledger SimonPPledger added firebreak Mod Platform skunk works and removed firebreak Mod Platform skunk works labels Dec 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: To Do
Development

No branches or pull requests

1 participant