-
Notifications
You must be signed in to change notification settings - Fork 298
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Observe and Prometheus/Alertmanager API queries broken after upgrade 4.12 -> 4.16 #1998
Comments
Observe page functions normally in 4.13.0-0.okd-2023-09-30-084937 |
Tested on 4.14.0-0.okd-2024-01-06-084517 Other than a bug with the plug-in rbac is working correctly for accessing the Observe page and querying the above endpoints. |
Breaks from 4.15.0-0.okd-2024-02-10-035534 |
A 4.16 cluster has now been built too and the problem also exists there. |
Thanks for testing. We'll take a look. |
hi guys In our case Grafana failed whith the exact same error message as @cannibalisticcow mentioned. Grafana used a custom SA and custom clusterrole with get,list,watch on all namespaces. This is because the cluster-monitoring-view clusterrole does not have list and watch.
which is not on older cluster. Hope this helps :) |
Hi folks, is this issue still broken when upgraded to 4.17? |
Describe the bug
A user that has access to view the Observe panel and make queries to the endpoints;
/api/prometheus/api/v1/rules
and
/api/alertmanager-user-workload/api/v2/alerts
On version 4.12 can no longer do this when the cluster is upgraded to 4.15.
The Observe panel shows Forbidden and when looking at the network response the problem seems to be;
Forbidden (user=robot, verb=get, resource=prometheuses, subresource=api)
When querying the user-workload endpoint the problem is
Forbidden (user=robot, verb=get, resource=alertmanagers, subresource=api)
I've been able to work around the problem by creating a new cluserrole and clusterrolebinding but feel like as this worked previously it should continue to work post upgrade. The user has a number of default custom roles applied to allow cluster metrics and state to be viewed.
Default Roles applied
Work around
Version
4.15.0-0.okd-2024-03-10-010116
How reproducible
We have multiple OKD clusters, all of the ones that have been upgraded from 4.12 to 4.15 are displaying this problem. The ones that are still at 4.12 continue to work.
I've not had the opportunity yet to see if this is a problem that was introduced in .13, .14, or .15
The text was updated successfully, but these errors were encountered: