Aug 2024

Introducing: The Monitors section

Release Date: Aug 1, 2024

Affected Sections: Workloads

Using the left navigation pane, you can now access a new section of the platform titled "Monitors". groundcover introduced Monitors a few months ago, adding a set of out-of-the-box alerts that act as a safety net that allows you to effortlessly stay on top of the most important issues and irregularities in your environment. Today, all of these alerts are centralized in one dedicated section of the platform, allowing you to instantly get an overview of the health and status of your environment.

The section is divided into two tabs:

Issues

Offers a quick view of all current issues in your environment. See the name and labels of the Monitor, as well as a timeline offering you a quick overview of the change in status of the Monitor, where each bar represents a time period (which changes according to the time range chosen in the time picker in the upper right corner of the screen) and the color of the status (red = Altering, grey = Normal). The percentage on the left of the timeline shows the percentage of the selected time range in which the Monitor was Alerting.

Clicking on any of the issues will offer you expanded information, including:

The name of the Monitor - which is comprised of the name of the workload, the criteria, the cluster's name, and the namespace.
A timeline - showing the Monitor's threshold (orange), the monitored value (purple), and the status of the Monitor (upper bar), over time. This can help you visualize what happened before or after the Monitor changed status. The ability to select a specific area on the timeline itself enables an even quicker drill-down into the problematic time window.
Additional related data for context - view Kubernetes events, logs related to the Monitor (including the ability to filter based on log level), and a map to visualize all interactions with the Monitor's workload.

Monitors

Centralizes all your Monitors in one place, in addition to the different tabs inside each workload's individual information page.

You can use this new section to answer critical questions:

Know if anything requires your attention - see which monitors are currently on "Alerting" status.
Know where the issue is taking place - see the name of the workload, cluster, and namespace, detailed in the "Name" column.
Know what the issue is - view the criteria detailed in the Name column (for example - "memory usage > 85%"), and the description and query details by clicking on the Monitor and opening the extended information screen.
Evaluate the scope of the issue - see the number of "Live issues" in that column. You can also see all the workloads affected by this issue and and their statuses.

Clicking on any Monitor will open its full details, including a description, the query details, its status over time, and a list of all related issues.

You can leverage the search bar and filters on the left to show only certain Monitors, based on multiple criteria, such as status, category, and whether or not the Monitor is silenced.

There are many additional developments ahead for this new section, which will increasingly become a central piece of the platform, where you will be able to create and edit Monitors, choose where to be notified about alerting Monitors, view API and infrastructure issues, and much more.

Last updated 1 year ago