Category

Research

Posted by

Sébastien Rannou

Sébastien Rannou

Research

Share

X (Twitter)

Linkedin

Share

How to monitor Ethereum validation keys at scale

June 22, 2022

How to monitor Ethereum validation keys at scale

Category

Research

Posted by

Sébastien Rannou

Sébastien Rannou

Research

Share

X (Twitter)

Linkedin

Share

‍

Our philosophy of Monitoring

‍

When operating production at scale, monitoring can be approached in two complementary ways:

‍

1️⃣ open-box monitoring, using internal signals from the stack,

2️⃣ closed-box monitoring, using external signals from the outside.

‍

The former is usually fine-grain but depends on the monitored stack itself, while the latter tends to be more generic but robust. Those two approaches are meant to be combined: for instance, if one of your internal stack stops to report metrics, chances are open-box monitoring will not catch it, while closed-box monitoring will. It also helps to have two pairs of independent eyes to watch a critical system and build up confidence in it.

‍

Landscape of Monitoring

‍

Most of the Ethereum validators today support Prometheus as a metric exporter, which can be used to pull-in metrics to build open-box dashboards and alerts, this approach is widely used and the most straightforward way to operate. For instance, the Prysm validators come with a ready-to-use Grafana dashboard build on top of their metrics.

‍

In large setups with client diversity, open-box monitoring tends to be complicated, because the signals on each validator type may be different, and finding a common set of signals that is meaningful and comparable between each type is challenging.

‍

Closed-box monitoring on the other hand is agnostic to the client type as it purely relies on metrics from the beaconchain: it implies to have a process that watches attestations on the beacon chain from the outside of your infrastructure. Tooling in this area is sparse and usually requires to build your own aggregator or instrument an existing one like Chaind to expose per-validator metrics to Prometheus.

‍

Enters rated-cli

‍

Another approach to closed-box monitoring is to use Rated Network, which offers statistics about validation keys via a public API.

With the help of the Rated Network team, we have developed a Prometheus exporter for Rated Network that can be configured to watch a set of validation keys to export them into a Prometheus stack.

‍

‍

The watcher is configured via a YAML config file containing validation keys to watch, it polls metrics about each validation key by using the Rated Network API every 24 hours and refreshes the exported Prometheus values. It also comes with a Grafana dashboard that can then be installed to watch key metrics:

‍

For now, the implementation supports the subset of signals from the Rated Network API:

uptime of a validator key
average correctness
attester effectiveness
proposer effectiveness
validator effectiveness

We have been using it internally for several weeks now and are happy to share it today with the community. Feel free to open PRs or issues!

‍

🙌 Thanks @ratedw3b @aimxhaisse and the whole engineering team

‍

Resources

Github repository: https://github.com/rated-network/rated-cli
Rated Network Discord: https://discord.gg/GCUfk8ryJS

‍

Recent news

What are Solana Network Extensions, and how are they different from Ethereum Layer-2s?

What are Solana Network Extensions, and how are they different from Ethereum Layer-2s?

Kiln and OSL partner to offer enterprise-grade ETH staking

Kiln and OSL partner to offer enterprise-grade ETH staking

Kiln Announces SOC 2 Type II Compliance Renewal

Kiln Announces SOC 2 Type II Compliance Renewal

io.finnet leverages Kiln Widget for enterprise-grade staking solutions

io.finnet leverages Kiln Widget for enterprise-grade staking solutions

Kiln joins the Tron Network as a Super Representative

Kiln joins the Tron Network as a Super Representative

Kiln welcomes Carlos Arena as VP of Sales – Americas

Kiln welcomes Carlos Arena as VP of Sales – Americas