Trace the end-to-end time spent on handling GMessages #945

masih · 2025-04-16T09:52:23Z

Add detailed time metrics to trace exactly how much time is spent on what right from sub.Next for granite topic to the end when the message is either buffered elsewhere for post processing or handed to gpbft.

Context
In passive testing at scale 50% we see "subscriber too slow" logs increase with QUALITY quorum of senders drop to ~50%. When the buffer size was doubled to 256, the log rate decreased and the quorum of senders in QUALITY phase increased to ~65%.

We need these metrics to understand what exactly is slow in processing.
We know that a fair chunk of time is spent on fetch committee ( though testes were run with power override which does significantly reduce the time spent on fetching committee )

The text was updated successfully, but these errors were encountered:

BigLep · 2025-04-22T01:37:17Z

Slack context: https://filecoinproject.slack.com/archives/C0556MSR945/p1744794111019479

BigLep · 2025-04-22T13:49:36Z

2025-04-22 conversation: we have the coding done, but there is an operational side to collect the traces. This is probably another day to do the operational side.

Our guess is this won't affect parameters for activation.

BigLep · 2025-05-02T15:58:36Z

2025-05-02 conversation: deferring because it's clear that the biggest time is spent in committee and proposal fetch (sometimes upwards of 10 seconds), and we have data on those already. Optimizing those is Lotus work and that is where we should spend time rather than full end to end tracing.

github-project-automation bot added this to F3 Apr 16, 2025

github-project-automation bot moved this to Todo in F3 Apr 16, 2025

BigLep assigned masih Apr 22, 2025

BigLep moved this from Todo to In progress in F3 Apr 22, 2025

BigLep added this to the M4: Important optimization post activation milestone Apr 22, 2025

BigLep moved this from In progress to Todo in F3 Apr 24, 2025

masih removed their assignment May 2, 2025

BigLep modified the milestones: M4: Important optimization post activation, MX: Priority and sequencing TBD May 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Trace the end-to-end time spent on handling GMessages #945

Trace the end-to-end time spent on handling GMessages #945

masih commented Apr 16, 2025 •

edited

Loading

BigLep commented Apr 22, 2025

Uh oh!

BigLep commented Apr 22, 2025

Uh oh!

BigLep commented May 2, 2025

Uh oh!

Trace the end-to-end time spent on handling GMessages #945

Trace the end-to-end time spent on handling GMessages #945

Comments

masih commented Apr 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

BigLep commented Apr 22, 2025

Uh oh!

BigLep commented Apr 22, 2025

Uh oh!

BigLep commented May 2, 2025

Uh oh!

masih commented Apr 16, 2025 •

edited

Loading