feat: extract server timings and measure individual retrievals #332

kylehuntsman · 2023-06-27T07:33:13Z

Pulls the Server-Timing response header logic into it's own file and updates the retrieval event parsing to get timing on individual retrieval attempts. This tracks the individual events that occur on a per retrieval basis, whereas before we were treating "retrieval" as a phase and tracking all events under that phase. This was problematic since the second event of any type, for example during a second concurrent retrieval, would overwrite any previous event and it's timings.

The following is an example of the Server-Timing response header when fetching bafybeic56z3yccnla3cutmvqsn5zy3g24muupcsjtoyp3pu5pm5amurjx4/birb.mp4 via the daemon. I've added newlines for each comma delimited item.

Server-Timing: 
started-finding-candidates;dur=0.1301;candidates-found=179140830;candidates-filtered=179186930,
retrieval-Bitswap;dur=179.38963,
retrieval-12D3KooWKGCcFVSAUXxe7YP62wiwsBvpCmMomnNauJCA67XbmHYj;dur=179.36533,
retrieval-12D3KooWPgBdZSbmKbD7ZQGjU7gZCcCKGvWSnBf1q4xAbpDdtJaJ;dur=179.44633;connected-to-sp=5900;first-byte-received=404370468;failed-retrieval=404419368,
retrieval-12D3KooWJ8YAF6DiRxrzcxoeUVjSANYxyxU55ruFgNvQB4EHibpG;dur=179.43173,
retrieval-QmUA9D3H7HeCYsirB3KmPSvZh3dNXMZas6Lwgr4fv1HTTp;dur=179.48633;connected-to-sp=151700;first-byte-received=498249083,
retrieval-12D3KooWDCXxiSsLi1NT9tsiyimwV6YstQkrjTjD2hAkz2KRVAGG;dur=179.49513;connected-to-sp=181700

Explaination

Due to the caveat explained below, all the "dur" fields are actually the time since the started-fetch event, while the individual metric extras are the duration since the beginning of that metric in nanoseconds.

Caveats

We are unable to get the duration of the entire fetch/successful retrievals due to the way in which the headers are written. Since the headers are written before an http Write occurs, we can only collect info about the retrievals until a first-byte-received event that results in data being written to the client. The http Write ends up occurring before the success and finished events are emitted, therefore cutting off the trailing events that occur for any given retrieval. Because of this, the started-fetch, success, and finished events are not processed.

codecov-commenter · 2023-06-27T07:43:21Z

Codecov Report

Merging #332 (05ec3c4) into feat/discreet-events (81a2850) will decrease coverage by 0.39%.
The diff coverage is 100.00%.

Additional details and impacted files

@@                   Coverage Diff                    @@
##           feat/discreet-events     #332      +/-   ##
========================================================
- Coverage                 76.40%   76.02%   -0.39%     
========================================================
  Files                        84       85       +1     
  Lines                      6361     6318      -43     
========================================================
- Hits                       4860     4803      -57     
- Misses                     1230     1244      +14     
  Partials                    271      271

Impacted Files	Coverage Δ
pkg/retriever/bitswapretriever.go	`93.42% <100.00%> (ø)`
pkg/server/http/ipfs.go	`65.96% <100.00%> (-3.52%)`	⬇️
pkg/server/http/servertimingssubscriber.go	`100.00% <100.00%> (ø)`

... and 10 files with indirect coverage changes

pkg/server/http/servertimingssubcriber.go

willscott · 2023-06-27T15:05:51Z

for the started on each of the retrievals, i would have that set as the dur for the line - having the first number named that way makes it render in standard browser tooling.

kylehuntsman requested review from willscott and rvagg June 27, 2023 07:33

kylehuntsman force-pushed the feat/server-timings branch from f66166c to 9e0518c Compare June 27, 2023 07:41

kylehuntsman mentioned this pull request Jun 27, 2023

feat: update event types to be discreet #321

Merged

willscott reviewed Jun 27, 2023

View reviewed changes

pkg/server/http/servertimingssubcriber.go Outdated Show resolved Hide resolved

kylehuntsman force-pushed the feat/server-timings branch 2 times, most recently from c761c3f to c840136 Compare June 28, 2023 01:37

kylehuntsman requested a review from willscott June 28, 2023 01:38

kylehuntsman force-pushed the feat/discreet-events branch from 2879b02 to 81a2850 Compare June 28, 2023 03:11

fix: update failed bitswap event to failed retrieval

102eee3

kylehuntsman force-pushed the feat/server-timings branch 3 times, most recently from 8669bf4 to 5ae1fda Compare June 28, 2023 03:29

feat: extract server timings and measure individual retrievals

05ec3c4

kylehuntsman force-pushed the feat/server-timings branch from 5ae1fda to 05ec3c4 Compare June 28, 2023 03:34

willscott approved these changes Jun 28, 2023

View reviewed changes

kylehuntsman merged commit e51af12 into feat/discreet-events Jun 28, 2023

kylehuntsman deleted the feat/server-timings branch June 28, 2023 07:51

willscott mentioned this pull request Jul 6, 2023

Include per-provider bitswap interactions in response timing headers #348

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: extract server timings and measure individual retrievals #332

feat: extract server timings and measure individual retrievals #332

kylehuntsman commented Jun 27, 2023 •

edited

Loading

codecov-commenter commented Jun 27, 2023 •

edited

Loading

willscott commented Jun 27, 2023

feat: extract server timings and measure individual retrievals #332

feat: extract server timings and measure individual retrievals #332

Conversation

kylehuntsman commented Jun 27, 2023 • edited Loading

Explaination

Caveats

codecov-commenter commented Jun 27, 2023 • edited Loading

Codecov Report

willscott commented Jun 27, 2023

kylehuntsman commented Jun 27, 2023 •

edited

Loading

codecov-commenter commented Jun 27, 2023 •

edited

Loading