core(script-elements): parallelize getting request contents #9713

connorjclark · 2019-09-21T00:36:38Z

We don't need to request these resources in series.

Result: on average, cnn.com loads 1s faster.

Data:

Data is from this script I have: https://github.com/GoogleChrome/lighthouse/blob/timings-script/lighthouse-core/scripts/timings.js

node lighthouse-core/scripts/timings.js --name base --collect -n 10 --lh-flags='--only-categories=performance' --urls https://www.buzzfeed.com/ https://www.cnn.com/  https://www.example.com/ https://www.nyt.com/ https://www.sfgate.com/
# apply changes, then ...
node lighthouse-core/scripts/timings.js --name pr --collect -n 10 --lh-flags='--only-categories=performance' --urls https://www.buzzfeed.com/ https://www.cnn.com/  https://www.example.com/ https://www.nyt.com/ https://www.sfgate.com/

# results
node lighthouse-core/scripts/timings.js --name base --summarize --measure-filter='afterPass:ScriptElements'
node lighthouse-core/scripts/timings.js --name pr --summarize --measure-filter='afterPass:ScriptElements'

brendankenny · 2019-09-23T15:41:34Z

(note to others: reads a lot easier from https://github.com/GoogleChrome/lighthouse/pull/9713/files?w=1)

brendankenny

these serial requests come from when we used to be wary of doing things out of order, so it's probably ok to parallelize (ideally we could benchmark on a phone (e.g. WPT) since disk/memory contention will be much worse there, but we can hopefully trust the implementation to chunk appropriately. At least it's in afterPass). Looks like a good win on desktop, though!

@patrickhulce since reasoning for going serial isn't discussed in #3950.

@connorjclark do you want to add ScriptElements to the byte-efficiency expectations (since unminified-javascript is already using it) and/or one of the other smoke tests so we can get coverage of it?

connorjclark · 2019-09-23T16:58:41Z

lighthouse-cli/test/smokehouse/byte-efficiency/expectations.js

+          async: true,
+          defer: false,
+          source: 'body',
+          devtoolsNodePath: '2,HTML,1,BODY,1438,SCRIPT',


This alternated between 1438 / 1437, until I changed the script to be a 8s delay (there's a 7s timer that adds an element)

connorjclark · 2019-09-23T16:59:07Z

do you want to add ScriptElements to the byte-efficiency expectations (since unminified-javascript is already using it) and/or one of the other smoke tests so we can get coverage of it?

done. I verified the smoke test worked in master too.

patrickhulce · 2019-09-23T21:04:03Z

since disk/memory contention will be much worse there

This was the primary concern and reason for doing it serially because if you flood the phone all at once requesting all scripts and it's trying to serialize all of that JSON for the protocol it adds memory pressure. if it's a device with low memory and needs to be start evicting to handle the protocol traffic, once a network file's contents are evicted from memory we basically lose out on it completely and get those sentry errors "Unable to read network response body" or whatever that are otherwise very difficult to track down.

I don't have any data to say how much more often this happens and I imagine trace size with screenshots is a bigger memory pressure concern we don't do much about, so I won't protest too much here. But it'd be nice to keep some upper limit on the number of parallel requests at once whenever we try to read network file contents from the memory cache.

Love the .catch on it to isolate the artifact from individual failure though :)

connorjclark · 2019-09-24T22:04:07Z

I needed some numbers to understand the memory footprint:

Seems reasonable that fetching the ScriptElements is the most memory intensive burst of activity from LH. Probably way more than screenshots, since those come in over a longer period of time. But do we really expect that devices are operating with a headroom of ~8MB memory?

"Unable to read network response body"

FWIW I don't see these errors in Sentry.

patrickhulce · 2019-09-24T23:39:34Z

@connorjclark In sentry it's the

Protocol error (Network.getResponseBody): No resource with given identifier found
Protocol error (Audits.getEncodedResponse): No resource with given identifier found
Protocol error (CSS.getStyleSheetText): No style sheet with given id found
Protocol error (Runtime.evaluate): Promise was collected
Protocol error (Network.getResponseBody): No data found for resource with given identifier

family of errors that are all indicators something was evicted at some point due to memory pressure, all of which occur at a fairly high rate. And to be clear it's not the artifact sizes that are the memory pressure concern. It's the entirety of network record bodies, decoded image data, parsed and compiled script, etc stored by Chrome during page load that's at risk of being evicted when the totality of these things climbs. The errors occurring at all already indicate that Chrome decided it was under memory pressure in those situations, so increasing our impact by 8MB I would expect us to lose out on 8MB more of assets in other gatherers. Requesting a lot things at once over the protocol is just another way to increase this incidence rate, so if we can avoid it, it'd be nice to keep it limited that's all :)

connorjclark · 2019-10-09T18:09:20Z

If we could detect that we are running in a memory-limited environment, would it be OK to branch behavior based on that?

Example...

if onMobileDevice or onLowMemoryDevice or benchmarkIndex < x .... etc.... *
    fetch in series
else
    fetch in parallel
end

* idk what this would be

patrickhulce · 2019-10-09T18:49:04Z

If we implement the max parallelism style approach I suggested then we get the "branching" for free by controlling concurrency based on benchmarkindex or whatever we like without having to test multiple different codepaths :)

connorjclark · 2019-10-09T18:51:10Z

@exterkamp can benchmark index be used reliably for this? I thought you concluded it was ... lacking

patrickhulce · 2019-10-09T22:07:34Z

@exterkamp can benchmark index be used reliably for this? I thought you concluded it was ... lacking

@connorjclark it most definitely is lacking for performance metric accuracy (see my conclusions at #9085) but we're just scaling something totally nonessential here and the scale of its inaccuracy is like something that should be 1000 is 400 and not something that should be 25 is 1000 which would not affect what we're talking about doing here at all :)

connorjclark · 2019-10-11T01:27:10Z

@paulirish pointed out

lighthouse/lighthouse-core/gather/gather-runner.js

Line 459 in 1b8f4d8

    
           const IsMobileHost = hostUserAgent.includes('Android') || hostUserAgent.includes('Mobile');

- so we can do binary parallel / series requests. We won't get the speedup for great mobile devices, but we won't be hurting them either.

patrickhulce

LGTM

patrickhulce · 2020-01-14T22:18:30Z

@connorjclark still ready to land?

connorjclark · 2020-01-14T22:19:29Z

yeah seems cool

connorjclark added 2 commits September 18, 2019 23:27

faster script-elements

1b402cb

Merge remote-tracking branch 'origin/master' into faster-script-elements

df4b6fe

connorjclark requested review from patrickhulce and paulirish as code owners September 21, 2019 00:36

googlebot added the cla: yes label Sep 21, 2019

brendankenny reviewed Sep 23, 2019

View reviewed changes

smoke

47551b4

vercel bot deployed to staging September 23, 2019 16:55 View deployment

connorjclark commented Sep 23, 2019

View reviewed changes

connorjclark mentioned this pull request Sep 23, 2019

misc: timings script #9723

Merged

paulirish added the waiting4committer label Oct 11, 2019

patrickhulce mentioned this pull request Oct 22, 2019

tests: refactor smokehouse for extensibility #9843

Merged

connorjclark added 2 commits October 25, 2019 21:19

Merge remote-tracking branch 'origin/master' into faster-script-elements

ceca3b2

user agent

6e609a8

vercel bot deployed to staging October 26, 2019 04:37 View deployment

This was referenced Nov 2, 2019

core: use the same scoring curve for desktop in all channels #9911

Merged

core: add new base artifact, HostFormFactor #9923

Merged

connorjclark added 2 commits November 9, 2019 21:03

Merge remote-tracking branch 'origin/master' into faster-script-elements

2997705

use HostFormFactor

9eccd03

vercel bot deployed to staging November 10, 2019 05:15 View deployment

comment

36f3e68

vercel bot deployed to staging November 10, 2019 05:17 View deployment

connorjclark removed the waiting4committer label Nov 11, 2019

patrickhulce approved these changes Jan 14, 2020

View reviewed changes

connorjclark merged commit 56d1840 into master Jan 14, 2020

connorjclark deleted the faster-script-elements branch January 14, 2020 22:19

brendankenny mentioned this pull request May 11, 2020

bytes smoke test is flaky #10734

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

core(script-elements): parallelize getting request contents #9713

core(script-elements): parallelize getting request contents #9713

Uh oh!

connorjclark commented Sep 21, 2019

Uh oh!

brendankenny commented Sep 23, 2019

Uh oh!

brendankenny left a comment

Uh oh!

connorjclark Sep 23, 2019

Uh oh!

connorjclark commented Sep 23, 2019

Uh oh!

patrickhulce commented Sep 23, 2019

Uh oh!

connorjclark commented Sep 24, 2019

Uh oh!

patrickhulce commented Sep 24, 2019

Uh oh!

connorjclark commented Oct 9, 2019 •

edited

Loading

Uh oh!

patrickhulce commented Oct 9, 2019

Uh oh!

connorjclark commented Oct 9, 2019

Uh oh!

patrickhulce commented Oct 9, 2019

Uh oh!

connorjclark commented Oct 11, 2019

Uh oh!

patrickhulce left a comment

Uh oh!

patrickhulce commented Jan 14, 2020

Uh oh!

connorjclark commented Jan 14, 2020

Uh oh!

Uh oh!

core(script-elements): parallelize getting request contents #9713

core(script-elements): parallelize getting request contents #9713

Uh oh!

Conversation

connorjclark commented Sep 21, 2019

Uh oh!

brendankenny commented Sep 23, 2019

Uh oh!

brendankenny left a comment

Choose a reason for hiding this comment

Uh oh!

connorjclark Sep 23, 2019

Choose a reason for hiding this comment

Uh oh!

connorjclark commented Sep 23, 2019

Uh oh!

patrickhulce commented Sep 23, 2019

Uh oh!

connorjclark commented Sep 24, 2019

Uh oh!

patrickhulce commented Sep 24, 2019

Uh oh!

connorjclark commented Oct 9, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

patrickhulce commented Oct 9, 2019

Uh oh!

connorjclark commented Oct 9, 2019

Uh oh!

patrickhulce commented Oct 9, 2019

Uh oh!

connorjclark commented Oct 11, 2019

Uh oh!

patrickhulce left a comment

Choose a reason for hiding this comment

Uh oh!

patrickhulce commented Jan 14, 2020

Uh oh!

connorjclark commented Jan 14, 2020

Uh oh!

Uh oh!

connorjclark commented Oct 9, 2019 •

edited

Loading