Implement Staged Tile Loading Pipeline #779

csciguy8 · 2023-12-15T20:26:44Z

When loading tiles, implement a new staged pipeline where network requests are dispatched separately from processing work.

This achieves a 25-32% loading time reduction in some test cases.

Closes #746. Also closes #473 as a side effect.

Performance Tests Summary (Cesium for Unreal Performance Tests)

SampleLocaleDenver (cold cache) - 21% slow down
SampleLocaleDenver (warm cache) -15% slow down
SampleLocaleMelbourne (cold cache) - 5% improvement
SampleLocaleMelbourne (warm cache) - about the same
GoogleTiles.LocaleDeathValley (cold cache) - 32% improvement
GoogleTiles.LocaleDeathValley (warm cache) - 15% slow down
GoogleTiles.LocaleChrysler (cold cache) - 25% improvement
GoogleTiles.LocaleChrysler (warm cache) - 9% slow down

Complete Testing Data

cesium-native main
--------------------------------------------------------------------------
SampleLocaleDenver (27 MB)
	Cold cache - 2.02, 2.03, 2.08 | 2.96, 2.45, 2.09, 2.15 - 2.04 avg (best 3)
	Warm cache - 0.81, 0.90, 0.91 | 0.98, 0.98, 0.93, 0.94 - 0.87 avg (best 3)
SampleLocaleMelbourne (89 MB)
	Cold cache - 3.38, 3.38, 3.45 | 3.54, 4.10, 4.26, 3.54 - 3.40 avg (best 3)
	Warm cache - 1.75, 1.76, 1.81 | 1.96, 2.09, 1.81, 1.83 - 1.77 avg (best 3)
GoogleTiles.LocaleDeathValley (33 MB)
	Cold cache - 4.63, 4.75, 4.91 | 5.17, 5.08, 4.96, 4.96 - 4.76 avg (best 3)
	Warm cache - 1.92, 1.96, 1.98 | 5.07, 2.00, 2.03, 2.02 - 1.95 avg (best 3)
GoogleTiles.LocaleChrysler (62 MB)
	Cold cache - 6.69, 6.84, 6.84 | 7.04, 7.09, 6.98, 6.96 - 6.79 avg (best 3)
	Warm cache - 3.06, 3.16, 3.21 | 3.62, 3.26, 3.25, 3.26 - 3.14 avg (best 3)

this PR
--------------------------------------------------------------------------
SampleLocaleDenver (27 MB)
	Cold cache - 2.43, 2.46, 2.52 | 2.99, 2.86, 2.77, 3.05 - 2.47 avg (best 3)
	Warm cache - 0.97, 1.01, 1.04 | 1.12, 1.08, 1.09, 1.05 - 1.00 avg (best 3)
SampleLocaleMelbourne (89 MB)
	Cold cache - 3.19, 3.25, 3.27 | 4.14, 5.17, 3.31, 3.84 - 3.23 avg (best 3)
	Warm cache - 1.78, 1.81, 1.88 | 1.94, 2.03, 2.06, 1.95 - 1.82 avg (best 3)
GoogleTiles.LocaleDeathValley (33 MB)
	Cold cache - 3.22, 3.24, 3.24 | 3.25, 3.32, 3.38, 3.57 - 3.23 avg (best 3)
	Warm cache - 2.24, 2.25, 2.26 | 2.28, 2.29, 2.37, 2.38 - 2.25 avg (best 3)
GoogleTiles.LocaleChrysler (62 MB)
	Cold cache - 5.05, 5.16, 5.17 | 5.20, 5.20, 5.37, 5.17 - 5.12 avg (best 3)
	Warm cache - 3.39, 3.42, 3.47 | 3.51, 3.53, 3.60, 3.57 - 3.42 avg (best 3)

In Depth

...

Simplified view of existing tile loads

This PR introduces this structure

The best place to start looking at the code changes would be in Tileset:: _processWorkerThreadLoadQueue.

Previous load requests were handled immediately, but now they are now passed to TilesetContentManager, which transforms the requests into new data that TileWorkManager can work with, a generalized TileLoadWork. TileWorkManager handles the critical logic to queue / throttle / and execute TileLoadWork objects.

TO DO

…k executes

- Change _maxSimultaneousRequests to 28 for testing - Put loadProgress calculation into ViewUpdateResult instead of being determined on the fly - Put loadProgress kick hack back in for testing

Add assertion to view results Rename some members

This was just a test. In practice, the view can change frequently. We want to drop the work that doesn't make it.

No reason to bump this. Latest tests show minimal improvement

(still much to do here)

…processLoadRequests) Previously this was only happening in dispatchMainThreadTasks, at the beginning of update_view

- Handle completed work for newly dispatched work that completes immediately (like unit tests) - Add done loading notify for tiles that fail too (but don't count towards used bytes

csciguy8 · 2024-03-12T22:35:04Z

Another update to the performance test numbers...

Good news: the performance wins from this PR went up! (now 25-32% improvements for the google tests)

Mediocre news: although the inexplicable "warm cache is really slow" problem is gone, warm cache results are still slightly slower in this PR (9-21% slower). Denver cold cache falls into this category too. I have a pretty good reason to believe that the problem is here, which coincides with @kring 's recommendation to look out for main thread continuations.

The fix would be to dispatch processing work immediately, rather than waiting to dispatch on the main thread. Not a trivial change, but not an unreasonable amount of work either.

A snippet

SampleLocaleDenver (cold cache) - 21% slow down
SampleLocaleDenver (warm cache) -15% slow down
SampleLocaleMelbourne (cold cache) - 5% improvement
SampleLocaleMelbourne (warm cache) - about the same
GoogleTiles.LocaleDeathValley (cold cache) - 32% improvement
GoogleTiles.LocaleDeathValley (warm cache) - 15% slow down
GoogleTiles.LocaleChrysler (cold cache) - 25% improvement
GoogleTiles.LocaleChrysler (warm cache) - 9% slow down

kring · 2024-05-09T11:59:19Z

Cesium3DTilesSelection/src/TileWorkManager.cpp

+ // A response code of 0 is not a valid HTTP code
+ // and probably indicates a non-network error.


No, it probably indicates a valid response from a file:/// URL. It's annoying that libcurl returns a status code of 0 for a successful file read, but it does.

…een released When unloading tiles, make sure raster mapped tiles aren't loading, if so, wait for them to finish. This case can happen when moving very quickly through the world, where a tile is unloaded before it is finished loading.

In LayerJsonTerrainLoader::getLoadWork, when no quadtree tile ID is detected, don't skip adding all work, just the url request work.

electrum-bowie · 2024-07-26T17:53:19Z

Any progress on this ?

I can't wait for these improvements to come !

kring · 2024-07-26T18:24:45Z

This is on hold for the time being, due to other higher-priority work and because the performance we saw with this approach was a bit mixed (some things got notably faster, but others got slower).

electrum-bowie · 2024-07-26T18:26:48Z

Is there any other work being done for performance ?

csciguy8 added 30 commits November 3, 2023 15:10

Baby steps - Flatten child / parent / raster work into one container

eee88eb

Move mapOverlaysToTile to parsing of work

f6bf021

Move raster load throttling to Tileset

8b34976

Fix variant query

2c55d34

Let new ::getRequestWork get the request urls before doTileContentWor…

28f0778

…k executes

Rename TileLoadTask -> TileLoadRequest. Introduce RequestDispatcher

c411929

Remove unused members

83c1f62

Remove unused members

480e065

Dispatch TileLoadWork properly

08e21b3

Put this back in

217ec58

Misc fixups

5327545

Simplify RequestDispatcher creation and thread exit logic

34db4a4

Fix RequestDispatcher shutdown issues (crash, freeze)

d4eba7a

Run the dispatcher logic in a worker thread

30896c1

Misc changes to support raster tile requests

2d0f4da

Support raster tile requests properly

90999f5

Fixup ::computeLoadProgress to take RequestDispatcher into account

c56f2f1

Add proper raster work throttling

51651fe

Misc tweaks

b06d22c

- Change _maxSimultaneousRequests to 28 for testing - Put loadProgress calculation into ViewUpdateResult instead of being determined on the fly - Put loadProgress kick hack back in for testing

Cleanup load stats a little more

555c044

Add logging to request dispatcher

3818f51

Add assertion to view results Rename some members

Run format

bf05e7a

Remove tile kick hack in ::ComputeLoadProgress

3824edb

Fix CWT not loading in Melbourne test

1c89183

Remove proposed solution for melbourne freeze (it was something else)

2eeb071

Don't allow unlimited queueing of request work

20f2b1b

This was just a test. In practice, the view can change frequently. We want to drop the work that doesn't make it.

Change back to default 20

ea7bc88

No reason to bump this. Latest tests show minimal improvement

WIP for removing asset accessor from processing code

a6de2c6

(still much to do here)

Fix work ownership issues. Misc improvements

b503c6d

auto format

7655bdc

csciguy8 marked this pull request as ready for review March 4, 2024 20:29

This comment was marked as outdated.

Sign in to view

csciguy8 added 8 commits March 5, 2024 11:08

Add chance to dispatch processing before we leave ::processLoadRequests

15ec811

Add additional chance for tile to be set up for finalization (during …

07a05e9

…processLoadRequests) Previously this was only happening in dispatchMainThreadTasks, at the beginning of update_view

Fix warning (unused capture)

14761c0

Fixes found from unit tests

e5f686f

- Handle completed work for newly dispatched work that completes immediately (like unit tests) - Add done loading notify for tiles that fail too (but don't count towards used bytes

Fix up unit tests

1082b79

Another warning fix

36cf86d

Update stats logging to include main thread queue length

a31290c

Add tracking of total main thread loads

a16c3d5

This comment was marked as outdated.

Sign in to view

csciguy8 mentioned this pull request Mar 14, 2024

Fix Tileset::ComputeLoadProgress incorrectly reporting done, when main thread loading exists #831

Merged

This was referenced Apr 9, 2024

Runtime Integrations Performance Plan #698

Closed

Support 3D Tile I3dm legacy tile format #854

Merged

csciguy8 added 3 commits April 23, 2024 10:41

Merge branch 'main' into request-gap-refactor

04c923a

Fixup new unit test from merge

75146ce

Merge branch 'main' into request-gap-refactor

e527425

This comment was marked as outdated.

Sign in to view

kring reviewed May 9, 2024

View reviewed changes

This comment was marked as outdated.

Sign in to view

csciguy8 added 2 commits May 13, 2024 13:18

Fix bug where upsampled tiles weren't creating any processing work

70df413

In LayerJsonTerrainLoader::getLoadWork, when no quadtree tile ID is detected, don't skip adding all work, just the url request work.

This comment was marked as outdated.

Sign in to view

kring mentioned this pull request Jun 13, 2024

Map Tiles Terrain suddenly stops trying to load at times CesiumGS/cesium-unity#466

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Staged Tile Loading Pipeline #779

Implement Staged Tile Loading Pipeline #779

csciguy8 commented Dec 15, 2023 •

edited

Loading

This comment was marked as outdated.

This comment was marked as outdated.

csciguy8 commented Mar 12, 2024

This comment was marked as outdated.

kring May 9, 2024

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

electrum-bowie commented Jul 26, 2024

kring commented Jul 26, 2024

electrum-bowie commented Jul 26, 2024

		// A response code of 0 is not a valid HTTP code
		// and probably indicates a non-network error.

Implement Staged Tile Loading Pipeline #779

Are you sure you want to change the base?

Implement Staged Tile Loading Pipeline #779

Conversation

csciguy8 commented Dec 15, 2023 • edited Loading

Performance Tests Summary (Cesium for Unreal Performance Tests)

In Depth

TO DO

This comment was marked as outdated.

This comment was marked as outdated.

csciguy8 commented Mar 12, 2024

This comment was marked as outdated.

kring May 9, 2024

Choose a reason for hiding this comment

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

electrum-bowie commented Jul 26, 2024

kring commented Jul 26, 2024

electrum-bowie commented Jul 26, 2024

csciguy8 commented Dec 15, 2023 •

edited

Loading