Combine draw commands to improve rendering performance #2421

douira · 2024-04-14T03:25:01Z

This PR makes it so that draw commands that read from adjacent vertex data are combined. This reduces the number of draw commands by around 30% and improves fps on my system by up to 57% depending on the scene and circumstances. I'm on macOS with a 6900 XT. This performance improvement likely comes, as jellysquid stated on discord, from reduced CPU overhead in the driver and better GPU occupancy.

Please test if this results in a similar improvement or other effect, as it's probably dependent on graphics card, memory bandwidth, and platform (os/driver/vendor etc).

Here's a recording of the number of draw commands per pass:
ts on, before:
Draw total for pass Solid: 15531
Draw total for pass Cutout: 13277
Draw total for pass Translucent: 2298

ts on, after:
Draw total for pass Solid: 9571
Draw total for pass Cutout: 8306
Draw total for pass Translucent: 2298

ts off, before:
Draw total for pass Solid: 15531
Draw total for pass Cutout: 13277
Draw total for pass Translucent: 3812

ts off, after:
Draw total for pass Solid: 9571
Draw total for pass Cutout: 8306
Draw total for pass Translucent: 3645

Here's some screenshots without and with this patch. The fps numbers here are outdated, since this branch has been updated in the meantime. See the newest comments at the bottom of this thread instead.

…ing distance sorting through the detection of primary intersectors when geometry is intersecting and then sorting them in a fixed order

…iately instead of keeping them to avoid memory usage buffer caching would be a better solution but that's complicated and doesn't currently work correctly

…g sorting or building performance

… wrong indexes

…ard to fix

also removed the warning message about unpartitionable geometry as it seems to not be a relevant problem

… not recalculated when the normal is quantized. also fixed aligned quads not receiving the more accurate center based on the average of the unique vertexes.

…ommands

douira · 2024-05-08T03:00:54Z

Testing on Discord has shown that these changes can improve performance by around 35%, highly variable depending on specific combinations of many system and scene-related factors. There don't seem to have been any regressions that are statistically significant.

An even more radical optimization that attempts to organize sections such that then combining draw commands across sections is possible did not yield useful results, but I suspect the implementation has a bug. It can be found here (link), but isn't included in this PR.

douira · 2024-05-08T03:01:44Z

I'm marking it as ready for review/merging

…small amount of geometry is intersecting

jellysquid3 · 2024-05-20T03:27:00Z

Because the changes from #2352 have been squashed into /dev, the pull request needs to be re-based to properly isolate the relevant changes.

# Conflicts: # src/main/java/net/caffeinemc/mods/sodium/client/render/chunk/compile/ChunkBuildBuffers.java

douira · 2024-05-20T17:55:51Z

I think it's good to go now

# Conflicts: # common/src/main/java/net/caffeinemc/mods/sodium/client/render/chunk/compile/ChunkBuildBuffers.java # common/src/main/java/net/caffeinemc/mods/sodium/client/render/chunk/compile/tasks/ChunkBuilderMeshingTask.java # common/src/main/java/net/caffeinemc/mods/sodium/client/render/chunk/data/SectionRenderDataStorage.java # src/main/java/net/caffeinemc/mods/sodium/client/gl/util/VertexRange.java

…ing because it seems broken

douira · 2024-09-20T03:25:58Z

Updated to dev, fixed, and changed some things. The effect on performance seems to have increased. On my computer it now goes from 260 to 410 (peak) between dev and this branch. Note again that it will somewhat depend on what point of view each section was initially loaded from.

this

dev

BlueGradientHorizon · 2024-09-21T11:27:31Z

I've tested latest sodium build artifact from your combine-draw-commands branch (575ecb5) and faced some random texture glitches in one server's lobby. They don't always appear in the same spots.
Fabric loader: 0.16.5
FAPI: 0.104.0
No other mods present. I'm sorry if the problem is already known.

…ely picking the size of the required shared index buffer

douira · 2024-09-26T23:24:42Z

The graphical corruption should be fixed now.

BlueGradientHorizon · 2024-09-28T09:56:32Z

Something's wrong with glass panes textures, not happening on beta 2.

And sometimes even

…ex buffer and instead share this type of data within regions

douira · 2024-09-29T20:13:56Z

That bug is fixed now.

douira · 2024-10-02T01:08:55Z

I can't think of anything else to add here. I would appreciate review/merging as appropriate. Testing happened on discord and generally there's been a significant performance improvement on some systems and at least no regressions on the rest. The latest few commits have also resulted in significant VRAM savings in specific scenarios, and a moderate savings in normal scenes.

Felix14-v2 · 2024-10-02T07:46:46Z

and at least no regressions on the rest

What about 5% regression on Nvidia?

douira · 2024-10-02T13:08:27Z

On your particular system (i5-8300H, gtx 1050) there seems to be a slight regression from 500 fps at RD 8, but this effect wasn't observed on other systems with nvidia graphics cards.

# Conflicts: # common/src/main/java/net/caffeinemc/mods/sodium/client/gl/arena/GlBufferArena.java # common/src/main/java/net/caffeinemc/mods/sodium/client/gl/arena/GlBufferSegment.java # common/src/main/java/net/caffeinemc/mods/sodium/client/render/chunk/DefaultChunkRenderer.java # common/src/main/java/net/caffeinemc/mods/sodium/client/render/chunk/data/SectionRenderDataStorage.java # common/src/main/java/net/caffeinemc/mods/sodium/client/render/chunk/data/SectionRenderDataUnsafe.java

… ugly hacks, rename a bunch of methods to be consistent and clearer

Radk6 · 2024-11-01T21:56:57Z

I don't know but it's normal?? It's like every block textures are cash in a line Turning on persistence mapping doesn't affect I just using sodium with sodium extra

Sodium doesn't support PojavLauncher so issues may happen there.

jellysquid3 · 2024-11-01T22:04:09Z

I'm locking this thread since it's been continually driven off-topic by Pojav Launcher users, despite the fact that we continue to tell them that their broken graphics drivers are not supported.

douira added 19 commits February 21, 2024 19:20

rename some things for clarity

d8f6517

fix waterlogged glass panes (once again, but more this time) by avoid…

e17aca9

…ing distance sorting through the detection of primary intersectors when geometry is intersecting and then sorting them in a fixed order

use Mth.clamp for clarity

0e9f45b

refactor buffer and sort result handling, buffers are now freed immed…

856f96d

…iately instead of keeping them to avoid memory usage buffer caching would be a better solution but that's complicated and doesn't currently work correctly

reduce number of unique triggers by around 5 percent without impactin…

f969b7f

…g sorting or building performance

importantly sort a little farther away, sort tasks are fast

e9c9062

use defer zero frames for important sort tasks by default

bee8d00

fix build

7d8587d

clarify authorship of BitArray

8ccba8c

fix bug with radix sort for SNR heuristic in BSP partition generating…

be07541

… wrong indexes

Merge branch 'dev' into ts-waterlogged-glass-panes

37f1f67

combine draw commands

2dd7f5e

correctly reset accumulated element count

d407a83

remove draw call combining for indexed rendering as it's broken and h…

2520c25

…ard to fix

skip heuristic if there's no quads

a84c18f

refactor primary intersector detection to handle large cases better,

298522f

also removed the warning message about unpartitionable geometry as it seems to not be a relevant problem

fix topo sorting in some situations where the dot product was wrongly…

6b7bc8f

… not recalculated when the normal is quantized. also fixed aligned quads not receiving the more accurate center based on the average of the unique vertexes.

reorder vertex ranges before uploaded to optimize for combined draw c…

4322aaf

…ommands

Merge branch 'ts-waterlogged-glass-panes' into combine-draw-commands

53c8e79

douira marked this pull request as ready for review May 8, 2024 03:01

douira added 3 commits May 9, 2024 05:44

tune primary intersector detection to handle situations where only a …

8953480

…small amount of geometry is intersecting

Merge branch 'dev' into ts-waterlogged-glass-panes

1907715

Merge branch 'dev' into ts-waterlogged-glass-panes

3da73cb

douira added 2 commits May 20, 2024 19:32

Merge branch 'ts-waterlogged-glass-panes' into combine-draw-commands

d4ac4c1

Merge branch 'dev' into combine-draw-commands

51fe61d

# Conflicts: # src/main/java/net/caffeinemc/mods/sodium/client/render/chunk/compile/ChunkBuildBuffers.java

Merge branch 'dev' into combine-draw-commands

47f11ac

douira mentioned this pull request Aug 11, 2024

[TS] Delete VertexRange, use isTranslucent, fix partition tree sorting on geometry with negative dot products #2655

Merged

douira added 2 commits September 20, 2024 02:09

fix draw command combining, remove aggressive non-empty command skipp…

575ecb5

…ing because it seems broken

douira added 2 commits September 27, 2024 00:53

fix graphical corruption when there's a lot of geometry by appropriat…

e936630

…ely picking the size of the required shared index buffer

cleanup unused and broken code

2af56fc

cleanup calculation of mask bit and element count

a395364

douira added 4 commits September 28, 2024 23:38

cleanup meshing, storage, and renderer

2163c9e

fix translucent rendering by correctly decoding vertex segments

c01a6bd

cleanup misc, remove unused code

fde4e60

refactor translucent AnyOrderData to not generate its own trivial ind…

59a9517

…ex buffer and instead share this type of data within regions

douira added 2 commits September 30, 2024 04:32

add Index Pool arena size

8334733

add arena content caching

35e374f

douira requested a review from jellysquid3 October 2, 2024 01:07

douira added 3 commits October 4, 2024 17:35

refactor storage to cope with larger amounts of geometry and use less…

85bc54e

… ugly hacks, rename a bunch of methods to be consistent and clearer

remove debug code

9b24127

This comment was marked as off-topic.

Sign in to view

Merge branch 'dev' into combine-draw-commands

e962b37

This comment was marked as off-topic.

Sign in to view

CaffeineMC locked as off-topic and limited conversation to collaborators Nov 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Combine draw commands to improve rendering performance #2421

Combine draw commands to improve rendering performance #2421

douira commented Apr 14, 2024 •

edited

Loading

douira commented May 8, 2024 •

edited

Loading

douira commented May 8, 2024

jellysquid3 commented May 20, 2024

douira commented May 20, 2024

douira commented Sep 20, 2024

BlueGradientHorizon commented Sep 21, 2024

douira commented Sep 26, 2024

BlueGradientHorizon commented Sep 28, 2024

douira commented Sep 29, 2024

douira commented Oct 2, 2024 •

edited

Loading

Felix14-v2 commented Oct 2, 2024

douira commented Oct 2, 2024 •

edited

Loading

This comment was marked as off-topic.

This comment was marked as off-topic.

Radk6 commented Nov 1, 2024

jellysquid3 commented Nov 1, 2024

Combine draw commands to improve rendering performance #2421

Are you sure you want to change the base?

Combine draw commands to improve rendering performance #2421

Conversation

douira commented Apr 14, 2024 • edited Loading

douira commented May 8, 2024 • edited Loading

douira commented May 8, 2024

jellysquid3 commented May 20, 2024

douira commented May 20, 2024

douira commented Sep 20, 2024

BlueGradientHorizon commented Sep 21, 2024

douira commented Sep 26, 2024

BlueGradientHorizon commented Sep 28, 2024

douira commented Sep 29, 2024

douira commented Oct 2, 2024 • edited Loading

Felix14-v2 commented Oct 2, 2024

douira commented Oct 2, 2024 • edited Loading

This comment was marked as off-topic.

This comment was marked as off-topic.

Radk6 commented Nov 1, 2024

jellysquid3 commented Nov 1, 2024

douira commented Apr 14, 2024 •

edited

Loading

douira commented May 8, 2024 •

edited

Loading

douira commented Oct 2, 2024 •

edited

Loading

douira commented Oct 2, 2024 •

edited

Loading