samples: net: zperf: Optimize configuration for better performance #75281

rlubos · 2024-07-01T14:58:32Z

Increase the number of network packets and buffers for better TCP performance in the sample out-of-the-box.

Decrease the network buffer data size for better buffer management in the sample (less buffer space wasted for L2 header). The only drawback of this is reduced TCP TX performance, but less than 2 Mbps in my case.

Finally, enable speed optimizations for another small performance boost.

As the RAM requirements of the sample now increase considerably in the default configuration, add a note in the readme file about it, and how to make it fit into smaller boards.

Tested on nucleo_h723zg:

  Before (current defaults):
    UDP      TX          RX
         76.47 Mbps  93.48 Mbps
    TCP      TX          RX
         76.18 Mbps  67.75 Mbps

  After (new defaults):
    UDP      TX          RX
         76.08 Mbps  93.62 Mbps
    TCP      TX          RX
         74.19 Mbps  85.51 Mbps

jukkar · 2024-07-02T04:57:56Z

samples/net/zperf/prj.conf

+CONFIG_NET_PKT_RX_COUNT=50
+CONFIG_NET_PKT_TX_COUNT=50
+CONFIG_NET_BUF_RX_COUNT=300
+CONFIG_NET_BUF_TX_COUNT=300


Tried this with NXP imxrt1050-evkb board, and the Zephyr download throughput drops to 79.06 Mbps with this.
With slightly lower values

CONFIG_NET_PKT_RX_COUNT=40 CONFIG_NET_PKT_TX_COUNT=40 CONFIG_NET_BUF_RX_COUNT=160 CONFIG_NET_BUF_TX_COUNT=160

the throughput rises to 94.4 Mbits/sec

I'll check this with my STM, I recall I couldn't reach maximum TCP download throughput with lower buffer count.

Ok, both TCP TX and RX degraded a bit with those configs:

TCP TX RX 74.19 Mbps 85.51 Mbps

But they're still not bad so I guess it's fine?

It is weird how the numbers change when having more buffers in NXP but with STM the results are more in line what to expect.

samples/net/zperf/prj.conf

jukkar · 2024-07-02T07:53:22Z

@dleach02 any idea why increasing network buffers for NXP boards gives worse results than having a lower net buf count?

We could discuss the zperf issues next week in network forum meeting.

ssharks

A slight inconsistency, great work, I'm quite amazed how well the TCP stack is performing.

ssharks · 2024-07-05T11:51:00Z

samples/net/zperf/README.rst

+
+.. code-block:: cfg
+
+   CONFIG_NET_PKT_RX_COUNT=50


Ensure that these values match the prj.conf

Thanks, updated

ssharks · 2024-07-05T11:54:27Z

Interesting things are going on lately. What I do not understand is why the UDP TX performance of Zephyr is so bad in comparison. Also in the report that NXP released lately this was very clear.

Increase the number of network packets and buffers for better TCP performance in the sample out-of-the-box. Decrease the network buffer data size for better buffer management in the sample (less buffer space wasted for L2 header). The only drawback of this is reduced TCP TX performance, but less than 2 Mbps in my case. Finally, enable speed optimizations for another small performance boost. As the RAM requirements of the sample now increase considerably in the default configuration, add a note in the readme file about it, and how to make it fit into smaller boards. Tested on nucleo_h723zg: Before (current defaults): UDP TX RX 76.47 Mbps 93.48 Mbps TCP TX RX 76.18 Mbps 67.75 Mbps After (new defaults): UDP TX RX 76.08 Mbps 93.62 Mbps TCP TX RX 74.19 Mbps 85.51 Mbps Signed-off-by: Robert Lubos <[email protected]>

ssharks · 2024-07-06T07:28:08Z

I just looked at the UDP TX path and recently had a mail conversation with @dleach02 on the performance. Is the fact that the transmitted packets always go through a separate thread a significant contributor to the lower (UDP) TX performance?

A reasonable simple way to test is would be to see if playing with CONFIG_NET_TC_SKIP_FOR_HIGH_PRIO has a significant impact.
See

zephyr/subsys/net/ip/net_if.c

Line 356 in 2e956c2

if ((IS_ENABLED(CONFIG_NET_TC_SKIP_FOR_HIGH_PRIO) &&

If this is of significant influence we might need to look if this can be implemented in a different way.

rlubos · 2024-07-08T08:15:48Z

I just looked at the UDP TX path and recently had a mail conversation with @dleach02 on the performance. Is the fact that the transmitted packets always go through a separate thread a significant contributor to the lower (UDP) TX performance?

A reasonable simple way to test is would be to see if playing with CONFIG_NET_TC_SKIP_FOR_HIGH_PRIO has a significant impact. See

zephyr/subsys/net/ip/net_if.c

Line 356 in 2e956c2

if ((IS_ENABLED(CONFIG_NET_TC_SKIP_FOR_HIGH_PRIO) &&

If this is of significant influence we might need to look if this can be implemented in a different way.

Actually, when I was testing the impact of a separate TX thread in the past, I've observed quite a contrary, see the table in #23302 (comment). But that was because the driver I've used blocked the thread for the transmission (hence offloading it to a separate thread allowed the net stack to already prepare the next packet instead of waiting).

jukkar · 2024-07-08T11:50:01Z

I just looked at the UDP TX path and recently had a mail conversation with @dleach02 on the performance. Is the fact that the transmitted packets always go through a separate thread a significant contributor to the lower (UDP) TX performance?

In default settings, there is no separate TX thread in the system. If userspace is enabled, then there needs to be one TX thread.

zephyr/subsys/net/ip/Kconfig

Line 200 in 1ed04e7

config NET_TC_TX_COUNT

zephyrbot added area: Samples Samples area: Networking labels Jul 1, 2024

zephyrbot requested review from jukkar, kartben, nashif, pdgendt, ssharks and tbursztyka July 1, 2024 14:59

zephyrbot assigned rlubos and jukkar Jul 1, 2024

pdgendt previously approved these changes Jul 1, 2024

View reviewed changes

jukkar reviewed Jul 2, 2024

View reviewed changes

rlubos dismissed pdgendt’s stale review via d28aca7 July 2, 2024 07:43

rlubos force-pushed the net/zperf-sample-optimizations branch from 6d00e28 to d28aca7 Compare July 2, 2024 07:43

jukkar requested a review from dleach02 July 2, 2024 07:49

jukkar added this to the v3.7.0 milestone Jul 4, 2024

jukkar previously approved these changes Jul 4, 2024

View reviewed changes

jukkar requested a review from aescolar July 4, 2024 11:46

ssharks reviewed Jul 5, 2024

View reviewed changes

rlubos dismissed jukkar’s stale review via ce805a0 July 5, 2024 12:01

rlubos force-pushed the net/zperf-sample-optimizations branch from d28aca7 to ce805a0 Compare July 5, 2024 12:01

rlubos requested a review from jukkar July 5, 2024 13:38

jukkar approved these changes Jul 5, 2024

View reviewed changes

ssharks approved these changes Jul 6, 2024

View reviewed changes

aescolar merged commit e91c47d into zephyrproject-rtos:main Jul 6, 2024
17 checks passed

ssharks mentioned this pull request Jul 8, 2024

UDP TX performance improvement #75610

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

samples: net: zperf: Optimize configuration for better performance #75281

samples: net: zperf: Optimize configuration for better performance #75281

rlubos commented Jul 1, 2024 •

edited

Loading

jukkar Jul 2, 2024

rlubos Jul 2, 2024

rlubos Jul 2, 2024

jukkar Jul 2, 2024

jukkar commented Jul 2, 2024

ssharks left a comment

ssharks Jul 5, 2024

rlubos Jul 5, 2024 •

edited

Loading

ssharks commented Jul 5, 2024

ssharks commented Jul 6, 2024

rlubos commented Jul 8, 2024

jukkar commented Jul 8, 2024

samples: net: zperf: Optimize configuration for better performance #75281

samples: net: zperf: Optimize configuration for better performance #75281

Conversation

rlubos commented Jul 1, 2024 • edited Loading

jukkar Jul 2, 2024

Choose a reason for hiding this comment

rlubos Jul 2, 2024

Choose a reason for hiding this comment

rlubos Jul 2, 2024

Choose a reason for hiding this comment

jukkar Jul 2, 2024

Choose a reason for hiding this comment

jukkar commented Jul 2, 2024

ssharks left a comment

Choose a reason for hiding this comment

ssharks Jul 5, 2024

Choose a reason for hiding this comment

rlubos Jul 5, 2024 • edited Loading

Choose a reason for hiding this comment

ssharks commented Jul 5, 2024

ssharks commented Jul 6, 2024

rlubos commented Jul 8, 2024

jukkar commented Jul 8, 2024

rlubos commented Jul 1, 2024 •

edited

Loading

rlubos Jul 5, 2024 •

edited

Loading