Support benchmarking of multiple HTTP(S) endpoints #76

t-lo · 2019-08-11T14:59:28Z

This PR adds support for specifying, and for benchmarking,
multiple HTTP(S) endpoints in a single wrk2 run.

Our main motivation of running a benchmark over multiple endpoints
is to allow benchmarking of e.g. a whole web application instead
of the pages and/or restful resources that make up said
application individually.

Most of the heavy lifting is done in a LUA script, multiple-endpoints.lua
The script allows for specifying an arbitrary number of HTTP(S) endpoints
to include in the benchmark. Endpoints will be connected to in a random, evenly
distributed fashion. After a run finished, the overall latency will be reported
(i.e. there's currently no break-down of latency per endpoint).

Furthermore, this PR introduces a change in wrk.c that will force a thread
to reconnect (i.e. close socket / open socket using current value of
wrk.thread.addr) each time wrk.thread.addr is set from a LUA script.

Lastly, the PR includes a patch by @janmejay to handle remote connection
close. @dongsupark identified this issue during our testing.

Known Limitations Please note that currently, benchmarking multiple endpoints requires threads == connections, as we close & reconnect as soon as a thread assigns wrk.thread.addr, which impedes ongoing async requests. There are a number of ways to remove this limitation; and we are actively investigating. However, we'd like to start getting early feedback on our direction, hence moved to create this PR with a known limitation.

… be closed and re-opened

When EOF gets received, it should reconnect the socket without increasing error counter. Otherwise we could see socket read errors even in case of ordinary reconnects.

This change forces a reconnect of all connections of a thread when wrk.thread.addr is set from a LUA script. wrk.thread.addr has always been writeable from LUA, but the actual socket connection was never updated in wrk'c C code. This change enables LUA scripts to connect to multiple servers, extending the feature set of wrk. Signed-off-by: Thilo Fromm <[email protected]>

Re-connect when peer closes the connection

Signed-off-by: Thilo Fromm <[email protected]>

…ddr-is-set LUA API: force reconnect when wrk.thread.addr is set

Signed-off-by: Thilo Fromm <[email protected]>

This change makes multi-endpoint support more generic, with the motivation of making this feature useful for upstream. The LUA script 'multiple-endpoints.lua' allows for specifying an arbitrary number of HTTP(S) endpoints to include in the benchmark. Endpoints will be connected to in a random, evenly distributed fashion. After a run finished, the overall latency will be reported (i.e. there's currently no break-down per endpoint). The main purpose of running a benchmark over multiple endpoints is to allow benchmarking of e.g. a whole web application instead of the pages and/or restful resources that make up said application individually. Signed-off-by: Thilo Fromm <[email protected]>

t-lo · 2019-08-12T13:59:39Z

Example usage:

./wrk -s scripts/multiple-endpoints.lua -L -R10000 -t 30 -c 30 -d 60 \ 
                http://app.my-service.io/api/job-endpoint.json \
                http://app.my-service.io/api/data.json \
                http://app.my-service.io/static/page.html \
                http://app2.my-other-service.io/api/exec.json \
                http://app2.my-other-service.io/static.html
[...]
  Thread Stats   Avg      Stdev     Max   +/- Stdev
    Latency     3.99s     2.43s    8.77s    58.11%
    Req/Sec   292.07    123.44     1.60k    77.88%
  Latency Distribution (HdrHistogram - Recorded Latency)
 50.000%    3.78s
 75.000%    6.06s
 90.000%    7.48s
 99.000%    8.43s
 99.900%    8.69s
 99.990%    8.77s
 99.999%    8.77s
100.000%    8.78s

  Detailed Percentile spectrum:
       Value   Percentile   TotalCount 1/(1-Percentile)
[...]
#[Mean    =     3991.360, StdDeviation   =     2426.005]
#[Max     =     8773.632, Total count    =       415581]
#[Buckets =           27, SubBuckets     =         2048]
----------------------------------------------------------
  515126 requests in 1.00m, 2.14GB read
Requests/sec:   8591.59
Transfer/sec:     36.58MB
Total Requests: 515126
HTTP errors: 0
Requests timed out: 0
Bytes received: 2300037590
Socket connect errors: 0
Socket read errors: 0
Socket write errors: 0

URL call count
http://app.my-service.io/api/job-endpoint.json  : 105330
http://app.my-service.io/api/data.json  : 104250
http://app.my-service.io/static/page.html : 99840 
http://app2.my-other-service.io/api/exec.json : 103200 
http://app2.my-other-service.io/static.html : 103290

giltene · 2019-08-17T19:03:58Z

Let's open an issue for this to discuss before we pull it in...

One of my main concerns is "forking" too far from the place we originally forked wrk at, which would make catching up with wrk itself harder. And since I (personally) have not really tracked how wrk has evolved from that point, I don't know how this PR relates to features there.

Has wrk added support benchmarking of multiple HTTP(S) endpoints?

t-lo · 2019-08-18T08:27:43Z

Happily opening an issue to discuss if that's the preferred path - I did not see much of a benefit over discussing right here, on the PR, so I did not cut an issue right away.

Regarding upstream, I'd argue that the feature introduced by this PR makes a lot more sense in the context of benchmarking with constant RPS - something upstream does not support. The main scenario we were aiming at when writing this code was to simulate constant RPS load on a cloud-native (i.e. clustered) web app (consisting of multiple micro-services with multiple URLs each), so basing our PR on wrk2 instead of wrk made more sense to us. To answer your question, no, I do not believe upstream currently has a comparable feature.

That said, I think I better understand the main concern of not diverging from upstream too much. Let me look into the latest upstream changes, with the goal of produing a PR to update this fork, before we continue discussing this PR.

…ti-endpoint-support

janmejay and others added 13 commits May 8, 2019 12:40

handle EPOLLRDHUP and EPOLLHUP as read-events, so dead connection can…

9bea8ae

… be closed and re-opened

trigger read-event for EPOLLRDHUP

2e001cf

fixed permanent CLOSE_WAIT_accumulation during load-generation

c8395f0

EoF on read was preventing stats collection, attempt to fix that.

0dded10

src: handle EOF as reconnect without error

5851d46

When EOF gets received, it should reconnect the socket without increasing error counter. Otherwise we could see socket read errors even in case of ordinary reconnects.

Merge pull request #1 from kinvolk/dongsu/fix-reconnect

e055282

Re-connect when peer closes the connection

scripts: add multi-server.lua

ecc99a2

Signed-off-by: Thilo Fromm <[email protected]>

reconnect: call reconnect from the correct LUA API function

0052ff8

Signed-off-by: Thilo Fromm <[email protected]>

reconnect: fixed potential segfault

7eded81

Signed-off-by: Thilo Fromm <[email protected]>

Merge pull request #2 from kinvolk/t-lo/thread-force-reconnect-when-a…

bebf908

…ddr-is-set LUA API: force reconnect when wrk.thread.addr is set

wrk.c: remove start-up banner

996f8c4

Signed-off-by: Thilo Fromm <[email protected]>

t-lo changed the title ~~T lo/upstream multi endpoint support~~ Support benchmarking of multiple HTTP(S) endpoints Aug 11, 2019

Merge remote-tracking branch 'upstream/master' into t-lo/upstream-mul…

82c1642

…ti-endpoint-support

t-lo mentioned this pull request Jun 9, 2020

Add option to not pass HTTP body to lua script kinvolk/wrk2#4

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support benchmarking of multiple HTTP(S) endpoints #76

Support benchmarking of multiple HTTP(S) endpoints #76

t-lo commented Aug 11, 2019 •

edited

Loading

t-lo commented Aug 12, 2019

giltene commented Aug 17, 2019

t-lo commented Aug 18, 2019

Support benchmarking of multiple HTTP(S) endpoints #76

Are you sure you want to change the base?

Support benchmarking of multiple HTTP(S) endpoints #76

Conversation

t-lo commented Aug 11, 2019 • edited Loading

t-lo commented Aug 12, 2019

giltene commented Aug 17, 2019

t-lo commented Aug 18, 2019

t-lo commented Aug 11, 2019 •

edited

Loading