ci: Make more robust timer_test #3953

lgritz · 2023-08-21T15:14:36Z

On overloaded VMs, the timer_test is ureliable and fails spuriously too often.

Two strategies in this patch, hope some combination does the trick:

Increase the absolute error allowed for the timing tests when doing CI runs.
Add just a bit more relative error (via a new unit test macro, OIIO_CHECK_EQUAL_THRESH_REL).

Hopefully this will cut down on CI failures for unit_timer test case.

On overloaded VMs, the timer_test is ureliable and fails spuriously too often. Two strategies in this patch, hope some combination does the trick: 1. Increase the absolute error allowed for the timing tests when doing CI runs. 2. Add just a bit more relative error (via a new unit test macro, OIIO_CHECK_EQUAL_THRESH_REL). Hopefully this will cut down on CI failures for unit_timer test case. Signed-off-by: Larry Gritz <[email protected]>

ThiagoIze

10% relative error could still be too small since the error is related to the OS's tick size. On Windows this defaults to 15ms, so if a test tries to sleep for 100ms (what interval is set to I think), 10% error is 10ms and so you might still be wrong some of the time. 15% relative error should work on standard Windows machines if you want to go with this approach.

But I believe users can modify the tick size, so a better approach would be to have OS specific queries for tick size and then verify that the timer was equal to or up to 15ms slower. Since this code is for a unit test, it's possible the quick 15% padding is good enough and we can revisit this in the future if anyone reports fails.

I don't believe the timer can return faster than the requested time, so you might want to get rid of the absolutes.

lgritz · 2023-08-21T20:47:20Z

In these examples, I'm using an absolute error PLUS relative, so I think it's going to be ok.

ThiagoIze

So abs + relative is giving you 20ms of padding? Yes, that might be OK for most machines. As I said before, this won't work on all machines but for unit tests it's probably not worth worrying about until we hit it.

lgritz · 2023-08-21T21:59:19Z

This takes a stab at improving our hit rate on this test in CI, and I don't see how it hurts anything. If it doesn't fully solve the problem and we have to revisit again, so be it.

jessey-git · 2023-08-21T22:17:16Z

I can at least verify that this patch allows the unit_timer test to pass for me locally on windows :)

Also, here's output from a quick .exe I threw together to verify the timer resolution from Sleep(0) through Sleep(100). Just a FYI to put into perspective of actual timings you might see:
win_sleep_resolution.txt

On overloaded VMs, the timer_test is ureliable and fails spuriously too often. Two strategies in this patch, hope some combination does the trick: 1. Increase the absolute error allowed for the timing tests when doing CI runs. 2. Add just a bit more relative error (via a new unit test macro, OIIO_CHECK_EQUAL_THRESH_REL). Hopefully this will cut down on CI failures for unit_timer test case. Signed-off-by: Larry Gritz <[email protected]>

ThiagoIze reviewed Aug 21, 2023

View reviewed changes

ThiagoIze approved these changes Aug 21, 2023

View reviewed changes

lgritz merged commit 0fca555 into AcademySoftwareFoundation:master Aug 21, 2023
23 checks passed

lgritz deleted the lg-timerfail branch August 23, 2023 21:32

lgritz mentioned this pull request Sep 27, 2023

[BUG] unit_timer test unreliable #2628

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci: Make more robust timer_test #3953

ci: Make more robust timer_test #3953

lgritz commented Aug 21, 2023

ThiagoIze left a comment

lgritz commented Aug 21, 2023

ThiagoIze left a comment

lgritz commented Aug 21, 2023

jessey-git commented Aug 21, 2023

ci: Make more robust timer_test #3953

ci: Make more robust timer_test #3953

Conversation

lgritz commented Aug 21, 2023

ThiagoIze left a comment

Choose a reason for hiding this comment

lgritz commented Aug 21, 2023

ThiagoIze left a comment

Choose a reason for hiding this comment

lgritz commented Aug 21, 2023

jessey-git commented Aug 21, 2023