-
Notifications
You must be signed in to change notification settings - Fork 331
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
🧑🌾 Flaky test demo_nodes_cpp.TestExecutablesTutorial.test_processes_output
failing variations from connextdds
#693
Comments
I did take a look at this flaky test issue with connextdds. 1st, this cannot be reproducible with my local dev environment... 2nd, 30 seconds should be long enough to receive the parameter events, both cases https://ci.ros2.org/view/nightly/job/nightly_linux_debug/3064/testReport/junit/(root)/projectroot/test_tutorial_parameter_events_async__rmw_connextdds/ and https://ci.ros2.org/view/nightly/job/nightly_win_rep/3362/testReport/junit/(root)/projectroot/test_tutorial_parameter_events__rmw_connextdds/ missed the last 2 events below. demos/demo_nodes_cpp/test/parameter_events.txt Lines 58 to 69 in 4a18995
3rd, QoS for parameter events is reliable enough, https://github.com/ros2/rmw/blob/22f59f8931944999864ef3b0d7aa75ab7258f028/rmw/include/rmw/qos_profiles.h#L77-L88 after all, i could not find why last 2 events are missing here only connextdds, probably connextdds misses those messages already. |
demo_nodes_cpp.TestExecutablesTutorial.test_processes_output : |
job_name | last_fail | first_fail | build_count | failure_count | failure_percentage |
---|---|---|---|---|---|
nightly_win_rep | 2024-09-08 | 2024-08-25 | 13 | 10 | 76.92 |
nightly_win_deb | 2024-09-08 | 2024-08-25 | 16 | 10 | 62.5 |
nightly_win_rel | 2024-09-03 | 2024-09-03 | 16 | 1 | 6.25 |
nightly_linux_repeated | 2024-09-03 | 2024-09-03 | 9 | 1 | 11.11 |
Flaky ratio of projectroot.test_tutorial_parameter_events_async__rmw_connextdds
:
projectroot.test_tutorial_parameter_events_async__rmw_connextdds
:job_name | last_fail | first_fail | build_count | failure_count | failure_percentage |
---|---|---|---|---|---|
nightly_win_deb | 2024-09-08 | 2024-08-25 | 16 | 8 | 50.0 |
nightly_win_rep | 2024-09-06 | 2024-08-31 | 13 | 2 | 15.38 |
Flaky ratio projectroot.test_tutorial_parameter_events__rmw_connextdds
:
projectroot.test_tutorial_parameter_events__rmw_connextdds
:job_name | last_fail | first_fail | build_count | failure_count | failure_percentage |
---|---|---|---|---|---|
nightly_win_rep | 2024-09-08 | 2024-08-25 | 13 | 9 | 69.23 |
nightly_win_deb | 2024-09-08 | 2024-08-26 | 16 | 5 | 31.25 |
nightly_linux_repeated | 2024-09-03 | 2024-09-03 | 9 | 1 | 11.11 |
I poked at this a bit, and while I'm not 100% sure of this, I think this was caused by ros2/rclcpp#2142 . At least, if I checkout a workspace back before that, I can't make it happen anymore. There is no way to revert that change at this point, so we'll have to do some additional poking at the executors and see what we can find here. |
Bug report
Required Info:
Steps to reproduce issue
Description
There is a parent test regression with different variations on ConnextDDS:
demo_nodes_cpp.TestExecutablesTutorial.test_processes_output
Failing test regressions:
Log output (test_tutorial_parameter_events_async__rmw_connextdds):
Log output (test_tutorial_parameter_events__rmw_connextdds):
Flakiness report (projectroot.test_tutorial_parameter_events_async__rmw_connextdds):
Flakiness report (projectroot.test_tutorial_parameter_events__rmw_connextdds):
I don't see any specific change that points to a reason why it started failing more in normal jobs, and not only repeated ones (package history).
projectroot.test_tutorial_parameter_events_async__rmw_connextdds
projectroot.test_tutorial_parameter_events__rmw_connextdds:
The text was updated successfully, but these errors were encountered: