Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

🧑‍🌾 Flaky test test_graph__rmw_fastrtps_cpp on ros2 buildfarm #1016

Open
Crola1702 opened this issue Oct 27, 2022 · 1 comment
Open
Assignees
Labels
bug Something isn't working

Comments

@Crola1702
Copy link

Bug report

  • Operating System:
    • Ubuntu 20.04 and 22.04
  • Installation type:
    • Source
  • Version or commit hash:
    • Rolling
  • DDS implementation:
    • Fast-RTPS

Steps to reproduce issue

  1. Run a build in one of the following jobs
    • Nightlies repeated jobs (windows, arch64, linux, rhel)
    • Humble coverage or debug
    • Linux aarch64 Debug
  2. See it fail (if lucky)

Expected behavior

Not failing

Actual behavior

This test is failing on repeated jobs since the last year (as they retest everything until fail, it is expected that this test fails there),
However, I find it weird that is happening on other jobs different to repeated ones, because when the test fails there, it's rerun (opposite to repeated jobs)

This test has a 21% flaky ratio on Humble Coverage (7/33 builds checked, however just 1 of them is marked as unstable with this test failure).

Additional information

Reference build: https://ci.ros2.org/job/nightly_linux_humble_coverage/187/

Test regression: rcl_action.test_graph__rmw_fastrtps_cpp.gtest.missing_result

Log output:

Log output:

      Start 10: test_graph__rmw_fastrtps_cpp

10: Test command: /home/jenkins-agent/workspace/nightly_linux_humble_coverage/venv/bin/python3.10 "-u" "/home/jenkins-agent/workspace/nightly_linux_humble_coverage/ws/install/ament_cmake_test/share/ament_cmake_test/cmake/run_test.py" "/home/jenkins-agent/workspace/nightly_linux_humble_coverage/ws/build/rcl_action/test_results/rcl_action/test_graph__rmw_fastrtps_cpp.gtest.xml" "--package-name" "rcl_action" "--output-file" "/home/jenkins-agent/workspace/nightly_linux_humble_coverage/ws/build/rcl_action/ament_cmake_gtest/test_graph__rmw_fastrtps_cpp.txt" "--env" "RCL_ASSERT_RMW_ID_MATCHES=rmw_fastrtps_cpp" "RMW_IMPLEMENTATION=rmw_fastrtps_cpp" "--command" "/home/jenkins-agent/workspace/nightly_linux_humble_coverage/ws/build/rcl_action/test_graph__rmw_fastrtps_cpp" "--gtest_output=xml:/home/jenkins-agent/workspace/nightly_linux_humble_coverage/ws/build/rcl_action/test_results/rcl_action/test_graph__rmw_fastrtps_cpp.gtest.xml"
10: Test timeout computed to be: 180
10: -- run_test.py: extra environment variables:
10:  - RCL_ASSERT_RMW_ID_MATCHES=rmw_fastrtps_cpp
10:  - RMW_IMPLEMENTATION=rmw_fastrtps_cpp
10: -- run_test.py: invoking following command in '/home/jenkins-agent/workspace/nightly_linux_humble_coverage/ws/build/rcl_action':
10:  - /home/jenkins-agent/workspace/nightly_linux_humble_coverage/ws/build/rcl_action/test_graph__rmw_fastrtps_cpp --gtest_output=xml:/home/jenkins-agent/workspace/nightly_linux_humble_coverage/ws/build/rcl_action/test_results/rcl_action/test_graph__rmw_fastrtps_cpp.gtest.xml
10: Running main() from /home/jenkins-agent/workspace/nightly_linux_humble_coverage/ws/install/gtest_vendor/src/gtest_vendor/src/gtest_main.cc
10: [==========] Running 10 tests from 2 test suites.
10: [----------] Global test environment set-up.
10: [----------] 3 tests from TestActionGraphFixture__rmw_fastrtps_cpp
10: [ RUN      ] TestActionGraphFixture__rmw_fastrtps_cpp.test_action_get_client_names_and_types_by_node
10: -- run_test.py: return code -11
10: -- run_test.py: generate result file '/home/jenkins-agent/workspace/nightly_linux_humble_coverage/ws/build/rcl_action/test_results/rcl_action/test_graph__rmw_fastrtps_cpp.gtest.xml' with failed test
10: -- run_test.py: verify result file '/home/jenkins-agent/workspace/nightly_linux_humble_coverage/ws/build/rcl_action/test_results/rcl_action/test_graph__rmw_fastrtps_cpp.gtest.xml'
10/22 Test #10: test_graph__rmw_fastrtps_cpp ....................***Failed    0.31 sec

First time happening: Nightly Linux Repeated 2361 (1 year ago)

First 20 builds with this test regression in ros2 nightlies:

image

Last 20 builds with this test regression:

image

Jobs different from repeated

image

@fujitatomoya
Copy link
Collaborator

different test case also fails if lucky.

root@tomoyafujita:~/ros2_ws/colcon_ws# while true; do colcon test --event-handlers console_direct+ --packages-select rcl --ctest-args -R test_graph__rmw_fastrtps_cpp; done
...<snip>
72: Test command: /usr/bin/python3.10 "-u" "/root/ros2_ws/colcon_ws/install/ament_cmake_test/share/ament_cmake_test/cmake/run_test.py" "/root/ros2_ws/colcon_ws/build/rcl/test_results/rcl/test_graph__rmw_fastrtps_cpp.gtest.xml" "--package-name" "rcl" "--output-file" "/root/ros2_ws/colcon_ws/build/rcl/ament_cmake_gtest/test_graph__rmw_fastrtps_cpp.txt" "--env" "RMW_IMPLEMENTATION=rmw_fastrtps_cpp" "--append-env" "LD_LIBRARY_PATH=/root/ros2_ws/colcon_ws/build/rcl" "--command" "/root/ros2_ws/colcon_ws/build/rcl/test/test_graph__rmw_fastrtps_cpp" "--gtest_output=xml:/root/ros2_ws/colcon_ws/build/rcl/test_results/rcl/test_graph__rmw_fastrtps_cpp.gtest.xml"
72: Test timeout computed to be: 120
72: -- run_test.py: extra environment variables:
72:  - RMW_IMPLEMENTATION=rmw_fastrtps_cpp
72: -- run_test.py: extra environment variables to append:
72:  - LD_LIBRARY_PATH+=/root/ros2_ws/colcon_ws/build/rcl
72: -- run_test.py: invoking following command in '/root/ros2_ws/colcon_ws/build/rcl/test':
72:  - /root/ros2_ws/colcon_ws/build/rcl/test/test_graph__rmw_fastrtps_cpp --gtest_output=xml:/root/ros2_ws/colcon_ws/build/rcl/test_results/rcl/test_graph__rmw_fastrtps_cpp.gtest.xml
72: Running main() from /root/ros2_ws/colcon_ws/install/gtest_vendor/src/gtest_vendor/src/gtest_main.cc
72: [==========] Running 20 tests from 2 test suites.
72: [----------] Global test environment set-up.
72: [----------] 16 tests from TestGraphFixture__rmw_fastrtps_cpp
72: [ RUN      ] TestGraphFixture__rmw_fastrtps_cpp.test_rcl_get_and_destroy_topic_names_and_types
[Processing: rcl]
[Processing: rcl]
[Processing: rcl]
[Processing: rcl]
1/1 Test #72: test_graph__rmw_fastrtps_cpp .....***Timeout 120.06 sec

0% tests passed, 1 tests failed out of 1

Label Time Summary:
gtest    = 120.06 sec*proc (1 test)

Total Test time (real) = 120.06 sec

The following tests FAILED:
         72 - test_graph__rmw_fastrtps_cpp (Timeout)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants