[Failing Test]: apache_beam.io.gcp.bigquery_test.PipelineBasedStreamingInsertTest is flaky #32069

tvalentyn · 2024-08-02T19:38:52Z

What happened?

The test_batch_size_with_auto_sharding scenario seems to become flaky recently; I encountered this error in coverage test suite on some seemingly unrelated PRs:

=================================== FAILURES ===================================
____ PipelineBasedStreamingInsertTest.test_batch_size_with_auto_sharding_0 _____
[gw5] linux -- Python 3.8.18 /runner/_work/beam/beam/sdks/python/test-suites/tox/py38/build/srcs/sdks/python/target/.tox-py38-cloudcoverage/py38-cloudcoverage/bin/python

a = (<apache_beam.io.gcp.bigquery_test.PipelineBasedStreamingInsertTest testMethod=test_batch_size_with_auto_sharding_0>,)
kw = {}

    @wraps(func)
    def standalone_func(*a, **kw):
>       return func(*(a + p.args), **p.kwargs, **kw)

target/.tox-py38-cloudcoverage/py38-cloudcoverage/lib/python3.8/site-packages/parameterized/parameterized.py:620: 
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
apache_beam/io/gcp/bigquery_test.py:2239: in test_batch_size_with_auto_sharding
    self.assertEqual(out1['colA_values'], ['value1', 'value3'])
E   AssertionError: Lists differ: ['value1', 'value5'] != ['value1', 'value3']
E   
E   First differing element 1:
E   'value5'
E   'value3'
E   
E   - ['value1', 'value5']
E   ?                  ^
E   
E   + ['value1', 'value3']
E   ?                  ^

=============================== warnings summary ===============================
https://github.com/apache/beam/actions/runs/10219876243/job/28279049265?pr=32066

Issue Failure

Failure: Test is flaky

Issue Priority

Priority: 1 (unhealthy code / failing or flaky postcommit so we cannot be sure the product is healthy)

Issue Components

The text was updated successfully, but these errors were encountered:

tvalentyn · 2024-08-07T00:34:16Z

cc: @damccorm

damccorm · 2024-08-22T23:18:50Z

Looks like this is likely just a bad test. The actual assertion here is based on an assumption that data will be processed in order, but that's not guaranteed. I'll clean the test up a bit

tvalentyn added bug failing test awaiting triage labels Aug 2, 2024

github-actions bot added python io tests P1 flake labels Aug 2, 2024

damccorm self-assigned this Aug 20, 2024

github-actions bot removed the awaiting triage label Aug 20, 2024

damccorm mentioned this issue Aug 22, 2024

Make autosharding test more robust #32293

Merged

3 tasks

damccorm added the good first issue label Aug 22, 2024

damccorm closed this as completed in #32293 Aug 25, 2024

github-actions bot added this to the 2.60.0 Release milestone Aug 25, 2024

damccorm mentioned this issue Sep 5, 2024

[Failing Test]: Python BigQuery Test - PipelineBasedStreamingInsertTest::tes_batch_size_with_auto_sharding_0 is flaky #31985

Closed

17 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Failing Test]: apache_beam.io.gcp.bigquery_test.PipelineBasedStreamingInsertTest is flaky #32069

[Failing Test]: apache_beam.io.gcp.bigquery_test.PipelineBasedStreamingInsertTest is flaky #32069

tvalentyn commented Aug 2, 2024

tvalentyn commented Aug 7, 2024

damccorm commented Aug 22, 2024

[Failing Test]: apache_beam.io.gcp.bigquery_test.PipelineBasedStreamingInsertTest is flaky #32069

[Failing Test]: apache_beam.io.gcp.bigquery_test.PipelineBasedStreamingInsertTest is flaky #32069

Comments

tvalentyn commented Aug 2, 2024

What happened?

Issue Failure

Issue Priority

Issue Components

tvalentyn commented Aug 7, 2024

damccorm commented Aug 22, 2024