Add coverage for NeuralSearch class #898

dbwiddis · 2024-09-07T05:28:01Z

Description

Adds test coverage for the main plugin class file

Related Issues

Part of #429

Check List

New functionality includes testing.
Commits are signed per the DCO using --signoff.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Signed-off-by: Daniel Widdis <[email protected]>

vibrantvarun · 2024-09-09T18:46:09Z

@dbwiddis can you check why gradle check is failing?

dbwiddis · 2024-09-09T19:52:31Z

@dbwiddis can you check why gradle check is failing?

Unrelated to my PR:

Tests with failures:
 - org.opensearch.neuralsearch.query.HybridQueryWeightTests.testSubQueries_whenMultipleEqualSubQueries_thenSuccessful
 - org.opensearch.neuralsearch.query.HybridQueryWeightTests.classMethod
 - org.opensearch.neuralsearch.search.query.HybridQueryPhaseSearcherTests.testQueryResult_whenMultipleTextSubQueriesWithSomeHits_thenHybridResultsAreSet
 - org.opensearch.neuralsearch.search.query.HybridQueryPhaseSearcherTests.classMethod
446 tests completed, 4 failed

I can keep force-pushing until it's green but generally it's easier for repo maintainers to retry just the failed tests.

vibrantvarun · 2024-09-10T00:18:27Z

@dbwiddis can you add

@ThreadLeakScope(ThreadLeakScope.Scope.NONE)

HybridQueryWeightTests and HybridQueryPhaseSearcherTests classes and add it into your PR ?

dbwiddis · 2024-09-10T00:25:47Z

@dbwiddis can you check why gradle check is failing?

Looking into the log, it seems it's timing out and it's likely related to thread locking. Multiple threads show

  2> 	Locked synchronizers:
  2> 	- java.util.concurrent.ThreadPoolExecutor$Worker@386f0da3

This makes me suspect some interaction with this PR which does mock a Threadpool and an ExcecutorService to be returned by the threadpool:

when(threadPool.executor(anyString())).thenReturn(executorService);

I expected that mock to be isolated only to this test, and of course when run in isolation the test passes. However, looking at the plugin createComponents, I see this line:

neural-search/src/main/java/org/opensearch/neuralsearch/plugin/NeuralSearch.java

Line 101 in f58d989

HybridQueryExecutor.initialize(threadPool);

That leads to this line:

neural-search/src/main/java/org/opensearch/neuralsearch/executors/HybridQueryExecutor.java

Line 60 in f58d989

    
           taskExecutor = new TaskExecutor(threadPool.executor(HYBRID_QUERY_EXEC_THREAD_POOL_NAME));

That sets the value of a static variable, which is therefore no longer isolated to that test: it remains stuck with a mocked value for any test run after this one.

neural-search/src/main/java/org/opensearch/neuralsearch/executors/HybridQueryExecutor.java

Line 31 in f58d989

private static TaskExecutor taskExecutor;

@dbwiddis can you add

@ThreadLeakScope(ThreadLeakScope.Scope.NONE)

I could replace the mock with a real thread pool and executor, somewhat like this pattern we use in Flow Framework and that I've carried over into ML Commons in a few places, and shut it down (the correct way to avoid the thread leak scope alert)

public class ProcessNodeTests extends OpenSearchTestCase {

    private static TestThreadPool testThreadPool;

    @BeforeClass
    public static void setup() {
        testThreadPool = new TestThreadPool(
            ProcessNodeTests.class.getName(),
            new ScalingExecutorBuilder(
                PROVISION_WORKFLOW_THREAD_POOL,
                1,
                Math.max(1, OpenSearchExecutors.allocatedProcessors(Settings.EMPTY) - 1),
                TimeValue.timeValueMinutes(5),
                FLOW_FRAMEWORK_THREAD_POOL_PREFIX + PROVISION_WORKFLOW_THREAD_POOL
            )
        );
    }

    @AfterClass
    public static void cleanup() {
        ThreadPool.terminate(testThreadPool, 500, TimeUnit.MILLISECONDS);
    }

But this works because the static variable is confined to this class and used only within this test. If I used this pattern in this test, it would still initialize the TaskExecutor in HybridQueryExecutor to this pool, and it would break tests when it was shut down.

I could leave it running for the entire duration of the test suite, but OpenSearchTestCase has leak detection mechanisms, which you've alluded to with the @ThreadLeakScope(ThreadLeakScope.Scope.NONE) annotation, but I submit this is just hiding the symptom of a real thread leak.

This still doesn't solve the underlying problem: there is no way to ever execute createComponents without a real thread pool you want to use for all the tests, that you can shut down at the end of your testing.

Putting aside the question of whether a static object initialized by a different class is even a good idea at all (which I don't think it is), at this point I would prefer to just remove the perfunctory test of createComponents() whose entire raison d'être is to exercise the method and cover those lines to increase an arbitrary coverage metric.

dbwiddis · 2024-09-10T00:27:50Z

TLDR: I'll push a commit removing the coverage for createComponents() and leave it to a more well thought out design.

Signed-off-by: Daniel Widdis <[email protected]>

dbwiddis · 2024-09-10T20:42:57Z

OpenSearch has a direct executor service that simulates the (Runnable::run) default behavior of the non-initialized taskExecutor, so I used that.

vibrantvarun · 2024-09-11T21:30:37Z

@martin-gaievski can you review this PR?

* Add coverage for NeuralSearch class Signed-off-by: Daniel Widdis <[email protected]> (cherry picked from commit 481a347)

* Add coverage for NeuralSearch class Signed-off-by: Daniel Widdis <[email protected]> (cherry picked from commit 481a347) Co-authored-by: Daniel Widdis <[email protected]>

Add coverage for NeuralSearch class

4ac9af0

Signed-off-by: Daniel Widdis <[email protected]>

dbwiddis requested review from heemin32, navneet1v, VijayanB, vamshin, jmazanec15, naveentatikonda, junqiu-lei, martin-gaievski, sean-zheng-amazon, model-collapse, zane-neo, ylwu-amzn, jngz-es, vibrantvarun and zhichao-aws as code owners September 7, 2024 05:28

dbwiddis added the skip-changelog label Sep 7, 2024

dbwiddis mentioned this pull request Sep 7, 2024

[BUG] The Changelog Verifier GitHub Action runs 16 times and sends 16 emails if changelog is omitted #899

Closed

vibrantvarun added the backport 2.x Label will add auto workflow to backport PR to 2.x branch label Sep 9, 2024

Don't test createComponents as it has side-effects on other tests

f11ba32

Signed-off-by: Daniel Widdis <[email protected]>

dbwiddis force-pushed the coverage branch from 9b44a22 to f11ba32 Compare September 10, 2024 01:30

Use a direct thread executor to restore existing default behavior

75d6ea0

Signed-off-by: Daniel Widdis <[email protected]>

dbwiddis force-pushed the coverage branch from f67c4a3 to 75d6ea0 Compare September 10, 2024 20:20

vibrantvarun approved these changes Sep 11, 2024

View reviewed changes

martin-gaievski approved these changes Sep 19, 2024

View reviewed changes

martin-gaievski added the Maintenance Add support for new versions of OpenSearch/Dashboards from upstream label Sep 19, 2024

martin-gaievski merged commit 481a347 into opensearch-project:main Sep 19, 2024
35 of 37 checks passed

opensearch-trigger-bot bot pushed a commit that referenced this pull request Sep 19, 2024

Add coverage for NeuralSearch class (#898)

0ba89da

* Add coverage for NeuralSearch class Signed-off-by: Daniel Widdis <[email protected]> (cherry picked from commit 481a347)

opensearch-trigger-bot bot mentioned this pull request Sep 19, 2024

[Backport 2.x] Add coverage for NeuralSearch class #912

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add coverage for NeuralSearch class #898

Add coverage for NeuralSearch class #898

dbwiddis commented Sep 7, 2024

vibrantvarun commented Sep 9, 2024

dbwiddis commented Sep 9, 2024

vibrantvarun commented Sep 10, 2024

dbwiddis commented Sep 10, 2024 •

edited

Loading

dbwiddis commented Sep 10, 2024

dbwiddis commented Sep 10, 2024 •

edited

Loading

vibrantvarun commented Sep 11, 2024

Add coverage for NeuralSearch class #898

Add coverage for NeuralSearch class #898

Conversation

dbwiddis commented Sep 7, 2024

Description

Related Issues

Check List

vibrantvarun commented Sep 9, 2024

dbwiddis commented Sep 9, 2024

vibrantvarun commented Sep 10, 2024

dbwiddis commented Sep 10, 2024 • edited Loading

dbwiddis commented Sep 10, 2024

dbwiddis commented Sep 10, 2024 • edited Loading

vibrantvarun commented Sep 11, 2024

dbwiddis commented Sep 10, 2024 •

edited

Loading

dbwiddis commented Sep 10, 2024 •

edited

Loading