Onboard basic sentiment analysis with defaults #350

ohltyler · 2024-09-06T22:57:05Z

Description

Adds a new sentiment analysis preset, with defaults for ingest/search pipelines and index configurations. Based roughly off of this documented example: https://opensearch.org/docs/latest/search-plugins/search-pipelines/ml-inference-search-request/#example-externally-hosted-model

This use case is intended to be used with a specialized sentiment analysis model (or LLM with tuned prompt) that takes in text and returns a sentiment/category (generally Positive/Neutral/Negative). One basic example is for storing and analyzing website reviews. This particular preset is two-fold:

Take in a document with text, process the text with an ML ingest processor to generate and store a label field with the returned sentiment as part of the document
Search using plaintext, augment with an ML search request processor to generate and replace the label field's value in the request, such that only results with the matching sentiment are returned.

Overall, this use case could be tuned and enhanced in many different ways. Users may want to persist more than just a label. For example, one reasonable use case is being able to perform a hybrid search over some text's vector, it's sentiment/label, and its plaintext, and try out different weights in a hybrid query, etc.

More details:

adds quick-configure presets and form inputs for sentiment analysis
adds logic in quick configure modal to inject quick configure values into the config (the "label" field)
adds metadata defaults and a new preset JSON resource for this use case
adds default query inputs for vector search use cases (query.term.${text_field}.value) for the ML models. This may be tuned later on and depends on the default queries or if the query editing experience changes.
remove noisy toast when editing transforms, since we now run it automatically instead of requiring explicit user input

Demo video, showing a basic usecase with a sagemaker sentiment analysis model. Also shows the default values set in the ML search request processor for a vector search use case. Note that now by using all defaults, no further input is needed on this search request processor now.

screen-capture.14.webm

Check List

Commits are signed per the DCO using --signoff

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Signed-off-by: Tyler Ohlsen <[email protected]>

Signed-off-by: Tyler Ohlsen <[email protected]> (cherry picked from commit 9b644de)

Signed-off-by: Tyler Ohlsen <[email protected]> (cherry picked from commit 9b644de) Co-authored-by: Tyler Ohlsen <[email protected]>

Onboard basic sentiment analysis with defaults

36311f5

Signed-off-by: Tyler Ohlsen <[email protected]>

ohltyler added backport 2.x rapid new workflow Roadmap:Ease of Use Project-wide roadmap label v2.18.0 labels Sep 6, 2024

ohltyler requested review from dbwiddis, owaiskazi19, joshpalis, amitgalitz, jackiehanyang, minalsha and saimedhi as code owners September 6, 2024 22:57

ohltyler marked this pull request as draft September 6, 2024 22:57

ohltyler added 2 commits September 9, 2024 09:31

cleanup

e2439e9

Signed-off-by: Tyler Ohlsen <[email protected]>

more cleanup; remove noisy toast

197abc9

Signed-off-by: Tyler Ohlsen <[email protected]>

ohltyler marked this pull request as ready for review September 9, 2024 16:44

dbwiddis approved these changes Sep 9, 2024

View reviewed changes

ohltyler merged commit 9b644de into opensearch-project:main Sep 9, 2024
6 checks passed

ohltyler deleted the sentiment-analysis branch September 9, 2024 16:57

opensearch-trigger-bot bot pushed a commit that referenced this pull request Sep 9, 2024

Onboard basic sentiment analysis with defaults (#350)

4133edb

Signed-off-by: Tyler Ohlsen <[email protected]> (cherry picked from commit 9b644de)

opensearch-trigger-bot bot mentioned this pull request Sep 9, 2024

[Backport 2.x] Onboard basic sentiment analysis with defaults #353

Merged

ohltyler added a commit that referenced this pull request Sep 9, 2024

Onboard basic sentiment analysis with defaults (#350) (#353)

bd02a3d

Signed-off-by: Tyler Ohlsen <[email protected]> (cherry picked from commit 9b644de) Co-authored-by: Tyler Ohlsen <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Onboard basic sentiment analysis with defaults #350

Onboard basic sentiment analysis with defaults #350

ohltyler commented Sep 6, 2024 •

edited

Loading

Onboard basic sentiment analysis with defaults #350

Onboard basic sentiment analysis with defaults #350

Conversation

ohltyler commented Sep 6, 2024 • edited Loading

Description

Check List

ohltyler commented Sep 6, 2024 •

edited

Loading