-
Notifications
You must be signed in to change notification settings - Fork 17
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Add a new folder of updated e2e tests with assertions (#46)
* update some e2e_tests to the new assertions format * adapt 2 correction test cases * adapt all cancellations, chitchat, disambiguation * adapt test cases in invalid_path, invalid_user_inputs, negations, potential_bugs * adapt digressions, flow_guards, skip_question * adapt tests for corrections * adapt all happy_path test cases * fix error in running datetime validation in the custom actions * add generative assertion test cases * update gitignore * Add assertions for failing and flaky tests * udate rasa-pro to 3.10.0rc1 * add mlflow optional dependency, add new workflow and new make commands * fix CI deprecation warning, fix failing test case * update threshold to prevent flakiness * update button payload and test case * fix flaky passing e2e test --------- Co-authored-by: Maksim Moiseikin <[email protected]>
- Loading branch information
Showing
94 changed files
with
4,061 additions
and
205 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -142,3 +142,6 @@ models/ | |
prompts/ | ||
tests/ | ||
qdrant_storage/ | ||
|
||
# mlflow | ||
mlruns/ |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
32 changes: 32 additions & 0 deletions
32
e2e_tests_with_assertions/failing/potential_bugs/user_tries_to_use_abstract_values.yml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,32 @@ | ||
test_cases: | ||
- test_case: user tries to use abstract values | ||
steps: | ||
- user: send money | ||
assertions: | ||
- bot_uttered: | ||
utter_name: utter_ask_transfer_money_recipient | ||
# the llm is extracting "good friend" as the recipient | ||
- user: to a good friend | ||
assertions: | ||
- slot_was_not_set: | ||
- name: transfer_money_recipient | ||
- bot_uttered: | ||
utter_name: utter_ask_transfer_money_recipient | ||
- user: okay, to Mary | ||
assertions: | ||
- slot_was_set: | ||
- name: transfer_money_recipient | ||
value: Mary | ||
- bot_uttered: | ||
utter_name: utter_ask_transfer_money_amount_of_money | ||
- user: "50" | ||
assertions: | ||
- slot_was_set: | ||
- name: transfer_money_amount_of_money | ||
value: "50" | ||
- bot_uttered: | ||
utter_name: utter_ask_transfer_money_final_confirmation | ||
- user: "yes" | ||
assertions: | ||
- bot_uttered: | ||
utter_name: utter_transfer_complete |
23 changes: 23 additions & 0 deletions
23
...ns/failing/skip_question/user_tries_to_skip_a_question_and_then_cancels_the_flow_hard.yml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,23 @@ | ||
test_cases: | ||
- test_case: user tries to skip a question and then cancels the flow (hard) | ||
steps: | ||
- user: send money to John | ||
assertions: | ||
- slot_was_set: | ||
- name: transfer_money_recipient | ||
value: John | ||
- bot_uttered: | ||
utter_name: utter_ask_transfer_money_amount_of_money | ||
- user: i don't want to answer this | ||
assertions: | ||
# cancel flow instead of skip question is predicted | ||
- bot_uttered: | ||
utter_name: utter_skip_question_answer | ||
- bot_uttered: | ||
utter_name: utter_ask_transfer_money_amount_of_money | ||
- user: i don't want to continue | ||
assertions: | ||
- bot_uttered: | ||
utter_name: utter_flow_cancelled_rasa | ||
- bot_uttered: | ||
utter_name: utter_can_do_something_else |
34 changes: 34 additions & 0 deletions
34
...th_assertions/failing/skip_question/user_tries_to_skip_a_question_multiple_times_hard.yml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,34 @@ | ||
test_cases: | ||
- test_case: user tries to skip a question multiple times (hard) | ||
steps: | ||
- user: send money to John | ||
assertions: | ||
- slot_was_set: | ||
- name: transfer_money_recipient | ||
value: John | ||
- bot_uttered: | ||
utter_name: utter_ask_transfer_money_amount_of_money | ||
- user: i don't want to answer this for now | ||
assertions: | ||
- bot_uttered: | ||
utter_name: utter_skip_question_answer | ||
- bot_uttered: | ||
utter_name: utter_ask_transfer_money_amount_of_money | ||
- user: i don't want to answer this | ||
# cancel flow instead of skip question is predicted | ||
assertions: | ||
- bot_uttered: | ||
utter_name: utter_skip_question_answer | ||
- bot_uttered: | ||
utter_name: utter_ask_transfer_money_amount_of_money | ||
- user: okay 50 | ||
assertions: | ||
- slot_was_set: | ||
- name: utter_ask_transfer_money_amount_of_money | ||
value: "50" | ||
- bot_uttered: | ||
utter_name: utter_ask_transfer_money_final_confirmation | ||
- user: "yes" | ||
assertions: | ||
- bot_uttered: | ||
utter_name: utter_transfer_complete |
28 changes: 28 additions & 0 deletions
28
e2e_tests_with_assertions/failing/skip_question/user_tries_to_skip_a_question_once_hard.yml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,28 @@ | ||
test_cases: | ||
- test_case: user tries to skip a question once (hard) | ||
steps: | ||
- user: send money to John | ||
assertions: | ||
- slot_was_set: | ||
- name: transfer_money_recipient | ||
value: John | ||
- bot_uttered: | ||
utter_name: utter_ask_transfer_money_amount_of_money | ||
- user: i don't want to answer this | ||
assertions: | ||
# cancel flow instead of skip question is predicted | ||
- bot_uttered: | ||
utter_name: utter_skip_question_answer | ||
- bot_uttered: | ||
utter_name: utter_ask_transfer_money_amount_of_money | ||
- user: okay 50 | ||
assertions: | ||
- slot_was_set: | ||
- name: utter_ask_transfer_money_amount_of_money | ||
value: "50" | ||
- bot_uttered: | ||
utter_name: utter_ask_transfer_money_final_confirmation | ||
- user: "yes" | ||
assertions: | ||
- bot_uttered: | ||
utter_name: utter_transfer_complete |
19 changes: 19 additions & 0 deletions
19
e2e_tests_with_assertions/flaky/disambiguation/user_sends_short_noun_only_message.yml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,19 @@ | ||
# instead of clarify a start flow command for "list_contact" is predicted | ||
|
||
fixtures: | ||
- route_to_calm: | ||
- route_session_to_calm: True | ||
|
||
test_cases: | ||
- test_case: user sends short noun only message | ||
fixtures: | ||
- route_to_calm | ||
steps: | ||
- user: contact | ||
assertions: | ||
- bot_uttered: | ||
utter_name: utter_clarification_options_rasa | ||
- user: add | ||
assertions: | ||
- bot_uttered: | ||
utter_name: utter_ask_add_contact_handle |
16 changes: 16 additions & 0 deletions
16
e2e_tests_with_assertions/flaky/happy_path/user_sets_up_recurrent_payment.yml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,16 @@ | ||
# recurrent_payment_type is not mapped to standing order | ||
|
||
test_cases: | ||
- test_case: user wants to set up a new recurrent payment, but specifies the type incompletely, example 3 | ||
steps: | ||
- user: I want to set up a new recurrent payment | ||
assertions: | ||
- bot_uttered: | ||
utter_name: utter_ask_recurrent_payment_type | ||
- user: stand order | ||
assertions: | ||
- slot_was_set: | ||
- name: recurrent_payment_type | ||
value: standing order | ||
- bot_uttered: | ||
utter_name: utter_ask_recipient |
15 changes: 15 additions & 0 deletions
15
e2e_tests_with_assertions/flaky/happy_path/user_wants_to_register_to_vote.yml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,15 @@ | ||
fixtures: | ||
- non_california_resident: | ||
- based_in_california: False | ||
- route_session_to_calm: True | ||
|
||
test_cases: | ||
- test_case: Register to vote for non-California resident (should not trigger) | ||
fixtures: | ||
- non_california_resident | ||
steps: | ||
- user: I want to register to vote | ||
# ChitChat is predicted instead of no command being predicted | ||
assertions: | ||
- bot_uttered: | ||
utter_name: utter_cannot_answer |
Oops, something went wrong.