AI assistants to determine if the context of the conversation #787

Hugo-Calero · 2024-05-03T14:54:23Z

If your PR is related to a contribution to the taxonomy, please, fill
out the following questionnaire. If not, replace this whole text and the
following questionnaire with whatever information is applicable to your PR.

Describe the contribution to the taxonomy

This contribution may help to identify if a user is changing topic in a multiturn conversation.
This is helpful in the context of building AI assistants, to decide when to keep context of the conversation or not.
...

Input given at the prompt

   A list of previous user queries, and the new query to compare.

Response from the original model

...

Response from the fine-tuned model

...

Contribution checklist

The contribution was tested with ilab generate
No errors or warnings were produced by ilab generate
All commits are signed off (DCO)
The qna.yaml file contains at least 5 seed_examples
The qna.yaml file was linted and prettified (yaml-validator can do both)
An attribution.txt file in the same folder as the qna.yaml file
Content does not include PII or otherwise sensitive or confidential information
Content does not include anything documented in the project's Avoid these Topics guidelines

Signed-off-by: Hugo Carlos Calero Díaz <[email protected]>

instruct-lab-bot · 2024-05-03T14:54:43Z

Beep, boop 🤖, Hi, I'm @instructlab-bot and I'm going to help you with your pull request. Thanks for you contribution! 🎉

I support the following commands:

@instructlab-bot precheck -- Check existing model behavior using the questions in this proposed change.
@instructlab-bot generate -- Generate a sample of synthetic data using the synthetic data generation backend infrastructure.
@instructlab-bot generate-local -- Generate a sample of synthetic data using a local model.
@instructlab-bot help -- Print this help message again.

Note

Results or Errors of these commands will be posted as a pull request check in the Checks section below

Note

Currently only maintainers belongs to [[taxonomy-triagers taxonomy-approvers taxonomy-maintainers labrador-org-maintainers instruct-lab-bot-maintainers]] teams are allowed to run these commands.

Signed-off-by: Hugo Carlos Calero Díaz <[email protected]>

mingxzhao · 2024-05-09T01:05:14Z

You will also need to sign off on your commits as outlined here

instruct-lab-bot · 2024-05-09T01:05:31Z

Beep, boop 🤖, Hi, I'm @instructlab-bot and I'm going to help you with your pull request. Thanks for you contribution! 🎉

I support the following commands:

@instructlab-bot precheck -- Check existing model behavior using the questions in this proposed change.
@instructlab-bot generate -- Generate a sample of synthetic data using the synthetic data generation backend infrastructure.
@instructlab-bot generate-local -- Generate a sample of synthetic data using a local model.
@instructlab-bot help -- Print this help message again.

Note

Results or Errors of these commands will be posted as a pull request check in the Checks section below

Note

Currently only maintainers belongs to [[taxonomy-triagers taxonomy-approvers taxonomy-maintainers labrador-org-maintainers instruct-lab-bot-maintainers]] teams are allowed to run these commands.

mingxzhao · 2024-05-09T01:05:53Z

@instructlab-bot precheck

instruct-lab-bot · 2024-05-09T01:05:54Z

Beep, boop 🤖, Generating test data for your PR with the job type: precheck. Your Job ID is 277. The results will be presented below in the pull request status box. This may take several minutes...

instruct-lab-bot · 2024-05-09T01:06:23Z

Results for job ID: 277 using the model merlinite-7b!

Results can be found here.

Hugo-Calero · 2024-05-09T07:07:40Z

Hello @mingxzhao I can see the commits are signed off already. Eg: "Signed-off-by: Hugo Carlos Calero Díaz [email protected]". Is there anything else I need to do?

mingxzhao · 2024-05-09T15:55:04Z

Ah apologies I seemed to have missed that. If you could just update the attribution file and fix the linting issues, I can approve! It seems there are some spacing issues in your file.

Hugo-Calero · 2024-05-13T06:54:10Z

Hello! I have checked the listing in https://www.yamllint.com, and I see no error, is there any specific guideline I need to follow or recommended software to check?
Also, what is missing in the attribution file?
Thanks

jjasghar

Please resolve the linting issues.

Signed-off-by: Hugo Carlos Calero Díaz <[email protected]>

instruct-lab-bot · 2024-05-16T14:20:19Z

Beep, boop 🤖, Hi, I'm @instructlab-bot and I'm going to help you with your pull request. Thanks for you contribution! 🎉

I support the following commands:

@instructlab-bot precheck -- Check existing model behavior using the questions in this proposed change.
@instructlab-bot generate -- Generate a sample of synthetic data using the synthetic data generation backend infrastructure.
@instructlab-bot generate-local -- Generate a sample of synthetic data using a local model.
@instructlab-bot help -- Print this help message again.

Note

Results or Errors of these commands will be posted as a pull request check in the Checks section below

Note

Currently only maintainers belongs to [[taxonomy-triagers taxonomy-approvers taxonomy-maintainers labrador-org-maintainers instruct-lab-bot-maintainers]] teams are allowed to run these commands.

Hugo-Calero · 2024-05-16T14:21:22Z

Hi, I commited modifications to the yaml file, I hope there are no spacing issues now. Please review! Many thanks :)

jjasghar

Please fix the linting issue.

Signed-off-by: Hugo Carlos Calero Díaz <[email protected]>

instruct-lab-bot · 2024-05-18T07:14:17Z

Beep, boop 🤖, Hi, I'm @instructlab-bot and I'm going to help you with your pull request. Thanks for you contribution! 🎉

I support the following commands:

@instructlab-bot precheck -- Check existing model behavior using the questions in this proposed change.
@instructlab-bot generate -- Generate a sample of synthetic data using the synthetic data generation backend infrastructure.
@instructlab-bot generate-local -- Generate a sample of synthetic data using a local model.
@instructlab-bot help -- Print this help message again.

Note

Results or Errors of these commands will be posted as a pull request check in the Checks section below

Note

Currently only maintainers belongs to [[taxonomy-triagers taxonomy-approvers taxonomy-maintainers labrador-org-maintainers instruct-lab-bot-maintainers]] teams are allowed to run these commands.

jjasghar · 2024-05-20T22:37:41Z

@instructlab-bot precheck

instruct-lab-bot · 2024-05-20T22:37:44Z

Beep, boop 🤖, Generating test data for your PR with the job type: precheck. Your Job ID is 324. The results will be presented below in the pull request status box. This may take several minutes...

instruct-lab-bot · 2024-05-20T22:38:31Z

Results for job ID: 324 using the model instructlab/granite-7b-lab!

Results can be found here.

Hugo-Calero · 2024-05-27T09:19:50Z

Hello, I see the merging is still blocked. Is there anything left to do on my side? I already fixed the linting issues I identified, I hope I didn't miss any.
Thanks

jjasghar · 2024-05-28T19:25:47Z

@instructlab-bot generate

instruct-lab-bot · 2024-05-28T19:25:50Z

Beep, boop 🤖, Generating test data for your PR with the job type: sdg-svc. Your Job ID is 346. The results will be presented below in the pull request status box. This may take several minutes...

jjasghar · 2024-05-28T19:27:11Z

Looking at the precheck it seems to be very far off the rails which is good for this PR. Now the SDG generate will give us an understanding of the data that is generated and if it will add possible value to the model.

Assuming it does, then we will tag it as approved, and it will be upstreamed, then we will see from the engineering team if the model improves, only then we will merge.

instruct-lab-bot · 2024-05-28T19:27:14Z

Results for job ID: 346 using the model sdg service backend!

Results can be found here.

jjasghar

Please fix the version: 2 then we are ready the next steps.

compositional_skills/linguistics/Conversation_orchestration/identify_different_themes/qna.yaml

Signed-off-by: Hugo Carlos Calero Díaz <[email protected]>

instruct-lab-bot · 2024-06-02T09:33:29Z

Beep, boop 🤖, Hi, I'm @instructlab-bot and I'm going to help you with your pull request. Thanks for you contribution! 🎉

I support the following commands:

@instructlab-bot precheck -- Check existing model behavior using the questions in this proposed change.
@instructlab-bot generate -- Generate a sample of synthetic data using the synthetic data generation backend infrastructure.
@instructlab-bot generate-local -- Generate a sample of synthetic data using a local model.
@instructlab-bot help -- Print this help message again.

Note

Results or Errors of these commands will be posted as a pull request check in the Checks section below

Note

Currently only maintainers belongs to [[taxonomy-triagers taxonomy-approvers taxonomy-maintainers labrador-org-maintainers instruct-lab-bot-maintainers]] teams are allowed to run these commands.

jjasghar · 2024-06-03T21:21:04Z

@instructlab-bot precheck

instruct-lab-bot · 2024-06-03T21:21:07Z

Beep, boop 🤖, Generating test data for your PR with the job type: precheck. Your Job ID is 362. The results will be presented below in the pull request status box. This may take several minutes...

instruct-lab-bot · 2024-06-03T21:21:42Z

Results for job ID: 362 using the model instructlab/granite-7b-lab!

Results can be found here.

jjasghar · 2024-06-03T21:29:08Z

With what I read through the pre-check the model already seems to do this quite well. @mingxzhao can you do a sanity check for me?
I think we should reject this as something the model already does.

mingxzhao · 2024-06-03T21:35:53Z

It does seem to get several of the answers wrong when compared to the user provided answers. I think this could be a good skill, but the context part of the question may need to be placed in the "context" field of the yaml for good SDG. At the moment there does seem to be several wrong answers though.

jjasghar · 2024-06-03T21:39:32Z

Wait really? I thought the precheck answer what we wanted. That it knew the questions didn't match the context, for instance: https://instruct-lab-bot.s3.us-east-2.amazonaws.com/precheck-pr-787-1dee610dbcdeb1f9cc76357c360aa8e63ebd1e8c-job-362/chat_2024-06-03T21_21_14.log

While the questions you've provided are related to the topic of food and dining
in Madrid, the question about the Retiro park is unrelated to the previous
conversation.

mingxzhao · 2024-06-03T21:42:40Z

Not quite, the question is about whether the following provided question is in the same context as the previous provided questions. The model seems to have difficulty parsing this and even answers the question separately from the context entirely.

Hugo-Calero · 2024-06-06T15:25:04Z

Hi, is there anything I can do to advance the status of this contribution?

jjasghar · 2024-06-06T15:30:22Z

We are in progress here. I believe we need this to be put in the next run, updates will be added to the PR as they arrive.

jjasghar · 2024-07-10T17:28:35Z

Hi! 👋
It’s been a while since you’ve seen any movement on this PR. We haven’t forgotten about you! We’ve run into some logistical issues, hence this delay. We absolutely want your PR, and being marketed as e2e-ready is still the last stop before we get it into the upstream model.

We are thankful for your patience and ask that you please keep this PR open. As soon as we finish all our behind-the-scenes work, we’ll test the full model against your submissions and, ideally, accept your amazing contribution(s)!

Your Community Maintainer Team.

P.S. if you have any specific questions or thoughts, don’t hesitate to comment on pull request this or email [email protected] and [email protected], and we’ll get back to you as soon as possible.

jjasghar · 2024-08-26T16:58:22Z

@mcorbin-ibm where should this one live?

mcorbin-ibm · 2024-08-26T17:32:34Z

I think that this applies more to the "AI" side of things than "linguistics" and after doing some research in Wikipedia, in conjunction with our Dewey Decimal System reference document, I recommend placing this here in the taxonomy:

compositional_skills/technology/computer_science/ai/nlp/conversation_orchestration

Note: I did not see a context specified in the qna, so I did not put this under the compositional_skills/grounded directory, but if the qna requires a context, it should be moved there instead?

Hugo-Calero added 3 commits May 3, 2024 16:19

Create qna.yaml

6d871c4

Signed-off-by: Hugo Carlos Calero Díaz <[email protected]>

Update qna.yaml

8a6628e

Signed-off-by: Hugo Carlos Calero Díaz <[email protected]>

Create attribution.txt

1c3b85a

Signed-off-by: Hugo Carlos Calero Díaz <[email protected]>

github-actions bot added triage-needed (Auto labeled) skill is ready to be triaged skill (Auto labeled) labels May 3, 2024

Update qna.yaml

03bb82d

Signed-off-by: Hugo Carlos Calero Díaz <[email protected]>

mingxzhao added triage-requested-changes skill has been reviewed; changes requested from contributor and removed triage-needed (Auto labeled) skill is ready to be triaged labels May 9, 2024

jjasghar requested changes May 13, 2024

View reviewed changes

Update qna.yaml

3aa0df5

Signed-off-by: Hugo Carlos Calero Díaz <[email protected]>

github-actions bot added the triage-needed (Auto labeled) skill is ready to be triaged label May 16, 2024

jjasghar requested changes May 17, 2024

View reviewed changes

jjasghar removed the triage-needed (Auto labeled) skill is ready to be triaged label May 17, 2024

Update qna.yaml linting

6cf4ec3

Signed-off-by: Hugo Carlos Calero Díaz <[email protected]>

github-actions bot added the triage-needed (Auto labeled) skill is ready to be triaged label May 18, 2024

jjasghar changed the title ~~Add skill~~ AI assistants to determine if the context of the conversation May 20, 2024

jjasghar added the precheck-generate-ready PR is ready for precheck or generate step label May 20, 2024

jjasghar approved these changes May 28, 2024

View reviewed changes

jjasghar requested changes May 28, 2024

View reviewed changes

compositional_skills/linguistics/Conversation_orchestration/identify_different_themes/qna.yaml Show resolved Hide resolved

jjasghar added the triage-requested-changes skill has been reviewed; changes requested from contributor label May 28, 2024

Added version: 2

1dee610

Signed-off-by: Hugo Carlos Calero Díaz <[email protected]>

github-actions bot added the triage-needed (Auto labeled) skill is ready to be triaged label Jun 2, 2024

jjasghar removed triage-needed (Auto labeled) skill is ready to be triaged triage-requested-changes skill has been reviewed; changes requested from contributor labels Jun 3, 2024

jjasghar approved these changes Jun 7, 2024

View reviewed changes

jjasghar added community-build-ready Triage Team has signed off for synthetic data generation and removed precheck-generate-ready PR is ready for precheck or generate step labels Jun 7, 2024

AI assistants to determine if the context of the conversation #787

Are you sure you want to change the base?

AI assistants to determine if the context of the conversation #787

Conversation

Hugo-Calero commented May 3, 2024 • edited Loading

instruct-lab-bot bot commented May 3, 2024

mingxzhao commented May 9, 2024

instruct-lab-bot bot commented May 9, 2024

mingxzhao commented May 9, 2024

instruct-lab-bot bot commented May 9, 2024

instruct-lab-bot bot commented May 9, 2024

Hugo-Calero commented May 9, 2024

mingxzhao commented May 9, 2024

Hugo-Calero commented May 13, 2024

jjasghar left a comment

Choose a reason for hiding this comment

instruct-lab-bot bot commented May 16, 2024

Hugo-Calero commented May 16, 2024

jjasghar left a comment

Choose a reason for hiding this comment

instruct-lab-bot bot commented May 18, 2024

jjasghar commented May 20, 2024

instruct-lab-bot bot commented May 20, 2024

instruct-lab-bot bot commented May 20, 2024

Hugo-Calero commented May 27, 2024

jjasghar commented May 28, 2024

instruct-lab-bot bot commented May 28, 2024

jjasghar commented May 28, 2024

instruct-lab-bot bot commented May 28, 2024

jjasghar left a comment

Choose a reason for hiding this comment

instruct-lab-bot bot commented Jun 2, 2024

jjasghar commented Jun 3, 2024

instruct-lab-bot bot commented Jun 3, 2024

instruct-lab-bot bot commented Jun 3, 2024

jjasghar commented Jun 3, 2024

mingxzhao commented Jun 3, 2024

jjasghar commented Jun 3, 2024

mingxzhao commented Jun 3, 2024

Hugo-Calero commented Jun 6, 2024

jjasghar commented Jun 6, 2024

jjasghar commented Jul 10, 2024

jjasghar commented Aug 26, 2024

mcorbin-ibm commented Aug 26, 2024

Hugo-Calero commented May 3, 2024 •

edited

Loading