Issue #49 : Use the pydantic model type in the dspy Signature #51

snakedye · 2024-10-16T16:09:48Z

This PR doesn't fully resolve #49 but rather serves as a stepping stone for the use of annotation in the Pydantic model in the future.

We now leverage the TypedChainOfThough for better integration of the pydantic schema in DSPy. It allows us to simplifies the Signature and set the Pydantic model as the output type meaning DSPy must return an object that matches the schema.

The internals of the the GPT related was also refactored to match the changes in the API.

Related to #49

…irectory-of-fertilizer-label-inspection-data-for-testing-pipeline-performance' into 49-dspy-pydantic-schema

Endlessflow

WARNING about the observability code added that is related to Phoenix:
Tread with caution as it will break the pipeline if the endpoint given is not valid (it will keep trying to send the traces in an infinite loop).

Also, we forgot to remove everything related to the performance assessment from this PR. Therefore, all the changes in to the following files need to be removed from the PR:

.gitignore
performance_assessment.py
test_data/*

Otherwise, looks good.

Endlessflow · 2024-10-16T18:32:13Z

pipeline/gpt.py

+)
+
+DSPyInstrumentor().instrument(tracer_provider=tracer_provider)
+


I recommend exercising caution when adding lines 8-17 to the main branch. If the endpoint isn’t valid (i.e., Phoenix isn’t set up), it WILL cause the entire pipeline to stop functioning (I had that happen to me this morning).

To mitigate this, I suggest either removing the observability code for now or commenting it out by default. We can uncomment it when we need to use traces for debugging.

Additionally, I’m unsure if this feature should have its own issue and PR, or if it’s acceptable to include it in this PR.

Endlessflow · 2024-10-16T18:34:39Z

.gitignore

 test_logs/
+reports/
+test_data/
+performance_assessment.py


I don't think adding test_data/ and performance_assessment.py into the gitignore was intended (nor desirable).

Endlessflow · 2024-10-16T18:35:07Z

performance_assessment.py

We forgot to remove this from this PR.

Endlessflow · 2024-10-16T18:35:38Z

test_data/labels/label_001/expected_output.json

Same here, test_data stuff should not be merged into main yet.

Endlessflow · 2024-10-16T18:35:55Z

test_data/labels/label_001/img_001.png

idem - should not be merged into main yet.

SamuelPelletierEvraire and others added 4 commits September 24, 2024 16:33

Adding new dataset of label to the performance testing

54face2

Refactor code to improve performance and accuracy of label analysis

4e5150e

Merge remote-tracking branch 'origin/34-as-a-dev-i-want-to-create-a-d…

a6ec30d

…irectory-of-fertilizer-label-inspection-data-for-testing-pipeline-performance' into 49-dspy-pydantic-schema

Refactor code to improve performance and accuracy of label analysis

50b01cf

snakedye self-assigned this Oct 16, 2024

snakedye requested a review from k-allagbe October 16, 2024 16:35

Endlessflow requested changes Oct 16, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue #49 : Use the pydantic model type in the dspy Signature #51

Issue #49 : Use the pydantic model type in the dspy Signature #51

snakedye commented Oct 16, 2024

Endlessflow left a comment

Endlessflow Oct 16, 2024

Endlessflow Oct 16, 2024

Endlessflow Oct 16, 2024

Endlessflow Oct 16, 2024

Endlessflow Oct 16, 2024

		)

		DSPyInstrumentor().instrument(tracer_provider=tracer_provider)

Issue #49 : Use the pydantic model type in the dspy Signature #51

Are you sure you want to change the base?

Issue #49 : Use the pydantic model type in the dspy Signature #51

Conversation

snakedye commented Oct 16, 2024

Endlessflow left a comment

Choose a reason for hiding this comment

Endlessflow Oct 16, 2024

Choose a reason for hiding this comment

Endlessflow Oct 16, 2024

Choose a reason for hiding this comment

Endlessflow Oct 16, 2024

Choose a reason for hiding this comment

Endlessflow Oct 16, 2024

Choose a reason for hiding this comment

Endlessflow Oct 16, 2024

Choose a reason for hiding this comment