Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feature: Teach the combiner to keep track of a running inferred schema and log whenever it changes. #1128

Merged
merged 17 commits into from
Aug 10, 2023

Conversation

jshearer
Copy link
Contributor

@jshearer jshearer commented Jul 28, 2023

This is the second step of #1115.


This change is Reviewable

@jshearer jshearer force-pushed the feature/combiner_emit_schemas branch from e6eb307 to 483f11e Compare August 4, 2023 16:21
@jshearer jshearer changed the base branch from feature/shape_widening to master August 4, 2023 16:22
@jshearer jshearer force-pushed the feature/combiner_emit_schemas branch 2 times, most recently from 713009d to f185673 Compare August 4, 2023 16:47
@jshearer jshearer changed the base branch from master to refactor/schema_builder August 4, 2023 16:48
@jshearer jshearer force-pushed the feature/combiner_emit_schemas branch 2 times, most recently from ec9aec8 to 25b6025 Compare August 4, 2023 17:30
@jshearer
Copy link
Contributor Author

jshearer commented Aug 4, 2023

I'm also done force-pushing here too, sorry 🤦

Copy link
Member

@jgraettinger jgraettinger left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Two major notes:

  • We want to selectively enable this widening / logging. Capture tasks definitely should do it, but materializations don't need to and also don't want to pay the overhead. How about a new boolean to enable the behavior in the protobuf config API?

  • The logs must include the collection name, which also needs to be plumbed through.

Other than that and comments below LGTM 👍

crates/derive/src/combine_api.rs Outdated Show resolved Hide resolved
crates/ops/src/tracing.rs Outdated Show resolved Hide resolved
crates/derive/src/combine_api.rs Outdated Show resolved Hide resolved
@jshearer
Copy link
Contributor Author

jshearer commented Aug 4, 2023

Ok, I added collection_name and enable_schema_inference to the Config message. Thoughts?

@jshearer jshearer force-pushed the feature/combiner_emit_schemas branch from 73e1103 to 9b2066c Compare August 8, 2023 14:50
@jshearer jshearer changed the base branch from refactor/schema_builder to master August 8, 2023 14:50
@github-actions
Copy link

github-actions bot commented Aug 8, 2023

PR Preview Action v1.4.4
🚀 Deployed preview to https://estuary.github.io/flow/pr-preview/pr-1128/
on branch gh-pages at 2023-08-08 14:52 UTC

@jshearer jshearer force-pushed the feature/combiner_emit_schemas branch from 9b2066c to 0f50e31 Compare August 10, 2023 17:09
@jshearer jshearer force-pushed the feature/combiner_emit_schemas branch from 0f50e31 to f536e96 Compare August 10, 2023 18:26
crates/derive/src/combine_api.rs Show resolved Hide resolved
crates/derive/src/combine_api.rs Outdated Show resolved Hide resolved
crates/doc/src/lib.rs Outdated Show resolved Hide resolved
Copy link
Member

@jgraettinger jgraettinger left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

crates/derive/src/combine_api.rs Outdated Show resolved Hide resolved
@jgraettinger jgraettinger merged commit 886e308 into master Aug 10, 2023
3 checks passed
@jgraettinger jgraettinger deleted the feature/combiner_emit_schemas branch August 10, 2023 20:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants