Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adds data source properties to git connectors #1280

Merged
merged 45 commits into from
Oct 10, 2023

Conversation

rvztz
Copy link
Contributor

@rvztz rvztz commented Sep 1, 2023

Adds data source properties to git connectors:

  • data_created
  • date_modified
  • version
  • record_locator
    These properties are instantiated when supported by the connector.

Separates the logic between fetching the file from source and get_file. Retrieves file metadata when any of the properties are called.

Adds logic to check if file exists in the remote source. For connectors that don't directly support it, adds exception handling to check any issues while retrieving the file.

@ryannikolaidis ryannikolaidis added the ingest Issues related to the Ingest CLI or unstructured.ingest modules label Sep 1, 2023
@rvztz rvztz marked this pull request as ready for review September 8, 2023 10:23
CHANGELOG.md Outdated Show resolved Hide resolved
Copy link
Contributor

@ryannikolaidis ryannikolaidis left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

see note on record_locator

rvztz and others added 2 commits September 20, 2023 23:35
… update (#1491)

This pull request includes updated ingest test fixtures.
Please review and merge if appropriate.

Co-authored-by: rvztz <[email protected]>
@@ -1,28 +0,0 @@
[
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this looks like a bug, this shouldn't be removed. try re-running fixtures.

Copy link
Contributor

@ryannikolaidis ryannikolaidis left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm!

@ryannikolaidis ryannikolaidis added this pull request to the merge queue Oct 10, 2023
Merged via the queue into main with commit 7fd61e3 Oct 10, 2023
39 checks passed
@ryannikolaidis ryannikolaidis deleted the data-source-properties-git branch October 10, 2023 23:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ingest Issues related to the Ingest CLI or unstructured.ingest modules
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants