Integrate with syft #28

TTitcombe · 2020-06-19T11:31:30Z

What?

The current implementation of data loaders and datasets is quite hacky.
We should integrate existing syft functionality and extend it to make Vertically-partitioned dataset
a robust class, making it easy for anyone to apply PyVertical to any dataset

Breakdown

Build on syft.fl.BaseDataset to create a dataset which holds partitions and may hold either data or targets. This should extend PyVertical's VerticalDataset to include syft functionality of ownership Extend syft federated datasets #47
Create a function dataset_partition which partitions a dataset, sends the partitioned datasets to the correct worker, and returns a syft.fl.FederatedDataset of partitioned datssets. This builds on the current partition_dataset function in PyVertical, and is similar to syft.fl.dataset_federate Create partition function for federated datasets #48
Replace PartitionDistributingDataLoader with a dataloader which takes a syft.fl.FederatedDataset. This should extend syft.fl.FederatedDataLoader to account for datasets which may not contain data or targets Create syft-like federated dataloader #49
Integrate with PSI Integrate PSI with workers #50
Encrypt unique IDs Encrypt IDs #54

Additional Context

This will developed simultaneously with the extended PyVertical demonstration (#25), so to avoid breaking changes existing dataloaders/data splitters should be kept until this issue is complete

The text was updated successfully, but these errors were encountered:

TTitcombe · 2020-06-19T11:48:42Z

cc @tudorcebere from PySyft

TTitcombe · 2020-11-24T17:26:29Z

closing as issues do not align with current roadmap for syft 0.3.0

TTitcombe added Priority: 3 - Medium 😒 Should be fixed soon, but there may be other pressing matters that come first Type: Refactor 🔨 A complete overhaul of a file, feature, or codebase labels Jun 19, 2020

TTitcombe added this to the Generalised Vertically Partitioned Training milestone Jun 19, 2020

TTitcombe added Priority: 2 - High 😰 Should be fixed as quickly as possible, ideally within the current or following sprint and removed Priority: 3 - Medium 😒 Should be fixed soon, but there may be other pressing matters that come first labels Jul 4, 2020

TTitcombe changed the title ~~Make partitioned datasets more syft-like~~ Integrate with syft Jul 11, 2020

TTitcombe added Type: Epic 🤙 Describes a large amount of functionality that will likely be broken down into smaller issues and removed Type: Refactor 🔨 A complete overhaul of a file, feature, or codebase labels Jul 11, 2020

TTitcombe removed this from the Extended example milestone Jul 11, 2020

This was referenced Jul 11, 2020

Extend syft federated datasets #47

Closed

Create partition function for federated datasets #48

Closed

Create syft-like federated dataloader #49

Closed

Integrate PSI with workers #50

Closed

TTitcombe pinned this issue Jul 15, 2020

TTitcombe added this to the Syft 3.2 milestone Aug 5, 2020

TTitcombe closed this as completed Nov 24, 2020

TTitcombe unpinned this issue Nov 24, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrate with syft #28

Integrate with syft #28

TTitcombe commented Jun 19, 2020 •

edited

Loading

TTitcombe commented Jun 19, 2020

TTitcombe commented Nov 24, 2020

Integrate with syft #28

Integrate with syft #28

Comments

TTitcombe commented Jun 19, 2020 • edited Loading

What?

Breakdown

Additional Context

TTitcombe commented Jun 19, 2020

TTitcombe commented Nov 24, 2020

TTitcombe commented Jun 19, 2020 •

edited

Loading