Refactor synchronization #62

alexandermorozov · 2016-04-19T21:09:08Z

I've implemented memory access API and syncronization based on bitmasks. Tesnsor/TensorView and decoupling aren't implemented.

Native and CUDA pass all tests. OpenCL compiles but segfaults on my machine, both with this PR and without it.

PR isn't ready to be merged yet -- I'd like to fix plugins and Leaf first to see that there are no unexpected problems.

Remove methods `sync()`, `get()`, `get_mut()`, `remove_copy()` of `SharedTensor` and introduce new set of methods: `read()`, `read_write()`, `write_only()`, `drop_device()`. Signature of `SharedTensor::new()` has also changed. New API has following benefits: - limited checks of use of uninitialized memory, - better memory tracking: several memories can be simultaneously marked as up-to-date, so some synchronization operations might be skipped, - backing memory is automatically allocated on the first use and added to `SharedTensor`, even if it's immutable. Mutability is required only for reshaping, modifying actual data and dropping memories. Rationale and design decisions are discussed at the corresponding bugtracker issue. BREAKING CHANGE: sync and memory management API of `SharedTensor` CLOSE: autumnai#37

…NGELOG]

REFERENCE: autumnai#37

Implementation of SharedTensor uses `unsafe` to extend lifetime of memory references that are returned by read/write family of methods. Those tests verify that attempts to create dangling pointers or otherwise misuse API fail at compile time.

During refactoring (autumnai#37) several error were upgraded into panics. Those errors may happen only if internal logic of `SharedTensor` is incorrect and leads to inconsistent state and broken invariants.

…_CHANGELOG] REFERENCE: autumnai#37

Since plugin operations rely on `SharedTensor`'s memory access and allocation API, they need to proxy errors. This commit adds new error enum entry to plugin::Error, and removes deprecated plugin::Error::MissingMemoryForDevice. It also adds autoconversion from plugin::Error::SharedTensor for convenient use of `try!`.

…..>` Allocation of `SharedTensor` may fail only on OOM, so returning `Result` type is redundant.

Refactor code CUDA and Native backend to match #autumnai/collenchyma/62 that provides enchanced memory management and syncronization. Since memory management is now automatic, `*_plain` variants of functions are removed. BREAKING CHANGE: *_plain versions of API functions are removed, arguments of their counterpart functions may have changed in mutablity. REFERENCE: autumnai/collenchyma#37, autumnai/collenchyma#62 refactor/native: convert to the new memory management API Convert Native backend. Code now compiles.

Refactor code CUDA and Native backend to match #autumnai/collenchyma/62 that provides enchanced memory management and syncronization. Since memory management is now automatic, `*_plain` variants of functions are removed. BREAKING CHANGE: *_plain versions of API functions are removed, arguments of their counterpart functions may have changed in mutablity. REFERENCE: autumnai/collenchyma#37, autumnai/collenchyma#62

Use .read()/.write_only()/.read_write() instead of .sync()/.add_device()/.get() calls. REFERENCE: autumnai/collenchyma#37, autumnai/collenchyma#62

…I [SKIP_CHANGELOG] REFERENCE: autumnai/collenchyma#37, autumnai/collenchyma#62

Fix SharedTensor::new() usage. REFERENCE: autumnai/collenchyma#37, autumnai/collenchyma#62

alexandermorozov added 8 commits April 19, 2016 02:10

refactor/tests: fix tests after breaking change autumnai#37 [SKIP_CHA…

9c4918b

…NGELOG]

test/tensor: add specific test for new sync/access API [SKIP_CHANGELOG]

f76ced4

REFERENCE: autumnai#37

refactor/tensor: remove unused error types

947d966

During refactoring (autumnai#37) several error were upgraded into panics. Those errors may happen only if internal logic of `SharedTensor` is incorrect and leads to inconsistent state and broken invariants.

fix/benches: fix benches after refactoring of memory access API [SKIP…

926f1e3

…_CHANGELOG] REFERENCE: autumnai#37

refactor/tensor: return SharedTensor from new instead of `Result<…

6a21c7f

…..>` Allocation of `SharedTensor` may fail only on OOM, so returning `Result` type is redundant.

alexandermorozov force-pushed the refactor-sync branch from c128a9b to 6a21c7f Compare April 23, 2016 16:46

alexandermorozov mentioned this pull request Apr 23, 2016

Refactor for new memory syncronization API autumnai/collenchyma-blas#15

Open

alexandermorozov mentioned this pull request Apr 28, 2016

Refactor sync autumnai/collenchyma-nn#48

Open

alexandermorozov added a commit to alexandermorozov/leaf that referenced this pull request Apr 30, 2016

refactor/tests: convert tests and benches to the new memory access AP…

f506a2c

…I [SKIP_CHANGELOG] REFERENCE: autumnai/collenchyma#37, autumnai/collenchyma#62

alexandermorozov mentioned this pull request Apr 30, 2016

Convert Leaf to the new memory access API autumnai/leaf#103

Open

alexandermorozov added a commit to alexandermorozov/leaf-examples that referenced this pull request Apr 30, 2016

refactor/tensor: convert to the new memory API

704e45e

Fix SharedTensor::new() usage. REFERENCE: autumnai/collenchyma#37, autumnai/collenchyma#62

This was referenced Apr 30, 2016

refactor/tensor: convert to the new memory API autumnai/leaf-examples#17

Open

Simplify SharedTensor syncing #37

Open

jonysy mentioned this pull request Feb 2, 2017

Merge refactored collenchyma code jonysy/parenchyma#2

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor synchronization #62

Refactor synchronization #62

alexandermorozov commented Apr 19, 2016

Refactor synchronization #62

Are you sure you want to change the base?

Refactor synchronization #62

Conversation

alexandermorozov commented Apr 19, 2016