Stack cache and acceleration #767

olivierlabayle · 2022-05-12T16:52:50Z

Following up on: #759

This adds two fields to the Stack to enable/disable submachines caching and the network's acceleration:

cache: Whether submachines will be caching data or not.
acceleration: Acceleration mode of the learning network.

added by @ablaom: This PR allows one to train a learning network node using multithreading, as in fit!(node, acceleration=CPUThreads()).

codecov-commenter · 2022-05-15T08:18:00Z

Codecov Report

Merging #767 (dd2478b) into dev (4d1ed14) will increase coverage by 0.11%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##              dev     #767      +/-   ##
==========================================
+ Coverage   85.85%   85.96%   +0.11%     
==========================================
  Files          36       36              
  Lines        3451     3471      +20     
==========================================
+ Hits         2963     2984      +21     
+ Misses        488      487       -1

Impacted Files	Coverage Δ
src/composition/learning_networks/machines.jl	`91.95% <100.00%> (ø)`
src/composition/learning_networks/nodes.jl	`71.24% <100.00%> (+1.37%)`	⬆️
src/composition/models/stacking.jl	`94.66% <100.00%> (+0.14%)`	⬆️
src/resampling.jl	`91.60% <0.00%> (+0.44%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 4d1ed14...dd2478b. Read the comment docs.

test/Project.toml

src/composition/models/stacking.jl

ablaom · 2022-05-16T00:50:35Z

@olivierlabayle This is great, thank you.

Since this adds CPUThreads as an option for training any learning network node, it's probably prudent to add some more tests for this. I would like to see that the CPUThreads analogue of these tests also a pass:

MLJBase.jl/test/composition/learning_networks/machines.jl

Line 194 in 4d1ed14

## EXTRA TESTS FOR TRAINING SEQUENCE

Note that @test_mach_sequence already provides a list of multiple acceptable training sequences, to handle case of asynchronous training. So they "should" pass.

Probably we can just wrap these tests in an @test_accelerated block, as we have elsewhere, but exclude CPUProcesses.

I don't think we need to check both options give same results. You already do this for Stack. What do you think?

ablaom

My comments and test suggestions not withstanding, I'm already happy with this "draft" PR.

olivierlabayle · 2022-05-18T00:53:41Z

@ablaom Thank you for this early review. I had indeed only marked this as a draft because I wanted to be a bit more thorough about the testing of the new multithreading feature.

I will need to have a proper look at the @test_accelerated and @test_mach_sequence macros that you suggest. If the models in the sequence are completely deterministic I think I should make sure the results are concordent at least to some extent.

olivierlabayle · 2022-05-19T00:14:48Z

I have incremented the testset you suggested with CPUThreads() as a resource for the custom learning network. Unfortunately, I couldn't make use of the @testset_accelerated macro with the exclude keyword. Besides, what is the advantage over the classic @testset for accel in accelerations end? (This is what I have used)

ablaom · 2022-05-19T05:21:35Z

Besides, what is the advantage over the classic @testset for accel in accelerations end? (This is what I have used)

👍🏾 Probably that macro was written before that syntax was available?? I didn't know about it myself.

ablaom

I'm happy with this, @olivierlabayle

I'm going to recommend that @OkonSamuel also reviews this, as he is more familiar with multithreading issues. He is pretty busy, but I think it's worth getting an expert to look over this.

ablaom · 2022-05-30T04:21:04Z

Following suggestion of @OkonSamuel we should test multi-threading works for a variety of component models. I am working on some enhancements of MLJTestIntegration.jl to automate such a test, and will check back here when that is ready.

olivierlabayle · 2022-05-30T05:23:37Z

Following suggestion of @OkonSamuel we should test multi-threading works for a variety of component models. I am working on some enhancements of MLJTestIntegration.jl to automate such a test, and will check back here when that is ready.

Great idea, I'll be waiting then!

src/composition/models/stacking.jl

ablaom · 2022-06-09T01:04:14Z

src/composition/models/stacking.jl

@@ -54,7 +58,7 @@ const Stack{modelnames, inp_scitype, tg_scitype} =
            ProbabilisticStack{modelnames, inp_scitype, tg_scitype}}

 """
-    Stack(;metalearner=nothing, resampling=CV(), name1=model1, name2=model2, ...)
+    Stack(;metalearner=nothing, resampling=CV(), name1=model1, cache=true, acceleration=CPU1(), name2=model2, ...)


This doc-string got a little mangled, with the model1 and model2 being separated. As this has grown passed the length of the line, how about we not list all the options here:

Suggested change

Stack(;metalearner=nothing, resampling=CV(), name1=model1, cache=true, acceleration=CPU1(), name2=model2, ...)

Stack(;metalearner=nothing, name1=model1, name2=model2, keyword_options...)

add stack_evaluation; needs JuliaAI/MLJBase.jl#767 rm target_scitype arg from stack_evaluation put stack test into test() oops fix some bugs separate out :accelerated_stack_evaluation test more tweaks oops

ablaom · 2022-06-09T21:14:39Z

@olivierlabayle I needed to rebase this and am closing in favour of #785. The rebase includes a fix for the doc string marked above.

olivierlabayle added 2 commits May 12, 2022 16:37

add cache and acceleration to the stack fields

4a48417

add test Project.toml and some tests

462b18d

olivierlabayle marked this pull request as draft May 12, 2022 16:53

olivierlabayle mentioned this pull request May 12, 2022

Learning graph in mach #759

Closed

4 tasks

update docstrings

37a5256

ablaom reviewed May 16, 2022

View reviewed changes

test/Project.toml Show resolved Hide resolved

ablaom reviewed May 16, 2022

View reviewed changes

src/composition/models/stacking.jl Outdated Show resolved Hide resolved

ablaom reviewed May 16, 2022

View reviewed changes

src/composition/models/stacking.jl Outdated Show resolved Hide resolved

ablaom requested changes May 16, 2022

View reviewed changes

remove extras sections from Project and update some docs and logs

da8a91f

add some tests for fit!(::Node, acceleration=CPUThreads())

23bddfe

olivierlabayle marked this pull request as ready for review May 19, 2022 00:17

ablaom approved these changes May 19, 2022

View reviewed changes

ablaom requested a review from OkonSamuel May 19, 2022 05:24

ablaom reviewed Jun 3, 2022

View reviewed changes

src/composition/models/stacking.jl Show resolved Hide resolved

update propertynames

dd2478b

ablaom reviewed Jun 9, 2022

View reviewed changes

This was referenced Jun 9, 2022

Julia crashes for multithreaded Stack for some non-Julia models #783

Open

Stack cache and acceleration (rebased) #785

Merged

ablaom closed this Jun 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stack cache and acceleration #767

Stack cache and acceleration #767

olivierlabayle commented May 12, 2022 •

edited by ablaom

Loading

codecov-commenter commented May 15, 2022 •

edited

Loading

ablaom commented May 16, 2022

ablaom left a comment

olivierlabayle commented May 18, 2022

olivierlabayle commented May 19, 2022

ablaom commented May 19, 2022

ablaom left a comment

ablaom commented May 30, 2022 •

edited

Loading

olivierlabayle commented May 30, 2022

ablaom Jun 9, 2022

ablaom commented Jun 9, 2022

	Stack(;metalearner=nothing, resampling=CV(), name1=model1, cache=true, acceleration=CPU1(), name2=model2, ...)
	Stack(;metalearner=nothing, name1=model1, name2=model2, keyword_options...)

Stack cache and acceleration #767

Stack cache and acceleration #767

Conversation

olivierlabayle commented May 12, 2022 • edited by ablaom Loading

codecov-commenter commented May 15, 2022 • edited Loading

Codecov Report

ablaom commented May 16, 2022

ablaom left a comment

Choose a reason for hiding this comment

olivierlabayle commented May 18, 2022

olivierlabayle commented May 19, 2022

ablaom commented May 19, 2022

ablaom left a comment

Choose a reason for hiding this comment

ablaom commented May 30, 2022 • edited Loading

olivierlabayle commented May 30, 2022

ablaom Jun 9, 2022

Choose a reason for hiding this comment

ablaom commented Jun 9, 2022

olivierlabayle commented May 12, 2022 •

edited by ablaom

Loading

codecov-commenter commented May 15, 2022 •

edited

Loading

ablaom commented May 30, 2022 •

edited

Loading