[c++] Fix `dump_model()` information for root node #6569

neNasko1 · 2024-07-24T14:01:46Z

This PR corrects the output of dump_model() and other dump-related functions like trees_to_dataframe(). There are 2 fixes implemented:

The current Tree::Split implementation incorrectly saves the old leaf output value in the internal_value_ array when called on the root node. This in turn makes inspecting the whole training process from python incomplete.

Before:

(Pdb) booster_.trees_to_dataframe()
     tree_index  node_depth node_index left_child right_child parent_index  ... decision_type  missing_direction missing_type     value weight count
0             0           1       0-S0       0-S1        0-S2         None  ...            ==              right         None  0.000000      0   200
1             0           2       0-S1       0-S5        0-S4         0-S0  ...            <=               left         None  0.106573    113   113
2             0           3       0-S5       0-L0        0-L6         0-S1  ...            ==              right         None  0.082122     56    56
3             0           4       0-L0       None        None         0-S5  ...          None               None         None  0.064612     26    26
4             0           4       0-L6       None        None         0-S5  ...          None               None         None  0.097297     30    30

After:

(Pdb) booster_.trees_to_dataframe().head()
   tree_index  node_depth node_index left_child right_child parent_index  ... decision_type  missing_direction missing_type     value weight count
0           0           1       0-S0       0-S1        0-S2         None  ...            ==              right         None  0.081757    200   200
1           0           2       0-S1       0-S5        0-S4         0-S0  ...            <=               left         None  0.106573    113   113
2           0           3       0-S5       0-L0        0-L6         0-S1  ...            ==              right         None  0.082122     56    56
3           0           4       0-L0       None        None         0-S5  ...          None               None         None  0.064612     26    26
4           0           4       0-L6       None        None         0-S5  ...          None               None         None  0.097297     30    30

Stump has no leaf_count inside dump_model() output #5962

neNasko1 · 2024-07-29T11:11:17Z

Currently the CI is not passing as #6574 is blocking.

neNasko1 · 2024-07-29T11:49:02Z

I am open to ideas of ways to test related functionalities.

Tests should now be sufficient for the change.

…ix-root-values

neNasko1 · 2024-07-30T16:47:43Z

@jameslamb
Could you take a look at the PR, now that the CI is passing?

jameslamb

@shiyu1994 or @guolinke could you help with a review of this?

I'm not sure if this will correctly handle these cases:

custom init_score provided (via Dataset)
boost_from_average=False passed

@neNasko1 could you also look at #5962 and let us know if you think this change would fix the issue @thatlittleboy reported there?

neNasko1 · 2024-08-03T19:02:06Z

Thank you for taking the time to look into the PR and linking a relevant issue.

I'm not sure if this will correctly handle these cases:

custom init_score provided (via Dataset)

boost_from_average=False passed

I think those cases are handled as the results are consistent with what leaf values report, I also remade the test to boost from average.

@neNasko1 could you also look at #5962 and let us know if you think this change would fix the issue @thatlittleboy reported there?

I took the liberty to merge @thatlittleboy's WIP code into mine, additionally fixing the issues that they reported. I will also change the description of the PR to reflect both of the fixes.

guolinke · 2024-08-14T18:52:41Z

Sorry for the late response. This PR looks good to me.

jameslamb

Thanks very much for the review @guolinke !

I've left a few other small requests.

tests/python_package_test/test_dask.py

tests/python_package_test/test_engine.py

src/io/tree.cpp

tests/python_package_test/test_engine.py

jameslamb

Thanks, I left a few questions for your consideration.

tests/python_package_test/test_dask.py

src/treelearner/cuda/cuda_single_gpu_tree_learner.cpp

borchero

This looks good to me! 🚀

neNasko1 · 2024-09-02T10:07:36Z

@jameslamb
Just to recap: the tests/python_package_test/test_dask.py seems to have previously been a no-op since the 2 models produced with and without an init scores are the same for the classifier case. This however is not related to the current changes. Can you tell me whether I am missing something?

jameslamb · 2024-09-05T02:44:25Z

the tests/python_package_test/test_dask.py seems to have previously been a no-op since the 2 models produced with and without an init scores are the same for the classifier case.

I'll investigate this when I can, hopefully in the next few days. In the interim, you can help move this forward by resolving merge conflicts and pulling in the latest changes on master.

StrikerRUS

LGTM!

But I'll keep following the discussion about Dask Ranker test (#6569 (comment)).

neNasko1 requested review from guolinke, jameslamb, shiyu1994, jmoralez, borchero and StrikerRUS as code owners July 24, 2024 14:01

neNasko1 and others added 3 commits July 24, 2024 17:46

Fix value calculation in root node

12102cc

Fix dask tests

c933399

Merge branch 'master' into fix-root-values

c240016

Create proper tests

2f1de57

Merge branch 'master' into fix-root-values

273a1df

jameslamb added awaiting review fix labels Jul 29, 2024

jameslamb changed the title ~~[c++] Root internal_value_ is not calculated properly~~ [c++] Fix calculation of internal_value_ for root node Jul 29, 2024

neNasko1 added 3 commits July 30, 2024 02:10

Test only on cpu

208df85

Merge branch 'fix-root-values' of github.com:neNasko1/LightGBM into f…

130879b

…ix-root-values

Disable new tests for CUDA

48e6b96

jameslamb requested changes Aug 2, 2024

View reviewed changes

neNasko1 added 3 commits August 3, 2024 19:10

Merge with microsoft#5964

26b9859

Finish merging with dump_model unification

88e3dec

Improve tests

e1274dc

neNasko1 changed the title ~~[c++] Fix calculation of internal_value_ for root node~~ [c++] Fix dump_model() information for root node Aug 3, 2024

neNasko1 and others added 4 commits August 4, 2024 20:44

Add linear test for stump

38ee92c

Fix CUDA compilation

3b423de

Merge branch 'master' into fix-root-values

c89e257

Merge branch 'master' into fix-root-values

3de14d9

guolinke approved these changes Aug 14, 2024

View reviewed changes

Merge branch 'master' into fix-root-values

fc42c1c

jameslamb requested changes Aug 14, 2024

View reviewed changes

tests/python_package_test/test_dask.py Outdated Show resolved Hide resolved

tests/python_package_test/test_engine.py Outdated Show resolved Hide resolved

src/io/tree.cpp Outdated Show resolved Hide resolved

StrikerRUS reviewed Aug 14, 2024

View reviewed changes

tests/python_package_test/test_engine.py Outdated Show resolved Hide resolved

neNasko1 and others added 15 commits August 15, 2024 01:27

Comments after code review

3ffcac6

Fix test

d5a82c4

Reenable cuda testing

be7675d

Tests

f616e03

Merge branch 'microsoft:master' into fix-root-values

6c6bc33

test cuda

c28a2cf

.

6113f90

Fix warning

94cf7f0

reenable tests

01aa952

.

fadaa83

Merge branch 'fix-cuda' into fix-root-values

b9c681b

fix cuda

a323acb

Fix compilation error

0fd0c59

Fix weight

4cc5dd4

Fix numerical

a743a87

neNasko1 requested a review from jameslamb August 15, 2024 23:57

jameslamb requested changes Aug 16, 2024

View reviewed changes

tests/python_package_test/test_dask.py Outdated Show resolved Hide resolved

src/treelearner/cuda/cuda_single_gpu_tree_learner.cpp Show resolved Hide resolved

Make tests more robust

031c945

neNasko1 requested a review from jameslamb August 18, 2024 20:19

borchero approved these changes Sep 2, 2024

View reviewed changes

Merge branch 'master' into fix-root-values

91993a9

Merge branch 'master' into fix-root-values

f744f64

StrikerRUS approved these changes Sep 5, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[c++] Fix `dump_model()` information for root node #6569

[c++] Fix `dump_model()` information for root node #6569

neNasko1 commented Jul 24, 2024 •

edited

Loading

neNasko1 commented Jul 29, 2024

neNasko1 commented Jul 29, 2024

neNasko1 commented Jul 30, 2024

jameslamb left a comment

neNasko1 commented Aug 3, 2024 •

edited

Loading

guolinke commented Aug 14, 2024

jameslamb left a comment

jameslamb left a comment

borchero left a comment

neNasko1 commented Sep 2, 2024 •

edited

Loading

jameslamb commented Sep 5, 2024

StrikerRUS left a comment

[c++] Fix dump_model() information for root node #6569

Are you sure you want to change the base?

[c++] Fix dump_model() information for root node #6569

Conversation

neNasko1 commented Jul 24, 2024 • edited Loading

neNasko1 commented Jul 29, 2024

neNasko1 commented Jul 29, 2024

neNasko1 commented Jul 30, 2024

jameslamb left a comment

Choose a reason for hiding this comment

neNasko1 commented Aug 3, 2024 • edited Loading

guolinke commented Aug 14, 2024

jameslamb left a comment

Choose a reason for hiding this comment

jameslamb left a comment

Choose a reason for hiding this comment

borchero left a comment

Choose a reason for hiding this comment

neNasko1 commented Sep 2, 2024 • edited Loading

jameslamb commented Sep 5, 2024

StrikerRUS left a comment

Choose a reason for hiding this comment

[c++] Fix `dump_model()` information for root node #6569

[c++] Fix `dump_model()` information for root node #6569

neNasko1 commented Jul 24, 2024 •

edited

Loading

neNasko1 commented Aug 3, 2024 •

edited

Loading

neNasko1 commented Sep 2, 2024 •

edited

Loading