WIP: Add support for NaN and mixed types in infer_labels() #13

hagenw · 2021-12-15T14:09:44Z

Closes #12

This adds support for having NaN in truth or prediction and for having mixed types like ['a', 0] in truth and prediction.

codecov · 2021-12-15T14:11:09Z

Codecov Report

Merging #13 (513244d) into master (279806b) will not change coverage.
The diff coverage is 100.0%.

Impacted Files	Coverage Δ
audmetric/core/utils.py	`100.0% <100.0%> (ø)`

frankenjoe · 2021-12-15T14:17:02Z

I see the use-case for supporting NaN. But does it make sense to have mixed types? Isn't that rather a sign for mis-use? E.g. the user passes strings for truth but ids for prediction. I wonder if it makes more sense to raise an error in that case.

frankenjoe · 2021-12-15T14:17:40Z

Btw: how do we treat NaN in our metrics? E.g. how do we account for those labels in a confusion matrix?

hagenw · 2021-12-15T15:11:56Z

Btw: how do we treat NaN in our metrics? E.g. how do we account for those labels in a confusion matrix?

We have not really thought much about it. For some it just works:

>>> audmetric.unweighted_average_recall(['a', 'b'], ['a', 'a'])
0.5
>>> audmetric.unweighted_average_recall(['a', 'b'], ['a', np.NaN])
0.5

Others did strange stuff in master:

>>> audmetric.precision_per_class([0, 0, 2, 1], [0, 1, np.NaN, np.NaN])
{0: 1.0, 1: 0.0, 2: 0.0, nan: 0.0}

Now it would look like this

>>> audmetric.precision_per_class([0, 0, 2, 1], [0, 1, np.NaN, np.NaN])
{0: 1.0, 1: 0.0, 2: 0.0}

hagenw · 2021-12-15T15:13:16Z

I see the use-case for supporting NaN. But does it make sense to have mixed types? Isn't that rather a sign for mis-use? E.g. the user passes strings for truth but ids for prediction. I wonder if it makes more sense to raise an error in that case.

I'm not so sure about this. If you have audformat with its fixed types in mind, maybe. But I would find it valid if for example the labels are IDs collected by users, and some decide to enter an integer and others to enter a string.

frankenjoe · 2021-12-15T15:15:41Z

I'm not so sure about this. If you have audformat with its fixed types in mind, maybe. But I would find it valid if for example the labels are IDs collected by users, and some decide to enter an integer and others to enter a string.

But how do you map between strings and integers? Or do you assume they represent different classes? Why would a user do that? I still see the risk that integer and strings are accidentally mixed and the user will not notice it if allow that.

frankenjoe · 2021-12-15T15:17:09Z

We have not really thought much about it. For some it just works:

I guess we either have to come up with a proper solution or otherwise I think it would be safer to raise an error if we encounter NaN.

hagenw · 2021-12-15T15:29:33Z

OK, I created #14 and set this pull request to WIP.

hagenw added 2 commits December 15, 2021 15:07

Add support for NaN in utils.infer_labels()

031a7e1

Add support for mixed types in infer_labels()

513244d

hagenw assigned frankenjoe Dec 15, 2021

hagenw mentioned this pull request Dec 15, 2021

Integrating handling of NaN values #14

Open

hagenw changed the title ~~Add support for NaN and mixed types in infer_labels()~~ WIP: Add support for NaN and mixed types in infer_labels() Dec 15, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Add support for NaN and mixed types in infer_labels() #13

WIP: Add support for NaN and mixed types in infer_labels() #13

hagenw commented Dec 15, 2021

codecov bot commented Dec 15, 2021

frankenjoe commented Dec 15, 2021 •

edited

Loading

frankenjoe commented Dec 15, 2021 •

edited

Loading

hagenw commented Dec 15, 2021

hagenw commented Dec 15, 2021

frankenjoe commented Dec 15, 2021

frankenjoe commented Dec 15, 2021 •

edited

Loading

hagenw commented Dec 15, 2021

WIP: Add support for NaN and mixed types in infer_labels() #13

Are you sure you want to change the base?

WIP: Add support for NaN and mixed types in infer_labels() #13

Conversation

hagenw commented Dec 15, 2021

codecov bot commented Dec 15, 2021

Codecov Report

frankenjoe commented Dec 15, 2021 • edited Loading

frankenjoe commented Dec 15, 2021 • edited Loading

hagenw commented Dec 15, 2021

hagenw commented Dec 15, 2021

frankenjoe commented Dec 15, 2021

frankenjoe commented Dec 15, 2021 • edited Loading

hagenw commented Dec 15, 2021

frankenjoe commented Dec 15, 2021 •

edited

Loading

frankenjoe commented Dec 15, 2021 •

edited

Loading

frankenjoe commented Dec 15, 2021 •

edited

Loading