Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clustering prechecks #7

Open
aretaon opened this issue Sep 4, 2023 · 1 comment
Open

Clustering prechecks #7

aretaon opened this issue Sep 4, 2023 · 1 comment
Labels
enhancement New feature or request

Comments

@aretaon
Copy link
Contributor

aretaon commented Sep 4, 2023

Migrated from Gitea

In #2 a

submodule that checks if an object is usable for a certain kind of clustering

was requested.
So far, both HCA and KMeans require plain MxN numpy arrays without NaNs as input. This is checked in the init procedure.
This needs further clarification.

@aretaon aretaon added the enhancement New feature or request label Sep 4, 2023
@aretaon
Copy link
Contributor Author

aretaon commented Sep 4, 2023

As stated in the documentation (https://docs.scipy.org/doc/scipy/reference/generated/scipy.cluster.hierarchy.linkage.html) not all "linkage" models work with every "methode" (see Note 2.). The same is true for the kMeans clustering. So the question is if we include some warning for certain combinations.

The same question is true for other functions which rely on data transformation (e.g. vsn works only with log transformed values).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant