Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Vertebrate-level brain atlas, questions about evolutionary distanct integration #62

Open
DiracZhu1998 opened this issue Jun 17, 2024 · 4 comments

Comments

@DiracZhu1998
Copy link

Dear authors,

Thank you for giving us such a wonderful toolkit!
If I want to build an evolutionary distance atlas, should we use major cell type level as "cell_type" label?
Is there any other parameters you would recommend we could tune to make the atlas better since the default results doesn't integrated well.

Thank you for your help!

Best wishes,
Yuanzhen
Screenshot 2024-06-17 at 09 04 39

@DiracZhu1998
Copy link
Author

In addition, I checked but couldn't find any relevant code and parameter usage related to the frog-zebrafish integration in your paper. The Jupyter notebook you provided is not the version you generated for the paper, the graph and integration in your paper are great but frog-zebrafish with default parameters is not that good.

@Yanay1
Copy link
Collaborator

Yanay1 commented Jun 17, 2024

The jupyter notebook is the version used in the paper (same hyperparameters, the random seed will be slightly different but this shouldn't make a hude difference), how were the results different?

How are you judging how well the species are integrated? You should try transferring labels between species and measuring accuracy.

@DiracZhu1998
Copy link
Author

Hi Yanay, thank you for your quick response!
Probably you are right, I just compared them with naked eye. so not that accurate but looks quite different from your paper.
I assume that the same major clusters (cell types) from different species should be close to each other rather than separate.
I also tested for human and mouse whole brain atlas, It also doesn't integrated well.
Screenshot 2024-06-17 at 20 37 06
Screenshot 2024-06-17 at 20 36 14
Screenshot 2024-06-17 at 20 41 42

@DiracZhu1998
Copy link
Author

I checked about distance between your generated protein embeddings and mine, the corresponding genes had the lowest distance so no problem with the step of protein embedding.
The problem seems to be related to the scRNA and snRNA datasets, once I removed the snRNA human dataset and only integrated mouse and lizard (both are scRNA datasets), they integrated much better than before. I was wondering do you have some recommendations to give more "force" on integration to make snRNA human better integrate with other scRNA datasets, for example, maybe increasing the pretrain numbers? Many thanks!
Screenshot 2024-06-19 at 20 15 16

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants