Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Addition of Deuterium to model vocab #145

Open
abazabaaa opened this issue Oct 10, 2024 · 2 comments
Open

Addition of Deuterium to model vocab #145

abazabaaa opened this issue Oct 10, 2024 · 2 comments

Comments

@abazabaaa
Copy link

Hi,

Thanks for putting together the code and documentation. I’ve been enjoying getting into the code and exploring with it a bit.

Our group has some compounds that have CD3 or CD2 in them and some of our models rely on those features. As far as I am aware REINVENT doesn’t tolerate isotopes. I wanted to see if you had gone done this road at all and maybe had a few thoughts. We would be happy to give a shot and contribute to the project — but didn’t want to rush into something you already abandoned.

Thanks for your time,
Tom

@halx
Copy link
Contributor

halx commented Oct 10, 2024

Hi,

many thanks for your interest in REINVENT and welcome to the community!

I was wondering why your models depend on deuterated compounds.

In preparation of all prior models we strip out all isotopes. The vocabulary is fixed after training and cannot be extended. This means that you would have to train a new model keeping [2H].

Cheers,
Hannes.

@abazabaaa
Copy link
Author

Some of our models are trained on compounds containing deuterium. Metabolism can be influenced by the substitution of CH for CD. Without accounting for D it would be hard to use those models with it. My colleague will follow up with a question about how best to modify the code. I think she has some ideas but we want to make sure we aren’t breaking anything.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants