Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Normalize family names #22

Open
cthoyt opened this issue Mar 24, 2020 · 2 comments
Open

Normalize family names #22

cthoyt opened this issue Mar 24, 2020 · 2 comments

Comments

@cthoyt
Copy link
Member

cthoyt commented Mar 24, 2020

I'm not sure how many special cases you want to add in to Gilda, but there are lots of times when text comes with the suffix "family" like "RAS family" that would be easy to normalize to an appropriate term like fplx:RAS

I'm manually curating PID right now for bio2bel/bio2bel#27 in this spreadsheet that has some examples.

@bgyori
Copy link
Member

bgyori commented Mar 24, 2020

Yes, this could be handled in two ways: either when generating lookup strings, for an input like "RAS family", we could also look up "RAS". Alternatively, * family and * complex could be added to all FamPlex entries as synonyms.

@cthoyt
Copy link
Member Author

cthoyt commented Mar 24, 2020

Hmm i think adding family and complex into the synonyms would probably be the simplest and also most informative

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants