Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Inconsistent format representation for assembly graph edges #160

Open
anuradhawick opened this issue Dec 2, 2019 · 1 comment
Open

Inconsistent format representation for assembly graph edges #160

anuradhawick opened this issue Dec 2, 2019 · 1 comment

Comments

@anuradhawick
Copy link

In one of my experiments involving the assembly graph, I noted that ED read2 read1 0 46 50 3 49 50 0 0 lines are not tab-delimited. Only the ED is separated with a tab and everything else uses spaces. This makes the downstream analysis (especially using C++/C) a bit inefficient and would appreciate if it can be corrected as per the definition, or alternatively mention that in the wiki. It took a good amount of time to figure that out just by relying on the format specification. Keep up the good work! Cheers :)

@jts
Copy link
Owner

jts commented Dec 2, 2019

This was intentional but probably not a good idea. The coordinates part (0 46 50 3 49 50) is an SGA object that has its own serialization function that uses spaces as a delimiter. Fixing this would be too large of a change and this project is largely deprecated anyway. I suggest you use a GFA variant for your experiment, there are convertors from ASQG to GFA i think.

Jared

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants