-
Notifications
You must be signed in to change notification settings - Fork 18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to obtain the RS ID from Minimac4 results #34
Comments
You need to add |
Hi Jonathon, |
I'm assuming you are referring to the 1000 genomes panel. This panel does not have RS IDs. You should be able to get them from the 1000 genomes VCFs on the 1000 genomes FTP site. |
I used the Minimac3 to Convert 1000 genomes panel VCF to M3VCF, I can not find the RS IDs, only obtain Chr:pos as SNP |
If the RS IDs exist in the VCF but not the M3VCF, then I would suggest using https://github.com/Santy-8128/m3vcftools to compress to M3VCF. This tool will copy over the ID column. I don't know whether the VCFs on our site include RS IDs, but the VCFs on the 1000 genomes site do. |
Hi, |
Hi all! We have a similar problem with missing IDs in the imputation output. We use minimac3 v.2.0.1 with the We checked the msav file with the |
@steffenom, thanks for reporting this. The earlier conversation was regarding v4.0.x. You are using v4.1.x, and this feature was missing from the new version. I just pushed a fix to the master branch. Please try the latest from the master branch to generate a new msav file. |
Hi @jonathonl, Minor drawback is that now all variants have an ID. The ones that don't have an ID in the reference panel now have an ID given by CHR:POS, but that is not a problem for us. Might be unexpected for other users. |
For the IDs with CHR:POS, are these variants that exist only in the target file (not in reference)? If using --all-typed-sites, IDs for such variants are carried over from the target VCF instead of the reference panel. If the variant exists in the reference panel and has a missing ID in the reference panel, then the ID for that variant should also be missing in the imputed results. |
No, for me all variants without an ID in the initial reference panel have the CHR:POS ID in the final output (without using |
I see. FYI, you can generate an msav directly from a VCF, BCF, or SAV file with |
Thanks for the hint! I tried Should the results of the imputation with a reference panel created with |
There may be a small difference with smaller reference panels (tens of thousands of samples). By default, |
input.vcf.gz has 1052764 chromosome 20 variants (rows). The code line where I'm kicked out is: Minimac4/src/unique_haplotype.cpp Line 211 in 99ce06c
It also failed for 4, 14, 15 - all other chromosomes works. |
@buegelbeatz , this should be fixed with 6f9f140 |
Just tested it - works now! - Thanx for the quick fix. |
Hi,
How to obtain the RS ID from Minimac4 results? Thanks a lot!
Best,
Bo
The text was updated successfully, but these errors were encountered: