发现搜索引擎是个神奇的东西,偶然想起的关键词一搜索,获得的就是意想不到的结果,我以imputation+qc搜索,就找到了snpQT(发音Snip Cute)这样一个神奇的工具/流程。这个流程的目的是让你的SNP cute,为处理人类基因变异提供了帮助:
如是你有如下的想法,它也许是有用的:
好像并没有发表在好的杂志上,康奈尔大学团队做的。Vasilopoulou C, Wingfield B, Morris AP and Duddy W. snpQT: flexible, reproducible, and comprehensive quality control and imputation of genomic data [version 1; peer review: 2 approved with reservations]. F1000Research 2021, 10:567 (https://doi.org/10.12688/f1000research.53821.1)
GPL3,流程离不开以下第三方软件,看起来还版本还挺新的:
Software | Version | Reference | License |
---|---|---|---|
EIGENSOFT | 7.2.1 | Price, Alkes L., et al. "Principal components analysis corrects for stratification in genome-wide association studies." Nature genetics 38.8 (2006): 904-909. | Custom open source |
impute5 | 1.0 | Rubinacci, Simone, Olivier Delaneau, and Jonathan Marchini. "Genotype imputation using the positional burrows wheeler transform." PLoS Genetics 16.11 (2020): e1009049.APA | Academic use only |
nextflow | 20.10.0 | Di Tommaso, Paolo, et al. "Nextflow enables reproducible computational workflows." Nature biotechnology 35.4 (2017): 316-319. | GPL3 |
picard | 2.24.0 | MIT | |
PLINK | 1.90b6.18 | Purcell, Shaun, et al. "PLINK: a tool set for whole-genome association and population-based linkage analyses." The American journal of human genetics 81.3 (2007): 559-575. | GPL3 |
PLINK2 | 2.00a2.3 | Chang CC, Chow CC, Tellier LCAM, Vattikuti S, Purcell SM, Lee JJ (2015) Second-generation PLINK: rising to the challenge of larger and richer datasets. GigaScience, 4. | GPL3 |
samtools | 1.11 | Danecek, P.et al.(2021). Twelve years of SAMtools and BCFtools.GigaScience,10(2), 1–4 | MIT |
shapeit4 | 4.1.3 | Delaneau, Olivier, et al. "Accurate, scalable and integrative haplotype estimation." Nature communications 10.1 (2019): 1-10. | MIT |
snpflip | 0.0.6 | https://github.com/biocore-ntnu/snpflip | MIT |
还用了R语言, tidyverse包等。怎样,有没有兴趣用起来呀?