A tool to select tagging-SNPs (tSNPs). To comprehensively test the role of a candidate gene in an association study the selection of informative SNPs is paramount. Specifically, it is important to select tSNPs that represent a large portion (>90%) of the common genetic variation of a gene. PCAtag performs tSNP selection using principal component analysis (PCA) as described in Horne and Camp (2004). The advantage of PCA analysis for tSNP selection is that LD groups do not need to be contiguous and can be overlapping. This flexible framework does not impose over-simplified assumptions on the genetic architecture structure, and likely fits reality much better.
- Manuscripts (Horne and Camp, 2004, Naiman et al, 2010)
- Code
- Documentation
Affiliations
- Department of Internal Medicine
- Division of Hematology and Hematologic Malignancies
- Cancer Control and Population Sciences Cancer Center Program
- Breast and Gynecologic Cancers Center
- Department of Human Genetics
- Department of Biomedical Informatics
- Department of Family and Preventive Medicine
- Utah Population Database
- Molecular Biology and Biochemistry Program
- Biostatistics MSTAT Program