Human Genome Analysis Reflections : Analysis of SNP data using SNPedia and Promethease
Learning Objectives
- View information from a personal genetic test from the perspective of the rsID
- To understand the concepts of Magnitude and Repute in scoring the effect of a SNP
Background
SNP Analysis using SNPedia
In this lab we will look SNP data from the perspective of SNPedia. The project is described more fully in this publication SNPedia: a wiki supporting personal genome annotation, interpretation and analysis. SNPedia is a MediaWiki site with information on SNP data. Much of the information is reformatted from NCBI’s dbSNP database based on the Reference SNP ID (rs#). This is the rs# that you have already seen in the 23andMe data files you have been working with. “dbSNP contains population-specific frequency and genotype data, experimental conditions, molecular context, and mapping information for both neutral variations and clinical mutations”. The rs# cand be used to link to genome wide association studies(GWAS) and other information. SNPedia creates ‘rs-pages’ “defining individual SNPs with genotype-specific pages are primarily only created for variants that have significant medical or genealogical consequences based on published meta-analyses, studies of at least 500 patients or two or more independent studies (i.e. replicated findings), or other historic, statistical or medical significance.
The are two concepts implemented in SNPedia that are used to sort an individual’s personal genetic testing results. The first is Magnitude. Magnitude is a subjective measure given to a result by the curators of SNPedia. It generally varies from 0 to 10.
The second concept is Repute. This is a perspective applied to a (single) genotype, and it can be either Good, Bad, or Not Set (blank). “‘Good’ and ‘Bad’ are mostly self explanatory. However, it’s worth stating that many genotypes have a bit of good and a bit of bad. These and many other genotypes should remain ‘Not Set’. Sometimes that’s because they’re not distinctly Good or Bad (like eye color) or Ancestry, but usually it means no human has come along to do the classification yet. Often this is the right thing to do, because the published studies are still too new and unreplicated to be convincing.”
Generating of a personal genetic test using Promethease
Promethease builds a personal DNA report based on connecting a file of DNA genotypes in SNPedia. If have your data back you can upload it in Promethease (this will cost $12) or else you can use the Promethease results provided on Moodle. The Promethease site includes a short tutorial - and information on “How to read a Promethease Report”. You are not required to use this service for this class, but if you do you are welcome to include it in your final report.
Exercise
Here is are two example reports Person 1 report and Person 2 report.
Compare the Promethease report with your 23andMe or Ancestry reports. Include a discussion which results they present and the concepts of Magnitude and Repute. Do you agree with using these concepts in a genetic testing report? How might you change or create a different system to highlight particular results. Discuss not only medical related SNPs, but some of the popular ones listed in 23andMe and on the SNPedia page.There is no code to upload for this lab, but write the report in rMarkdown and knit to produce the html file to upload onto Moodle.
Note In 2019 MyHeritage bought Promethease and SNPedia - https://blog.myheritage.com/2019/09/myheritage-acquires-promethease-and-snpedia/. I assume the eventually plan is to incorporate them into the DNA reports they provide to their users, but I haven’t seen this feature released yet.