The goal of this assignment is to develop basic requirements for a data model fo
ID: 217041 • Letter: T
Question
The goal of this assignment is to develop basic requirements for a data model for sequence variation data. To limit the assignment, we will consider sequence variation data for inbred strains of model organisms, such as the laboratory mouse. Here is an example record from the Mouse Phenome Database for a set of genotypes for a SNP. Your data model would be used to store genotypes for any number of inbred strains for any number of SNPs. Keep in mind that designing a flexible system is desirable. For example, there are lots of strains that have been genotyped for some SNPs and others that have only been genotypes for nearly all known SNPS dbSNP 142 functional Chr location GRCm38 mm10RS number dbSNP Requested target More info 2:129364986 rs225862268 AVC 2:129365041 rs27447593 A/GCs:111b:S:266 GGGGGGAGGGGGGGGGGGAGGGGGGAGGGGGGAGGGG 2:129365098 rs27447592 C/T Cs:ll1b:K:247 C C C CCCCC CCC C C C CCCC CC CCC CC CC CCC T C CCCT C 2:129365101 rs232919644 A/G Cs:ll1b:H:246 G GG G G G GG G G G GG G G G G G G G G G G G G G G G G G G G G G G A G 2:129365197 rs27447591 C/TCs:ll1b:K:214 TTT T T TCT T T T T TT T T TTCTT TTT TCTTT TCTCCTCT 2:129365272 rs217285410 A/G U3:111b A A A AA ACAAA A A A A A AA A CA AA A A A C AA A ACACAACA ll1b ll1b (30 points) What are the attributes you would define for each object in order to represent the information in the image above? In your answer, also think about which attributes are required for every record and which are optional as they could be missing. Keep in mind that the system you are designing would ideally work with any number of SNPs from any number of strains. Please list the attributes as a table with the name of the object at the 1.Explanation / Answer
2: 129364986
Attributes name
Gene symbol
Gene coordinates
rs number
Non-synonm
2:129365041
Attributes name
Gene symbol
Gene coordinates
rs number
2:123965098
Attributes name
Gene symbol
Gene coordinates
rs number
2:123965101
Attributes name
Gene symbol
Gene coordinates
rs number
2:123965197
Attributes name
Gene symbol
Gene coordinates
rs number
2:123965272
Attributes name
Gene symbol
Gene coordinates
rs number
2: 129364986
Attributes name
Gene symbol
Gene coordinates
rs number
Non-synonm
2:129365041
Attributes name
Gene symbol
Gene coordinates
rs number
2:123965098
Attributes name
Gene symbol
Gene coordinates
rs number
2:123965101
Attributes name
Gene symbol
Gene coordinates
rs number
2:123965197
Attributes name
Gene symbol
Gene coordinates
rs number
2:123965272
Attributes name
Gene symbol
Gene coordinates
rs number
Related Questions
drjack9650@gmail.com
Navigate
Integrity-first tutoring: explanations and feedback only — we do not complete graded work. Learn more.