Academic Integrity: tutoring, explanations, and feedback — we don’t complete graded work or submit on a student’s behalf.

Use the PopSet (population study data sets) in ENTREZ to retrieve coding sequenc

ID: 191658 • Letter: U

Question

Use the PopSet (population study data sets) in ENTREZ to retrieve coding sequence (CDS) of amylase-related gene (amyrel) generated for Drosophila yakuba by Cariou et al. (2001). You should retrieve 5 Drosophila yakuba sequences from different population strains. ***Make sure all your sequences are Drosophila yakuba.***

(a) Report the GenBank accession numbers.

(b) Align coding regions of the 5 sequences. Which alignment software did you use?

(c) Look at the first 200 bases of the alignment. Count the numbers of segregating sites(K), the numbers of segregating sites per sites (k), and nucleotide diversity ().

Explanation / Answer

Go to the site https://www.ncbi.nlm.nih.gov/Web/Search/entrezfs.html and select ENTREZ

https://www.ncbi.nlm.nih.gov/gquery/ and select PopSet

Enter amyrel in the search column

https://www.ncbi.nlm.nih.gov/popset/12620152 - this has Cariou et al (2001) amyrel sequences for different strains of Drosophila yakuba

(a)

1) GenBank: AF280878.1 : Drosophila yakuba strain LO4

2) GenBank: AF280877.1 : Drosophila yakuba strain LBV2 clone 4

3) GenBank: AF280876.1 : Drosophila yakuba strain LBV2 clone 1

4) GenBank: AF280875.1 : Drosophila yakuba strain SA3 clone 8

5) GenBank: AF280874.1 : Drosophila yakuba strain SA3 clone 6

b) Get the FASTA sequence of each strain and use Multiple sequence alignment -Clustal Omega

https://www.ebi.ac.uk/Tools/msa/clustalo/

Results :