Examine multiple variation parameters for a genomic region , Biology

Assignment Help:
  1. Determine SNP variation among the aligned DNAs for a genomic region.   See below for how to count SNP variation.  The output file (Your_name_snp.txt) should have two columns of numbers.  The first column will indicate total number of SNP sites per species and the second will be the percent of sequences/species having that same number of variant nucleotides.
  2. Determine in-del variation among the aligned DNAs for a genomic region. The output file (Your_name_in_del.txt) should be two columns of numbers.  The first column will indicate total number of in-del sites per species and the second will be the percent of sequences/species having that same number of in-del.
  3. Determine overall variation (SNPs and in-dels) among the aligned DNAs for a genomic region. The output file (Your_name_both.txt) two columns of numbers.  The first column will indicate total number of variant sites (SNP and in-del) per species and the second will be the percent of sequences/species having that same number of variant nucleotides.  This will generate the same data used for the figure on page 3.

Sample Alignment: 48 bases,  differences are highlighted

Seq1      ATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq2      AAAAATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq3      AAAAATGCATGCATGCA-GCATGCATGCATGCATGCATGCATGCATGC

Seq4      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCATGCATGCATGC

Seq5      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCAT-CATGCATGC

Computation:  Compare Seq1 to 2,3,4, and 5 you find the differences (SNPs and InDels).

Seq1:Seq1 = 0 changes

Seq1:Seq2 = 3 changes

Seq1:Seq3 = 4 changes

Seq1:Seq4 = 7 changes

Seq1:Seq5 = 8 changes

 Repeat using each of the other sequences as the basis for comparison

Seq2:Seq1 = 3 changes                  Seq3:Seq1 = 4 changes

Seq2:Seq2 = 0 changes                  Seq3:Seq2 = 1 changes

Seq2:Seq3 = 1 changes                  Seq3:Seq3 = 0 changes

Seq2:Seq4 = 4 changes                  Seq3:Seq4 = 3 changes

Seq2:Seq5 = 5 changes                  Seq3:Seq5 = 4 changes

 

Seq4:Seq1 = 7 changes                  Seq5:Seq1 = 8 changes

Seq4:Seq2 = 4 changes                  Seq5:Seq2 = 5 changes

Seq4:Seq3 = 3 changes                  Seq5:Seq3 = 4 changes

Seq4:Seq4 = 0 changes                  Seq5:Seq4 = 1 changes

Seq4:Seq5 = 1 changes                  Seq5:Seq5 = 0 changes

 

Our input file is a FASTA format file of all sequences/species that has been previously aligned and trimmed.  There are some odd characters in the file, so we'll have to deal with that.


Related Discussions:- Examine multiple variation parameters for a genomic region

Define recommended dietary allowance for folate (rda), Define Recommended D...

Define Recommended Dietary Allowance for Folate (RDA)? Folate requirements are the intake levels necessary to prevent deficiency with clinical symptoms. The requirements are ex

What is isonietric exercise in dynamic auscultation, What is Isonietric Exe...

What is Isonietric Exercise in dynamic auscultation ? Can be carried out by doing handgrip exercise which is sustained over 20-30 second. It results in transient increase in SV

Photosynthesis carbon dioxide, Q. Why is it said that during photosynthesis...

Q. Why is it said that during photosynthesis carbon dioxide is improved to form glucose? During photosynthesis carbon dioxide is energetically improve with hydrogen from water.

Anatomy of the brain, In order to be able to study the neural basis of perc...

In order to be able to study the neural basis of perceptual and cognitive functions, it is necessary to have some knowledge of brain anatomy (i.e., what the names of different brai

Major conventional symbols & signs used in genetic family, What are the maj...

What are the major conventional symbols and signs used in genetic family trees? In the genetic family trees the male sex is usually represented by a square and the female by a

Endosperm with micropylar haustorium, Endosperm with micropylar Haustorium ...

Endosperm with micropylar Haustorium A very prominent and aggressive micropylar haustorium is seen in Impatiens. Here the division of the primary endosperm nucleus is followed

What are enzyme cofactors, What are enzyme cofactors? Some enzymes requ...

What are enzyme cofactors? Some enzymes require other associated molecules to work. These molecules are known as enzyme cofactors and they can be, for example, organic ions lik

Importance of intestinal bacterial synthesis - vitamin k, Define Importance...

Define Importance of Intestinal Bacterial Synthesis as a Source of Vitamin K? Intestinal microflora synthesizes large amounts of menaquinones, which are potentially available a

Determine improper fit at the abutment - implant interface, Improper fit at...

Improper fit at the abutment/- implant interface It is very vital that the fit of the abutment is crosschecked radiographically prior to final delivery of the prosthesis. The m

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd