Examine multiple variation parameters for a genomic region , Biology

Assignment Help:
  1. Determine SNP variation among the aligned DNAs for a genomic region.   See below for how to count SNP variation.  The output file (Your_name_snp.txt) should have two columns of numbers.  The first column will indicate total number of SNP sites per species and the second will be the percent of sequences/species having that same number of variant nucleotides.
  2. Determine in-del variation among the aligned DNAs for a genomic region. The output file (Your_name_in_del.txt) should be two columns of numbers.  The first column will indicate total number of in-del sites per species and the second will be the percent of sequences/species having that same number of in-del.
  3. Determine overall variation (SNPs and in-dels) among the aligned DNAs for a genomic region. The output file (Your_name_both.txt) two columns of numbers.  The first column will indicate total number of variant sites (SNP and in-del) per species and the second will be the percent of sequences/species having that same number of variant nucleotides.  This will generate the same data used for the figure on page 3.

Sample Alignment: 48 bases,  differences are highlighted

Seq1      ATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq2      AAAAATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq3      AAAAATGCATGCATGCA-GCATGCATGCATGCATGCATGCATGCATGC

Seq4      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCATGCATGCATGC

Seq5      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCAT-CATGCATGC

Computation:  Compare Seq1 to 2,3,4, and 5 you find the differences (SNPs and InDels).

Seq1:Seq1 = 0 changes

Seq1:Seq2 = 3 changes

Seq1:Seq3 = 4 changes

Seq1:Seq4 = 7 changes

Seq1:Seq5 = 8 changes

 Repeat using each of the other sequences as the basis for comparison

Seq2:Seq1 = 3 changes                  Seq3:Seq1 = 4 changes

Seq2:Seq2 = 0 changes                  Seq3:Seq2 = 1 changes

Seq2:Seq3 = 1 changes                  Seq3:Seq3 = 0 changes

Seq2:Seq4 = 4 changes                  Seq3:Seq4 = 3 changes

Seq2:Seq5 = 5 changes                  Seq3:Seq5 = 4 changes

 

Seq4:Seq1 = 7 changes                  Seq5:Seq1 = 8 changes

Seq4:Seq2 = 4 changes                  Seq5:Seq2 = 5 changes

Seq4:Seq3 = 3 changes                  Seq5:Seq3 = 4 changes

Seq4:Seq4 = 0 changes                  Seq5:Seq4 = 1 changes

Seq4:Seq5 = 1 changes                  Seq5:Seq5 = 0 changes

 

Our input file is a FASTA format file of all sequences/species that has been previously aligned and trimmed.  There are some odd characters in the file, so we'll have to deal with that.


Related Discussions:- Examine multiple variation parameters for a genomic region

Tissue Cells, How is an Epithelial cell made?

How is an Epithelial cell made?

Explain the classes of enzymes, System of classification The Enzyme Com...

System of classification The Enzyme Commission divided enzymes into 6 main classes, on the basis of the total reaction catalyzed. Each enzyme was assigned a code number; consis

What are examples of the ecological and economic, What are examples of the ...

What are examples of the ecological and economic importance of molluscs? Molluscs are significant players in several food chains in ecosystems. Many marine molluscs are part of

Explain adaptive radiation, Adaptive radiation: The growth of a variety of...

Adaptive radiation: The growth of a variety of species from the single ancestral form; occurs when a new habitat becomes accessible to a population. The evolutionary pattern of di

State the term cercaria, State the term Cercaria? A stage in the life c...

State the term Cercaria? A stage in the life cycle of trematode flukes. Cercaria develops from redia found in intermediate host. This tadpolelike organism is released from inte

What is the pigment responsible for this perception, What are the plant org...

What are the plant organs responsible for the perception of light variation? What is the pigment responsible for this perception? Leaves are mostly responsible for perception o

Function of adenosine in brain, Q. Function of Adenosine in brain? Aden...

Q. Function of Adenosine in brain? Adenosine has four different receptor subtypes (A1, A2A, A2B and A3). Adenosine A2A receptors are concentrated in striatum. Adenosine recepto

Important conditions for malabsorption syndrome, Q. Important conditions fo...

Q. Important conditions for malabsorption syndrome? Let us now discuss a few important conditions grouped collectively under the term of malabsorption syndrome. These are:

Evaluation of heart failure, The initial evaluation of new onset heart fail...

The initial evaluation of new onset heart failure should include an electrocardiogram, chest radiograph, and B-type natriuretic peptide assay. The cardiac rhythm may be normal sinu

Dfine integrated child development services (icds), Dfine Integrated Child ...

Dfine Integrated Child Development Services (ICDS)? The adolescent girl scheme under ICDS intends to cover school dropout girls, 11-18 years in age with a view to meet their ne

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd