Examine multiple variation parameters for a genomic region , Biology

Assignment Help:
  1. Determine SNP variation among the aligned DNAs for a genomic region.   See below for how to count SNP variation.  The output file (Your_name_snp.txt) should have two columns of numbers.  The first column will indicate total number of SNP sites per species and the second will be the percent of sequences/species having that same number of variant nucleotides.
  2. Determine in-del variation among the aligned DNAs for a genomic region. The output file (Your_name_in_del.txt) should be two columns of numbers.  The first column will indicate total number of in-del sites per species and the second will be the percent of sequences/species having that same number of in-del.
  3. Determine overall variation (SNPs and in-dels) among the aligned DNAs for a genomic region. The output file (Your_name_both.txt) two columns of numbers.  The first column will indicate total number of variant sites (SNP and in-del) per species and the second will be the percent of sequences/species having that same number of variant nucleotides.  This will generate the same data used for the figure on page 3.

Sample Alignment: 48 bases,  differences are highlighted

Seq1      ATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq2      AAAAATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq3      AAAAATGCATGCATGCA-GCATGCATGCATGCATGCATGCATGCATGC

Seq4      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCATGCATGCATGC

Seq5      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCAT-CATGCATGC

Computation:  Compare Seq1 to 2,3,4, and 5 you find the differences (SNPs and InDels).

Seq1:Seq1 = 0 changes

Seq1:Seq2 = 3 changes

Seq1:Seq3 = 4 changes

Seq1:Seq4 = 7 changes

Seq1:Seq5 = 8 changes

 Repeat using each of the other sequences as the basis for comparison

Seq2:Seq1 = 3 changes                  Seq3:Seq1 = 4 changes

Seq2:Seq2 = 0 changes                  Seq3:Seq2 = 1 changes

Seq2:Seq3 = 1 changes                  Seq3:Seq3 = 0 changes

Seq2:Seq4 = 4 changes                  Seq3:Seq4 = 3 changes

Seq2:Seq5 = 5 changes                  Seq3:Seq5 = 4 changes

 

Seq4:Seq1 = 7 changes                  Seq5:Seq1 = 8 changes

Seq4:Seq2 = 4 changes                  Seq5:Seq2 = 5 changes

Seq4:Seq3 = 3 changes                  Seq5:Seq3 = 4 changes

Seq4:Seq4 = 0 changes                  Seq5:Seq4 = 1 changes

Seq4:Seq5 = 1 changes                  Seq5:Seq5 = 0 changes

 

Our input file is a FASTA format file of all sequences/species that has been previously aligned and trimmed.  There are some odd characters in the file, so we'll have to deal with that.


Related Discussions:- Examine multiple variation parameters for a genomic region

Decomposers, 5 beneficial effects of decomposers and 2 harmful effects

5 beneficial effects of decomposers and 2 harmful effects

What is the name of the terminal portion of the axon, What is the name of t...

What is the name of the terminal portion of the axon? The terminal portion of the axon is known presynaptic membrane. Through this membrane neurotransmitters are released into

Urea, #questionwhy is urea the major nitrogenous excretory product..

#questionwhy is urea the major nitrogenous excretory product..

Hormones secreted by adrenal gland, Hormone s . Cortical steroids (cortico...

Hormone s . Cortical steroids (corticoids-hormones of adrenal cortex) are grouped into three catagories : mineralocorticoids, glucocorticoids and gonadocorticoids. (i) Mine

What are the risk factors of cataractogenesis, What are the risk factors of...

What are the risk factors of cataractogenesis? Risk Factors of Cataractogenesis: a. Hereditary b. Exposure to ultra violet radiation: The ultra violet radiation of 290

Explain about the osteomalacia - deficiency of vitamin d, Explain about the...

Explain about the Osteomalacia - Deficiency of Vitamin D? It occurs when there is a lack of vitamin D and calcium, in women who have had many pregnancies, who subsist on a meag

Characteristic features of coelom, Characteristic Features of Coelom ...

Characteristic Features of Coelom Sensory system consisting of eyes, photoreceptor cells, statocysts, taste buds and tactile organs. Respiration by skin, gills or par

Human physiology, which bones forms lhe non moving muscle attachment in

which bones forms lhe non moving muscle attachment in

Transpiration, Most of the water taken up by a plant passes through it and ...

Most of the water taken up by a plant passes through it and is evaporated to the atmosphere.What use is made of the tiny fraction of this water which is retained by the plant?

Coarse fish reach or lowland course zone, Coarse fish reach or lowland cour...

Coarse fish reach or lowland course zone This zone corresponds to the lower course of the river. Here the river is deep and slow moving. Its sluggish flow results in the depos

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd