Examine multiple variation parameters for a genomic region , Biology

Assignment Help:
  1. Determine SNP variation among the aligned DNAs for a genomic region.   See below for how to count SNP variation.  The output file (Your_name_snp.txt) should have two columns of numbers.  The first column will indicate total number of SNP sites per species and the second will be the percent of sequences/species having that same number of variant nucleotides.
  2. Determine in-del variation among the aligned DNAs for a genomic region. The output file (Your_name_in_del.txt) should be two columns of numbers.  The first column will indicate total number of in-del sites per species and the second will be the percent of sequences/species having that same number of in-del.
  3. Determine overall variation (SNPs and in-dels) among the aligned DNAs for a genomic region. The output file (Your_name_both.txt) two columns of numbers.  The first column will indicate total number of variant sites (SNP and in-del) per species and the second will be the percent of sequences/species having that same number of variant nucleotides.  This will generate the same data used for the figure on page 3.

Sample Alignment: 48 bases,  differences are highlighted

Seq1      ATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq2      AAAAATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq3      AAAAATGCATGCATGCA-GCATGCATGCATGCATGCATGCATGCATGC

Seq4      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCATGCATGCATGC

Seq5      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCAT-CATGCATGC

Computation:  Compare Seq1 to 2,3,4, and 5 you find the differences (SNPs and InDels).

Seq1:Seq1 = 0 changes

Seq1:Seq2 = 3 changes

Seq1:Seq3 = 4 changes

Seq1:Seq4 = 7 changes

Seq1:Seq5 = 8 changes

 Repeat using each of the other sequences as the basis for comparison

Seq2:Seq1 = 3 changes                  Seq3:Seq1 = 4 changes

Seq2:Seq2 = 0 changes                  Seq3:Seq2 = 1 changes

Seq2:Seq3 = 1 changes                  Seq3:Seq3 = 0 changes

Seq2:Seq4 = 4 changes                  Seq3:Seq4 = 3 changes

Seq2:Seq5 = 5 changes                  Seq3:Seq5 = 4 changes

 

Seq4:Seq1 = 7 changes                  Seq5:Seq1 = 8 changes

Seq4:Seq2 = 4 changes                  Seq5:Seq2 = 5 changes

Seq4:Seq3 = 3 changes                  Seq5:Seq3 = 4 changes

Seq4:Seq4 = 0 changes                  Seq5:Seq4 = 1 changes

Seq4:Seq5 = 1 changes                  Seq5:Seq5 = 0 changes

 

Our input file is a FASTA format file of all sequences/species that has been previously aligned and trimmed.  There are some odd characters in the file, so we'll have to deal with that.


Related Discussions:- Examine multiple variation parameters for a genomic region

Explain night blindness - micronutrient deficiencies, Explain Night Blindne...

Explain Night Blindness - micronutrient deficiencies? Night blindness is the earliest symptom of Vitamin 'A' deficiency. The reduction in the supply of vitamin A aldehyde i.e.

Define natural adulteration - types of adulteration, Define Natural Adulter...

Define Natural Adulteration - Types of Adulteration? These are the chemicals, organic compounds or radicals which are naturally present in the food and are harmful to the healt

Define food quality factors and their measurement, Define Food quality fact...

Define Food quality factors and their measurement? Appearance; textural, flavour, nutritional, sanitary and keeping factors; quality standards, objective and organoleptic evalu

Pathophysiology of aortic regurgitation, Q. Pathophysiology of aortic regur...

Q. Pathophysiology of aortic regurgitation? Left ventricle responds to chronic aortic regurgitation by chamber dilatation and an increase in its compliance so that end diastoli

How coacervates have facilitated emergence of life on earth, How could coac...

How could coacervates have facilitated the emergence of life on earth? Coacervates probably provided a nitid separation between an internal and an external environment and thus

Water, Water Water is the most important constituent of all living tis...

Water Water is the most important constituent of all living tissue. It forms up to 95% of the fresh weight of some animals. We all know that water is lost through sweat, excre

Why does the recombination frequency of genes vary, Why does the recombinat...

Why does the recombination frequency of genes vary with the distance between them in the chromosome? The farther the distance among the loci of two genes in a chromosome the hi

Determine the purpose of glucometer, Glucometer mainly for the following pu...

Glucometer mainly for the following purpose. (i) In acute and chronic care facilities at the patient's bedside, in clinics or hospitals. (ii) In physician's clinics. (iii

Explain whipping foam method, Explain Whipping foam method? Whipping is...

Explain Whipping foam method? Whipping is the most common method used as it forms bubbles by cutting the surface and introducing air into liquid.  Repeated action makes the

What is mitosis, What is mitosis? What is the importance of mitosis? Mi...

What is mitosis? What is the importance of mitosis? Mitosis is the process in which one eukaryotic cell separates into two cells identical to the parent cell (generally identic

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd