Examine multiple variation parameters for a genomic region , Biology

Assignment Help:
  1. Determine SNP variation among the aligned DNAs for a genomic region.   See below for how to count SNP variation.  The output file (Your_name_snp.txt) should have two columns of numbers.  The first column will indicate total number of SNP sites per species and the second will be the percent of sequences/species having that same number of variant nucleotides.
  2. Determine in-del variation among the aligned DNAs for a genomic region. The output file (Your_name_in_del.txt) should be two columns of numbers.  The first column will indicate total number of in-del sites per species and the second will be the percent of sequences/species having that same number of in-del.
  3. Determine overall variation (SNPs and in-dels) among the aligned DNAs for a genomic region. The output file (Your_name_both.txt) two columns of numbers.  The first column will indicate total number of variant sites (SNP and in-del) per species and the second will be the percent of sequences/species having that same number of variant nucleotides.  This will generate the same data used for the figure on page 3.

Sample Alignment: 48 bases,  differences are highlighted

Seq1      ATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq2      AAAAATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq3      AAAAATGCATGCATGCA-GCATGCATGCATGCATGCATGCATGCATGC

Seq4      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCATGCATGCATGC

Seq5      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCAT-CATGCATGC

Computation:  Compare Seq1 to 2,3,4, and 5 you find the differences (SNPs and InDels).

Seq1:Seq1 = 0 changes

Seq1:Seq2 = 3 changes

Seq1:Seq3 = 4 changes

Seq1:Seq4 = 7 changes

Seq1:Seq5 = 8 changes

 Repeat using each of the other sequences as the basis for comparison

Seq2:Seq1 = 3 changes                  Seq3:Seq1 = 4 changes

Seq2:Seq2 = 0 changes                  Seq3:Seq2 = 1 changes

Seq2:Seq3 = 1 changes                  Seq3:Seq3 = 0 changes

Seq2:Seq4 = 4 changes                  Seq3:Seq4 = 3 changes

Seq2:Seq5 = 5 changes                  Seq3:Seq5 = 4 changes

 

Seq4:Seq1 = 7 changes                  Seq5:Seq1 = 8 changes

Seq4:Seq2 = 4 changes                  Seq5:Seq2 = 5 changes

Seq4:Seq3 = 3 changes                  Seq5:Seq3 = 4 changes

Seq4:Seq4 = 0 changes                  Seq5:Seq4 = 1 changes

Seq4:Seq5 = 1 changes                  Seq5:Seq5 = 0 changes

 

Our input file is a FASTA format file of all sequences/species that has been previously aligned and trimmed.  There are some odd characters in the file, so we'll have to deal with that.


Related Discussions:- Examine multiple variation parameters for a genomic region

Name the three main arthropod classes, How are the three main arthropod cla...

How are the three main arthropod classes characterized according to the body division? In crustaceans and arachnids the head is fused with the thorax forming the cephalothorax.

What are the major negative ions found in living beings, Q. What are the ...

Q. What are the major negative ions found in living beings? The main anions found in living beings are the chlorine anion (Cl - ), the sulphate anion (SO - ), the nitrate ani

Neuropsychological understanding of behavioural deficits, Neuropsychologica...

Neuropsychological understanding of behavioural deficits Behaviour is an outcome of the interaction of the brain with the environment. A composite of multiple psychological pro

Explain the rotary dryers, Explain the Rotary Dryers? Rotary dryers are...

Explain the Rotary Dryers? Rotary dryers are widely used to dry relatively large throughputs of granular products and by products in a number of industries, including the food

What is the phototropism, What is the phototropism? The Phototropism is...

What is the phototropism? The Phototropism is the movement of plant structures in response to light. The Phototropism may be negative or positive. The Positive phototropism is

Risk analysis - animal studies, Animal studies Most toxicological data ...

Animal studies Most toxicological data for risk assessment are derived from the animal studies and  it  is, therefore, essential that these studies be conducted following widel

What is congenital mitral stenosis, What is Congenital Mitral Stenosis ? ...

What is Congenital Mitral Stenosis ? The onset of symptoms or signs depends on severity of the stenosis, severe the stenosis earlier the presentation. With significant MS, the

Mechanism of enzyme action, Mechanism  of Enzyme Action  The action of...

Mechanism  of Enzyme Action  The action of enzymes is to lower  the activation energy  or threshold  of their substrates which. Therefore  become  activated and   react  with

Db is a standard abbreviation, DB is a standard abbreviation used for the q...

DB is a standard abbreviation used for the quantitative expression of: 1. the density of bacteria in a medium 2. a particular pollutant 3. the dominant Bacillus in a cultu

Describe st segment depression, Q. Describe ST Segment Depression? The ...

Q. Describe ST Segment Depression? The development of ST-segment depression with exercise is probably the most reliable sign if myocardial ischaemia and appears to be the most

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd