Examine multiple variation parameters for a genomic region , Biology

Assignment Help:
  1. Determine SNP variation among the aligned DNAs for a genomic region.   See below for how to count SNP variation.  The output file (Your_name_snp.txt) should have two columns of numbers.  The first column will indicate total number of SNP sites per species and the second will be the percent of sequences/species having that same number of variant nucleotides.
  2. Determine in-del variation among the aligned DNAs for a genomic region. The output file (Your_name_in_del.txt) should be two columns of numbers.  The first column will indicate total number of in-del sites per species and the second will be the percent of sequences/species having that same number of in-del.
  3. Determine overall variation (SNPs and in-dels) among the aligned DNAs for a genomic region. The output file (Your_name_both.txt) two columns of numbers.  The first column will indicate total number of variant sites (SNP and in-del) per species and the second will be the percent of sequences/species having that same number of variant nucleotides.  This will generate the same data used for the figure on page 3.

Sample Alignment: 48 bases,  differences are highlighted

Seq1      ATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq2      AAAAATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq3      AAAAATGCATGCATGCA-GCATGCATGCATGCATGCATGCATGCATGC

Seq4      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCATGCATGCATGC

Seq5      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCAT-CATGCATGC

Computation:  Compare Seq1 to 2,3,4, and 5 you find the differences (SNPs and InDels).

Seq1:Seq1 = 0 changes

Seq1:Seq2 = 3 changes

Seq1:Seq3 = 4 changes

Seq1:Seq4 = 7 changes

Seq1:Seq5 = 8 changes

 Repeat using each of the other sequences as the basis for comparison

Seq2:Seq1 = 3 changes                  Seq3:Seq1 = 4 changes

Seq2:Seq2 = 0 changes                  Seq3:Seq2 = 1 changes

Seq2:Seq3 = 1 changes                  Seq3:Seq3 = 0 changes

Seq2:Seq4 = 4 changes                  Seq3:Seq4 = 3 changes

Seq2:Seq5 = 5 changes                  Seq3:Seq5 = 4 changes

 

Seq4:Seq1 = 7 changes                  Seq5:Seq1 = 8 changes

Seq4:Seq2 = 4 changes                  Seq5:Seq2 = 5 changes

Seq4:Seq3 = 3 changes                  Seq5:Seq3 = 4 changes

Seq4:Seq4 = 0 changes                  Seq5:Seq4 = 1 changes

Seq4:Seq5 = 1 changes                  Seq5:Seq5 = 0 changes

 

Our input file is a FASTA format file of all sequences/species that has been previously aligned and trimmed.  There are some odd characters in the file, so we'll have to deal with that.


Related Discussions:- Examine multiple variation parameters for a genomic region

Explain the characteristics of agar- agar, Explain the Characteristics of A...

Explain the Characteristics of Agar- Agar? There are many characteristics, which make agar-agar, an ideal solidifying agent. These are: (a) It has high gel strength, so it c

Metabolic reactions in the hmp pathway, Metabolic Reactions  in the HMP Pa...

Metabolic Reactions  in the HMP Pathway The hexose monophosphate pathway is responsible for  the generation of a substantialfraction of the cytoplasmic NADPH required  for bios

Lyme disease, Lyme disease It is a metazoonoses caused by a spirochaete, B...

Lyme disease It is a metazoonoses caused by a spirochaete, Borrelia burgdorferi. The disease based on the specific symptoms, is also known as erythema migrans (ECM) or lyme  arthr

What is genetic engineering, Q. What is genetic engineering? Genetic en...

Q. What is genetic engineering? Genetic engineering, It is also known as molecular cloning or gene cloning, is artificial recombination of nucleic acid molecules in a test tube

How did darwin reach the principle of natural selection, How did Darwin rea...

How did Darwin reach the principle of natural selection from the observation of differences among individuals of the same species? The Darwin recognized that in a same species

Define prevalence and incidence for anorexia nervosa, Define Prevalence and...

Define Prevalence and Incidence for anorexia nervosa? The disorder occurs most commonly in adolescent girls and young women, but adolescent boys and young men may be affected m

Polyphyletic theory - metazoa, Polyphyletic Theory - Metazoa This theo...

Polyphyletic Theory - Metazoa This theory was suggested by Greenheig (1959) and some other workers. According to this theory, sponges, coelenterates, ctenophores and flatworms

Human impact on nitrogen cycle , Human Impact on Nitrogen Cycle Human...

Human Impact on Nitrogen Cycle Human activities are profoundly affecting the cycling of nitrogen in nature. Over 30 x 10 6 metric tons/yr. of N 2 is fixed in the commercial

Cardiac branches and cardiac plexuses, The vagus gives of one superior cerv...

The vagus gives of one superior cervical cardiac branch and an inferior cervical cardiac branch in the neck. These branches descend into the thorax and take part in performing the

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd