Examine multiple variation parameters for a genomic region , Biology

Assignment Help:
  1. Determine SNP variation among the aligned DNAs for a genomic region.   See below for how to count SNP variation.  The output file (Your_name_snp.txt) should have two columns of numbers.  The first column will indicate total number of SNP sites per species and the second will be the percent of sequences/species having that same number of variant nucleotides.
  2. Determine in-del variation among the aligned DNAs for a genomic region. The output file (Your_name_in_del.txt) should be two columns of numbers.  The first column will indicate total number of in-del sites per species and the second will be the percent of sequences/species having that same number of in-del.
  3. Determine overall variation (SNPs and in-dels) among the aligned DNAs for a genomic region. The output file (Your_name_both.txt) two columns of numbers.  The first column will indicate total number of variant sites (SNP and in-del) per species and the second will be the percent of sequences/species having that same number of variant nucleotides.  This will generate the same data used for the figure on page 3.

Sample Alignment: 48 bases,  differences are highlighted

Seq1      ATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq2      AAAAATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq3      AAAAATGCATGCATGCA-GCATGCATGCATGCATGCATGCATGCATGC

Seq4      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCATGCATGCATGC

Seq5      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCAT-CATGCATGC

Computation:  Compare Seq1 to 2,3,4, and 5 you find the differences (SNPs and InDels).

Seq1:Seq1 = 0 changes

Seq1:Seq2 = 3 changes

Seq1:Seq3 = 4 changes

Seq1:Seq4 = 7 changes

Seq1:Seq5 = 8 changes

 Repeat using each of the other sequences as the basis for comparison

Seq2:Seq1 = 3 changes                  Seq3:Seq1 = 4 changes

Seq2:Seq2 = 0 changes                  Seq3:Seq2 = 1 changes

Seq2:Seq3 = 1 changes                  Seq3:Seq3 = 0 changes

Seq2:Seq4 = 4 changes                  Seq3:Seq4 = 3 changes

Seq2:Seq5 = 5 changes                  Seq3:Seq5 = 4 changes

 

Seq4:Seq1 = 7 changes                  Seq5:Seq1 = 8 changes

Seq4:Seq2 = 4 changes                  Seq5:Seq2 = 5 changes

Seq4:Seq3 = 3 changes                  Seq5:Seq3 = 4 changes

Seq4:Seq4 = 0 changes                  Seq5:Seq4 = 1 changes

Seq4:Seq5 = 1 changes                  Seq5:Seq5 = 0 changes

 

Our input file is a FASTA format file of all sequences/species that has been previously aligned and trimmed.  There are some odd characters in the file, so we'll have to deal with that.


Related Discussions:- Examine multiple variation parameters for a genomic region

Difference between plants and animals, DIFFERENCE BETWEEN PLANTS  AND ANIM...

DIFFERENCE BETWEEN PLANTS  AND ANIMALS - S . N o . C h a ra c ter P l an ts A n i m al s 1 Nutrition

Define dietary and non dietary factors - causation of cancer, Define Dietar...

Define Dietary and Non Dietary Factors - Causation of Cancer? Several dietary and non-dietary factors (including genetics) can increase the risk in the causation of cancer. Som

Acellular and cellular organisms, Acellular and Cellular Organisms you...

Acellular and Cellular Organisms you should be able to characterise living and non- living matter. However, the boundary-line between the two is not very precise. As you have

Hygroscopic coeflicient-retention of soil moisture, Hygroscopic Coeflicient...

Hygroscopic Coeflicient To have a more complete picture of soil moisture relations, let us take a soil sample and oven dry it for 24 hours at 11 o C. Now if this soil is kept i

Cause of the immunodeficiency presented by aids patients, Q. What is the ca...

Q. What is the cause of the immunodeficiency presented by AIDS patients? The cause of the immunodeficiency obtainable by AIDS patients is the destruction of CD4 T helper lympho

Homologous proteins - amino acid sequence in fasta format, Please answer th...

Please answer the following question on Sequence Y: Protein databases 1. What sequence from which organism is this sequence most similar to? 2. Can you find any homo

using a two-sample t-test , After stinging its victim, the honeybee leaves...

After stinging its victim, the honeybee leaves behind the barbed stinger, poison sac, and muscles that continue to pump venom into the wound. A study compared two methods of removi

Define autoclave - food microbiology, Define Autoclave - Food Microbiology?...

Define Autoclave - Food Microbiology? For sterilization, steam under pressure is generally employed using an instrument called autoclave. Figure illustrates the autoclave. Auto

What are the hormones produced by the testicles, What are the hormones prod...

What are the hormones produced by the testicles and the ovaries? The testicles make androgenic hormones, the major of them being testosterone. The ovaries make estrogen and pro

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd