Examine multiple variation parameters for a genomic region , Biology

Assignment Help:
  1. Determine SNP variation among the aligned DNAs for a genomic region.   See below for how to count SNP variation.  The output file (Your_name_snp.txt) should have two columns of numbers.  The first column will indicate total number of SNP sites per species and the second will be the percent of sequences/species having that same number of variant nucleotides.
  2. Determine in-del variation among the aligned DNAs for a genomic region. The output file (Your_name_in_del.txt) should be two columns of numbers.  The first column will indicate total number of in-del sites per species and the second will be the percent of sequences/species having that same number of in-del.
  3. Determine overall variation (SNPs and in-dels) among the aligned DNAs for a genomic region. The output file (Your_name_both.txt) two columns of numbers.  The first column will indicate total number of variant sites (SNP and in-del) per species and the second will be the percent of sequences/species having that same number of variant nucleotides.  This will generate the same data used for the figure on page 3.

Sample Alignment: 48 bases,  differences are highlighted

Seq1      ATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq2      AAAAATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq3      AAAAATGCATGCATGCA-GCATGCATGCATGCATGCATGCATGCATGC

Seq4      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCATGCATGCATGC

Seq5      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCAT-CATGCATGC

Computation:  Compare Seq1 to 2,3,4, and 5 you find the differences (SNPs and InDels).

Seq1:Seq1 = 0 changes

Seq1:Seq2 = 3 changes

Seq1:Seq3 = 4 changes

Seq1:Seq4 = 7 changes

Seq1:Seq5 = 8 changes

 Repeat using each of the other sequences as the basis for comparison

Seq2:Seq1 = 3 changes                  Seq3:Seq1 = 4 changes

Seq2:Seq2 = 0 changes                  Seq3:Seq2 = 1 changes

Seq2:Seq3 = 1 changes                  Seq3:Seq3 = 0 changes

Seq2:Seq4 = 4 changes                  Seq3:Seq4 = 3 changes

Seq2:Seq5 = 5 changes                  Seq3:Seq5 = 4 changes

 

Seq4:Seq1 = 7 changes                  Seq5:Seq1 = 8 changes

Seq4:Seq2 = 4 changes                  Seq5:Seq2 = 5 changes

Seq4:Seq3 = 3 changes                  Seq5:Seq3 = 4 changes

Seq4:Seq4 = 0 changes                  Seq5:Seq4 = 1 changes

Seq4:Seq5 = 1 changes                  Seq5:Seq5 = 0 changes

 

Our input file is a FASTA format file of all sequences/species that has been previously aligned and trimmed.  There are some odd characters in the file, so we'll have to deal with that.


Related Discussions:- Examine multiple variation parameters for a genomic region

Animal kingdom, i need information about speen of animals and stuff like th...

i need information about speen of animals and stuff like that for example a cheetah speed

Exercise- heat production, Exercise- Heat Production During physical ...

Exercise- Heat Production During physical activity heat production by exercise can to some extent substitute for heat generated by shivering. However, exercise also tends to

Mechanism of cleavage, Mechanism of Cleavage Such as the mitotic divi...

Mechanism of Cleavage Such as the mitotic division in any cell, cleavage is the result of two events: mitotic nuclear division (Karyokinesis) followed via cytoplasmic divisio

Calcium requirements of school children and adolescents, Determine Calcium ...

Determine Calcium requirements of school children and adolescents? Calcium requirements in children and adolescents can be calculated on the basis of Calcium accretion during g

Cause of the immunodeficiency presented by aids patients, Q. What is the ca...

Q. What is the cause of the immunodeficiency presented by AIDS patients? The cause of the immunodeficiency obtainable by AIDS patients is the destruction of CD4 T helper lympho

Which is right of the resting membrane potential, Which is true of the rest...

Which is true of the resting membrane potential? A. It requires few ions to be distributed unevenly. B. It has the same value in all cells C. Only nerve and muscle cells h

Implantation - pre-embryonic development, Implantation - Pre-Embryonic Deve...

Implantation - Pre-Embryonic Development After entering the uterus and formation of ICM, the blastocyst starts to embed in the endometrium of the uterine wall. By one week aft

1, what are the variations of the digestive system in animals? what musthav...

what are the variations of the digestive system in animals? what musthave caused these variation

Explain radiology, Radiology is a medical specialty that employs the use of...

Radiology is a medical specialty that employs the use of imaging to both diagnose and treat disease visualised within the human body. Radiologists use an array of imaging technolog

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd