Examine multiple variation parameters for a genomic region , Biology

Assignment Help:
  1. Determine SNP variation among the aligned DNAs for a genomic region.   See below for how to count SNP variation.  The output file (Your_name_snp.txt) should have two columns of numbers.  The first column will indicate total number of SNP sites per species and the second will be the percent of sequences/species having that same number of variant nucleotides.
  2. Determine in-del variation among the aligned DNAs for a genomic region. The output file (Your_name_in_del.txt) should be two columns of numbers.  The first column will indicate total number of in-del sites per species and the second will be the percent of sequences/species having that same number of in-del.
  3. Determine overall variation (SNPs and in-dels) among the aligned DNAs for a genomic region. The output file (Your_name_both.txt) two columns of numbers.  The first column will indicate total number of variant sites (SNP and in-del) per species and the second will be the percent of sequences/species having that same number of variant nucleotides.  This will generate the same data used for the figure on page 3.

Sample Alignment: 48 bases,  differences are highlighted

Seq1      ATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq2      AAAAATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq3      AAAAATGCATGCATGCA-GCATGCATGCATGCATGCATGCATGCATGC

Seq4      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCATGCATGCATGC

Seq5      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCAT-CATGCATGC

Computation:  Compare Seq1 to 2,3,4, and 5 you find the differences (SNPs and InDels).

Seq1:Seq1 = 0 changes

Seq1:Seq2 = 3 changes

Seq1:Seq3 = 4 changes

Seq1:Seq4 = 7 changes

Seq1:Seq5 = 8 changes

 Repeat using each of the other sequences as the basis for comparison

Seq2:Seq1 = 3 changes                  Seq3:Seq1 = 4 changes

Seq2:Seq2 = 0 changes                  Seq3:Seq2 = 1 changes

Seq2:Seq3 = 1 changes                  Seq3:Seq3 = 0 changes

Seq2:Seq4 = 4 changes                  Seq3:Seq4 = 3 changes

Seq2:Seq5 = 5 changes                  Seq3:Seq5 = 4 changes

 

Seq4:Seq1 = 7 changes                  Seq5:Seq1 = 8 changes

Seq4:Seq2 = 4 changes                  Seq5:Seq2 = 5 changes

Seq4:Seq3 = 3 changes                  Seq5:Seq3 = 4 changes

Seq4:Seq4 = 0 changes                  Seq5:Seq4 = 1 changes

Seq4:Seq5 = 1 changes                  Seq5:Seq5 = 0 changes

 

Our input file is a FASTA format file of all sequences/species that has been previously aligned and trimmed.  There are some odd characters in the file, so we'll have to deal with that.


Related Discussions:- Examine multiple variation parameters for a genomic region

Why isnt the cooking of vitamin c-containing foods, Why isn't the cooking o...

Why isn't the cooking of vitamin C-containing foods appropriate for vitamin C supply? To obtain vitamin C, for example, from an orange dessert, the vitamin-containing food cann

Principal categories in classification, Q. Principal categories in classifi...

Q. Principal categories in classification? First of all there is a need to know what classification is? Let us define in simple term. Classification is placing of a plant (or g

What is pre-requisites of counselling, Q. What is Pre-requisites of counsel...

Q. What is Pre-requisites of counselling? Pre-requisites of counselling are: 1. Facilities for counselling to be available for the patient near home or working place. 2.

What''s the difference between heterozygosity & homozygosity, What is the d...

What is the difference between heterozygosity and homozygosity? The Homozygosity occurs when an individual has two identical alleles of a gene, for instance, AA or aa. The Het

Aerobic based water treatment, Aerobic Based Water Treatment: Waste wat...

Aerobic Based Water Treatment: Waste water provides organic materials and inorganic nutrients to river waters; this is known as eutrophication. This promotes microbial and plan

Question, 1)what is plasma membrane? 2)what is multicellularity

1)what is plasma membrane? 2)what is multicellularity

What are the main types of inheritances - epistasis, According to Mendel's ...

According to Mendel's law phenotypical characteristics would be verified by pair of factors (alleles) that separate independently in gametes. What are the main types of inheritance

Precursors for gluconeogenesis, Glycerol can act as a substrate for glucose...

Glycerol can act as a substrate for glucose synthesis by conversion to dihydroxyacetone phosphate and an intermediate in gluconeogenesis.  In  order  for citric  acid  cycle pyruva

Merits of micro propagation, Merits of Micro Propagation The special m...

Merits of Micro Propagation The special merits of micro propagation are: 1. It considerably increases the rate of multiplication 2. High rate of multiplication can be ma

Infective endocarditis, Infective Endocarditis :  All patients with prosth...

Infective Endocarditis :  All patients with prosthetic valve come under the high-risk category for endocarditis. They need prophylactic antibiotics if any procedure, which is like

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd