Examine multiple variation parameters for a genomic region , Biology

Assignment Help:
  1. Determine SNP variation among the aligned DNAs for a genomic region.   See below for how to count SNP variation.  The output file (Your_name_snp.txt) should have two columns of numbers.  The first column will indicate total number of SNP sites per species and the second will be the percent of sequences/species having that same number of variant nucleotides.
  2. Determine in-del variation among the aligned DNAs for a genomic region. The output file (Your_name_in_del.txt) should be two columns of numbers.  The first column will indicate total number of in-del sites per species and the second will be the percent of sequences/species having that same number of in-del.
  3. Determine overall variation (SNPs and in-dels) among the aligned DNAs for a genomic region. The output file (Your_name_both.txt) two columns of numbers.  The first column will indicate total number of variant sites (SNP and in-del) per species and the second will be the percent of sequences/species having that same number of variant nucleotides.  This will generate the same data used for the figure on page 3.

Sample Alignment: 48 bases,  differences are highlighted

Seq1      ATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq2      AAAAATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq3      AAAAATGCATGCATGCA-GCATGCATGCATGCATGCATGCATGCATGC

Seq4      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCATGCATGCATGC

Seq5      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCAT-CATGCATGC

Computation:  Compare Seq1 to 2,3,4, and 5 you find the differences (SNPs and InDels).

Seq1:Seq1 = 0 changes

Seq1:Seq2 = 3 changes

Seq1:Seq3 = 4 changes

Seq1:Seq4 = 7 changes

Seq1:Seq5 = 8 changes

 Repeat using each of the other sequences as the basis for comparison

Seq2:Seq1 = 3 changes                  Seq3:Seq1 = 4 changes

Seq2:Seq2 = 0 changes                  Seq3:Seq2 = 1 changes

Seq2:Seq3 = 1 changes                  Seq3:Seq3 = 0 changes

Seq2:Seq4 = 4 changes                  Seq3:Seq4 = 3 changes

Seq2:Seq5 = 5 changes                  Seq3:Seq5 = 4 changes

 

Seq4:Seq1 = 7 changes                  Seq5:Seq1 = 8 changes

Seq4:Seq2 = 4 changes                  Seq5:Seq2 = 5 changes

Seq4:Seq3 = 3 changes                  Seq5:Seq3 = 4 changes

Seq4:Seq4 = 0 changes                  Seq5:Seq4 = 1 changes

Seq4:Seq5 = 1 changes                  Seq5:Seq5 = 0 changes

 

Our input file is a FASTA format file of all sequences/species that has been previously aligned and trimmed.  There are some odd characters in the file, so we'll have to deal with that.


Related Discussions:- Examine multiple variation parameters for a genomic region

What is the general cognitive ability, What is the General Cognitive Abilit...

What is the General Cognitive Ability General cognitive ability is usually assessed using standardised intelligence tests, such as Wechsler Intelligence Scale for Children Thir

What are the symptoms and signs of pericardial effusion, Q. What are the Sy...

Q. What are the Symptoms and Signs of pericardial effusion? Asymptomatic: Slowly accumulating small to moderate pericardial effusion may not cause significant elevation of intr

Explain threaded implants, Q. Explain threaded implants? Cylindrical no...

Q. Explain threaded implants? Cylindrical non-threaded implants poorly distribute compressive forces and generate shears forces that may fragment and break the bone surrounding

Microbiology quiz question, Each of the following are true of antimicrobial...

Each of the following are true of antimicrobial therapeutic drugs except A. Non-toxic to host B. Easily broken down by host C. Easily administered D. Limited capacity to elicit

Substances that have to be excreted from the body, Name four substances tha...

Name four substances that have to be excreted from the body. Substances that have to be excreted from the body are:- A) Carbon dioxide, B) urea, C) uric acid, D) s

Explain the term of monosaccharides, Explain the term of Monosaccharides? ...

Explain the term of Monosaccharides? Monosaccharides :  The simplest carbohydrates are Monosaccharides, simple sugars. These generally have three to six carbon atoms, and con

What is nadp and nadph, What is NADP and NADPH? NADP is the abbreviatio...

What is NADP and NADPH? NADP is the abbreviation of the nicotinamide adenine dinucleotide phosphate cation, a hydrogen acceptor. NADPH is made when NADP binds to one hydrogen a

How many different molecules composed, How many different molecules compose...

How many different molecules composed of (A) two (B) three, and (C) four amino acids, linked together by peptide bonds, can be made from the set of 20 naturally occurring amino aci

What is simple transposition in neonates, What is Simple Transposition in N...

What is Simple Transposition in Neonates ? A baby with this malformation needs to be operated without delay. A very cyanosed infant will require palliation by balloon arteries

Dormancy - plant growth substances, Dormancy - Plant Growth Substances ...

Dormancy - Plant Growth Substances Dormancy can be defined as a state of suspended growth and metabolism. When most plants are exposed to seasonal periods of very inclement we

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd