Examine multiple variation parameters for a genomic region , Biology

Assignment Help:
  1. Determine SNP variation among the aligned DNAs for a genomic region.   See below for how to count SNP variation.  The output file (Your_name_snp.txt) should have two columns of numbers.  The first column will indicate total number of SNP sites per species and the second will be the percent of sequences/species having that same number of variant nucleotides.
  2. Determine in-del variation among the aligned DNAs for a genomic region. The output file (Your_name_in_del.txt) should be two columns of numbers.  The first column will indicate total number of in-del sites per species and the second will be the percent of sequences/species having that same number of in-del.
  3. Determine overall variation (SNPs and in-dels) among the aligned DNAs for a genomic region. The output file (Your_name_both.txt) two columns of numbers.  The first column will indicate total number of variant sites (SNP and in-del) per species and the second will be the percent of sequences/species having that same number of variant nucleotides.  This will generate the same data used for the figure on page 3.

Sample Alignment: 48 bases,  differences are highlighted

Seq1      ATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq2      AAAAATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq3      AAAAATGCATGCATGCA-GCATGCATGCATGCATGCATGCATGCATGC

Seq4      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCATGCATGCATGC

Seq5      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCAT-CATGCATGC

Computation:  Compare Seq1 to 2,3,4, and 5 you find the differences (SNPs and InDels).

Seq1:Seq1 = 0 changes

Seq1:Seq2 = 3 changes

Seq1:Seq3 = 4 changes

Seq1:Seq4 = 7 changes

Seq1:Seq5 = 8 changes

 Repeat using each of the other sequences as the basis for comparison

Seq2:Seq1 = 3 changes                  Seq3:Seq1 = 4 changes

Seq2:Seq2 = 0 changes                  Seq3:Seq2 = 1 changes

Seq2:Seq3 = 1 changes                  Seq3:Seq3 = 0 changes

Seq2:Seq4 = 4 changes                  Seq3:Seq4 = 3 changes

Seq2:Seq5 = 5 changes                  Seq3:Seq5 = 4 changes

 

Seq4:Seq1 = 7 changes                  Seq5:Seq1 = 8 changes

Seq4:Seq2 = 4 changes                  Seq5:Seq2 = 5 changes

Seq4:Seq3 = 3 changes                  Seq5:Seq3 = 4 changes

Seq4:Seq4 = 0 changes                  Seq5:Seq4 = 1 changes

Seq4:Seq5 = 1 changes                  Seq5:Seq5 = 0 changes

 

Our input file is a FASTA format file of all sequences/species that has been previously aligned and trimmed.  There are some odd characters in the file, so we'll have to deal with that.


Related Discussions:- Examine multiple variation parameters for a genomic region

Define nutrition counseling - management of eating disorders, Define Nutrit...

Define Nutrition Counseling - Management of Eating Disorders? Nutrition counseling can be used to accomplish a variety of goals, such as reducing behaviours related to the eati

Implementation of nursing care, Implementation of Nursing Care   Admin...

Implementation of Nursing Care   Administration of Appropriate Drugs  If  the surgery is not necessary or if  it is postponed for some time then diuretics and digoxin are

Modification of mitosis, MODIFICATION OF MITOSIS 1 .      CRYPTOMITOS...

MODIFICATION OF MITOSIS 1 .      CRYPTOMITOSIS OR PROMITOSIS : Primitive type of mitosis. In this mitosis nuclear membrane not disappear (remain intact throughout the div

Non-modifiable risk factors for coronaru heart diseases, Q. Non-Modifiable ...

Q. Non-Modifiable Risk Factors for coronaru heart diseases? Non-Modifiable Risk Factors 1. Age 2. Sex 3. Heredity 4. Endomorphic Body Build Family history: Pe

Extracellular aging, Extracellular Aging The basic components of extra...

Extracellular Aging The basic components of extracellular space are mucopolysaccharides and fibrous proteins, particularly collagen and elastin. These proteins are synthesised

Amazonian butterflies, The Amazon rainforest in South America is a biodiver...

The Amazon rainforest in South America is a biodiverse ecosystem. There are large numbers of plant and animal species making up the food web, including over 350 species of predator

Illustrate hypertrophic cardiomyopathy?, Q. Illustrate Hypertrophic cardiom...

Q. Illustrate Hypertrophic cardiomyopathy? It is a genetic disorder due to mutations in the gene that encodes for β-Cardiac myosin heavy chain (Localised to chromosome 14). It

Feeding on liquids, Feeding on Liquids Animals feeding on liquids are...

Feeding on Liquids Animals feeding on liquids are generally highly specialised for their feeding habits. Certain protozoa, endoparasites and aquatic invertebrates take up nut

Define exclusion chromatography - basic separation technique, Define Exclus...

Define Exclusion chromatography - basic separation technique? It is a chromatographic process, in which separation of the sample components takes place according to the molecul

Who was charles darwin, Who was Charles Darwin? The Charles Darwin was ...

Who was Charles Darwin? The Charles Darwin was an English naturalist born in 1809 and considered the father of the theory of evolution. By the end of the year 1831, before turn

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd