Compute pairwise distances between sequences

Assignment Help Advanced Statistics
Reference no: EM131011075

Please assist with Higgins methods problem

Problem 1
Derive weights for sequences

ACTA
ACTT
CGTT
AGAT

using Thompson, Higgins, and Gibson method

Use the outline below (a-d) to solve this problem

a) compute pairwise distances between sequences

b) applyUPGMA method to join sequences and consequently the clusters)

c) build phylogenetic tree

d) derive sequence weights

Problem 2

We assumed additive property when constructed UPGMA tree in problem 1.

What is limitation of this assumption (if any)?

Problem 3

The protein sequence of bacterial species "B3" was used to blast against swissprot protein database. The query returned significant hits to four other bacterialproteins (B1,B2,B4, B5), and one protein in human genome (H). No other mammalian species have shown presence of protein that is similar to B3. Phylogenetic tree construction by several methods resulted in a tree shown below. Explain the presence of this gene in humans.

2358_Protein sequence of bacterial.jpg

Problem 4

Describe technical and theoretical challenges associated with building phylogenetic trees.

Problem 5

Compare and contrast parsimony, maximum likelihood, UPGMA, and neighbor-joining methods

Problem 6

Create multiple sequence alignment and phylogenetic tree in Rusing ape and clustalwby following steps below:

1. Install clutalw (depending you your OS) on your computer using https://www.clustal.org/clustal2/ link

2. Open R. (all of the following steps will be implemented in R)

3. Set a working directory

4. Install package "ape" from your R session by typing:

intall.packages("ape ")

5. Load "ape" package by typing

library("ape ")

6. Read accession numbers of sequences you downloaded for Homework 2 from GenBank; this step rather for exercising purposes since you have already downloaded these sequences.

7. Save the result from step 6 as <new.fas>file

8. Run clustalw by typing:
system(paste('"path_to_YOUR_clustalw/clustalw2.exe" new.fas'))

9. Read alignment file (*aln) it should be in your working directory

10. Create phylogenetic tree using neighbor-joining method

11. Plot the tree

Submit working R-code in a separate file

Reference no: EM131011075

Questions Cloud

Net present value of project : G Corporation is considering acquiring a newer, more modern machine. The machine, which requires an initial outlay of $4.5 million, will generate cash flows of $1.1 million at the end of each year for 5 years. Investors could earn 7.5 percent else..
How will information presented about emotional intelligence : At this point in the course you should have a good idea as to the topic area you will be considering for your dissertation. How will the information presented in this course guide your next steps? The topic that I am considering for my topic is emoti..
Provide a brief overview about why calculating roi : Provide a brief overview about why calculating ROI is strategically important and list common types of items and services that would be included in an ROI analysis.
Write c function to perform complex addition and subtraction : Write C functions to perform complex addition, subtraction, multiplication, and division using the complex structure dis­ cussed in this chapter. Add these functions to the calculator program. You will have to allow the user to specify a complex v..
Compute pairwise distances between sequences : compute pairwise distances between sequences - apply UPGMA method to join sequences and consequently the clusters) and build phylogenetic tree
The history of psychology in policing : In preparation for a PowerPoint- The history of psychology in policing and The role that the Americans with Disabilities Act of 1990 plays in the hiring and evaluation process of police officers
Determine the mean for the all the numerical columns : Divide the Costs by the Qty to develop a column of Cost per Unit, Use $ and two decimal points.
The ethics of euthanasia or physician assisted suicide : Here is the topic- The Ethics of Euthanasia or Physician Assisted Suicide- pro and con. Write a 3 - 4 page paper with the cover sheet, work cited page and in text citations
Charting to pick and keep an investment : Answer the following questions in 300+ words. Cite sources used in APA format. 1. Is it best to use technical analysis or charting to pick and keep an investment? Why or why not?

Reviews

Write a Review

Advanced Statistics Questions & Answers

  Relationship between speed, flow and geometry

Write a project proposal on relationship between speed, flow and geometry on single carriageway roads.

  Logistic regression model

Compute the log-odds ratio for each group in Logistic regression model.

  Logistic regression

Foundations of Logistic Regression

  Probability and statistics

The tubes produced by a machine are defective. If six tubes are inspected at random , determine the probability that.

  Solve the linear model

o This is a linear model. If your model needs a different engine, then you need to rethink your approach to the model. Remember, there are no IF, Max, or MIN statements in linear models.

  Plan the analysis

Plan the analysis

  Quantitative analysis

State the hypotheses that you are going to test.

  Modelise as a markov chain

modelise as a markov chain

  Correlation and regression

What are the degrees of freedom for regression

  Construct a frequency distribution for payment method

Construct a frequency distribution for Payment method

  Perform simple linear regression

Perform simple linear regression

  Quality control analysis

Determining the root causes

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd