Determine the probability of dna fragment

Assignment Help Basic Statistics
Reference no: EM131749403

Question:

a) Define zero and first order Markov models for seqeuence1.txt. The sequence is Mycobacterium tuberculosis gene mtb48

Hints:
- Zero order Markov model is defined by P(i), where i= {A,T,G,C}
For this you simply need the nucleotide counts and total number of nucleotides.
Zero order Markov model for DNA sequence should have four parameters
- First order Markov Model is defined by P(i|j), where i,j ={A,T,G,C}. For example P(A|T) is probability of observing A after T in DNA sequence
For this you'll need the number of occurrences of di-nucleotides and the total number of di-nucleotides
First order Markov model for DNA sequence should have sixteen parameters.
- To implement this, it would be easiest to write a small script in R using a alphabetFrequency() and dinucleotideFrequency() function of the Biostrings package. Or you can use perl or any other programming language of your choice. Otherwise, if you really have to (you exhausted all the options, see no other way and hopelessly behind on your schedule) you can use Microsoft Word or Excel substitute function or MS word's find/replace.

b) Using models you derived in (a) determine the probability of DNA fragment AGTAGCTTCCAG.

Verified Expert

The assignment was focused on bioinformatics based problem which is related to Markov model.Both zero and first order Markov model are calculated and shown 4 and 16 parameters respectively.

Reference no: EM131749403

Questions Cloud

Calculate the center line and control limits for chart : Control limits for a meat-packaging process. A meat-packaging company produces 1-pound packages of ground beef by having a machine slice a long circular.
Compute the breakeven point in sales dollars for story : Compute the breakeven point in sales dollars for Story, Story Park competes with Splash World by providing a variety of rides
Discuss the costs charged to a department : How the costs were assigned to the output. Physical transfers for a department. The costs charged to a department
What is the expected cost of fabric : Kathy likes to have half of the next month's fabric needs in ending inventory. What is the expected cost of fabric to be purchased in August
Determine the probability of dna fragment : Define zero and first order Markov models for seqeuence1.txt (attached). The sequence is Mycobacterium tuberculosis gene mtb48 - determine the probability
What should tax reform do : At the very least, a broader conversation about this country's tax policy is necessary, and that means asking a simple question: What should tax reform do
Discuss considerations when accounting for inventory : Management must confront which of the following considerations when accounting for inventory
What are the lessons from the case for general managers : Mount Everest - 1996 (by Michael Roberto and Gina Cariogga). What are the lessons from this case for general managers in business enterprises?
Determine the ending inventory cost : Determine the ending inventory cost and the cost of goods sold by three methods. Round interim calculations to one decimal

Reviews

Write a Review

Basic Statistics Questions & Answers

  What percent of vehicles have better gas mileage

The 2010 Chevrolet Camaro with an eight-cylinder engine and automatic transmission has a combined gas mileage of 19 mpg. What percent of all vehicles have better gas mileage than the Camaro?

  Test whether the mean population of apps is greater than 40

according to a 2010 nielsen survey the mean number of apps downloaded by iphone users is 40. suppose a sample of 36

  Confidence interval for the mean price of regular gasoline

a) Find a 95% confidence interval for the mean price of regular gasoline in that region. ( __, __ ) (Round to three decimal places as needed.)

  The central limit theorem says that when a simple sample of

the central limit theorem says that when a simple sample of size n is drawn from any population with mean ? and

  Data to determine the important factors

A single replicate of a 24 Exercise 10.2 factorial is run. The results in standard order are 1.106, 2.295, 7.074, 6.931, 4.132, 2.148, 10.2, 10.12, 3.337, 1.827, 8.698, 6.255, 3.755, 2.789, 10.99, and 11.85.

  Calculate the summary statistics relevant to this problem

questionthe effect of exercise on the amount of lactic acid in the blood of sportsmen was examined in a study. eight

  Academic approach to factoring polynomials

Factor the polynomial completely. Factor each completely given that the binomial following it is a factor of the polynomial.

  An arbitrary customer goes to server

Customers arrive according to a Poisson process with rate λ = 5 per hour in a system with two servers. The probability that an arbitrary customer goes to server no. 1 (respectively, no. 2) is equal to 3/4 (resp., 1/4).

  Describe the differences in brand preferences

Teen smokers, regionally. The Monitoring the Future study of teen smoking described in Exercise 9 looked at brand preferences by region of the country.

  Analyse data to answer a specific business problem

BUS708 Statistics and Data Analysis - Explain how you collect the data and discuss whether or not your sample is biased. Is this primary or secondary data?

  Calculate and interpret the p-value of the test

Give H0 and HA for testing whether the average July 2006 price was $25,000. Calculate and interpret the p-value of the test. State the appropriate conclusion for the test if α = 5%

  Find the margin of error

In a surgey of 280 adults over 50.80%; said they were taking vitamin supplements. Find the margin of error for this survey if we want a 99% confidence in our estimate of the percent of adults over 50 who take vitamin supplements.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd