Implement the chou-fasman algorithm

Assignment Help Biology
Reference no: EM13705784

The Chou-Fasman Method for Secondary Structure Prediction:

The basic idea is that each amino acid residue is assigned three numbers that describe its propensity to be part of α-helices, β-sheets and tight β-turns. A large number (above 100) for a particular structure means the amino acid has a high preference for that structure, e.g. α-helix, which is typically coupled to low propensity for the other structures. These Chou-Fasman parameters may be determined from the occurrence of different amino acids in different types of secondary structure in known protein structures. The prediction algorithm then contains the following steps:

1. Assign propensities to all residues in the sequence.

2. Scan the peptide and identify regions where 4 out of 6 contiguous residues exhibit P(α)>100, i.e. large probability for α-helix. Extend these nucleation regions in both directions until the average propensity for a set of four contiguous residues is P(α) < 100.

3. Scan the peptide and identify regions where 3 out of 5 contiguous residues have P(β)>100. These regions nucleate β-strands. Extend these in both directions until a set of four contiguous residues have an average P(β) < 100. This ends the β-strand - almost exactly the same as for the helices, but there might now be overlapping segments.

4. Any region containing overlapping α and β assignments are taken to be helical or β depending on if the average P(α) and P(β) for that region is largest. If this reduces an α- or β-region so that it becomes less than 5 residues, the α- or β-assignment for that region is removed.

5. To identify a tight β-turn at residue number i, the product f(turn) = f(i)f(i+1)f(i+2)f(i+3) is calculated. Remember that a tight turn uses two amino acids, and then there is one before and another after the turn too. These numbers are simply proportional to the frequency of each amino acids occurring in each such position of a β-turn.

6. To predict a β-turn, the following three conditions have to be simultaneously fulfilled:

(a) f(turn) > 0.000075, (b) the average value for the turn propensity p(turn) > 100 for the four amino acids and (c) the average p(turn) is larger than the average p(α) as well as p(β), i.e. it is more probable that we have a turn than any other structure.

7. The remaining parts of the sequence are considered coils.

Your task for this activity is to write a script in awk to implement the Chou-Fasman algorithm. You can find values for the Chou-Fasman parameters on the last page.

3-letter code

1-letter
code

p(a)

p(13)

p(t)

f(i)

f(i+1)

f(i+2)

f(i+3)

ALA

A

142

83

66

0.06

0.076

0.035

0.058

ARG

R

98

93

95

0.070

0.106

0.099

0.085

ASP

D

101

54

146

0.147

0.110

0.179

0.081

ASN

N

67

89

156

0.161

0.083

0.191

0.091

CYS

C

70

119

119

0.149

0.050

0.117

0.128

GLU

E

151

37

74

0.056

0.060

0.077

0.064

GLN

Q

111

110

98

0.074

0.098

0.037

0.098

GLY

G

57

75

156

0.102

0.085

0.190

0.152

HIS

H

100

87

95

0.140

0.047

0.093

0.054

ILE

I

108

160

47

0.043

0.034

0.013

0.056

LEU

L

121

130

59

0.061

0.025

0.036

0.070

LYS

K

114

74

101

0.055

0.115

0.072

0.095

MET

M

145

105

60

0.068

0.082

0.014

0.055

PHE

F

113

138

60

0.059

0.041

0.065

0.065

PRO

P

57

55

152

0.102

0.301

0.034

0.068

SER

S

77

75

143

0.120

0.139

0.125

0.106

THR

T

83

119

96

0.086

0.108

0.065

0.079

TRP

W

108

137

96

0.077

0.013

0.064

0.167

TYR

Y

69

147

114

0.082

0.065

0.114

0.125

VAL

V

106

170

50

0.062

0.048

0.028

0.053

Reference no: EM13705784

Questions Cloud

Find the standard deviation of the population : The ages (in years) of the employees at a particular computer store are   36, 32, 22, 32, 33 . Assuming that these ages constitute an entire population, find the standard deviation of the population. Round your answer to at least two decimal plac..
What will be the concentration after 705 seconds : Question- The rate constant is .430 m-1s-1 at 200c initial concentration of A is .00700M what will be the concentration after 705 Seconds
How many seconds will 23.6% of the reactant remain : Question- The rate constant for this first order reaction is .0840 s-1 at 400 C after how many seconds will 23.6% of the reactant remain
Computer output from a multiple regression model : Computer output from a multiple regression model with three independent variables yields the following estimated regression equation: 20.9 + 1.4X1 + 4.2X2 - 0.7X3.
Implement the chou-fasman algorithm : Assign propensities to all residues in the sequence - Chou-Fasman Method for Secondary Structure Prediction
How long in seconds would it take for the concentration : Question- The rate constant for this first order reaction is .830 s -1 at 400 C. How long in seconds would it take for the concentration of A to decrease from .690 M to .370M
What was the mass percent of nh4c1 in the original sample : Question- You start with a 2.354 g sample of a mixture containing SiO,, NaCl, and NH,Cl. If after subliming away the NH,Cl, you have 1.975 grams of sample remaining, what was the mass percent of NH4C1 in the original sample
Interval for the mean incubation period of sars virus : Based on interviews with 81 SARS patients, researchers found that the mean incubation period was5 days, with a standard deviation of 15.7 days. Based on this information construct a 95% confidence interval for the mean incubation period of SA..
Define what is the molar mass of the solute : Question- A solution is prepared from 59.8 g of nonvolatile, nondissociating solute and 85.0 g of water. The vapor pressure of the solution at 60 degrees Celsius is 132 torr. The vapor pressure of water at 60 degrees Celsius is 150 torr. What is t..

Reviews

Write a Review

Biology Questions & Answers

  State what kinds of mechanisms tend to isolate populations

What kinds of mechanisms tend to isolate populations enough to promote genetic divergence?

  Human physiology based question

A forty-two year old woman decides to lose weight on a diet prescribed by an anorexic friend. She loses about thirty pounds in 45 days, but her serum potassium level falls to 2.1 mmol/L.

  How do beta-blockers perform their pharmacological

how do beta-blockers perform their pharmacological functions? describe the process of hearing from the external to

  Where in the body does it become activated

There is another digestive enzyme other than salivary amylase that is secreted by the salivary glands. What is this enzyme? What substrate does it act one? Where in the body does it become activated, and why?

  Describe the biogoelogical systems

how does matter flow through biogoelogical systems? How are biological and geological cycles related? identify one nutrient and explain

  Discuss the biodiversity in the temperate forest biome

discuss the biodiversity in the temperate forest biome and discuss what factors make the biodiversity different from other biomes.

  Q1 two black female rats were crossed with the similar

q1. two black female rats were crossed with the similar brown male rat. in the resulting progeny female a produced 8

  Q1 lab report on ubiquity timed experiment with 6 plates of

q1. lab report on ubiquity timed experiment with 6 plates of agar. i want explanation on the number of colonies formed

  Determine the genotypes of the gametes

A mouse has genotype AaBb. He inherited both dominant alleles (A andB) from his mother and both recessive alleles (aand b) from his father. determine the genotypes of the gametes that this mouse can produce, and the proportions of the genotypes wh..

  Exploiting a resource to ruination

Determine what is it called when individual benefit is maximized by exploiting a resource to ruination?

  Describe it structure and what is a skin cancer

What is the skin - Describe it structure and what is a skin cancer - talk about the three types of skin cancers and their characteristics.

  Write the half reaction of oxidation of glucose

Write half reactions for the oxidation of 1 molecule glucose and the reduction of oxygen during the aerobic respiration in mammalian cells.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd