Analyse next generation sequencing data

Assignment Help Other Subject
Reference no: EM133121988

Task - Data analysis Genomics

For your coursework, you will analyse next generation sequencing data and spectrometry data produced by different techniques.

The coursework has two sections:

Section A tests your learning in genomics (learning outcome 1).
Section B tests your learning in proteomics (learning outcome 3).
You will be given an individual dataset for each section.

This document contains the brief for Section A. Dr David Smith will provide the brief for Section B.

You must perform the analyses yourself and write methods, results and answers in your own words. Doing the analysis yourself and writing in your own words is important because we are assessing whether you are able to perform the analysis and if you have understood the ideas you have been learning about.

Section A - Genomics

Your data are from Illumina paired-end sequencing of one human. You should analyse these resequencing data to
- get a list of genomic variants
- summarise the variants

Part 1: Producing a list of the individual's genomic variants Instructions
Start by creating a directory for this coursework inside your home directory. All the files you produce should go into this folder. You should not copy the raw data files into this folder, use the raw data files as input by providing their file path.
Make a text file (in MS Wordpad, Apple TextEdit, notepad++ or another software). On the first line of the text file state which directory you are working in. Copy each command into the text file as you run them. Check that the command is written exactly as you gave it on the command line. Make sure there are no typos and no spelling has been autocorrected.

This is very similar analysis that you did with a practise dataset in the computer practicals. You should do the computational steps that are needed to produce a list of variants (some exericses we did in the practicals were to help you understand the format and contents of file types, you do not need to repeat those exercises).

Part 2: Summary of the variants detected Instructions
Investigate the types of variants that were found and report how many were of each type (e.g. the number that were SNVs and the number that were indels). There are many ways of classifying variants and you should decide yourself how to do this (by thinking about what aspects of variants is most interesting). ANNOVAR can annotate variants with a lot of information. In human genomics, we are usually interested in the variants that are most likely to cause disease.

You should present your results in a table and you can classifiy variants a few ways but the table should take up no more than half a page. The counting can be done in Excel, which will be demonstrated in class. Credit will be given for working out a bioinformatics method to do this (such as using software or writing a script).

Attachment:- Data analysis.rar

Reference no: EM133121988

Questions Cloud

What is the bond nominal yield to call : The bond has a 5% nominal yield to maturity, but it can be called in 5 years at a price of $1,052.04. What is the bond's nominal yield to call
What is the change in capital : You are running a hedge fund with a long position of 4,000 shares of IBM, and a short position of 6,000 shares of Intel. IBM is currently trading at $190 per sh
Analyze the cash ?ow and retained earnings : To analyze the cash ?ow and retained earnings impact of a cyclical upturn in paper and linerboard prices (i.e., demonstrating the power of operating leverage in
Present the retained earnings reconciliation : Assuming that the change in policy was implemented retrospectively, present the retained earnings reconciliation that would appear
Analyse next generation sequencing data : Analyse next generation sequencing data and spectrometry data produced by different techniques - Producing a list of the individual's genomic variants
Calculate the payback period for project : a. Calculate the payback period for each project. Rank the projects by payback period.
Find malaysian island resort : Theresa Nunn is planning a 30-day vacation on Pulau Penang, Malaysia, one year from now. The present charge for a luxury suite plus meals in Malaysian ringgit (
Calculate the sharpe ratio : ABC Mutual Fund's total return (geometric mean over the past 5 years: 10.25%
How much will she have five years from today : She plans to save $4,000 per year at the end of each year for five years. If her savings earn 6% interest, how much will she have 5 years from today

Reviews

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd