Cancer Genome Analysis Presented by Karen Xu Introduction
Cancer Genome Analysis Presented by Karen Xu Introduction Cancer is commonly referred to as the disease of the genes Cancer may be favored by genetic predisposition, but it is thought to be primarily caused by mutations in specific tissues
that accumulate over time Difference between cancer genome analysis and GWAS GWAS use large cohorts of cases to analyze the relationship between the disease and thousands or millions of mutations across the entire genome The study of cancer genome is different. During the lifetime of the organism variants only accumulate in the tumor or the affected
tissue and they are not transmitted from generation to generation-----somatic mutations Types of cancer genome analysis May focus on the cancer type or the patient 1. examining a cohort of patients suffering from a particular type of cancer and is used to identify biomarkers, characterize cancer subtypes with clinical or therapeutic implications or to simply advance our
understanding of the tumorigenic process 2. examining the genome of a particular cancer patient in the search for specific alterations that may be susceptible to tailored therapy Figure 1. Idealized cancer analysis pipeline. Vazquez M, de la Torre V, Valencia A (2012) Chapter 14: Cancer Genome Analysis. PLoS Comput Biol 8(12): e1002824. doi:10.1371/journal.pcbi.1002824 http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.1002824
Sequencing, Alignment and Variant calling After samples are sequenced, sequencing reads are aligned to a reference genome and all differences are identified through a process known as variant calling. The output of the variant calling is a list of genomic variations that is organized according to their genomic location (chromosome and position) and the variant allele. They may be accompanied by scores measuring the sequencing quality over that region or the prevalence of the variant allele in the
samples. The workflow employed for this type of analysis is commonly known as a primary analysis. Consequence, Recurrence analysis and candidate drivers DNA mutations are translated into mutations in RNA transcripts, and from RNA into proteins, potentially altering their amino acid sequence. The impact of these amino acid alterations on protein function can range from largely irrelevant to highly deleterious
Severity of these alterations can be assessed using specialized software tools known as protein mutation pathogenicity predictors Mutations are also examined to identify recurrence, which may point to key genes and mutational hotspots Not all mutations that have deleterious consequences for protein function are necessarily involved in cancer Pathways and Functional Analysis Genes recurrently mutated in cancer tend to be easily identifiable. Examples, TP53 and KRAS
However, most often mutations are more widely distributed and the probability of finding the same gene mutated in several cases is low, making it difficult to identify common functional features associated with a given cancer Pathway analysis offers a means to overcome this challenge by associating mutated genes with known signaling pathways Cancer is not only a disease of the genes but also a disease of the pathways Integration, Visualization and Interpretation
Gene expression and alterations in the copy number of each gene, a very common phenomenon in cancer Mutations in promoters and enhancers Variation in the affinity of transcription factors and DNA binding proteins Dysregulation of epigenetic control Current Challenges 1. The heterogeneity of the data to be
analyzed, which ranges from genomic mutations in coding regions to alterations in gene expression or epigenetic marks 2. The range of databases software resources required to analyse and interpret the results 3. The comprehensive expertise required to understand the implications of such varied experimental data Critical Bioinformatics Tasks in Cancer Genome Analysis
4 Critical Bioinformatics Tasks in Cancer Genome Analysis Mapping between coordinate systems Driver Mutations and Pathogenicity Prediction Functional Interpretation Actionable results: patient stratification and drug targets Mapping between Coordinate systems Translating mutational information derived
from genomic coordinates to other data types is the first step. Example: point mutations in coding regions can be mapped to different transcripts by finding the exon affected, the offset of the mutation inside that exon and the position of the exon inside the transcript Driver Mutations and Pathogenicity Prediction driver----mutations that drive cancer onset and
progesssion passenger----mutations that play little or no role in tumorigenic process but are propagated by their coexistence with driver mutations Experimental assays of activity are one means of testing the tumorigenic potential of mutations, although such assays are difficult to perform to scale. Statistical approaches seek to identify traces of mutation selection during tumor formation by looking at the prevalence of mutations in particular genes in sample cohorts, or the ratios of synonymous versus nonsynonymous mutations in particular candidate genes.
Functional Intrepretation Frequently genomic data reveals the presence of mutated genes that are far less prevalent, and the significance of these genes must be considered in the context of the functional units they are part of. The involvement of genes in specific biological, metabolic and signaling pathways is the type of functional annotation most commonly considered and thus, functional analysis is often termed pathway analysis. The current systems for functional interpretation have
been derived from the systems previously developed to analyze expression arrays, and they have been adapted to analyze lists of cancer-related genes. Applicable results: diagnosis, patient stratification and drug therapies For clinical applications, the results of cancer genome analysis need to be translated into practical advice for clinicians, providing potential drug therapies, better tumor
classification or early diagnostic markers. Resources for Genome Analysis in Cancer Databases Some databases describe entities and their properties, such as: proteins and the drugs that target them; germline variations and the diseases with which they are associated; or genes along with the factors that regulate their transcription. Other databases are repositories of experimental data, such as the Gene Expression Omnibus and ArrayExpress, which contain data from microarray
experiments on a wide range of samples and under a variety of experimental conditions. Software In cancer analysis pipelines, several tasks must be performed that require supporting software. These range from simple database searches to cross-check lists of germline mutations with lists of known SNPs, to running complex computational methods to identify protein-protein interaction sub-networks affected by mutations. Workflow Enactment Tools and Visual Interfaces Given the complexity of cancer genome analysis, it
is worth discussing how to design and execute (enact) workflows, which may become very elaborate. Workflows can be thought of as analysis recipes, whereby each analysis entails enacting that workflow using new data. Ideally a workflow should be comprehensive and cover the complete analysis process from the raw data to the final results. --Improve Efficiency Limitations of Visual interfaces: overly complex, inflexible, and limited utility compared w/ general purpose programming language
That's why we cannot just say an RDA element and know which MARC tag it is - sometimes there may be more than one There are some consistent associations for many fields, but not always. With RDA we are adding...
An Interruptionby Robert Foote. A boy had stopped his carTo save a turtle in the road;I was not farBehind, and slowed,And stopped to watch as he beganTo shoo it off into the undergrowth—This wild reminder of an ancient past,Lumbering to...
This leads to a larger Bowen Ratio and may lead to an overall different effect on convection initiation. Fig. 4: Area mean of ISFF measurements and corresponding WRF outputs. (a) Time series of water vapor mixing ratio around solar noon,...
9th Lit Daily Agenda PPT. Mrs. Minich. ... (1 textual reference to support each point) - you can paraphrase instead of looking up quotes, or you can use quotes. ... Hand out TKAM chapter vocabulary (keep this in your book...
Department of Medicine Quality Program Medical ICU Morbidity and Mortality Conference ... Patients were classified into three groups: low (LRINEC score ≤ 5, <50% risk for nec fasc), moderate (LRINEC score 6-7, 50-75% risk for nec fasc), and high risk...
Grace Nichols talks about her poem 'Praise Song for my Mother'. Using strong visual images of Guyana, she explores how the poem reflects her family life, cultural background and approach to writing poetry and reads extracts from the poem.
angular velocity . related to the . tangential velocity . of an object moving in a circle? (Tangential velocity is the speed at which the object is moving at an instant in time during the circular motion—the vector is drawn...
THE MIDDLE EAST Persian Gulf & Eastern Mediterranean Natural Environments Landforms and Rivers Tectonic forces have shaped the physical features of this region Rugged Mountains Narrow gulfs and seas Turkey lies in Europe and Asia, most lies in Asia and...
Ready to download the document? Go ahead and hit continue!