1、TCGA癌症数据癌症数据库库介介绍专题绍专题1.前言前言2.数据数据产生生历程程5.目前已有的癌症种目前已有的癌症种类3.barcode4.Data types and data levels6.数据下数据下载解解读目录前言01http:/ CANCER GENOME ATLAScancerGenomeTranscriptomeClinicEpigenomeProteome癌症种类丰富,样本量大癌症种类丰富,样本量大34 kinds of cancer325 samples on averagehttp:/cancergenome.nih.gov/数据产生历程02http:/ TYPES AN
2、D LEVELS04http:/ TYPESData LevelLevel TypeDescription1RawLow-level data for single sample单个样本的低级数据Not normalized未标准化2ProcessedNormalized single sample data标准化的单个样本Interpreted for presence or absence of specific molecular abnormalities解释异常的个体3Segmented/InterpretedAggregate of processed data from sing
3、le sample单个样本整合在了一起Grouped by probed loci to form larger contiguous regions(in some cases)根据probe的位置分组4Summary/Regions of Interest(ROI)Quantified association across classes of samples量化关联类的样本Associations based on two or more两个或多个的关联Molecular abnormalities分子水平的异常Sample characteristics样本特性Clinical var
4、iables临床变异DATA LEVLES注意:低水平的测序数据存储在CGHub https:/cghub.ucsc.edu/,申请下载时需要DUNS number.The Cancer Genomics Hub(CGHub)is a secure repository for storing,cataloging,and accessing cancer genome sequences,alignments,and mutation information from the Cancer Genome Atlas(TCGA)consortium and related projects.目
5、前已有的癌症种类05癌症种类丰富,样本量大癌症种类丰富,样本量大34 kinds of cancer325 samples on average详细见:详细见:TCGA publication guideline,http:/cancergenome.nih.gov/publications/publicationguidelines数据下载及解读06http:/ 第第1封邮件通知下载申请已经提交封邮件通知下载申请已经提交第第2封给出下载链接封给出下载链接Step4Step 4 文件内容文件内容File_manifest.txt,对所下载文件的说明,对所下载文件的说明l 临床数据解读临床数据解读CDE:Common Data Elements https:/tcga-data.nci.nih.gov/docs/dictionary/THANKShttp:/