Skip to content

alhafidzhamdan/GLASS

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GLASS_logo_072219_08

The GLASS consortium

Overview

The Glioma Longitudinal AnalySiS (GLASS) consortium consists of clinical, bioinformaticians, and basic science researchers from leading institutions across the world striving to better understand glioma tumor evolution and to expose its therapeutic vulnerabilities. For more information, please read our position paper: The Glioma Longitudinal Analysis Consortium. Glioma Through the Looking GLASS: Molecular Evolution of Diffuse Gliomas and the Glioma Longitudinal AnalySiS Consortium. Neuro Oncol. 2018 PMID: 29432615.

Analytical contributors

Floris Barthel*, Kevin Johnson*, Frederick Varn, Anzhela Moskalik, Georgette Tanner, Emre Kocakavuk, Hoon Kim, Roel Verhaak

Data contributors

In no particular order: Mustafa Khasraw (U Sydney), Joe Costello (UCSF), Tom Mikkelsen (Henry Ford Hospital), Dohyun Nam (Samsung Medical Center), Pieter Wesseling (VUMC), Antonio Iavarone (Columbia), Gaetano Finochiaro (Instituto Besta), Lucy Stead (Leeds U), Adelheid Wöhrer (Vienna), Hiromichi Suzuki (Kyoto U), Rameen Beroukhim (DFCI), Bernhard Radlwimmer (DKFZ), Jill S. Barnholtz-Sloan (CWRU), Priscilla Brastianos (MGH), Jason Huse (MD Anderson), John de Groot (MD Anderson), Kristin Alfaro-Munoz (MD Anderson), and NIH/NCI The Cancer Genome Atlas.

The GLASS Data Model

The GLASS data model was built to carefully take into the account the temporal nature of the dataset. The layers of the dataset is best explained using the diagram below. Here we see a single hypothetical patient (left-most column) from which we have obtained tumor samples at three different time points (second column), including a second tumor sub-sample (eg. multisector sample) from the primary tumor (third column). Moreover, a simultaneous DNA-RNA extraction was performed on 1st tumor recurrence and both were sequenced (fourth column). Lastly, some of the DNA aliquots were split off and separately sent off for whole genome and whole exome sequencing (fifth column).

The gray colored blocks in the diagram represent the abstraction layers we are using to represent this complexity. Firstly, the case level represents patient in the cohort. Secondly, the sample level represent bulk samples (tumor and control) in the cohort and finally the aliquot level is the aggregate of the sample portion used for differentiating between multi-sector samples, the analyte used for differentiating between DNA and RNA extractions and finally the analysis to which the aliquot was subjected, such as whole genome, whole exome or RNA sequencing.

data-model

Internally, we are using the PostgreSQL database management system to manage the complex relationships between data entities. More information on the database scheme can be found here.

GLASS Barcode

The GLASS barcodes are inspired by the TCGA barcodes and are constructed in a recognizable fashion:

barcode

Key parts of the data model are represented in the barcode.

Data Release version March 2019.

The data available here marks the second release of the GLASS project dataset, which is managed internally using the PostgreSQL database management system. The data available here represents the March-2019 snapshot of our dataset. These data are under active curation so future versions will include additional data as well as correct potential errors. Please report any inconsistencies you find on the discussion board here or on our Github issue tracker https://github.com/TheJacksonLaboratory/GLASS.

Current Data by the Numbers

The following infographic shows the some summary statistics of the March-2019 release of the GLASS dataset.

glass-numbers

Data Download

The GLASS data can be downloaded from the Tables page here. It is also possible to query the data directly using the the API by using queries. You can read more about that here.

About

GLASS consortium

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • R 65.1%
  • Python 17.7%
  • TSQL 7.6%
  • Jupyter Notebook 5.6%
  • Shell 3.8%
  • Perl 0.1%
  • PLpgSQL 0.1%