GCAT CATALOGUE DATA

The shared use of the GCAT data is one of the objectives of the GCAT project. For this reason, you can find the GCAT project on the Maelstrom Catalogue. Maelstrom Research's activities are founded on the data harmonization methodology developed under the DataSHaPER project, the open-source software developed by the OBiBa team, and the federated data analysis methodology developed under the DataSHIELD project. See our project on the Maelstrom Catalogue here.

Moreover, the GCAT data variables definition, as well as summary data, can be browsed on the GCAT variable catalogue on MICA to help identify variables of interest to support your collaborative research projects. A short video tutorial showing how to use the variable browser is available below (use CC button for video captions).

GCAT variable search from GCAT Genomes for Life on Vimeo


Avalaible GCAT data

For more details, here you can find an interactive summary table of all available GCAT data and a summary table with Baseline Questionnaire and Follow up Variables.

GCAT health data

For health data avalaible on the GCAT,  we have used the ICD-9 disease classification. Here you can find interactive tables summarizing autoreported diseases and family history diseases at Baseline.

Please download the main descriptive GCAT paper for more information. Cohort Profile: GCAT Genomes for Life. A Prospective Cohort study of the Genomes of Catalonia. Obón-Santacana M, Vilardell M, Carreras A, Durán X, Velasco J,Galván-Femenía I, Alonso T, Puig L, Sumoy L, Duell, E.J, Perucho M, Moreno V, de Cid R (link)
Summary of all available GCAT data

Data Type

Number of participants

Details

Date of adquisition

Date first available for research

Baseline  assessment

Whole cohort

Questionnaires, Physical measures, samples

2014-2017

2018

Repeat of baseline assessment

Whole cohort

Questionnaire follow-up every 2 years

2018

2019

Genotyping (baseline samples)

5.459 (GCATcore)

Dense genotyping array with 666.695 markers after quality control

2016

2018

Genotyping extended (baseline samples)

5.459 (GCATcore)

Dense genotyping map with 15,078.461 variants by in silico imputation (IMPUTE)

2017-2018

2018

Food frequency web questionnaire (follow-up)

Whole cohort

Participants are invited by email to provide additional information about diet; estimates of nutrient intake

2017-2018

2018

Biochemical assay (baseline
samples)

6.000

Glycated haemoglobin (haemoglobin
A1c)

2016-2017

2018

Metabolome (baseline
samples)

5.000 (GCATcore)

Biomarkers with known disease
association (lipids and vascular
disease)

2017-2018

2018

Chronotype web
questionnaire (follow-up)

Whole cohort

Participants are invited by email to provide additional information (ie, sleep behavior, circadian rhythm, and work shift)

2017-2018

2018

Exposome (baseline)

Whole cohort

Map of environmental exposures
acquired with geographical information
system (GIS) technology

2017-2018

2018

Other web-based
questionnaire data (follow-up)

Whole cohort

Participants are invited by email to provide additional information via web about working places. Information
will be integrated with exposome assessment

2017-2018

2018

Exome

200 (GCATcore)

Clinic custom exome of hereditary cancer in 126 hereditary cancer genes (400×)

2017

2018

Whole-genome sequencing

808

30× whole genome sequencing from 1000 volunteers, 20% from GCATcore

2017-2018

2018

Epigenome

150

DNA methylation epigenomic profile using Infinum Methylation EPIC 850K beadarray assay

2018

2019

Health record linkage

       

Primary care

Whole cohort

ICD/ATC/ OPCS procedures/laboratory

2017-2018

2018

Death registrations

Whole cohort

ICD-coded cause specific mortality

2017-2018

2018

Hospital inpatient

Whole cohort

ICD/ATC/OPCS procedures/laboratory

2017-2018

2018

Hospital outpatient

Whole cohort

ICD (few)/OPCS

2018

2018

Other

Whole cohort

National mental healthcare/
national social healthcare

2018

2018