Jump to: Authorized Access | Attribution | Authorized Requests

Study Description

The causal mechanisms of common diseases and their therapies have been only marginally illuminated by genetic variants identified in genome wide association studies (GWAS) utilizing single nucleotide polymorphism (SNPs). Platelet activation pathways reflecting hemostasis and thrombosis are the underlying substrate for many cardiovascular diseases and related acute events. To overcome GWAS limitations, genomic studies are needed that integrate molecular surrogates for platelet-related phenotypes assayed in cell-based models derived from individuals of known genotypes and phenotypes. In our GWAS study of native platelet aggregation phenotypes and aggregation in response to low dose aspirin in 2200 subjects (GeneSTAR, Genetic Study of Aspirin Responsiveness), important genome wide "signals" (p<5x10-8) associated with native platelet aggregation and important "signals" associated with platelet responsiveness to aspirin were identified and replicated. Although we are currently performing functional genomics studies to elucidate our most promising findings in known genes (PEAR1, MET, PIKC3G), most "signals" occurred in intergenic regions or in introns. Mechanistic interpretation is limited by uncertainty as to which gene(s) are up- or down-regulated in the presence of most SNP modifications. In this 3 phase proposal, we will (1) create pluripotent stem cells (iPS) from peripheral blood mononuclear cells, and then differentiate these stem cells into megakaryocytes (2) develop an efficient strategy to produce iPS and megakaryocytes using a novel pooling method, and (3) produce iPS and megakaryocytes from 250 subjects in GeneSTAR (European Americans and African Americans), selected based on specific hypotheses derived from GWAS signals in native and post aspirin platelet function; characterize genetic mRNA transcripts using a comprehensive Affymetrix array; measure protein expression for transcripts of interest using mass spectrometry; examine mRNA and protein expression patterns for each GWAS signal to determine the functional pathway(s) involved in native platelet phenotypes; and examine the functional genomics of variations in responsiveness to aspirin using our prior genotyped and phenotyped population. Precise information about the exact functional processes in megakaryocytes and platelets may lead to innovative and tailored approaches to risk assessment and novel therapeutic targets to prevent first and recurrent cardiovascular and related thrombotic events.

Authorized Access
Publicly Available Data
  Link to other NCBI resources related to this study
Study Inclusion/Exclusion Criteria

Subjects were included if they were members of the original GeneSTAR Study population and were over 21 years of age. Subjects were excluded if they developed diabetes, cardiovascular disease, and any new known bleeding disorder, AIDS, advanced cancer, cancer under treatment or autoimmune diseases. Women who were pregnant when first recruited with consent will be deferred until 6 months after delivery, so they were not excluded, but did not give a blood sample during their pregnancy. Subjects with certain conditions that might be influenced by taking a 100cc volume of blood have already been pre-screened in GeneSTAR and were not eligible if they had blood dyscrasias, anemia, bleeding disorder or any contraindications to phlebotomy.

Molecular Data
TypeSourcePlatformNumber of Oligos/SNPsSNP Batch IdComment
Targeted Genotyping Illumina Human1Mv1_C N/A N/A
Targeted Genotyping Illumina Human Exome BeadChip v1.2 N/A N/A
Targeted Genotyping Illumina Infinium Multi-Ethnic Genotyping Array (MEGA) N/A N/A In process, expected late March 2016
Study History

Phases 1-2: During Phases 1 and 2 the Cheng lab optimized production of iPS from PBMCs and differentiation of Mks from the reprogrammed iPS and developed high throughput efficient methods for both. The key features of iPS production, leading to a 10 fold increase in efficiency with reduced costs, included (1) the use of episomal vectors for reprogramming, (2) selection and sorting of cells expressing TRA-1-60 by MACS (>5000 cells) with expansion of 2 pools of cells, replacing hand picking and expansion of individual clones, (3) use of E8 culture medium and replacement of mouse embryonic feeder cells with matrigel or vitronectin for expansion. The key features of improved Mk differentiation included (1) the use of feeder cell free and serum free culture conditions, (2) no need for a period of hypoxia during differentiation, and (3) use of FDA approved human compatible preparations, including Romiplostium (a thrombopoietin analogue), Oprelvekin (recombinant IL-11), and Plasbumin (human albumin, replacing bovine albumin). This technique robustly generates >4M Mks from 2M iPS in 20 days, a saving of about 10 days compared to the standard method. This improved method for Mk differentiation has been submitted for publication.

We genotyped the original PBMCs, the reprogrammed iPS and the derived Mks from 6 subjects (total of 24 samples) using the HumanOmniExpressExome-8v1 genotyping array (946,674 SNPs). There was a very low rate of discordance between pairs of samples from the same subject (0.0001%-0.01%), far below the genotyping error rate of 0.37% and most likely due to genotyping error. In the 6 subjects, only 0-8 discordant genotypes were passed from the original PBMCs to the transformed iPS to the derived Mks. Even if these 'transmitted' discordant genotypes reflect a true mutation in the iPS line and not a genotyping error, we estimate this 'mutation rate' to be only 0-0.0008% or ~1 in 10-6 , which is within the normal expected somatic mutation rate. Thus, the process of iPS reprogramming and Mk differentiation does not produce significant genetic mutations, and the genotype of the derived Mks remains faithful to the original genotype of the subject.

Phase 3: Comparison of mRNA transcripts in iPS and Mks: Originally we proposed to measure mRNA transcripts in derived Mks using microarrays. However, with the fall in prices and proven superiority of whole transcriptome RNA sequencing (RNAseq) for detecting all transcripts and splice isoforms, we have decided to use RNAseq to analyze gene expression related to differences in target genotypes associated with differences in platelet aggregation. RNA is extracted from derived Mk cell pellets, after which cDNA is prepared for polyadenylated mRNA. Sequencing is then performed using an Illumina HS2500 instrument with 100 base paired-end reads, with about 100 million independent reads per sample. The short read output files are aligned to the reference genome using the Bowtie/TopHat software pipeline. Full-length transcript reads are assembled and quantified using Cufflinks. Differential gene expression is determined using Cuffmerge, a 'meta-assembler', which merges transfrags together parsimoniously. To calculate differential gene expression and to test the statistical significance of each observed change in expression between iPS and Mk cells, we used the program Cuffdiff.

To identify genes that are differentially regulated in iPS cells and megakaryocytes and pathways that are turned on or off during the process of iPS cell differentiation to Mks, we measured differential gene expression in at least two colonies each from iPS and megakaryocyte cell lines from 6 GeneSTAR participants (total of 24 samples). Expression values were calculated for each sample based on the number of fragments per kilobase of exon per million fragments mapped (FPKM) (<0.3 FPKM is considered to be not expressed). Both cell types shared 12,713 genes while 1,701 genes were expressed in only iPS cells and 1,587 genes were expressed only in Mks. The most highly expressed genes in iPS were almost all ribosomal proteins, while Mks had high expression of hematopoietic and platelet genes, such as GP9.

Pathway analysis found that genes involved in developmental biology, cell-cell communication, cell-cell junction organization, and axon guidance pathways were up-regulated in iPS cells as compared to megakaryocytes. On the other hand, genes involved in hemostasis, platelet activation, signaling, and aggregation, immune system, cytokine signaling, and interferon gamma signaling pathways were up-regulated in megakaryocytes. These results demonstrate a fundamental biological shift in gene expression patterns accompanying the transformation of iPS into Mks. The Mks express greater levels of platelet and immune function genes, while the iPS express more ribosomal protein genes.

Comparison of Gene Expression by Genotypes and Phenotypes

One of the major goals of this study is to better understand the mechanisms by which genetic variants identified in GWAS studies are associated with platelet aggregation. Since most of the genetic loci identified to date are intronic or intergenic, our approach is to examine differential gene expression in derived Mks by genotype for the highest priority variants to obtain clues as to which genes and pathways the variants control, both as cis and trans effects.

Current genes of interest include (1) PEAR1 (platelet endothelial aggregation receptor, activated by platelet-platelet contact, rs12041331 in intron 1 is robustly associated with platelet aggregation to multiple agonists in both whites and African Americans, G allele is associated with higher expression of platelet PEAR1 protein), (2) MET (an oncogene which also is a platelet membrane receptor, agonist is HGF contained in platelet granules, activation reduces thrombin mediated aggregation, rs10243024 in intron 2 is associated with aggregation to collagen in whole blood), (3) PIK3CG (phosphatidylinositol bisphosphate 3-kinase, modulator of E-cadherin mediated cell-cell adhesion, known to be involved in platelet signaling, rs342275 is associated with platelet aggregation to epinephrine), and (4) MRVI1 (endoplasmic reticulum protein, interacts with inositol triphosphate receptor and cGMP-dependent protein kinase, rs4909945 is associated with aggregation to ADP).

We measured differential gene expression in Mks by PEAR1 rs12041331 genotype in the first 42 subjects. We required a >2 fold difference on a log2 scale at an FDR of 10%. The GG genotype is characterized by greater platelet aggregation; interestingly, a number of neutrophil granule genes were overexpressed with this genotype, including azurocidin, myeloperoxidase, elastase, and lactoferrin. The reason for this is unknown, but one possibility is delayed maturation of the Mks, with a less differentiated phenotype. The AA genotype was associated with overexpression of several HLA immune function genes. There was a trend for the GA and GG genotypes to express more PEAR1 transcripts than the AA genotype, consistent with a dominant G allele model, but the results were not statistically significant. There was no evidence for alternatively spliced isoforms. We have previously shown that rs12041331 has a higher MAF and possibly a stronger effect on platelet aggregation in African Americans. We therefore also examined differential gene expression by race, and found only a few genes (including ESPN, which codes an actin binding protein, affecting cell movement, and SLC4A1, coding for an anion exchange protein expressed in erythrocyte cell membranes) that were differentially expressed by genotype in both races. We will further explore this racial overlap in gene expression with a larger sample size.

Gene Ontology pathway analysis demonstrated that with the GG genotype, genes in cell adhesion, biologic adhesion, extracellular matrix organization, extracellular structure organization and collagen fibril organization pathways were upregulated, while genes in antigen processing presentation, innate immune response, and cellular responses to interferon gamma and cytokines were down regulated relative to the AA genotype. Similar results were obtained with the Kyoto Encyclopedia of Genes and Genomics (KEGG Pathway mapping).

Gene expression analyses are ongoing.

Selected Publications
Diseases/Traits Related to Study (MeSH terms)
Links to Related Genes
Authorized Data Access Requests
See research articles citing use of the data from this study
Study Attribution
  • Principal Investigator
    • Lewis C. Becker, MD. Johns Hopkins University School of Medicine, Baltimore, MD, USA.
  • Co-Principal Investigator
    • Linzhao Cheng, PhD. Johns Hopkins University School of Medicine, Baltimore, MD, USA.
  • Funding Sources
    • 5U01 HL107446. National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, USA.