Introduction and Data files

This dataset contains nine tissues (heart, hippocampus, hypothalamus, kidney, liver, prefrontal cortex, skeletal muscle, small intestine, and spleen) from C57BL/6J mice that were fed 2-deoxyglucose (6g/L) through their drinking water for 96hrs or 4wks. 96hr mice were given their 2DG treatment 2 weeks after the other cohort started the 4 week treatment. The organs from the mice were harvested and processed for metabolomics and transcriptomics. The data in this document pertains to the transcriptomics data only. The counts that were used were FPKM normalized before being log transformed. It was determined that sample A113 had low RNAseq quality and through further analyses with PCA, MA plots, and clustering was an outlier and will be removed for the rest of the analyses performed. This document will determine the contribution of each main effect or their combination to each pathway identified, across modules.

needed.packages <- c("tidyverse", "here", "functional", "gplots", "dplyr", "GeneOverlap", "R.utils", "reshape2","magrittr","data.table", "RColorBrewer","preprocessCore", "ARTool","emmeans", "phia", "gProfileR", "WGCNA","plotly", "pheatmap", "pander", "GSVA", "kableExtra","downloadthis","DT")
for(i in 1:length(needed.packages)){library(needed.packages[i], character.only = TRUE)}

source(here("source_files","WGCNA_source.R"))
source(here("source_files","plot_theme.R"))

tdata.FPKM.sample.info <- readRDS(here("Data","20190406_RNAseq_B6_4wk_2DG_counts_phenotypes.RData"))

tdata.FPKM <- readRDS(here("Data","20190406_RNAseq_B6_4wk_2DG_counts_numeric.RData"))

log.tdata.FPKM <- log(tdata.FPKM + 1)
log.tdata.FPKM <- as.data.frame(log.tdata.FPKM)

log.tdata.FPKM.sample.info <- cbind(log.tdata.FPKM, tdata.FPKM.sample.info[,27238:27240])

log.tdata.FPKM.sample.info <- log.tdata.FPKM.sample.info %>% rownames_to_column() %>% filter(rowname != "A113") %>% column_to_rownames()

log.tdata.FPKM.subset <- log.tdata.FPKM[,colMeans(log.tdata.FPKM != 0) > 0.5] 

log.tdata.FPKM.sample.info.subset <- cbind(log.tdata.FPKM.subset,tdata.FPKM.sample.info[,27238:27240])
log.tdata.FPKM.sample.info.subset <- log.tdata.FPKM.sample.info.subset %>% rownames_to_column() %>% filter(rowname != "A113") %>% column_to_rownames()

log.tdata.FPKM.sample.info.subset.kidney <- log.tdata.FPKM.sample.info.subset %>% rownames_to_column() %>% filter(Tissue == "Kidney") %>% column_to_rownames()

log.data.FPKM.subset.kidney <- log.tdata.FPKM.sample.info.subset.kidney %>%
  dplyr::select(-c("Time", "Tissue", "Treatment")) %>%
  t()

modules <- readRDS(here("Data","Kidney","log.tdata.FPKM.sample.info.subset.kidney.WGCNA.module.membership.RData"))

Gene Set Variation Analysis

Gene Set Variation Analysis is a non-parametric, unsupervised method that assesses gene set enrichment. The method is performed using the R package GSVA. It is performed without phenotypes, which allows for complex experimental studies to be assessed for pathway enrichment. Gene sets can be provided or pulled down from websites that store pre-defined gene sets. The gene sets used for this dataset were determined by assessing pathways present within each module through WGCNA. To obtain these pathways WGCNA was performed to cluster genes into modules. Overrepresented pathways were identified within each module using gProfiler2. To assess enrichment of pathways, Gene set variation analysis (GSVA) was performed. Genes of interest need to be compared to a gene set. To create this gene set, a complete ensembl gene list was assessed using gProfiler2 to identify pathways, regardless of significance. Each set of genes within each module was then assessed against the gene set to perform GSVA.

ANOVA

Aligned rank transformation was used to determine which factors significantly altered each pathway. If a factor or interaction with more than two factors is identified as significant then contrast statements are performed using emmeans() or testInteraction() to identify which factors are significantly different.

Blue

print.gsva.table(w = 1, gsva.table.list$table.module, gsva.table.list$tES)

Cyan

print.gsva.table(w = 2, gsva.table.list$table.module, gsva.table.list$tES)

Darkgreen

print.gsva.table(w = 3, gsva.table.list$table.module, gsva.table.list$tES)

Darkgrey

print.gsva.table(w = 4, gsva.table.list$table.module, gsva.table.list$tES)

Darkred

print.gsva.table(w = 5, gsva.table.list$table.module, gsva.table.list$tES)

Darkturquoise

print.gsva.table(w = 6, gsva.table.list$table.module, gsva.table.list$tES)

Greenyellow

print.gsva.table(w = 7, gsva.table.list$table.module, gsva.table.list$tES)

Grey60

print.gsva.table(w = 8, gsva.table.list$table.module, gsva.table.list$tES)

Lightcyan

print.gsva.table(w = 9, gsva.table.list$table.module, gsva.table.list$tES)

Lightgreen

print.gsva.table(w = 10, gsva.table.list$table.module, gsva.table.list$tES)

Lightyellow

print.gsva.table(w = 11, gsva.table.list$table.module, gsva.table.list$tES)

Magenta

print.gsva.table(w = 12, gsva.table.list$table.module, gsva.table.list$tES)

Midnightblue

print.gsva.table(w = 13, gsva.table.list$table.module, gsva.table.list$tES)

Pink

print.gsva.table(w = 14, gsva.table.list$table.module, gsva.table.list$tES)

Purple

print.gsva.table(w = 15, gsva.table.list$table.module, gsva.table.list$tES)

Red

print.gsva.table(w = 16, gsva.table.list$table.module, gsva.table.list$tES)

Salmon

print.gsva.table(w = 17, gsva.table.list$table.module, gsva.table.list$tES)

Analysis performed by Ann Wells

The Carter Lab The Jackson Laboratory 2023

ann.wells@jax.org

Gene Set Variation Analysis Kidney

Ann Wells

April 23, 2023