needed.packages <- c("tidyverse", "here", "functional", "gplots", "dplyr", "GeneOverlap", "R.utils", "reshape2","magrittr","data.table", "RColorBrewer","preprocessCore", "ARTool","emmeans", "phia", "gProfileR","rlist", "pheatmap","WGCNA")
for(i in 1:length(needed.packages)){library(needed.packages[i], character.only = TRUE)}
source(here("source_files","WGCNA_source.R"))
There were 48 samples (4 per time and treatment for each tissue). Mice were fed 2DG (6g/L) or nothing in their water for 96hrs or 2 weeks. WGCNA analysis was performed to determine gene clustering. A soft-thresholding power was chosen so that the scale-free topology correlation attempts to hit 0.9. The power chosen for hippocampus, hypothalamus, and pre-frontal cortex was 9, which allowed the correlations to hover around 0.88. This power clustered the genes into 195 modules originally, however, modules that were highly correlated (0.75 or higher) were further merged together to create 41 modules.
modules<-read.csv(here("Data","Brain","log.tdata.FPKM.sample.info.subset.hip.hyp.cortex.WGCNA.module.membership.csv"), header=T)
module_barplot(modules)
Pathways were assessed for each module. Out of 41 modules, only 28 modules had pathway hits of any kind. The pathways that had the highest frequency across modules were metabolism of proteins, RNA, and arachidonic acid. The brown3, cyan, darkgrey, green3, green4, honeydew, lavenderblush1, lightcoral, magenta1, and orangered4 modules all contained these pathways.
60 pathways were identified for the antiquewhite module. The pathways identified were involved in amino acid metabolism, cell signaling, neuronal signaling, insulin regulation, cancer, and immunity.
17 pathways were identified for the blanchedalmond module. The pathways identified were involved in neuronal signaling and calcium signaling.
38 pathways were identified for the brown3 module. The pathways identified were involved in metabolism (macro-nutrients, TCA cycle, mitochondrial metabolism, respiratory electron transport), thermogenesis, and neurodegenerative diseases.
102 pathways were identified for the cyan module. The pathways identified were involved in cell cycle, metabolism, translation, rRNA, ribosomal, and ubiquitin.
50 pathways were identified for the darkgrey module. The pathways identified were involved in cell cycle, neurodegenerative diseases, thermogenesis, RNA, and cellular responses to stress and external stimuli.
52 pathways were identified for the darkseagreen module. The pathways were related to cAMP, insulin signaling, interleukin-17, cell-cell interactions, MAPK, MyD88, p53, T cell receptor signaling, and TLR cascades.
7 pathways were identified for the deeppink2 module. The pathways identified were involved in signaling and osteoclast differentiation.
24 pathways were identified in the green3 module. The pathways were related to autoimmune diseases, mRNA, mitochondrial metabolism, rRNA, and translation.
5 pathways were identified in the green4 module. The pathways identified were involved in mRNA.
17 pathways were identified in the grey module. The pathways were related to immunity, complement, bile acids, gamma-carboxylation, steroid hormones, and phenylalanine metabolism.
22 pathways were identified in the honeydew module. The pathways were related to RNA polymerase II, mRNA, SUMOylation, and PERK.
183 pathways were identified in the lavenderblush1 module. The pathways related to APC, cellular responses, antigen processing-cross presentation, MHC I mediated antigen processing and presentation, DNA, metabolism, mitochondrial metabolism, nonsense mediated decay, p53, RNA polymerase I and II, RUNX3, rRNA, transcriptional regulation, thermogenesis, and TCR signaling.
3 pathways were identified in the lavenderblush2 module: fatty acids; incretin synthesis, secretion, and activation: and synthesis, secretion and inactivation of glucagon-like peptide-1 (GLP-1)
1 pathway (axon guidance) was identified in the lavenderblush3 module.
35 pathways were identified in the lightblue1 module. The pathways were related to beta oxidation, antigen processing and presentation, fatty acid metabolism, MET, extracellular matrix, mitochondrial metabolism, and amino acid metabolism.
42 pathways were identified in the lightcoral module. The pathways were related to amino acids, AMPK, fatty acid metabolism, gluconeogenesis, glycolysis, insulin resistance, metabolism, sodium/chloride transporters, PPAR signaling, and transport.
23 pathways were identified in the magenta1 module. The pathways were related to collagen, extracellular matrix, fatty acid metabolism, protein digestion and absorption, and arachidonic acid metabolism.
18 pathways were identified in the magenta3 module. The pathways were related to calcium and calmodulin.
5 pathways were identified in the magenta4 module. The pathways identified were related to RNA transcription and Herpes simplex virus 1 infection.
9 pathways were identified in the mediumorchid3 module. The pathways identified were related to potassium and calcium channels, and GLP-1 regulates insulin secretion.
43 pathways were identified in the medium orchid3 module. The pathways were related to axon guidance, growth hormone, MAPK signaling, thyroid hormone signaling, TLR signaling, VEGF, and osteoclast differentiation.
6 pathways were identified in the orange3 module. The pathways identified were related to muscle contraction.
2 pathways were identified in the orangered1 module. The pathways identified were herpes simplex virus 1 infection and ubiquitin mediated proteolysis.
16 pathways were identified in the orangered4 module. The pathways were related to keratan, complement, sodium/chloride transporter, platelet degranulation, and arachidonic acid metabolism.
3 pathways were identified in the plum module. The pathways identified were ether lipid metabolism, metabolic pathways, and sphingolipid metabolism.
62 pathways were identified in the royalblue module. The pathways were related to adaptive immunity, B cell receptor signaling, chemokine receptors, complement, cytokine-cytokine, NF-kappa B, NOD-like receptor signaling, TLR cascade, natural killer cells, and autoimmune disease.
14 pathways were identified in the thistle3 module. The pathways were related to defensins, digestion, immune system, neutrophil degranulation, NOD-like receptor signaling, and pancreatic secretion.
tdata.FPKM.sample.info <- readRDS(here("Data","20190406_RNAseq_B6_4wk_2DG_counts_phenotypes.RData"))
tdata.FPKM <- readRDS(here("Data","20190406_RNAseq_B6_4wk_2DG_counts_numeric.RData"))
log.tdata.FPKM <- log(tdata.FPKM + 1)
log.tdata.FPKM <- as.data.frame(log.tdata.FPKM)
log.tdata.FPKM.sample.info <- cbind(log.tdata.FPKM, tdata.FPKM.sample.info[,27238:27240])
log.tdata.FPKM.sample.info <- log.tdata.FPKM.sample.info %>% rownames_to_column() %>% filter(rowname != "A113") %>% column_to_rownames()
log.tdata.FPKM.subset <- log.tdata.FPKM[,colMeans(log.tdata.FPKM != 0) > 0.5]
log.tdata.FPKM.subset <- log.tdata.FPKM.subset %>% rownames_to_column() %>% filter(rowname != "A113") %>% column_to_rownames()
log.tdata.FPKM.sample.info.subset.hip.hypo.cortex <- log.tdata.FPKM.sample.info %>% rownames_to_column() %>% filter(Tissue %in% c("Hippocampus","Hypothanamus","Pre-frontal Cortex")) %>% column_to_rownames()
log.tdata.FPKM.subset <- subset(log.tdata.FPKM.sample.info.subset.hip.hypo.cortex, select = -c(Time,Treatment,Tissue))
WGCNA.pathway <-readRDS(here("Data","Brain","Chang_B6_96hr_4wk_gprofiler_pathway_annotation_list_hip_hyp_cortex_WGCNA.RData"))
Matched<-readRDS(here("Data","Brain","Annotated_genes_in_hip_hyp_cortex_WGCNA_Chang_B6_96hr_4wk.RData"))
module.names <- Matched$X..Module.
name <- str_split(module.names,"_")
samples <-c()
for(i in 1:length(name)){
samples[[i]] <- name[[i]][2]
}
name <- str_split(samples,"\"")
name <- unlist(name)
Treatment <- unclass(as.factor(log.tdata.FPKM.sample.info.subset.hip.hypo.cortex[,27238]))
Tissue <- unclass(as.factor(log.tdata.FPKM.sample.info.subset.hip.hypo.cortex[,27239]))
Time <- unclass(as.factor(log.tdata.FPKM.sample.info.subset.hip.hypo.cortex[,27237]))
Treat.Tissue <- paste0(Treatment,Tissue)
Treat.Time <- paste0(Treatment, Time)
Tissue.Time <- paste0(Tissue, Time)
Treat.Tissue.Time <- paste0(Treatment,Tissue, Time)
phenotype <- data.frame(cbind(Tissue, Treatment, Time, Treat.Tissue, Treat.Time, Tissue.Time, Treat.Tissue.Time))
nSamples <- nrow(log.tdata.FPKM.sample.info.subset.hip.hypo.cortex)
MEs0 <- read.csv(here("Data","Brain","log.tdata.FPKM.sample.info.subset.hip.hyp.cortex.WGCNA.module.eigens.csv"),header = T, row.names = 1)
name <- str_split(names(MEs0),"_")
samples <-c()
for(i in 1:length(name)){
samples[[i]] <- name[[i]][2]
}
name <- str_split(samples,"\"")
name <- unlist(name)
colnames(MEs0) <-name
MEs <- orderMEs(MEs0)
moduleTraitCor <- cor(MEs, phenotype, use = "p");
moduleTraitPvalue <- corPvalueStudent(moduleTraitCor, nSamples)
#sizeGrWindow(10,6)
# Will display correlations and their p-values
textMatrix = paste(signif(moduleTraitCor, 2), "\n(",
signif(moduleTraitPvalue, 1), ")", sep = "");
dim(textMatrix) = dim(moduleTraitCor)
# Display the correlation values within a heatmap plot
heat <- pheatmap(moduleTraitCor, main = paste("Module-trait relationships"), display_numbers = textMatrix,color=colorRampPalette(brewer.pal(n = 11, name = "Paired"))(10), cluster_rows = F, cluster_cols = F, fontsize_number = 4, angle_col = 45, number_color = "black", border_color = "white")
heat
The top 10 genes that were significant for tissue within the lavenderblush3 module and had the highest correlation with lavenderblush3 were:
When assessing the overall sample contribution through their summary eigengenes it appears that for darkslateblue all hippocampus had a positive effect while hypothalamus and prefrontal cortex had a negative effect. Hypothalumus and prefrontal cortex had under-expressed genes while hippocampus had over-expressed genes.
The top 10 genes that were significant for tissue within the chocolate3 module and had the highest correlation with chocolate3 were:
The top 10 genes that were significant for treatment within the royalblue module and had the highest correlation with royalblue were:
When assessing the overall sample contribution through their summary eigengenes it appears that for royalblue most controls had a negative effect while most 2DG had a positive effect. Controls had under-expressed genes while 2DG had over-expressed genes.
None
The top 10 genes that were significant for treatment by tissue within the royalblue module and had the highest correlation with royalblue were:
When assessing the overall sample contribution through their summary eigengenes it appears that for royalblue most controls had a negative effect while most 2DG had a positive effect, independent of tissue. Controls had under-expressed genes while 2DG had over-expressed genes.
The top 10 genes that were significant for treatment by time within the royalblue module and had the highest correlation with royalblue were:
When assessing the overall sample contribution through their summary eigengenes it appears that for royalblue most controls had a negative effect while most 2DG had a positive effect, independent of time. Controls had under-expressed genes while 2DG had over-expressed genes.
The top 10 genes that were significant for tissue by time within the lavenderblush3 module and had the highest correlation with lavenderblush3 were:
The top 10 genes that were significant for tissue by time within the chocolate3 module and had the highest correlation with chocolate3 were:
The top 10 genes that were significant for treatment by tissue by time within the royalblue module and had the highest correlation with royalblue were:
When assessing the overall sample contribution through their summary eigengenes it appears that for royalblue most controls had a negative effect while most 2DG had a positive effect, independent of tissue and time. Controls had under-expressed genes while 2DG had over-expressed genes.
Analysis performed by Ann Wells
The Carter Lab The Jackson Laboratory 2023
ann.wells@jax.org