Onecut3 in mouse hypothalamus development

Last updated: 2022-10-04

Checks: 7 0

Knit directory: Zupancic_2022/

This reproducible R Markdown analysis was created with workflowr (version 1.7.0). The Checks tab describes the reproducibility checks that were applied when the results were created. The Past versions tab lists the development history.

R Markdown file: up-to-date

Great! Since the R Markdown file has been committed to the Git repository, you know the exact version of the code that produced these results.

Environment: empty

Great job! The global environment was empty. Objects defined in the global environment can affect the analysis in your R Markdown file in unknown ways. For reproduciblity it’s best to always run the code in an empty environment.

Seed: set.seed(20220105)

The command set.seed(20220105) was run prior to running the code in the R Markdown file. Setting a seed ensures that any results that rely on randomness, e.g. subsampling or permutations, are reproducible.

Session information: recorded

Great job! Recording the operating system, R version, and package versions is critical for reproducibility.

Cache: none

Nice! There were no cached chunks for this analysis, so you can be confident that you successfully produced the results during this run.

File paths: relative

Great job! Using relative paths to the files within your workflowr project makes it easier to run your code on other machines.

Repository version: 7f4395e

Great! You are using Git for version control. Tracking code development and connecting the code version to the results is critical for reproducibility.

The results in this page were generated with repository version 7f4395e. See the Past versions tab to see a history of the changes made to the R Markdown and HTML files.

Note that you need to be careful to ensure that all relevant files for the analysis have been committed to Git prior to generating the results (you can use wflow_publish or wflow_git_commit). workflowr only checks the R Markdown file, but you know if there are other scripts or data files that it depends on. Below is the status of the Git repository when the results were generated:


Ignored files:
    Ignored:    .Rproj.user/
    Ignored:    data/oc3_fin.h5seurat
    Ignored:    data/rar2020.srt.cont.oc2or3.raw.h5seurat

Unstaged changes:
    Modified:   output/figures/stat-corr-plt_oc3-rna-data-Onecut3-Gad1_.pdf
    Modified:   output/figures/stat-corr-plt_oc3-rna-data-Onecut3-Gad2_.pdf
    Modified:   output/figures/stat-corr-plt_oc3-rna-data-Onecut3-Nav1_.pdf
    Modified:   output/figures/stat-corr-plt_oc3-rna-data-Onecut3-Nav2_.pdf
    Modified:   output/figures/stat-corr-plt_oc3-rna-data-Onecut3-Onecut2_.pdf
    Modified:   output/figures/stat-corr-plt_oc3-rna-data-Onecut3-Slc17a6_.pdf
    Modified:   output/figures/stat-corr-plt_oc3-rna-data-Onecut3-Slc32a1_.pdf
    Modified:   output/figures/stat-corr-plt_oc3-rna-data-Onecut3-Th_.pdf
    Modified:   output/figures/stat-corr-plt_oc3-rna-data-Onecut3-Trh_.pdf
    Modified:   output/figures/stat-corr-plt_oc3-rna-data-Onecut3-Trio_.pdf
    Modified:   output/figures/stat-corr-plt_oc3-rna-data-Slc32a1-Onecut3_.pdf
    Modified:   output/figures/stat-corr-plt_oc3-rna-data-Slc32a1-Th_.pdf
    Modified:   output/figures/stat-corr-plt_oc3-rna-data-Slc32a1-Trh_.pdf
    Modified:   output/figures/stat-corr-plt_oc3-rna-data-Th-Onecut3_.pdf
    Modified:   output/figures/stat-corr-plt_oc3-rna-data-Th-Trh_.pdf

Note that any generated files, e.g. HTML, png, CSS, etc., are not included in this status report because it is ok for generated content to have uncommitted changes.

These are the previous versions of the repository in which changes were made to the R Markdown (analysis/dynamic_oc3-pop.Rmd) and HTML (docs/dynamic_oc3-pop.html) files. If you’ve configured a remote Git repository (see ?wflow_git_remote), click on the hyperlinks in the table below to view the files as they were in that past version.

File	Version	Author	Date	Message
Rmd	7f4395e	Evgenii O. Tretiakov	2022-10-04	workflowr::wflow_publish("analysis/dynamic_oc3-pop.Rmd", update = T,
html	e928499	EugOT	2022-01-29	Build site.
html	da268b6	EugOT	2022-01-29	rebuild with fixed output device to png and pdf
Rmd	3049d38	EugOT	2022-01-29	fix output device to png and pdf
Rmd	b21b61a	EugOT	2022-01-29	save final subset of cells
html	b21b61a	EugOT	2022-01-29	save final subset of cells
html	fc91ebf	EugOT	2022-01-29	Build site.
Rmd	1df6d48	EugOT	2022-01-29	wflow_publish("*")
html	d106e47	EugOT	2022-01-29	Build site.
Rmd	6a77e95	EugOT	2022-01-29	wflow_publish("*")
html	6a77e95	EugOT	2022-01-29	wflow_publish("*")

# Load tidyverse infrastructure packages
library(here)
# here() starts at /home/etretiakov/src/Zupancic_2022
library(tidyverse)
# ── Attaching packages
# ───────────────────────────────────────
# tidyverse 1.3.2 ──
# ✔ ggplot2 3.3.6      ✔ purrr   0.3.4 
# ✔ tibble  3.1.8      ✔ dplyr   1.0.10
# ✔ tidyr   1.2.1      ✔ stringr 1.4.1 
# ✔ readr   2.1.2      ✔ forcats 0.5.2 
# ── Conflicts ────────────────────────────────────────── tidyverse_conflicts() ──
# ✖ dplyr::filter() masks stats::filter()
# ✖ dplyr::lag()    masks stats::lag()
library(magrittr)
# 
# Attaching package: 'magrittr'
# 
# The following object is masked from 'package:purrr':
# 
#     set_names
# 
# The following object is masked from 'package:tidyr':
# 
#     extract
library(zeallot)
library(future)
# 
# Attaching package: 'future'
# 
# The following objects are masked from 'package:zeallot':
# 
#     %->%, %<-%

# Load packages for scRNA-seq analysis and visualisation
library(sctransform)
library(Seurat)
# Attaching SeuratObject
# Attaching sp
library(SeuratWrappers)
library(SeuratDisk)
# Registered S3 method overwritten by 'SeuratDisk':
#   method            from  
#   as.sparse.H5Group Seurat
library(UpSetR)
library(patchwork)
library(Nebulosa)

src_dir    <- here("code")
data_dir   <- here("data")
output_dir <- here("output")
plots_dir  <- here(output_dir, "figures")
tables_dir <- here(output_dir, "tables")

source(here(src_dir, "functions.R"))
source(here(src_dir, "genes.R"))

reseed <- 42
set.seed(seed = reseed)

# available cores
n_cores <- available_cores(prop2use = .5)
# Parameters for parallel execution
plan("multicore", workers = n_cores)
# Warning in supportsMulticoreAndRStudio(...): [ONE-TIME WARNING] Forked
# processing ('multicore') is not supported when running R from RStudio
# because it is considered unstable. For more details, how to control forked
# processing or not, and how to silence this warning in future R sessions, see ?
# parallelly::supportsMulticore
options(future.globals.maxSize = Inf,
        future.rng.onMisuse = "ignore")
plan()
# multicore:
# - args: function (..., workers = 16, envir = parent.frame())
# - tweaked: TRUE
# - call: plan("multicore", workers = n_cores)

Read data

rar2020_ages_all     <- c("E15", "E17", "P00", "P02", "P10", "P23")
rar2020_ages_postnat <-                      c("P02", "P10", "P23")
samples_df <- read_tsv(here("data/samples.tsv"))
# Rows: 8 Columns: 14
# ── Column specification ────────────────────────────────────────────────────────
# Delimiter: "\t"
# chr (8): sample, age, condition, fullname, name, sex, date, date10x
# dbl (6): ncells, libbatch, seqbatch, perfussed, nt, sn
# 
# ℹ Use `spec()` to retrieve the full column specification for this data.
# ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
colours_wtree <- setNames(read_lines(here(data_dir, "colours_wtree.tsv")),
                          1:45)

onecut  <- LoadH5Seurat(here(data_dir, "rar2020.srt.cont.oc2or3.raw.h5seurat"))
# Validating h5Seurat file
# Initializing RNA with data
# Adding counts for RNA
# Adding scale.data for RNA
# Adding variable feature information for RNA
# Adding reduction pca
# Adding cell embeddings for pca
# Adding feature loadings for pca
# Adding miscellaneous information for pca
# Adding reduction tsne
# Adding cell embeddings for tsne
# Adding miscellaneous information for tsne
# Adding reduction umap
# Adding cell embeddings for umap
# Adding miscellaneous information for umap
# Adding command information
# Adding cell-level metadata
# Adding miscellaneous information
# Adding tool-specific results
onecut3 <- subset(onecut, subset = Onecut3 > 0)

Derive and filter matrix of Onecut3

mtx_oc3 <-
    onecut3 %>%
    GetAssayData("data", "RNA") %>%
    as.data.frame() %>%
    t()
rownames(mtx_oc3) <- colnames(onecut3)

# Filter features
filt_low_genes <-
    colSums(mtx_oc3) %>%
    .[. > quantile(., 0.4)] %>%
    names()
mtx_oc3 %<>% .[, filt_low_genes]

min_filt_vector <-
    mtx_oc3 %>%
    as_tibble() %>%
    select(all_of(filt_low_genes)) %>%
    summarise(across(.fns = ~ quantile(.x, .1))) %>%
    as.list %>%
    map(as.double) %>%
    simplify %>%
    .[colnames(mtx_oc3)]

# Prepare table of intersection sets analysis
content_mtx_oc3 <-
    (mtx_oc3 > min_filt_vector) %>%
    as_tibble() %>%
    mutate_all(as.numeric)

Correlation analysis visualisation between different genes

p_corrs <- list(
    ggstatsplot::ggscatterstats(as.data.frame(mtx_oc3),
        x = Onecut3,  y = Trh, xfill = "#ffc400", yfill = "#e22ee2"),
    ggstatsplot::ggscatterstats(as.data.frame(mtx_oc3),
        x = Slc32a1,  y = Onecut3,  xfill = "#0000da",   yfill = "#ffc400"),
    ggstatsplot::ggscatterstats(as.data.frame(mtx_oc3),
        x = Th,  y = Onecut3,  xfill = "#006eff",   yfill = "#ffc400"),
    ggstatsplot::ggscatterstats(as.data.frame(mtx_oc3),
        x = Slc32a1,  y = Trh, xfill = "#0000da",   yfill = "#e22ee2"),
    ggstatsplot::ggscatterstats(as.data.frame(mtx_oc3),
        x = Th,  y = Trh, xfill = "#006eff",   yfill = "#e22ee2"),
    ggstatsplot::ggscatterstats(as.data.frame(mtx_oc3),
        x = Slc32a1,  y = Th,  xfill = "#0000da",   yfill = "#006eff"),
    ggstatsplot::ggscatterstats(as.data.frame(mtx_oc3),
        y = Slc32a1,  x = Onecut3,  yfill = "#0000da",   xfill = "#ffc400"),
    ggstatsplot::ggscatterstats(as.data.frame(mtx_oc3),
        y = Th,  x = Onecut3,  yfill = "#006eff",   xfill = "#ffc400"),
    ggstatsplot::ggscatterstats(as.data.frame(mtx_oc3),
        y = Slc17a6,  x = Onecut3,  yfill = "#ff0000",   xfill = "#ffc400"),
    ggstatsplot::ggscatterstats(as.data.frame(mtx_oc3),
        y = Gad1,  x = Onecut3,  yfill = "#a50202",    xfill = "#ffc400"),
    ggstatsplot::ggscatterstats(as.data.frame(mtx_oc3),
        y = Gad2,  x = Onecut3,  yfill = "#4002a5",    xfill = "#ffc400"),
    ggstatsplot::ggscatterstats(as.data.frame(mtx_oc3),
        y = Onecut2,  x = Onecut3,  yfill = "#6402a5",    xfill = "#ffc400"),
    ggstatsplot::ggscatterstats(as.data.frame(mtx_oc3),
        y = Nav1,  x = Onecut3,  yfill = "#2502a5",    xfill = "#ffc400"),
    ggstatsplot::ggscatterstats(as.data.frame(mtx_oc3),
        y = Nav2,  x = Onecut3,  yfill = "#4002a5",    xfill = "#ffc400"),
    ggstatsplot::ggscatterstats(as.data.frame(mtx_oc3),
        y = Trio,  x = Onecut3,  yfill = "#2502a5",    xfill = "#ffc400")
)
# Registered S3 method overwritten by 'ggside':
#   method from   
#   +.gg   ggplot2
n_corrs <- list(
    "oc3-rna-data-Onecut3-Trh",
    "oc3-rna-data-Slc32a1-Onecut3",
    "oc3-rna-data-Th-Onecut3",
    "oc3-rna-data-Slc32a1-Trh",
    "oc3-rna-data-Th-Trh",
    "oc3-rna-data-Slc32a1-Th",
    "oc3-rna-data-Onecut3-Slc32a1",
    "oc3-rna-data-Onecut3-Th",
    "oc3-rna-data-Onecut3-Slc17a6",
    "oc3-rna-data-Onecut3-Gad1",
    "oc3-rna-data-Onecut3-Gad2",
    "oc3-rna-data-Onecut3-Onecut2",
    "oc3-rna-data-Onecut3-Nav1",
    "oc3-rna-data-Onecut3-Nav2",
    "oc3-rna-data-Onecut3-Trio"
)

walk2(n_corrs, p_corrs, save_my_plot, type = "stat-corr-plt")
# `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
# `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

Visualise intersections sets that we are going to use (highlighted)

upset(as.data.frame(content_mtx_oc3),
  order.by = "freq",
  sets.x.label = "Number of cells",
  text.scale = c(2, 1.6, 2, 1.3, 2, 3),
  nsets = 15,
  sets = c("Gad1", "Gad2", "Slc32a1", "Slc17a6"),
  queries = list(
    list(
      query = intersects,
      params = list("Gad1", "Gad2", "Slc32a1"),
      active = T
    ),
    list(
      query = intersects,
      params = list("Slc17a6"),
      active = T
    )
  ),
  nintersects = 60,
  empty.intersections = "on"
)

Version	Author	Date
da268b6	EugOT	2022-01-29
fc91ebf	EugOT	2022-01-29
6a77e95	EugOT	2022-01-29

upset(as.data.frame(content_mtx_oc3),
  order.by = "freq",
  sets.x.label = "Number of cells",
  text.scale = c(2, 1.6, 2, 1.3, 2, 3),
  nsets = 15,
  sets = c("Th", "Trh", "Slc32a1", "Slc17a6"),
  queries = list(
    list(
      query = intersects,
      params = list("Th", "Slc32a1"),
      active = T
    ),
    list(
      query = intersects,
      params = list("Trh", "Slc17a6"),
      active = T
    )
  ),
  nintersects = 60,
  empty.intersections = "on"
)

Regroup factor by stages for more balanced groups

onecut3$age %>% forcats::fct_count()
# # A tibble: 6 × 2
#   f         n
#   <fct> <int>
# 1 E15     156
# 2 E17     104
# 3 P00      77
# 4 P02      74
# 5 P10     174
# 6 P23       7
onecut3$stage <-
  onecut3$age %>%
  forcats::fct_collapse(`Embrionic day 15` = "E15",
                        `Embrionic day 17` = "E17",
                        Neonatal = c("P00", "P02"),
                        Postnatal = c("P10", "P23"))
onecut3$stage %>% forcats::fct_count()
# # A tibble: 4 × 2
#   f                    n
#   <fct>            <int>
# 1 Embrionic day 15   156
# 2 Embrionic day 17   104
# 3 Neonatal           151
# 4 Postnatal          181

Make subset of stable neurons

onecut3$gaba_status <-
  content_mtx_oc3 %>%
  select(Gad1, Gad2, Slc32a1) %>%
  mutate(gaba = if_all(.fns = ~ .x > 0)) %>%
  .$gaba

onecut3$gaba_occurs <-
  content_mtx_oc3 %>%
  select(Gad1, Gad2, Slc32a1) %>%
  mutate(gaba = if_any(.fns = ~ .x > 0)) %>%
  .$gaba

onecut3$glut_status <-
  content_mtx_oc3 %>%
  select(Slc17a6) %>%
  mutate(glut = Slc17a6 > 0) %>%
  .$glut

oc3_fin <-
  subset(onecut3,
    cells = union(
      WhichCells(onecut3,
        expression = gaba_status == TRUE & glut_status == FALSE),
      WhichCells(onecut3,
        expression = glut_status == TRUE & gaba_occurs == FALSE)))

Check contingency tables for neurotransmitter signature

oc3_fin@meta.data %>%
  janitor::tabyl(glut_status, gaba_status)
#  glut_status FALSE TRUE
#        FALSE     0  102
#         TRUE    92    0

By age

oc3_fin@meta.data %>%
  janitor::tabyl(age, gaba_status)
#  age FALSE TRUE
#  E15    29    9
#  E17    12    4
#  P00    12    6
#  P02    16    2
#  P10    21   81
#  P23     2    0

By stage

oc3_fin@meta.data %>%
  janitor::tabyl(stage, gaba_status)
#             stage FALSE TRUE
#  Embrionic day 15    29    9
#  Embrionic day 17    12    4
#          Neonatal    28    8
#         Postnatal    23   81

Make splits of neurons by neurotransmitter signature

oc3_fin$status <- oc3_fin$gaba_status %>%
  if_else(true = "GABAergic",
    false = "glutamatergic")
Idents(oc3_fin) <- "status"
SaveH5Seurat(
  object    = oc3_fin,
  filename  = here(data_dir, "oc3_fin"),
  overwrite = TRUE,
  verbose   = TRUE
)
# Warning: Overwriting previous file /home/etretiakov/src/Zupancic_2022/data/
# oc3_fin.h5seurat
# Creating h5Seurat file for version 3.1.5.9900
# Adding counts for RNA
# Adding data for RNA
# Adding scale.data for RNA
# Adding variable features for RNA
# No feature-level metadata found for RNA
# Adding cell embeddings for pca
# Adding loadings for pca
# No projected loadings for pca
# Adding standard deviations for pca
# No JackStraw data for pca
# Adding cell embeddings for tsne
# No loadings for tsne
# No projected loadings for tsne
# No standard deviations for tsne
# No JackStraw data for tsne
# Adding cell embeddings for umap
# No loadings for umap
# No projected loadings for umap
# No standard deviations for umap
# No JackStraw data for umap

## Split on basis of neurotrans and test for difference
oc3_fin_neurotrans <- SplitObject(oc3_fin, split.by = "status")

## Split on basis of age and test for difference
oc3_fin_ages       <- SplitObject(oc3_fin, split.by = "age")

DotPlots grouped by age

Expression of GABA receptors in GABAergic Onecut3 positive cells

DotPlot(object = oc3_fin_neurotrans$GABAergic,
        features = gabar,
        group.by = "age",
        cols = c("#adffff", "#0084ff"),
        col.min = -1, col.max = 1
) + RotatedAxis()

Version	Author	Date
da268b6	EugOT	2022-01-29
fc91ebf	EugOT	2022-01-29
6a77e95	EugOT	2022-01-29

Expression of GABA receptors in glutamatergic Onecut3 positive cells

DotPlot(object = oc3_fin_neurotrans$glutamatergic,
        features = gabar,
        group.by = "age",
        cols = c("#ffc2c2", "#ff3c00"),
        col.min = -1, col.max = 1
) + RotatedAxis()

Version	Author	Date
da268b6	EugOT	2022-01-29
fc91ebf	EugOT	2022-01-29
6a77e95	EugOT	2022-01-29

Expression of glutamate receptors in GABAergic Onecut3 positive cells

DotPlot(object = oc3_fin_neurotrans$GABAergic,
        features = glutr,
        group.by = "age",
        cols = c("#adffff", "#0084ff"),
        col.min = -1, col.max = 1
) + RotatedAxis()

Version	Author	Date
da268b6	EugOT	2022-01-29
fc91ebf	EugOT	2022-01-29
6a77e95	EugOT	2022-01-29

Expression of glutamate receptors in glutamatergic Onecut3 positive cells

DotPlot(object = oc3_fin_neurotrans$glutamatergic,
        features = glutr,
        group.by = "age",
        cols = c("#ffc2c2", "#ff3c00"),
        col.min = -1, col.max = 1
) + RotatedAxis()

Version	Author	Date
da268b6	EugOT	2022-01-29
fc91ebf	EugOT	2022-01-29
6a77e95	EugOT	2022-01-29

DotPlots grouped by stage