First steps into reproducible bioinformatics

Seminar at MBIM, Nov 30, 2023

Stephan König

Skylight, University of British Columbia

Mini lesson:
Reproducible reporting of computational analyses

Learning outcomes

  • Describe why combining natural language and code enables reproducible analysis and research.

  • Discover how symbols and text convey formatting and the structure of a document.

Microbiomes at different lake depths

Sequence barcode marker of each sample

Group similar sequences
defining
amplicon sequence variants (ASVs)
ASV ≅ species

ASV counts table

Species richness at 10 m depth

Table 1: ASV counts
name depth_10_m depth_20_m
ASV_1 28 14
ASV_2 5 12
ASV_3 0 3
ASV_4 14 41
ASV_5 18 28
asv_counts |> 
  select(depth_10_m) |>
  filter(depth_10_m >= 1) |> 
  nrow()
[1] 4

How to report the results?

Your Turn: Think-pair-share

How would ensure the correct reporting of your analysis?

  • Think for one minute about possible solutions by yourself.
  • Then discuss with your neighbour your solution.

02:00

Do you want your grades to be reproducible?

Your Turn: Worksheet

  • Review the report of a fellow student.
  • Discover how symbols and text structure the document.

Plain text

analysis.qmd
---
title: "Species richness analysis"
format: typst
---

# Identifying species in microbial communities

To describe the species found in a microbial community, we isolate the DNA of a sample and then amplify and sequence a suitable barcoding marker, typically the 16S rRNA gene. We then group all similar sequences into so-called **amplicon sequence variants (ASV)**. For our purposes here, we assume that *each ASV corresponds to a bacterial species*. By counting the number of each ASV in each sample, we get an ASV counts table (@tbl-asv-counts).

```{r}
#| label: tbl-asv-counts
#| echo: false
#| tbl-cap: "ASV counts of samples collected at different depths of a lake."

suppressMessages(library(dplyr))

asv_counts <- read.csv("asv-counts.csv")
asv_counts
```

PDF

Your Turn: Worksheet

  • Take about 2 minute to highlight differences between plain text and the PDF output (exercise 1).
  • Compare your results with your neighbour and work with them on exercise 2–4.

05:00

Markdown

  • Simple expressive formatting with symbols and text.
  • Used in many apps and services, for example .
Markdown syntax

# Heading 1

## Heading 2

**strong**
*emphasized*

How Quarto works

index.qmd
---
title: "MICB 2XX: Data science teaching demo"
subtitle: "Dr. Stephan Koenig"
execute:
  freeze: false
---

```{r}
#| label: setup
#| echo: false

library(conflicted)
library(dplyr)
library(fontawesome)
library(fs)

Next lab: Create your own website!

  • Build a professional ePortfolio online.
  • Reflect on and record your growth across the MICB program.
  • Demonstrate your coding and collaboration skills.
  • Showcase your work as a developing microbiologist and immunologist to future employers.

Universal Design for Learning:
Embracing open source

Representation

  • Open source tools support alt text for images.
  • Easy to provide headings and descriptive hyperlinks.
  • Support for different file formats and integrating other media.

Engagement

  • Follow the mastery learning model, i.e., frequent practice, assessment and feedback.
  • Use online assessment platform (e.g., PrairieLearn) to support mastery learning at scale.
    • Academic integrity by randomization.
    • Use questions in practices and exams.

Engagement

  • Use authentic examples and assessments (for example, ePortfolio).
  • Use relevant, widespread technologies and industry standards, such as HTML, CSS, Git and GitHub, in a framework that can be adapted to any bioinformatics pipeline.
  • Model inclusive practices in the way course is run and materials are provided.

Action and expression

  • Alternative ways of participating, e.g., discussion section on course website.
  • Increased expression and communication (open source platforms offer a lot of flexibility).
  • Accommodations supported by online assessment platforms.

We are not alone: UDL community of practice

Universal design for learning fellows program—2024 cohort.

Teaching philosphy: Lifelong learner

Acknowlegements

  • Tracy Kion and the search committee
  • Hannah Wong
  • Karen Smith