Molecular case studies (MCSs) provide educational opportunities to explore biomolecular structure and function using data from public bioinformatics resources. The conceptual basis for the design of MCSs has yet to be fully discussed in the literature, so we present molecular storytelling as a conceptual framework for teaching with case studies. Whether the case study aims to understand the biology of a specific disease and design its treatments or track the evolution of a biosynthetic pathway, vast amounts of structural and functional data, freely available in public bioinformatics resources, can facilitate rich explorations in atomic detail. To help biology and chemistry educators use these resources for instruction, a community of scholars collaborated to create the Molecular CaseNet. This community uses storytelling to explore biomolecular structure and function while teaching biology and chemistry. In this article, we define the structure of an MCS and present an example. Then, we articulate the evolution of a conceptual framework for developing and using MCSs. Finally, we related our framework to the development of technological, pedagogical, and content knowledge (TPCK) for educators in the Molecular CaseNet. The report conceptualizes an interdisciplinary framework for teaching about the molecular world and informs lesson design and education research.
As high-throughput methods become more common, training undergraduates to analyze data must include having them generate informative summaries of large datasets. This flexible case study provides an opportunity for undergraduate students to become familiar with the capabilities of R programming in the context of high-throughput evolutionary data collected using macroarrays. The story line introduces a recent graduate hired at a biotech firm and tasked with analysis and visualization of changes in gene expression from 20,000 generations of the Lenski Lab’s Long-Term Evolution Experiment (LTEE). Our main character is not familiar with R and is guided by a coworker to learn about this platform. Initially this involves a step-by-step analysis of the small Iris dataset built into R which includes sepal and petal length of three species of irises. Practice calculating summary statistics and correlations, and making histograms and scatter plots, prepares the protagonist to perform similar analyses with the LTEE dataset. In the LTEE module, students analyze gene expression data from the long-term evolutionary experiments, developing their skills in manipulating and interpreting large scientific datasets through visualizations and statistical analysis. Prerequisite knowledge is basic statistics, the Central Dogma, and basic evolutionary principles. The Iris module provides hands-on experience using R programming to explore and visualize a simple dataset; it can be used independently as an introduction to R for biological data or skipped if students already have some experience with R. Both modules emphasize understanding the utility of R, rather than creation of original code. Pilot testing showed the case study was well-received by students and faculty, who described it as a clear introduction to R and appreciated the value of R for visualizing and analyzing large datasets.