Gene expression in stem cells
This dataset and content is provided our by our friends at Data Nuggets. Visit DataNuggets.org to see the original activity and additional materials
Featured scientist: Adam Heck from Colorado State University
Background:
Every cell in your body contains the same DNA. How is it that genetically identical skin, brain, and muscle cells can look very different and perform very different functions from each other? Cells differentiate, or become different from one another, by turning certain genes on and off. This process is called gene expression. For example, when you spend time in the sun your skin cells turn on the gene for pigment, which protects your cells from bright sunlight. In the winter when there is less sunlight, your cells turn off this gene. This process your body uses to turn genes on and off is the same one it uses to develop from one cell into the many different cell types that make up your body. Stem cells have the ability to turn into any other type of cell in the body, an ability known as pluripotency. Your body retains some stem cells for your entire life.
Some genes are only turned on in specific types of cells because they have specialized jobs for those cell types, like muscle or brain cells. Other genes are more like managers, controlling which genes are turned on and off. The activity of these manager genes may be more common in stem cells because they could control which type of cell the stem cell will become. In recent years, scientists discovered they could reprogram specialized cells back into non-specialized stem cells, simply by turning on several manager genes. They call these reprogrammed cells induced pluripotent, or iPS.
Adam was working as a biologist in Colorado when he learned that many cool medical advances in regenerative and personalized medicine will happen when we figure out which genes are turned on, and which are turned off, in pluripotent stem cells. In his research, Adam wanted to look at gene expression for two genetically identical cell lines, those that have specialized and those that have been reprogrammed to be iPS stem cells. He was interested to see which genes are expressed by both types of cells and which genes are only expressed in one type of cell.
He decided to work with fibroblast cells because they are easy to grow in the lab. Fibroblasts cells are mainly responsible for production and maintenance of the extracellular matrix (including joints, ligaments, tendons and connective tissues), which is critical in holding the body’s tissues together. From reading the work of other scientists, Adam learned how to transform fibroblast cells into iPS stem cells. This knowledge lead him to two genetically identical types of cells – (1) specialized fibroblast cells and (2) unspecialized iPS cells. When fibroblast cells are transformed into unspecialized iPS cells, their function changes and they become responsible for wound healing and generating new tissues, acting like a reserve set of cells. Because fibroblast and iPS cells perform very different functions, Adam thinks it is likely that each cell line will expresses genes that are specific to its individual function.
Adam looked at expression in 10 different genes that are thought to have important functions for fibroblast or iPS cells. Adam measured the expression for each gene by looking at RNA abundance of each gene in the different cell types. RNA is the intermediate between DNA (the genetic blueprint) and protein (the functional worker of the cell). Adam chose to look at RNA, because it is often representative of how much protein is present in a cell, which is very difficult to measure directly. Adam analyzed three replicates for each cell type. He replicated in order to get a more accurate representation for each cell type. This is important in case the samples were in slightly different conditions, like warmer or cooler temperatures, which could alter gene expression. This experiment allowed Adam to figure out which genes are turned on in iPS cells, allowing him to better understand how stem cells work.
Scientific Question: How does gene expression differ between human fibroblasts and induced pluripotent stem cells (iPS)?
Scientific Data:
Click on the graph tab, and explore the available variables. What data will you graph to explore the first hypothesis?
Independent variable:
Dependent variable:Visualize gene expression levels for the 10 genes in fibroblasts and iPSc. Create your graph and paste below: You may have to group the data by the Z-variable
3. Since Adam evaluated gene expression across multiple samples, we might want to look at average gene expression levels across all genes in each cell type. To plot the average values, select the descriptive statistics tab to visualize mean and SD of the distribution. Then hide dots and make sure the mean and SD is visualized as a bar by changing the options in the appearance tab.
4. Identify any changes, trends, or differences you see in your graph. Include your graph and specifically refer to it when describing those changes, trends, or differences.
Interpret the Data:
5. Make a claim that answers the scientific question.
6. What evidence was used to write your claim? Reference specific parts of the tables or graph.
7. Explain your reasoning and why the evidence supports your claim. Connect the data back to what you learned about the role of gene expression in cell differentiation.
Your next steps as a scientist:
7. Science is an ongoing process. What new question do you think should be investigated? What future data should be collected to answer your question?
These questions are a digital extension of the original Data Nuggets activity. The data manipulation and graphing tasks within are best completed here on DataClassroom.
9. Although we can see that expression levels vary across different genes and cell types, there are ways to test whether these differences are meaningful or not. Based on the data plotted in Q.3 and 4 what test of significance would you conduct? Explain why that test is appropriate and paste the results below. What do you find?
10. Plot a graph below that describes overall distribution of gene expression levels regardless of cell type or gene type, along with both mean and median based statistics (mean +- SD and box and whisker plots). Describe what you see in the data. You may change the bin width according to your preference to see detailed information.
11. How will you test if this data is normally distributed? Paste results from your test here and explain what it means.
12. Now compare patterns of gene expression between fibroblasts and pluripotent cells, Plot the appropriate graph, group by the Z variable along with descriptive statistics such as median as well as the mean with 95% Confidence intervals and paste it below and describe what you find.
13. Typically when we have significant interaction effects in the data as might be the case in Q9, it is difficult to interpret the main effects of each category on the variable of interest. To demonstrate this, conduct a test of significance but this time considering only one variable at a time (for eg: gene or only cell type) as an explanatory variable. Pick an appropriate test, paste the results below and explain how this may differ from results found in Q9. Remember to choose non-parametric tests in case the data is non-normally distributed and conduct post-hoc tests where appropriate.
14. Based on the following table of gene function associated with the annotated genes, explain how fibroblasts and iPS cells may differ in their functionality based on expression level differences.
Want an Answer Key? Fill out the form below.
This dataset and content is provided our by our friends at Data Nuggets.
Visit DataNuggets.org to see the original activity and additional materials