Welcome

Welcome to the Course Website for EN.580.428 Genomic Data Visualization!

As the primary mode through which analysts and audience members alike consume data, data visualization remains an important hypothesis generating and analytical technique in data-driven research to facilitate new discoveries. However, if done poorly, data visualization can also mislead, bias, and slow down progress. This hands-on course will cover the principles of perception and cognition relevant for data visualization and apply these principles to genomic data, including large-scale spatially-resolved omics datasets, using the R statistical programming language. Students will be expected to complete class readings, create weekly data visualizations as homework assignments, and make a major class presentation.

Course Information

Course Staff: Prof. Jean Fan and Suki
Lectures: 8:00am-9:50am Monday, Wednesday, and Friday. See Canvas for location details.
Office Hours: 10:00am-10:50am Monday, Wednesday, and by request. See Canvas for location details.

Course Details
☞ see Course tab


All Visualizations

HW3

Describe your figure briefly so we know what you are depicting (you no longer need to use precise data visualization terms as you have been doing). Write a description to...

HW 2

1. How do the gene loadings on the first PC relate to features of the genes such as its mean or variance? For context, the data was log-transformed, but not...

HW2 Submission

2. How do the genes with high versus low loadings relate to each other? How are they patterned relative to each other in the tissue? Genes with high loadings on...

A multipanel data visualization of PC1 gene loadings against gene features

1. What data types are you visualizing? Categorical- PCA unscaled and PCA scaled Quantitative data- PC1 gene loading Gene expression variability (variance) Mean gene expression

HW2

Question explored: “How do tSNE coordinates change as you increase or decrease the perplexity?”

Kidney PCA vs Loading Size Analysis(Prompt 2)

1. What data types are you visualizing?

Comparing high vs. low PC1 loading genes

Aim: How do the genes with high versus low loadings relate to each other? How are they patterned relative to each other in the tissue? Note: Panel numbers reference the...

Visualizing Spatial Gene Expression with PCA and tSNE

1. How do the gene loadings on the first PC relate to features of the genes such as its mean or variance? The first principal component captures the direction of...

HW2: How Distance Metric in tSNE affects Spatial Tissue Structure

1. What data types are you visualizing? Spatial data: X and Y coordinates representing physical tissue location Quantitative data: t-SNE embeddings (continuous numerical values) and euclidean distances to centroids Categorical...

Using PCA to visualize spatial patterns in high-dimensional gene expression within coronal kidney section

1. What data types are you visualizing? The represented data type is quantitative. I am visualizing the x and y spatial positions of cells in the coronal kidney section (all...

HW2

1. Write a description explaining what you are trying to make salient This visualization shows the expression of the five genes that contribute most positively and most negatively to PC1....

Spatial Organization of Genes with Extreme PCA Loadings

1. What data types are you visualizing? I’m visualizing both quantitative and categorical data. The dataset has quantitative spatial information of x and y coordinates for each spot in the...