Welcome

Welcome to the Course Website for EN.580.428 Genomic Data Visualization!

As the primary mode through which analysts and audience members alike consume data, data visualization remains an important hypothesis generating and analytical technique in data-driven research to facilitate new discoveries. However, if done poorly, data visualization can also mislead, bias, and slow down progress. This hands-on course will cover the principles of perception and cognition relevant for data visualization and apply these principles to genomic data, including large-scale spatially-resolved omics datasets, using the R statistical programming language. Students will be expected to complete class readings, create weekly data visualizations as homework assignments, and make a major class presentation.

Course Information

Course Staff: Prof. Jean Fan and Suki
Lectures: 8:00am-9:50am Monday, Wednesday, and Friday. See Canvas for location details.
Office Hours: 10:00am-10:50am Monday, Wednesday, and by request. See Canvas for location details.

Course Details
☞ see Course tab


All Visualizations

Influence of Gene Mean and Gene Variance on PC1

1. What data types are you visualizing? I am visualizing quantitative data of: 1) the mean expression of each gene, averaged across all of all spatial spots within the data...

Comparing High and Low Loading Genes Across Spatial and PCA Spaces

Write a description explaining what you are trying to make salient and why you believe your data visualization is effective, using vocabulary terms from Lesson 1. (Question 2: How do...

HW 2 How Gene Loadings on the First PC Relates to Mean and Variance

1. What data types are you visualizing? I am visualizing quantitative data for the genes, including the PC1 loadings, mean expression per gene, and variance per gene. I also visualized...

HW 2

###Summary PC1 loadings correlate positively with both mean expression (r = 0.52-0.58) and variance (r = 0.44-0.48). This indicates PC1 primarily captures overall expression magnitude - highly expressed genes dominate...

Impact of Principal Component Selection on t-SNE Coordinates

1. What about the data would you like to make salient?

HW2

1. Write a description explaining what you are trying to make salient and why you believe your data visualization is effective using vocabulary terms from Lesson 1.

How do tSNE coordinates change as increasing the number of PCs?

1. What data types are you visualizing? I am answering how do tSNE coordinates change as increasing the number of PCs. I computed PCA on the log-transformed, normalized gene expression...

How PC1 Gene Loadings Relate to Gene Mean Expression and Variance

1. What data types are you visualizing? I am visualizing both categorical and quantitative data. The categorical data is the set of gene names in the left panel, since each...

HW2

external resources: prompt: make setwd as output path, improve the layout structure. 1. What data types are you visualizing? I visualize (1) quantitative gene-level data: PC1 loadings, mean expression, and...

HW1

1. What data types are you visualizing? I am plotting 2D spatial coordinates and quantitative data through ACOX1 expression counts.

A data visualization for sequencing spatial transcriptomics data

1. What data types are you visualizing? Categorical- Gene type Aqp1, Slc34a1 (in the second plot) Quantitative data- Expression levels of genes Aqp1 and Slc34a1 in each cell The euclidean...

HW1

1. What about the data would you like to make salient through this data visualization? I wanted to make the total level of fibrosis-related gene expression for each spot apparent...