The past decade has seen a vast increase in the amount of data available to biologists, driven by the dramatic decrease in cost and concomitant rise in throughput of various next-generation sequencing technologies, such that a project unimaginable 10 years ago was recently proposed, the Earth BioGenomes Project, which aims to sequence the genomes of all eukaryotic species on the planet within the next 10 years. So while data are no longer limiting, accessing and interpreting those data has become a bottleneck. One important aspect of interpreting data is data visualization. This course introduces theoretical topics in data visualization through mini-lectures, and applied aspects in the form of hands-on labs. The labs use both web-based tools and R, so students at all computer skill levels can benefit. Syllabus may be viewed at https://tinyurl.com/DataViz4GenomeBio.
Data Visualization for Genome Biology
Instructor: Nicholas James Provart
2,431 already enrolled
Included with
(20 reviews)
Recommended experience
Details to know
Add to your LinkedIn profile
12 assignments
See how employees at top companies are mastering in-demand skills
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review
There are 6 modules in this course
In this module we'll cover 3 straightforward approaches for generating simple plots. As we'll see in the lab, often visualizing datasets can help us see the overall shape of the data that might not be captured in descriptive statistics like mean and standard deviation. Plotting datasets is also a useful way to identify outliers. In the mini-lectures we go over some common biological data visualization paradigms and more generally what the common chart types are, and we also talk about the context and grammar of data visualization.
What's included
5 videos5 readings2 assignments1 ungraded lab
In this week's module we explore ways of displaying biological variation and a little bit of background about track viewers. We also cover visual perception, Gestalt principles, and issues related to colour perception, important for accessibility-related reasons. In the lab we'll use an online app, PlotsOfDifferences, to generate some charts that display variation nicely, and we'll also use R to generate some box plots, histograms, and violin plots. Last but not least, we'll try adjusting some of the settings in JBrowse to help assess gene expression levels in a more intuitive manner. Thanks to Dr. Joachim Goedhart, University of Amsterdam, Netherlands for permission to use PlotsOfDifferences in the lab.
What's included
4 videos4 readings2 assignments1 ungraded lab
In this week's module we explore ways of visualizing gene expression data after briefly covering how we can measure gene expression levels with RNA-seq and identify significantly differentially expressed genes using statistical tests. We also cover design thinking. In the lab we'll use an online platform, Galaxy, to generate a volcano plot for visualizing significantly differentially expressed genes, and we'll also use R to generate some heatmaps of gene expression. Last but not least, we'll create our own "electronic fluorescent pictographs" for a gene expression data set.
What's included
3 videos3 readings2 assignments1 ungraded lab
In this week's module we cover how the Gene Ontology can be used to make sense of often overwhelmingly long lists of genes from transcriptomic and other kind of 'omic experiments, especially through Gene Ontology enrichment analyses. We'll also look at Agile Development and User Testing and how these can help improve data visualization tools. In the lab, we'll try our hand at 3 online Gene Ontology analysis apps, and create some nice overview charts for GO enrichment results in R. Thanks to Dr. Roy Navon, Technion University, Israel, for permission to use GOrilla in the lab. Thanks to Dr. Juri Reimand of the University of Toronto for permission to use g:Profiler. And thanks to Dr. Zhen Su of the China Agricultural University for permission to use AgriGO.
What's included
3 videos3 readings2 assignments1 ungraded lab
In this week's module, we explore tools for displaying and analyzing graph networks, notably those created when we generate protein-protein interactions, especially in a high-throughput manner. These PPIs are deposited in online databases like BioGRID, and can be retrieved on-the-fly via web services for display in powerful network visualization apps like Cytoscape. We'll talk about other web services/APIs that are available for biology in one of the mini-lectures, and in the lab we'll use Cytoscape to explore interactors of BRCA2. We'll also use a plug-in called BiNGO to do Gene Ontology enrichment analyses of its interactors, continuing our exploration of GO that we started last week. Last, we'll try using D3 to display an interaction network in a web page.
What's included
3 videos3 readings2 assignments
In this module we cover methods for generating and making sense of ever bigger biological data sets. The growth in sequencing capacity has enabled projects that we unimaginable even a few years ago, such as the Earth Biogenomes Project, which aims to sequence the genome of a representative of every eukaryotic species on the planet. In order to make sense of these large data sets, it is often useful to use dimentionality reduction methods, like t-SNE, PCA, and UMAP, to help visualize how similar samples are. Logic diagrams (Venn-Euler or Upset plots) are also useful for displaying how sets of genes are similar one to another. Thanks to Dr. Tim Hulsen (Philips Research, the Netherlands) for permission to use the DeepVenn app in the lab.
What's included
3 videos3 readings2 assignments1 ungraded lab
Instructor
Offered by
Recommended if you're interested in Data Analysis
Johns Hopkins University
Fred Hutchinson Cancer Center
Why people choose Coursera for their career
Learner reviews
20 reviews
- 5 stars
75%
- 4 stars
25%
- 3 stars
0%
- 2 stars
0%
- 1 star
0%
Showing 3 of 20
Reviewed on May 25, 2024
Great course, especially appreciated the UX design approach to data visualisation
Reviewed on Sep 10, 2024
Found it very informative. Glad to be aware now that there could be color-blind people present in the room.
New to Data Analysis? Start here.
Open new doors with Coursera Plus
Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription
Advance your career with an online degree
Earn a degree from world-class universities - 100% online
Join over 3,400 global companies that choose Coursera for Business
Upskill your employees to excel in the digital economy
Frequently asked questions
Access to lectures and assignments depends on your type of enrollment. If you take a course in audit mode, you will be able to see most course materials for free. To access graded assignments and to earn a Certificate, you will need to purchase the Certificate experience, during or after your audit. If you don't see the audit option:
The course may not offer an audit option. You can try a Free Trial instead, or apply for Financial Aid.
The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
When you purchase a Certificate you get access to all course materials, including graded assignments. Upon completing the course, your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile. If you only want to read and view the course content, you can audit the course for free.
You will be eligible for a full refund until two weeks after your payment date, or (for courses that have just launched) until two weeks after the first session of the course begins, whichever is later. You cannot receive a refund once you’ve earned a Course Certificate, even if you complete the course within the two-week refund period. See our full refund policy.