-
Notifications
You must be signed in to change notification settings - Fork 4
/
index.Rmd
64 lines (38 loc) · 2.49 KB
/
index.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
---
title: "Introduction to R for Cancer Scientists"
---
# Introduction
This site contains the materials for an R course run by the Bioinformatics Core
at the Cancer Research UK Cambridge Institute.
(Contributions : Matthew Eldridge, Chandra Chilamakuri, Mark Fernandes, Ashley Sawle, Kamal Kishore, Sergio Martinez Cuesta, Rory Stark)
October 2021
### Instructors
* Chandra Chilamakuri
* Mark Fernandes
# Description
**R** is one of the leading programming languages in **Data Science** and the
most widely used within CRUK CI for interacting with, analyzing and visualizing
cancer biology datasets.
In this training course, we aim to provide a friendly introduction to R pitched
at a beginners level but also for those who have been on R training courses
previously and would like a refresher or to consolidate their skills.
The course will be run over **5 sessions** with the following structure:
* **Each session lasts between 45 to 90 minutes
* the instructor will share his/her screen featuring an RStudio window
* More **in-depth material** covering the concepts introduced in the lesson to go through in your own time
* An **assignment** consisting of exercises to practice some of the concepts covered in that and previous lessons
# Schedule
0. [Getting set up](week0.html) - installing R and RStudio
1. [Introduction to R](week1.html) - Interacting with R using RStudio and introducing objects, data types and functions
2. [Working with data](week2.html) - Creating R scripts, working with tabular data and other types of objects in R, reading data into R
3. [Data visualization with ggplot2](week3.html) - A common grammar to create scatter plots, bar charts, boxplots, histograms and line graphs for time series data
4. [Data manipulation using dplyr](week4.html) - Filtering and modifying tabular data, computing summary values, faceting with ggplot2
5. [Grouping and combining data](week5.html) - Advanced grouping and summarization operations, joining data from different tables, customizing ggplot2 plots
6. [Restructuring data for analysis](week6.html) - The concept of 'tidy data', pivoting and separating operations, ggplot2 extras
7. **Option Capstone project** – you should try putting it all together in a typical data analysis including:
* reading in a data set
* handling missing values
* selecting and filtering subsets of interest
* creating plots
* generating summary statistics
* saving data transformed into a tidy format as a csv file for later analysis