Visualisation and Data: Analysing cell data for Jim 20141021

Tuesday, 21 October 2014

Ok, so I have been developing some R-scripts to play with Jim’s cellular data.

The basic data is made up of 57,290 observations of 18 variables.

It is currently imported as a data.frame called “cells”.

I reduced the number of columns in the dataframe to those I believed are the key.

CODE:

#reduce the number of columns in the dataframe

#I have selected these as the Key variables

keyvar <- c('Section','Target','WC.area','WC.488_Int', 'WC.546_Int', 'Nuc.647_Int' )

# This command creates a dataframe that is a subset with fewer columns

subset <- cells[,keyvar]

This reduces the data to just six columns.

Next step is to split the data into individual treatments.

I did this using a map of the plate, prepared in Excel and saved as a csv file.

N.B. Need to be careful to have a space after “A - 1 “ - otherwise first sample includes “A - 10”, “A - 11” and “A - 12”

There should be a way to do this automatically for all of the conditions but I currently have to do some of this by hand.

Arghh - There has to be a loop or some kind of control structure to do this!!!

I need to remove missing values from each of the data sets…

Then draw the density curves and plot them so that they can be output as a nice looking PDF.

Script Name: AllDensityPlots20141021.R

USING ggplot

So the next step is to do the dot plots.

I can get this to work well with the ppErk and the HA staining.

However, the EdU staining is more awkward and I am going to have to ask Jim how he did that.

Script Name: DotPlots20141021.R

Visualisation and Data