Ok, so I have been developing some R-scripts to play with Jim’s cellular data.
The basic data is made up of 57,290 observations of 18 variables.
It is currently imported as a data.frame called “cells”.
I reduced the number of columns in the dataframe to those I believed are the key.
CODE:
#reduce the number of columns in the dataframe
#I have selected these as the Key variables
keyvar <- c('Section','Target','WC.area','WC.488_Int', 'WC.546_Int', 'Nuc.647_Int' )
# This command creates a dataframe that is a subset with fewer columns
subset <- cells[,keyvar]
This reduces the data to just six columns.
Next step is to split the data into individual treatments.
I did this using a map of the plate, prepared in Excel and saved as a csv file.
N.B. Need to be careful to have a space after “A - 1 “ - otherwise first sample includes “A - 10”, “A - 11” and “A - 12”
There should be a way to do this automatically for all of the conditions but I currently have to do some of this by hand.
Arghh - There has to be a loop or some kind of control structure to do this!!!
Next
I need to remove missing values from each of the data sets…
Then draw the density curves and plot them so that they can be output as a nice looking PDF.
Script Name: AllDensityPlots20141021.R
USING ggplot
So the next step is to do the dot plots.
I can get this to work well with the ppErk and the HA staining.
However, the EdU staining is more awkward and I am going to have to ask Jim how he did that.
Script Name: DotPlots20141021.R
No comments:
Post a Comment