README.mkdn 2.4 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354
  1. % Frozen RMA implementation for TGI PAX and BX samples
  2. % Ryan Thompson
  3. % Thu Mar 19, 2015
  4. # Results #
  5. The scripts below were used to evaluate the consistency of the fRMA
  6. normalization vectors by repeating the training process with 5
  7. different random samples and then comparing a random selection of
  8. arrays normalized by all five trained vectors as well as by ordinary
  9. RMA. [This folder](fRMA_consistency_results) shows the results.
  10. # Scripts #
  11. There are two pairs of scripts. The first pair, `train.R` and
  12. `test.R`, handle the tasks of (respectively) generating/training the
  13. main fRMA vectors and ensuring that they work by normalizing all the
  14. data with them. The second pair, `consistency-train.R` and
  15. `consistency-evaluate.R`, handle (respectively) training five separate
  16. fRMA vector sets and testing their consistency.
  17. ## [`train.R`](train.R): Creating the fRMA vectors ##
  18. This script reads the sample metadata tables, assembles the full file
  19. lists for BX and PAX tissues, and trains a set of fRMA vectors for
  20. each tissue. It exports each of these vector sets to an installable R
  21. package.
  22. ## [`test.R`](test.R): Testing the fRMA vectors ##
  23. This script simply loads all the arrays and normalizes them using the
  24. appropriate fRMA vectors that were generated by `train.R`. It should
  25. be run after installing the packages produced by `train.R`. It is
  26. simply used for testing to make sure the fRMA vectors work.
  27. ## [`consistency-train.R`](consistency-train.R): Train several vector sets for each tissue ##
  28. This script essentially does the same thing as `train.R`, only it does
  29. it five times with five different subsamplings of the arrays to
  30. generate five different fRMA vector sets and saves them all in an R
  31. data file.
  32. ## [`consistency-evaluate.R`](consistency-evaluate.R): Verify consistency of fRMA vectors ##
  33. This script loads the data file from `consistency-train.R`, then loads
  34. 20 random arrays from each tissue and normalizes them with all five
  35. fRMA vector sets, and also by ordinary RMA. It then produces plots of
  36. M vs A for every pair of normalizations. Unlike regular MA plots,
  37. these are *not* plotting arrays against each other, but rather arrays
  38. against themselves, but normalized using two different methods. So if
  39. two normalizations were perfectly consistent, the MA plot would be a
  40. flat horizontal line at M=0. It also produces boxplots and violin
  41. plots showing the M distribution for each of the pairwise comparisons.