README.mkdn 1.7 KB

1234567891011121314151617181920212223242526272829
  1. This is a series of diagnostic plots that were used to evaluate how
  2. well a particular statistical model fits the data and explains the
  3. sources of variation in an Illumina 450k dataset.
  4. - [`mean-var-model.pdf`](mean-var-model.pdf) shows the variance trend
  5. modeling performed by voom, a method originally designed for
  6. mean-variance modeling in RNA-seq data. In this case, it models the
  7. mean-variance dependency induced by the logistic transform used for
  8. converting beta values (i.e. percent methylation) to M-values (i.e.
  9. ratio of methylated to unmethylated signal) in methylation data.
  10. Page 2 shows the mean-variance trend after fitting the model with
  11. the voom weights to cancel out the trend.
  12. - [`sample-weights.pdf`](sample-weights.pdf) Shows the results of
  13. limma's `arrayWeights` method, which detects and down-weights
  14. outlier samples, plotted against all known clinical covariates for
  15. those samples. Diabetes status had a significant association with
  16. the sample weights, indicating that the Type I diabetes samples were
  17. overall more consistent and had fewer outlier observations that Type
  18. II diabetes samples.
  19. - [`pcoa.pdf`](pcoa.pdf) shows a Principle Coordinate Plot (similar to
  20. a PCA plot) of all the samples after subtracting out the effects of
  21. known covariates. Points are sized by their sample weight, and a
  22. crosshair shows the center of mass of each group.
  23. - [`pval-histograms.pdf`](pval-histograms.pdf) and
  24. [`pval-cdf.pdf`](pval-cdf.pdf) show the p-value distributions for
  25. each contrast of interest, presented as a histogram and as an
  26. empirical cumulative distribution function. Each is annotated with
  27. asymptotes indicating the estimated fraction of probes affected by
  28. that contrast.