presentation.mkdn 11 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343
  1. % Bioinformatic analysis of complex, high-throughput genomic and epigenomic data in the context of $\mathsf{CD4}^{+}$ T-cell differentiation and diagnosis and treatment of transplant rejection
  2. % Ryan C. Thompson \
  3. Su Lab \
  4. The Scripps Research Institute
  5. % October 24, 2019
  6. ## Organ transplants are a life-saving treatment for many
  7. \LARGE <!-- This should probably be my default bullet point size -->
  8. ::: incremental
  9. * 36,528 transplants performed in the USA in 2018[^1]
  10. * 100 transplants every day!
  11. * Over 113,000 people on the national transplant waiting list as of
  12. July 2019
  13. :::
  14. [^1]: Source: https://www.organdonor.gov/statistics-stories/statistics.html
  15. ## Organ transplants are a life-saving treatment for many
  16. ![Organ donation statistics for the USA in 2018[^2]](graphics/presentation/transplants-organ-CROP-RASTER.png){ height=70% }
  17. [^2]: Source: https://www.organdonor.gov/statistics-stories/statistics.html
  18. ## Rejection is an adaptive immune response against a graft
  19. <!-- TODO: Need a graphic for this -->
  20. \LARGE
  21. ::: incremental
  22. * The host's adaptive immune system identifies and attacks cells
  23. bearing non-self antigens
  24. * An allograft contains differnet genetic variants from the host,
  25. resulting in protein-coding differences
  26. * Left unchecked, the host immune system eventually notices these
  27. alloantigens and begins attacking (rejecting) the graft
  28. * Rejection is the major long-term threat to organ allografts
  29. :::
  30. ## Allograft rejection remains a major long-term problem
  31. ![Kidney allograft survival rates in children by transplant year[^3]](graphics/presentation/kidney-graft-survival.png){ height=65% }
  32. [^3]: Kim & Marks. "Long-term outcomes of children after solid organ transplantation". In: Clinics (2014)
  33. ## Rejection is treated with immune suppressive drugs
  34. <!-- TODO: Need a graphic, or maybe a table of common drugs +
  35. mechanisms, or a diagram for periodic checking. -->
  36. \LARGE
  37. ::: incremental
  38. * To prevent rejection, a graft recipient must take immune suppressive
  39. drugs for the rest of their life
  40. * The graft is periodically checked for signs of rejection, and immune
  41. suppression dosage is adjusted accordingly
  42. * Immune suppression is a delicate balance: too much leads to immune
  43. compromise; too little leads to rejection.
  44. :::
  45. ## My thesis topics
  46. ### Topic 1: Immune memory
  47. Genome-wide epigenetic analysis of H3K4 and H3K27 methylation in naïve
  48. and memory $\mathsf{CD4}^{+}$ T-cell activation
  49. ### Topic 2: Diagnostics for rejection
  50. Improving array-based diagnostics for transplant rejection by
  51. optimizing data preprocessing
  52. ### Topic 3: Blood profiling during treatment
  53. Globin-blocking for more effective blood RNA-seq analysis in primate
  54. animal model for experimental graft rejection treatment
  55. ## Today's focus
  56. ### Chapter 2
  57. \Large
  58. Genome-wide epigenetic analysis of H3K4 and H3K27 methylation in naïve
  59. and memory $\mathsf{CD4}^{+}$ T-cell activation
  60. ## Memory cells: faster, stronger, and more independent
  61. <!-- TODO: Implement progressive reveal -->
  62. ![Memory T-cells proliferate and respond more quickly](graphics/presentation/T-cells-SVG.png)
  63. ## Memory cells are a problem for immune suppression
  64. \large
  65. Compared to naïve cells, memory cells:
  66. \normalsize
  67. * respond to a lower antigen concentration
  68. * respond more strongly at any given antigen concentration
  69. * require less co-stimulation
  70. * are somewhat independent of some types of co-stimulation required by
  71. naïve cells
  72. * evolve over time to respond even more strongly to their antigen
  73. . . .
  74. \large
  75. Result:
  76. \normalsize
  77. * Memory cells require progressively higher doses of immune suppresive
  78. drugs
  79. * Dosage cannot be increased indefinitely without compromising the
  80. immune system's ability to fight infection
  81. ## We need a better understanding of immune memory
  82. * Cell surface markers of naïve and memory $\mathsf{CD4}^{+}$ T-cells
  83. are fairly well-characterized
  84. * But internal mechanisms that allow memory cells to respond
  85. differently to the same stimulus (antigen presentation) are not
  86. well-understood
  87. . . .
  88. * A reasonable hypothesis is that some of these mechanisms are
  89. epigenetic: using histone marks or DNA methylation to regulate the
  90. expression of certain genes
  91. * We can test this hypothesis by measuring gene expression (using
  92. RNA-seq) and histone methylation (using ChIP-seq) in naïve and
  93. memory T-cells before and after activation
  94. ## Experimental design
  95. * Separately isolate naïve and memory $\mathsf{CD4}^{+}$ T-cells from
  96. 4 donors
  97. * Activate with CD3/CD28 beads
  98. * Take samples at 4 time points: Day 0 (pre-activation), Day 1 (early
  99. activation), Day 5 (peak activation), and Day 14 (post-activation)
  100. * Do RNA-seq + ChIP-seq for 3 histone marks (H3K4me2, H3K4me3, &
  101. H3K27me3) for each sample.
  102. Data generated by Sarah Lamere, published in GEO as
  103. [GSE73214](https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE73214)
  104. ## A few intermediate analysis steps are required
  105. ![Flowchart of workflow for data analysis](graphics/CD4-csaw/rulegraphs/rulegraph-all-RASTER300.png)
  106. ## Histone modifications are observed on consecutive histones
  107. ![Strand cross-correlation plots](graphics/CD4-csaw/csaw/CCF-plots-PAGE2-CROP-RASTER.png)
  108. ## Histone modifications are observed on consecutive histones
  109. ![ChIP-seq coverage in IL2 gene[^5]](graphics/presentation/LaMere-thesis-fig3.9-SVG-CROP.png){ height=70% }
  110. [^5]: Sarah LaMere. "Dynamic epigenetic regulation of CD4 T cell activation and memory formation". PhD thesis. TSRI, 2015.
  111. ## Finding enriched regions across the genome
  112. * Scan across the genome looking for regions with read coverage above
  113. background level in each donor using SICER peak caller
  114. * Use Irreducible Discovery Rate framework to identify peaks that are
  115. called consistently across multiple donors
  116. ![Peak-calling summary statistics](graphics/presentation/RCT-thesis-table2.2-SVG-CROP.png)
  117. ## Each histone mark has an "effective promoter radius"
  118. ![Enrichment of peaks near promoters](graphics/CD4-csaw/Promoter-Peak-Distance-Profile-PAGE1-CROP-RASTER.png)
  119. ## Peaks in promoters are correlated with gene expression
  120. <!-- TODO: Pull out one column and rotate it -->
  121. ![Expression distributions of genes with and without promoter peaks](graphics/CD4-csaw/FPKM-by-Peak-Violin-Plots-CROP-RASTER.png)
  122. ## The story so far
  123. * H3K4me2, H3K4me3, and H3K27me3 occur on many consecutive histones in
  124. broad regions across the genome
  125. * These enriched regions occur more commonly within a certain radius
  126. of gene promoters
  127. * This "effective promoter radius" is consistent across all samples
  128. for a given histone mark, but differs between histone marks
  129. * Presence or absence of a peak within this radius is correlated with
  130. gene expression
  131. . . .
  132. Next: Does the position of a histone modification within a gene
  133. promoter matter to that gene's expression, or is it merely the
  134. presence or absence anywhere within the promoter?
  135. ## H3K4me2 promoter neighborhood K-means clusters
  136. ![(Insert figure legend)](graphics/CD4-csaw/ChIP-seq/H3K4me2-neighborhood-clusters-CROP.png)
  137. ## H3K4me2 promoter neighborhood cluster PCA
  138. ![(Insert figure legend)](graphics/CD4-csaw/ChIP-seq/H3K4me2-neighborhood-PCA-CROP.png)
  139. ## H3K4me2 promoter neighborhood cluster expression
  140. ![(Insert figure legend)](graphics/CD4-csaw/ChIP-seq/H3K4me2-neighborhood-expression-CROP.png)
  141. ## H3K4me3 promoter neighborhood K-means clusters
  142. ![(Insert figure legend)](graphics/CD4-csaw/ChIP-seq/H3K4me3-neighborhood-clusters-CROP.png)
  143. ## H3K4me3 promoter neighborhood cluster PCA
  144. ![(Insert figure legend)](graphics/CD4-csaw/ChIP-seq/H3K4me3-neighborhood-PCA-CROP.png)
  145. ## H3K4me3 promoter neighborhood cluster expression
  146. ![(Insert figure legend)](graphics/CD4-csaw/ChIP-seq/H3K4me3-neighborhood-expression-CROP.png)
  147. ## H3K27me3 promoter neighborhood K-means clusters
  148. ![(Insert figure legend)](graphics/CD4-csaw/ChIP-seq/H3K27me3-neighborhood-clusters-CROP.png)
  149. ## H3K27me3 promoter neighborhood cluster PCA
  150. ![(Insert figure legend)](graphics/CD4-csaw/ChIP-seq/H3K27me3-neighborhood-PCA-CROP.png)
  151. ## H3K27me3 promoter neighborhood cluster expression
  152. ![(Insert figure legend)](graphics/CD4-csaw/ChIP-seq/H3K27me3-neighborhood-expression-CROP.png)
  153. ## What have we learned?
  154. ### H3K4me2 & H3K4me3
  155. * Peak closer to promoter $\Rightarrow$ more likely gene is highly
  156. expressed
  157. * Slightly asymmetric in favor of peaks downstream of TSS
  158. . . .
  159. ### H3K27me3
  160. * Depletion of H3K27me3 at TSS associated with elevated gene
  161. expression
  162. * Enrichment of H3K27me3 upstream of TSS even more strongly associated
  163. with elevated expression
  164. * Other coverage profiles not associated with elevated expression
  165. ## Differential modification disappears by Day 14
  166. ![Differential modification between naïve and memory samples at each time point](graphics/presentation/RCT-thesis-table2.4-SVG-CROP.png)
  167. ## Convergence at Day 14 H3K4me2
  168. ![(Insert figure legend)](graphics/CD4-csaw/ChIP-seq/H3K4me2-promoter-PCA-group-CROP.png)
  169. ## Convergence at Day 14 H3K4me3
  170. ![(Insert figure legend)](graphics/CD4-csaw/ChIP-seq/H3K4me3-promoter-PCA-group-CROP.png)
  171. ## Convergence at Day 14 H3K27me3
  172. ![(Insert figure legend)](graphics/CD4-csaw/ChIP-seq/H3K27me3-promoter-PCA-group-CROP.png)
  173. ## Convergence at Day 14 RNA-seq (PC 2 & 3)
  174. ![(Insert figure legend)](graphics/CD4-csaw/RNA-seq/PCA-final-23-CROP.png)
  175. ## MOFA identifies shared variation across all 4 data sets
  176. ![(Insert figure legend)](graphics/CD4-csaw/MOFA-varExplaiend-matrix-CROP.png)
  177. ## MOFA shared variation captures convergence pattern
  178. ![(Insert figure legend)](graphics/CD4-csaw/MOFA-LF-scatter-small.png)
  179. ## What have we learned?
  180. * Almost no differential modification observed between naïve and
  181. memory at Day 14, despite plenty of differential modification at
  182. earlier time points.
  183. * RNA-seq data and all 3 histone marks' ChIP-seq data all show
  184. "convergence" between naïve and memory by Day 14 in the first 2 or 3
  185. principal coordinates.
  186. * MOFA captures this convergence pattern in one of the latent factors,
  187. indicating that this is a shared pattern across all 4 data sets.
  188. <!-- ## Slide -->
  189. <!-- ![(Insert figure legend)](graphics/CD4-csaw/LaMere2016_fig8-RASTER.png) -->
  190. ## Takeaway 1: Each histone mark has an "effective promoter radius"
  191. * H3K4me2, H3K4me3, and H3K27me3 ChIP-seq reads are enriched in broad
  192. regions across the genome, representing areas where the histone
  193. modification is present
  194. * These enriched regions occur more commonly within a certain radius
  195. of gene promoters
  196. * This "effective promoter radius" is specific to each histone mark
  197. * Presence or absence of a peak within this radius is correlated with
  198. gene expression
  199. ## Takeaway 2: Peak position within the promoter is important
  200. * H3K4me2 and H3K4me3 peaks are more strongly associated with elevated
  201. gene expression the closer they are to the TSS, with a slight bias
  202. toward downstream peaks.
  203. * H3K27me3 depletion at the TSS and enrichement upstream are both
  204. associated with elevated expression, while other patterns are not.
  205. * In all histone marks, position of modification within promoter
  206. appears to be an important factor in association with gene
  207. expression
  208. ## Takeaway 3: Expression & epigenetic state both converge at Day 14
  209. * At Day 14, almost no differential modification observed between
  210. naïve and memory cells
  211. * Naïve and memory converge visually in PCoA plots
  212. * Convergence is a shared pattern of variation across all 3 histone
  213. marks and gene expression
  214. * This is consistent with the hypothesis that the naïve cells have
  215. differentiated into a more memory-like phenotype by day 14.