EPIGENETICS -- -ncRNA
EDITED
BY
C. David Allis The Rockefeller University, New York
Thomas Jenuwein Research Institute of Molecular Pathology (IMP), Vienna
Danny Reinberg HHMIIRobert Wood Johnson Medical School University of Medicine and Dentistry ofNew Jersey
Marie-Laure Caparros Associate Editor, London
COLD SPRING HARBOR LABORATORY PRESS Cold Spring Harbor, New York
•
http://www.cshlpress.com
Epigenetic Mechanisms That Operate in Different Model Organisms
s. cerevisiae s. pombe
N. crassa
C. elegans
Drosophila
Mammals
A. thaliana
14 Mb
40 Mb
100 Mb
180 Mb
3,400 Mb
150 Mb
GENOMIC FEATURES
Genome size
12 Mb 6,000
5,000
10,000
20,000
14,000
-25,000
25,000
1.45 kb
1.45 kb
1.7 kb
2 kb
5 kb
35-46 kb
2 kb
Average number of introns/gene
,,;1
2
2
5
3
6-8
4-5
% Genome as protein coding
70
60
44
25
13
1-1.5 (Hs)
26
Number of genes Average size of genes
EPIGENETIC FEATURES
ON
Histone acetylation
+
+
+
+
+
+
+
ON
H3K4 methylation
+
+
+
+
+
+
+
ON
H3K36 methylation
+
+
+
+
+
+
+
ON
H3K79 methylation
+
+
+
+
+
+
+
ON
H3.3 histone variant
+
+
+
+
+
+
+
ON/OFF
SWI/SNF ATPase complexes
+
+
+
+
+ (+)'
+
CHD1 ATPase family
+ (+)'
+
ON
(+)'
+
+
ON
SWR1 ATPase
+
(+)'
(+)'
(+)'
+
+
+ (+)'
ON/OFF
ISWI ATPase
+
+
+
+
+
+
+
ON/OFF
IN080 ATPase
+
+
+
+
MI-2 ATPase
+ (+)'
+
OFF
+ (+)'
+
+
+
+
OFF
CENP-A centromeric histone variant
+
+
+
+
+
+
OFF
H3K9 methylation b
+
+
+
+
+
+
OFF
HP1-like proteins
+
+
+
+
+
+
OFF
RNA interference
+
+
+
+
+
+
OFF
H4K20 methylation'
+
+
+
+
+
+
OFF
H3K27 methylation
+
+
+
+
+
+
+ (+)" +9
+
+
+
+
+
+
+h
+
+
OFF
Polycomb repressive complexes
OFF
DNA methylation
OFF
DNA methylation binding proteins
OFF
Imprinting
+
+ +'
+
+'
Abbreviation: (Hs) hom*o sopiens. , Epigenetic feature considered to be present based on sequence hom*ology but no functional data. b There is evidence that H3K9 methylation is found at active chromatin regions; however, the functional significance of this is unknown. , H4K20 tri-methylation is not present in S. cerevisiae, whereas all three H4K20 methylation states are present in multicellular organisms. d Drosophila possess very low levels of DNA methylation. , Mutated Dnmt2. Dnmt2 (Pp) and MBD-domain proteins (Ce, Cb, Pp). Dnmt2 and MBD-domain proteins (Dm). hChromosome- or genome-wide rather than gene-specific.
f
9
EPIGENETICS All rights reserved. © 2007 by Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York Printed in the United States of America Publisher Acquisition Editors Development Director Project Coordinator Permissions Coordinator Production Editor Desktop Editor Production Manager Cover Designer
John Inglis Alexander Gann and David Crotty Jan Argentine Inez Sialiano Carol Brown Pat Barker Lauren Heller Denise Weiss Lewis Agrell
Front cover artwork: Depicted is a schematic representation of the chromatin template. Epigenetic regulation affects and modulates this template through noncoding RNAs (ncRNA) that associate with it, covalent modification of histone tails (mod), methylation of DNA (Me), remodeling factors (blue oval), and nucleosomes that contain standard as well as variant histone proteins (the yellow nucleosome). In the background is a representation of several model organisms in which epigenetic control has been studied. From top left: Pair of mouse chromosomes that may differ in their genomic imprint; a S. cerevisiae colony, showing epigenetically inherited variegation of gene expression; anatomy of C. elegans; illustration of T. thermophila, showing the large "active" macronucleus and the smaller "silent" micronucleus; D. melanogaster; maize section with kernel color variegation; Arabidopsis flower. Library of Congress Cataloging-in-Publication Data Epigenetics / edited by C. David Allis, Thomas Jenuwein, Danny Reinberg ; Marie-Laure Caparros, associate editor. p.cm. Includes bibliographical references and index. ISBN-13: 978-0-87969-724-2 (hardcover: alk. paper) 1. Genetic regulation. 1. Allis, C. David. II. Jenuwein, Thomas. III. Reinberg, Danny. [DNLM: 1. Epigenesis, Genetic. 2. Gene Expression Regulation. QU 475 E64 2006] 1. Title. QH450.E655 2006 572.8'65--dc22 2006028894
10 9 8 7 6 5 4 3 2 Authorization to photocopy items for internal or personal use, or the internal or personal use of specific clients, is granted by Cold Spring Harbor Laboratory Press, provided that the appropriate fee is paid directly to the Copyright Clearance Center (CCC). Write or call CCC at 222 Rosewood Drive, Danvers, MA 01923 (978-750-8400) for information about fees and regulations. Prior to photocopying items for educational classroom use, contact CCC at the above address. Additional information on CCC can be obtained at CCC Online at http://www.copyright.com/. All Cold Spring Harbor Laboratory Press publications may be ordered directly from Cold Spring Harbor Laboratory Press, 500 Sunnyside Blvd., Woodbury, New York 11797-2924. Phone: 1-800-843-4388 in Continental U.S. and Canada. All other locations: (516) 422-4100. FAX: (516) 422-4097. E-mail: [emailprotected]. For a complete catalog of all Cold Spring Harbor Laboratory Press publications, visit our World Wide Web Site http://www.cshlpress.com/.
Long before epigenetics changed from little more than a diverse collection of bizarre phenomena to a well-respected field covered by its own textbook, a talented group of foresighted molecular biologists laid a rich foundation upon which the modern era of chromatin biology and epigenetics is based. This group includes Vince Allfrey, Wolfram Harz, Hal Weintraub, Alan Wolffe, and Abe Worcel. This book is dedicated to their collective memory. Their passion and commitment to the study of chromatin biology inspired all of us who followed their work, and we now profit from their many insights.
Contents
Preface, ix
14
Epigenetics: From Phenomenon to Field, 1
Epigenetic Regulation of Chromosome Inheritance, 265 Gary H. Karpen and R. Scott Hawley
Daniel E. Gottschling
A Brief History of Epigenetics, 15
Epigenetic Regulation of the X Chromosomes in C. elegam, 291
Gary Felsenfeld
Susan Strome and William G. Kelly
15
2
3
Overview and Concepts, 23
16
C. David Allis, Thomas Jenuwein, and Danny Reinberg 4
Epigenetics in Saccharomyces cerevisiae, 63
17
Michael Grunstein and Susan M. Gasser
5
Position-Effect Variegation, Heterochromatin Formation, and Gene Silencing in Drosophila, 81 Sarah c.R. Elgin and Gunter Reuter
18
DNA Methylation in Mammals, 341 En Li and Adrian Bird
Fungal Models for Epigenetic Research: Schizosaccharomyces pombe and Neurospora crassa, 101 Robin C. Allshire and Eric U. Selker
Dosage Compensation in Mammals, 321 Neil Brockdorff and Bryan M. Turner
19 6
Dosage Compensation in Drosophila, 307 John C. Lucchesi and Mitzi I. Kuroda
Genomic Imprinting in Mammals, 357 Denise P. Barlow and Marisa S. Bartolomei
20
Germ Line and Pluripotent Stem Cells, 377 M. Azim Surani and Wolf Reik
7
Epigenetics of Ciliates, 127 Eric Meyer and Douglas 1. Chalker
21
Epigenetic Control of Lymphopoiesis, 397 Meinrad Busslinger and Alexander Tarakhovsky
8
RNAi and Heterochromatin Assembly, 151 Robert Martienssen and Danesh Moazed
9
22
Epigenetic Regulation in Plants, 167
Nuclear Transplantation and the Reprogramming of the Genome, 415 RudolfJaenisch and John Gurdon
Marjori Matzke and Ortrun Mittelsten Scheid
23 10
Chromatin Modifications and Their Mechanism of Action, 191 Tony Kouzarides and Shelley 1. Berger
Epigenetics and Human Disease, 435 Huda Y. Zoghbi and Arthur 1. Beaudet
24
Epigenetic Determinants of Cancer, 457 Stephen B. Baylin and Peter A. Jones
11
Transcriptional Silencing by Polycomb Group Proteins, 211 Ueli Grossniklaus and Renato Paro
12
Transcriptional Regulation by Trithorax Group Proteins, 231 Robert E. Kingston and John ltv. Tamkun
13
Appendices WWW Resources, 477
2
Histone Modifications and References, 479
Histone Variants and Epigenetics, 249 Steven Henikoff and M. Mitchell Smith
Index, 491 vii
Preface
T
his advanced textbook on "Epigenetics" is truly a reflection of many talented colleagues and individuals, all of whom made this book possible and a rewarding experience. However, without hesitation, the editors want to thank Marie-Laure Caparros (London), without whom this project would have never materialized. Early in the process, it became evident that the editorial team needed help in coordinating such a large project, particularly for keeping the dialogue and editorial feedback with the >40 colleagues who agreed to provide outstanding chapter contributions, only to realize that we wanted more than their expert reviews and attention to detail. Marie-Laure has been instrumental in keeping the momentum moving forward, has bravely exchanged critical comments when needed, has informed all of us on the many deadlines, and has provided necessary coherence to make embryonic chapters come to life. Without her, this book would not have been possible. We are also grateful to our individual assistants, who forever kept us on our toes: Elizabeth Conley (David Allis), Christopher Robinson (Thomas Jenuwein), and Shelli Altman (Danny Reinberg). All of them are the unsung heroes of this book. We thank all of them for their innumerable contributions, large and small, and their unending patience with each of us and our quirky styles and shortcomings as editors. Discussions for such a book took initial form on the coattails of the outstanding 69th Cold Spring Harbor Symposium on Epigenetics in the summer of 2004, but were seeded in early 2003 and formally commissioned by CSHL Press through Alex Gann and other colleagues. This was followed by formulating an editorial team between David Allis, Thomas Jenuwein, and Danny Reinberg. The first concrete outline for this project, including the brainstorming of various chapters and potential contributing authors, was done on the picnic bench at the FASEB meeting on Chromatin and Transcription in Snow-
mass, Colorado, July 2003. We were then very fortunate to confirm the lineup of contributing colleagues who are the leaders in their field. In the early planning stages, a vision crystallized for a different concept. Ideally, we sought to ask not for a compilation of expert reviews which might soon be outdated. Rather, we wanted to compile a set of conceptual chapters, from pairs of experts, that highlight important discoveries for students in chromatin biology and for colleagues outside the epigenetics field. In keeping to a conceptual outline, we hoped to have a more long-lasting impact. Also, by including many diagrams and illuminating figures, and appendices, we hoped to list most of the systems and epigenetic marks currently known. The General Summaries were aimed as a stand-alone precis of the topics covered in each chapter, preceded by "teaser" images to entice the reader to investigate. The figures have been another important hallmark for this book; particularly, the examples provided in the Overview and Concepts chapter. Here, Stefan Kubicek, a Ph.D. student from the Jenuwein lab at the IMP (Vienna), and Marie-Laure Caparros have been the masters of the diagrams. They honed draft upon draft of figures (sometimes only from sketches) for the chapters, such that we could gain a more coherent presentation. Several postdocs and Ph.D. students (Gabriella Farkas, Fatima Santos, Heike Wohrmann, and others) in the labs of several authors also kindly contributed to the excellent illustrations in this book. However, we were unable to convert all of the contributions, and some figures have remained as submitted. We are also particularly grateful to Monika Lachner, Mario Richter, Roopsha Sengupta, Patrick Trojer, and other Ph.D. students and Postdocs in the Allis, Jenuwein, and Reinberg laboratories for amending, proofreading, and finalizing the tables and summaries that are displayed in the appendices. Here, Dr. Steven Gray (St. James Hospital, Dublin) has been
ix
X
n
PREFACE
particularly instrumental in validating and providing additional information for the table that lists all the currently known histone modifications. Where appropriate, submitted chapters were sent out for comments from other colleagues who provided important input for streamlining and clarifying some of the complex concepts. Not all of this input could be converted into the revised and final versions, but the comments and suggestions helped to shape many of the chapters and the overall framework of the book. Here, we are indebted to G. Almouzni, P. Becker, H. Cedar, V. Chandler, W. Dean, R. Feil, A. Ferguson-Smith, M. Gartenberg, S. Grewal, M. Hampsey, E. Heard, R. Metzenberg, V. Pirrotta, F. Santos, T. Schedl, D. Solter, R. Sternglanz, S. Tilghman, and others. Finally, we acknowledge the intellectual and, in some cases, emotional contributions made by all of our colleagues in the field who provided the chapters to make this book what it is. Their contributions, by way of writ-
ten chapters and drawings, stand by themselves. But what may not be obvious is the feedback and cross-fertilization that all of them had with the editorial team to help shape and guide the book as it took form. The Overview and Concepts chapter itself reflects their feedback, as in early drafts, we put too much of our own colors and bias into the sentences. For their wisdom and for bringing us a deeper perspective and balance, we thank them, and we admit that any deficiencies and mistakes there are ours. Financial support for this book has come from CSHL Press (New York), the Epigenome FP6 NoE (European Union), IMP (Vienna), the Rockefeller University (New York), and the Howard Hughes Medical SchoolRobert Wood Johnson Medical School (Piscataway, N~w Jersey). Critical contributions were also made by Upstate Serologicals (Lake Placid, New York) and AbCam (Cambridge, UK), leading suppliers of epigenetic-based reagents and tools. CDA, TJ,DR
c
H'
APT
E
1
R
Epigenetics: From Phenomenon to Field Daniel E. Gottschling Fred Hutchinson Cancer Research Center, Seattle, Washington 98109
CONTENTS 1. Introduction, 2
3.4
Prions, 9
2. A History Of:1P1 netics at Cold Spring Harbor Symposia, 2
3.5
New Phenomenon, 70
4. Closing Thoughts, 10
3. The 69th Sympo . m, 8 3.7 .The Histone Code Hypothesis, 8
Acknowledgments, 11
3.2
Dynamic Silent Chromatin, 8
References, 11
3.3
Nuclear Organization, 9
1.
2 •
CHAPTER
1
1 Introduction
In the summer of 2004, the 69th Cold Spring Harbor Symposium on Quantitative Biology covered the topic of "Epigenetics;' and many of the authors of this book were in attendance. As an observer at this Symposium, I knew this was going to be an interesting meeting. It started simply enough by trying to define epigenetics. After a week of querying participants about this, it became clear that such a request was akin to asking someone to define "family values"-everyone knew what it meant, but it had a different meaning for each person. Part of the reason for the range of opinions may be understood from the etymology of "epigenetics" as explained by David Haig: The word had two distinct origins in the biological literature in the past century, and the meaning has continued to evolve. Waddington first coined the term for the study of "causal mechanisms" by which "the genes of the genotype bring about phenotypic effects" (see Haig 2004). Later, Nanney used it to explain his realization that cells with the same genotype could have different phenotypes that persisted for many generations. I define an epigenetic phenomenon as a change in phenotype that is heritable but does not involve DNA mutation. Furthermore, the change in phenotype must be switch-like, "ON" or "OFF;' rather than a graded response, and it must be heritable even if the initial conditions that caused the switch disappear. Thus, I consider epigenetic phenomena to include the lambda bacteriophage switch between lysis and lysogeny (Ptashne 2004), pili switching in uropathogenic Escherichia coli (Hernday et al. 2003), position-effect variegation in Drosophila (Henikoff 1990), heritable changes in cortical patterning of Tetrahymena (Frankel 1990), prion diseases (Wickner et al. 2004a), and X-chromosome inactivation (Lyon 1993). The 69th Symposium came on the 100th anniversary of genetics as a field of study at Cold Spring Harbor Laboratory, making it very timely to consider epigenetics. Given this historical context, I thought it appropriate to provide an examination of epigenetics through the portal of previous Cold Spring Harbor Symposia. Although the 69th Symposium was the first dedicated to the topic, epigenetic phenomena and their study have been presented throughout the history of this distinguished series. The history I present is narrowed further by my limitations and likings. For a more complete and scholarly portrayal, I can recommend the more than 1000 reviews on epigenetics that have been written in the past five years. In presenting this chronological account, I hope to convey a sense of how a collection of apparently disparate
phenomena coalesced into a field of study that affects all areas of biology, and that th~tudy of epigenetics is founded upon trying to explain the unexpected-perhaps more than any other field of biological research. 2 A History of Epigenetics at Cold Spring Harbor Symposia
In 1941 during the 9th Symposium, the great Drosophila geneticist H.I. Muller described developments on his original "eversporting displacement," in which gross chromosomal rearrangements resulted in the mutant mosaic expression of genes near the breakpoint (Muller 1941). By the time of this meeting, he referred to it as "position effect variegation." It was well established that the affected genes had been transferred "into the neighborhood of a heterochromatic region;' that the transferred euchromatic regions had been "partly, but variably, transformed into a heterochromatic condition-'heterochromatized'," and that addition of extra copies of heterochromatic chromosomes "allowed the affected gene to become more normal in its functioning." This latter observation was an unexpected quandary at the time, which we now know to be the result of a titration of limiting heterochromatin components. At the 16th Symposium (1951), a detailed understanding of the gene was of high priority. This may explain why little progress had been made on understanding position-effect variegation (PEV), although more examples were being discovered. However, the opening speaker noted that PEV would be an exciting area for future research (Goldschmidt 1951). Barbara McClintock noted that chromosomal position effects were the basis of differencesfn'~ableloci" of maize, and she speculated that the variatiof of mutability she observed likely had its roots in the same mechanisms underlying PEV in Drosophila (McClintock 1951). By the time of the 21st Symposium, McClintock's ideas about "controlling elements" had developed (McClintock 1956). Two were particularly poignant with regard to epigenetics. In the Spm controlling element system, she had uncovered variants that allowed her to distinguish between trans-acting factors that could "suppress" a gene (reduce or eliminate its phenotypic expression) rather than mutate it. She also noted that some controlling elements could suppress gene action not only at the locus where it had inserted, but also at loci that were located some distance on either side of it. Others were discovering this "spreading effect" as well. J. Schultz presented a biochemical and physical characterization of whole Drosophila that contained
EPIGENETICS:
different amounts of heterochromatin (Schultz 1956). Although the work was quite primitive and the conclusions drawn were limited, the work represented early attempts to dissect the structure of heterochromatin and demonstrated just how difficult the problem would be. Two talks at the 23rd Symposium were landmarks with respect to our present-day Symposium. First, R.A. Brink described his stunning observations of "paramutation" at the R locus in maize. If two alleles (R sl and R') with distinct phenotypes as hom*ozygotes are combined to form a heterozygote, and this RsI/R' plant is in turn crossed again, the resulting progeny that contain the Rr allele will always have an Rsl phenotype, even though the Rsl is no longer present (Brink 1958). However, this phenotype is metastable-in subsequent crosses the phenotype reverts to the normal R' phenotype. He meant for the word paramutation "to be applied in this context in its literal sense, as referring to a phenomenon distinct from, but not wholly unlike, mutation." Second, D.L. Nanney went to great lengths to articulate "conceptual and operational distinctions between genetic and epigenetic systems" (Nanney 1958). In essence, he defined epigenetics differently from how it had been originally intended by Waddington (for details, see Haig 2004). He found it necessary to do so in order to describe phenomena he observed in Tetrahymena. He found evidence that the cytoplasmic history of conjugating parental cells influenced the mating-type determination of resulting progeny. His definition encompassed observations made by others as well, including Brink's work on the R locus and McClintock's work noted in the 21st Symposium. Mary Lyon's recently proposed hypothesis of X~chro mosome inactivation in female mammals (Lyon 1961) was of considerable interest at the 29th Symposium. S. Gartler, E. Beutler, and W.E. Nance presented further experimental evidence in support of it (Beutler 1964; Gartler and Linder 1964; Nance 1964). Beutler reviewed multiple examples of mosaic expression of X-linked genes in women, supporting the random nature of X inactivation. From careful quantitative analysis of an X-linked gene product, Nance deduced that X inactivation occurred before the 32-cell stage of the embryo. The 38th Symposium on "Chromosome Structure and Function" represented a return to examining eukaryotic chromosomes-significant progress had been made studying prokaryotic and phage systems, and consequently, bacterial gene expression had dominated much of the thinking in the burgeoning field of molecular biology. However, an appreciation for chromatin (DNA with histones and nonhistone proteins) in eukaryotes was building, but it was unclear whether it played a role in chromosome structure
FROM
PHENOMENON
TO
FIELD.
3
or function, or both (Swift 1974). Nevertheless, several groups began to speculate that posttranslational modification of chromatin proteins, including histones, was associated with gene transcription or overall chromosome structure (Allfrey et al. 1974; Louie et al. 1974; Weintraub 1974). There was only a hint of epigenetic phenomena in the air. It had been hypothesized that repetitive DNA regulated most genes in eukaryotes, partly based on the fact that McClintock's controlling elements were repeated in the genome. However, it was reported that most repeated DNA sequences were unlinked to genes (Peaco*ck et al. 1974; Rudkin and Tartof 1974). From these observations, the idea that repeated elements regulated gene expression lost significant support from those in attendance. More importantly, however, these same studies discovered that most of the repetitive DNA was located in heterochromatin. The 42nd Symposium demonstrated that in four years, an amazing number of technical and intellectual advances had transformed the study of eukaryotic chromosomes (Chambon 1978). This included the use of DNA restriction enzymes, development of recombinant DNA technology, routine separation of proteins and nucleic acids, the ability to perform Southern and northern analysis, rapid DNA and RNA sequencing, and immunofluorescence on chromosomes. The nucleosome hypothesis had been introduced, and mRNA splicing had been discovered. Biochemical and cytological differences in chromatin structure, especially between actively transcribed and inactive genes, comprised the primary interest at this meeting. However, most relevant to epigenetics, Hal Weintraub and colleagues presented ideas about how chromatin could impart variegated gene expression in an organism (Weintraub et al. 1978). The 45th Symposium was a celebration of Barbara McClintock's discoveries-~~le genetic elements (Yarmolinsky 1981). Mechanistic) studies of bacterial transposition had made enormous progress and justifiably represented about half the presentations, whereas others presented evidence that transposition and regulated genomic reorganization occurred not only in maize, but also in other eukaryotes-including flies, snapdragons, Trypanosomes, Ascobolus, and budding yeast. In the context of this meeting, all observed variegated expression events were ascribed to transposition. Moreover, there was a reticence to seriously consider that controlling elements were responsible for most gene regulation (Campbell 1981), which led some to suggest that "the sole function of these elements is to promote genetic variability." In essence, the idea that heterochromatin was responsible for the regulated expression in position-effect
4 • CHAPTER
7
variegation was called into question. With respect to future epigenetic studies, perhaps the most noteworthy discussion was the firm establishment of "silent mating cassettes" in Saccharomyces cerevisiae (Haber et al. 1981; Klar et al. 1981; Nasmyth et al. 1981; Rine et al. 1981). Leading up to the 47th Symposium, a general correlation had been established in vertebrate systems that the overall level of cytosine methylation in CpG DNA sequences was lower for genes that were transcribed than for those that were not. However, there were exceptions to this generalization, and more detailed analysis was presented that methylation of a specific area of a gene's promoter was most important (Cedar et al. 1983; Doerfler et al. 1983; La Volpe et al. 1983). On the basis of restriction/modification systems of bacteria, it was thought that DNA methylation prevented binding of key regulatory proteins. Furthermore, it had been shown that DNA methylation patterns could be mitotically inherited in vertebrates, which led to the hypothesis that DNA methylation could serve as a means of transcriptional "memory" as cells divided through development (Shapiro and Mohandas 1983). Another major epigenetic-related finding was the identification of DNA sequences on either side of the "silent mating cassettes" in budding yeast that were responsible for transcriptional repression of genes within the cassettes-these defined the first DNA sequences required for chromosomal position effects (Abraham et al. 1983). "The Molecular Biology of Development" was the topic for the 50th Symposium, and it too encompassed a number of important advances. Perhaps one of the most exciting developments was the overall awareness that fundamental molecular properties were conserved throughout evolution-e.g., human RAS functioned in budding yeast, homeo box proteins were conserved between flies and humans (Rubin 1985). New efforts to understand chromosome imprinting began with the development of nuclear transfer in mice (Solter et al. 1985). These studies revealed that parent-of-origin information was stored within the paternal and maternal genomes of a new zygote; it was not just the DNA that was important, but the chromosomes contained additional information about which parent they had passed through, and the information was required for successful development of an embryo. Part of the answer was thought to lie in the fact that differential gene expression was dependent' on the parental origin of a chromosome (Cattanach and Kirk 1985). There were a number of studies aimed at understanding the complex regulation of the bithorax complex, but
notably, E.B. Lewis made special mention of the curious nature of known trans regulators of the locus; nearly all were repressors of the locus (Lewis 1985). Thus, maintaining a gene in a silenced state for many cell doublings was imperative for normal development. This contrasted with much of the thinking at the time-that gene activation/induction was where the critical regulatory decisions of development would be. DNA transformation and insertional mutagenesis techniques had recently been achieved for a number of organisms. One particularly creative and epigeneticrelated use of this technology came in Drosophila. A P-element transposon with the white eye-color gene on it was created and "hopped" throughout the genome (Rubin et al. 1985). This provided a means to map sites throughout the Drosophila genome where PEV could occur. This meeting also highlighted the first genetic approaches to dissecting sex determination and sex chromosome dosage compensation in Drosophila (Belote et al. 1985; Maine et al. 1985) and Caenorhabditis elegans (Hodgkin et al. 1985; Wood et al. 1985). The 58th Symposium highlighted the celebration of the 40th anniversary of Watson and Crick's discovery. Part of the celebration was a coming-out party for epigenetic phenomena: There was identification of new phenomena, beginnings of molecular analysis of other phenomena, and sufficient progress had been made in a number of systems to propose hypotheses and to test them. In trypanosomes, the family of Variable Surface antigen Genes (VSG) located near telomeres are largely silenced, with only one VSG expressed at a time. Although this organism does not appear to contain methylated DNA, it was re~orted that the silenced VSG genes contained a novel minor base: ~-D-glucosylhydroxymethyl uracil (Borst et al.)1993). This base appeared to be in place of thymidine-irrthe DNA. Parallels between this base and cytosine methylation in other organisms were easy to draw-the modifications were important for maintaining a silenced gene. But how the base was introduced into the DNA, or how it imparted such a function, was unclear. Progress had also been made in vertebrate epigenetic phenomena, including chromosomal imprinting and X inactivation (Ariel et al. 1993; Li et al. 1993; Tilghman et al. 1993; Willard et al. 1993). It had become clear by this time that numerous loci were subject to imprinting in mammals; only one allele was expressed in diploid cells, and expression was dependent on parental origin. The Igf2-H19 locus was of particular interest, primarily because it contained two nearby genes that were regulated in opposing fashion. Igf2 is expressed from the paternal
EPIGENETICS:
chromosome while the maternal copy is repressed, whereas the paternal allele of H19 is repressed and its maternal allele is expressed. Interestingly, methylated CpG was observed just upstream of both genes on the paternal chromosome. It was proposed that the differential methylation regulated access of the two genes to a nearby enhancer element-the enhancer was closer to, and just downstream of, H19 (Tilghman et al. 1993). A mutually exclusive competition between the two genes for the enhancer was envisioned; when the H19 gene was methylated, the enhancer was free to activate the more distant Igf2 gene. Support for the idea that DNA methylation played a regulatory role in this process came from mouse studies. Mutation of the first vertebrate gene encoding a S-methyl-cytosine DNA methyltransferase in ES cells showed that as embryos developed, the paternal copy of H19 became hypomethylated and the gene became transcriptionally active (Li et al. 1993). An important step in the way in which sMeCpG mediated its effects came from the purification of the first sMeCpG DNA-binding complex (MeCP1) (Bird 1993). Not only did it bind DNA, but when tethered upstream of a reporter gene, MeCP1 caused the gene to be repressed. Although this did not explain regulation at the Igf2-H19 locus, it did provide a potential mechanism to explain the general correlation between DNA methylation and gene repression. Genetic mapping over a number of years had identified a portion of the human X chromosome as being critical for imparting X inactivation. Molecular cloning studies of this X-inactivation center led to the discovery of the Xist gene (Willard et al. 1993), an -17-kb noncoding RNA that was expressed only on the inactive X chromosome. The mouse version of Xist was surprisingly hom*ologous in st~~re and sequence and held the promise of being an excelllnt model system to dissect the way in which this RtlAJunctioned to repress most of the X chromosome. Two notable findings were described in Neurospora (Selker et al. 1993). First, it was shown that cytosine DNA methylation was not limited to epG dinucleotides but could occur in seemingly any DNA context. Second was the amazing description of the phenomenon of repeatinduced point mutation (RIP). Sequences become "RIP'd" when there isa sequence duplication (linked or unlinked) in a haploid genome and the genome is put through the sexual cycle via conjugation. Two events occur: Both copies of the duplicated DNA pick up G:C ----7 A:T mutations, and DNA within a few hundred base pairs of the RIP'd sequences becomes methylated. This double attack on the genome is quite efficient-SO% of unlinked
FROM
PHENOMENON
TO
FIELD
5
loci succumb to RIP, whereas tightly linked loci approach lOO%-and readily abolishes gene function. The brown gene in Drosophila, when translocated near heterochromatin, displays dominant PEV; the translocated copy can cause repression of the wild-type copy. In searching for enhancers and suppressors of this transinactivation phenomenon, Henikoff discovered that duplication of the gene located near heterochromatin increased the level of repression on the normal copy (Martin-Morris et al. 1993). Although the mechanism underlying this event remained mysterious, it was postulated that the phenomenon might be similar to RIP in Neurospora, although it had to occur in the absence of DNA methylation, which does not occur in Drosophila. Paul Schedl elucidated the concept of chromosomal "boundary elements" (Vazquez et al. 1993). The first were located on either side of the "puff" region at a heat shock locus in Drosophila and were defined by their unusual chromatin structure-an -300-bp nuclease-resistant core bordered by nuclease hypersensitive sites. It was postulated that such elements separated chromatin domains along the chromosome. Two in vivo assays supported this hypothesis: (1) When bordering either side of a reported gene, boundary elements effectively eliminated chromosomal position effects when the construct was inserted randomly throughout the genome. (2) The boundary element was also defined by its ability to block enhancer function. When inserted between a gene promoter and its enhancer, the boundary element blocked the gene's expression. Although not as well defined, the concept of boundary elements was also developing in other organisms, especially at the globin locus in mammals (Clark et al. 1993). Budding yeast shone the light on a mechanistic inroad to chromatin-related epigenetic phenomena. It had already been established that the silencers at the silent mating-type loci were sites for several DNA-binding proteins. Their binding appeared to be context-dependent, as exemplified by the Rap1 protein, which not only was important in silencing, but also bound upstream of a number of genes to activate transcription (for review, see Laurenson and Rine 1992). Over the years, numerous links had been made between DNA replication and transcriptionally quiescent regions of the genome. The inactive X chromosome, heterochromatin, and silenced imprinted loci had all been reported to replicate late in S phase relative to transcriptionally active regions of the genome. In addition, it had been shown that the establishment of silencing at the silent mating-type loci required passage through S phase, suggesting that silent.chromatin had to be built on newly repli-
6 •
CHAPTER
1
cated DNA. Thus, there was great interest when one of the Furthermore, overexpression for Sir3 caused it to silencers was found to be an origin of DNA replication, and "spread" inward along the chromatin fiber from the its origin activity could not be separated from silencing telomere, suggesting that it was a limiting component of function (Fox et al. 1993). Furthermore, mutants in the silent chromatin and could "polymerize" along the chrorecently identified origin recognition complex (ORC) were matin (Renauld et al. 1993). Taken together, there found to cripple silencing (Bell et al. 1993; Fox et al. 1993). appeared to be a large interaction network important for The discovery that telomeres in Saccharomyces ceresilencing-the Sir proteins initiated assembly at telomeric visiae exerted PEV, just like that seen in Drosophila, DNA, due to their interaction with Rap1, and then polybrought another entree into dissecting heterochromatic merized from the telomere along the chromatin fiber, structure and its influence on gene expression. Reporter presumably by binding to the tails of histones H3 and H4. genes inserted near telomeres give variegated expression Switching between transcriptional states in variegated in a colony. The repressed state is dependent on many of telomeric expression appeared to be the result of a compethe same genes (SIR2, SIR3, SIR4) as those required for tition between silent and active gene expression (Aparicio silencing at the silent mating-type loci. Several key aspects and Gottschling 1994; described in Weintraub 1993). If the about the silent chromatin structure and the regulation of transcriptional activator for a telomeric gene was deleted, the variegated expression were described. It is worth notthe gene's basal transcriptional machinery was insufficient ing that heterochromatin is defined cytologically as confor expression and the gene was constitutively silenced. Conversely, overexpression of the activator caused the densed chromatin, but silent chromatin in S. cerevisiae has never been visualized in this way. Nevertheless, telomeric gene to be expressed continuously-the gene was because of similarities to PEV in Drosophila, there was never silenced. In the absence of SIR3 (or SIR2 or SIR4), enthusiasm to consider silent chromatin in yeast to be a basal gene expression was sufficient, whereas increased functional equivalent of heterochromatin (described in dosage of SIR3 increased the fraction of cells that were Weintraub 1993). silenced. Although a transcriptional activator could overFrom the yeast studies, a number of fundamental concome silencing throughout the cell cycle, it was most effeccepts began to come to light. First, the importance of histive when cells were arrested in S phase, presumably when tone H3 and H4 became evident. In particular, the chromatin was being replicated and, hence, most susceptiamino-terminal tail of histones H3 and H4 appeared to ble to competition. Somewhat surprisingly, cells arrested in be directly involved in the formation of silent heterochroG/M also could be easily switched, suggesting that silent matin (Thompson et al. 1993). Specific mutants in the chromatin had not yet been fully assembled by this time. tails of these histones alleviated or crippled silencing and Silent chromatin in yeast was shown to be recalciled to the notion that both the net charge of the residues trant to nucleases and DNA modification enzymes, sugon the tails and specific residues within the tails congesting that the underlying DNA was much less tributed to silencing. In addition, these early days of chroaccessible relative to most of the genome (described in matin immunoprecipitation (ChIP) demonstrated that Thompson et al. 1993). the lysines in the amino-terminal tail of histone H4 were It also appeared that there was a hierarchy of silencing hypoacetylated in regions of silent chromatin relative to within the yeast genome: The telomeres were the most the rest of the genome. Moreover, one of the histone sensitive to perturbation, HML was next, and HMR was mutants identified histone H4 K16, which could be acety-~e least sensitive. In fact, when the SIR] gene was lated, as critical for forming silent chromatin. mutated, the normally completely silenced HM loci disTelomeres appeared to provide the simplest system in played variegated expression (Pillus and Rine 1989). which to develop an understanding of how Sir proteins Finally, Sir3 and Sir4 were localized to the nuclear mediated silencing. The concept of recruiting silencing periphery, as were the telomeres. It was proposed that proteins was being developed. Briefly, the telomeric DNAthe nucleus was organized such that the nuclear envebinding protein, Rap1, was found to interact with Sir3 lope provided a special environment for silencing (Paland Sir4 by two-hybrid methods (described in Palladino ladino et al. 1993). et al. 1993). Thus, Rap1 could "recruit" these Sir proteins Schizosaccharomyces pombe also has silent mating casto the telomeric region of the genome. There was evisettes that were suspected to behave similarly to those in dence that Sir3 and Sir4 could bind to one another, and S. cerevisiae. However, in S. pombe, there was an added most importantly, Sir3 and perhaps Sir4 interacted with twist to the story of mating-type switching. In an elegant the tails of histones H3 and H4 (Thompson et al. 1993). set of experiments, Amar Klar proposed how a "mark" is
EPIGENETICS:
imprinted on one strand of DNA in a cell (Klar and Bonaduce 1993). The mark is manifested, after two cell divisions in one of the four granddaughter cells, as a double-stranded break that facilitates mating-type switching. This yeast does not have any known DNA modifications (methylation, etc.), hence, a different type of mark was postulated to be left on the DNA strand. The topic of the 59th Symposium was "The Molecular Genetics of Cancer." The concept of epigenetic regulation in oncogenesis had begun to develop after the idea of tumor suppressor genes became established. There had been a couple of studies supporting such a notion, but an interesting twist to the story came in studies of Beckwith-Wiedemann syndrome and Wilms' tumor patients. Mutations in both types of patients had been mapped to a locus that included the imprinted H19IGF2 genes. Feinberg et al. (1994) discovered "loss of imprinting" (LOI) for these genes in affected patientsthe maternal locus lost its imprint, H19 was repressed, and IGF2 was expressed. Thus LOI, which in principle could occur elsewhere in the genome, could cause either biallelic expression and/or extinction of genes critical in oncogenesis. In the couple of years leading up to the 63rd Symposium on "Mechanisms of Transcription," several important developments occurred that would affect the molecular understanding of several epigenetic phenomena. Histone-modifying enzymes were identifiedspecifically, histone acetylases and deacetylases. Some of these enzymes played critical roles in regulating gene expression and provided an entry into gene products that directly affected PEV and silencing. The tip of this iceberg was presented at the Symposium (see Losick 1998). Molecular dissection of the Sir3 and Sir4 silencing proteins in yeast revealed the polyvalent nature of their interactions and revealed how the network of interactions between all the Sir proteins, the histones, and various DNA-binding factors set up silent chromatin. In addition, the molecu ar details of how various loci (telomeres, the rDNA, HM loci, and double-stranded breaks) could compete for the limited supply of Sir proteins were shown. By crippling the ability of a specific locus to recruit silencing factors, Sir protein levels were increased at the other loci (co*ckell et al. 1998). This provided direct evidence that principles of mass action were at work and that silencing at one locus could affect the epigenetic silencing at other locian idea originally put forth in studies on PEV in Drosophila, but not yet tested (Locke et al. 1988). Another finding explained how DNA methylation could regulate gene expression through chromatin. This
FROM
PHENOMENON
TO
FIELD.
7
came with the identification of protein complexes composed of MeCP2, which bind both methylated DNA and histone deacetylases (Wade et al. 1998). Methylated DNA could serve as a point of recruiting deacetylases to a locus and thus facilitate silencing of nearby genes. The concept of boundary elements was extended from Drosophila to mammals, with clear evidence provided at the ~-globin locus, thus indicating that chromatin' boundaries were indeed likely conserved in metazoans and perhaps all eukaryotes (Bell et al. 1998). The 64th Symposium on "Signaling and Gene Expression in the Immune System" provided evidence about how monoallelic expression arose, and that it might be more widespread than previously thought. Monoallelic expression at the immunoglobulin loci had been obvious in lymphocytes for some time-it guaranteed the production of a single receptor type per lymphoid cell (Mostoslavsky et al. 1999). The allele to be expressed was chosen early in development, apparently at random: Both alleles began in a repressed state, but over time one became demethylated. It was unclear how a single allele was chosen, but the phenomenon appeared at other loci, too, where the necessity of monoallelism was not obvious. For instance, only one allele of genes encoding the cytokines IL- 2 and IL-4 was expressed (Pannetier et al. 1999). The most significant epigenetics-related talk at the 65th Symposium concerned the discovery that the Sir2 protein was a histone deacetylase (Imai et al. 2000). This was the only Sir protein that had clear hom*ologs in all other eukaryotes and that regulated PEY. It seemed to be the enzyme primarily responsible for removing acetyl moieties from histones in silent chromatin. Furthermore, because it was an NAD-dependent enzyme, it linked the regulation of silencing (heterochromatin) to cellular physiology. The 68th Symposium on "The Genome of hom*o sapiens" was an important landmark in genetics, and although there is still much genetic work to be done, the complete sequencing of this and other genomes signified that it was time to move "above genetics"-a literal meaning of epigenetics. This historical account highlights several themes shared with many other areas of research. First, it demonstrates the episodic nature of advances in epigenetics. Second, as molecular mechanisms underlying epigenetic phenomena began to be understood, it made it easier to connect epigenetics to biological regulation in general. Third, it showed that people whom we now consider to be scientific luminaries had made these connections early on-it just took a while for most others to "see" the obvious.
8 • CHAPTER
7
3 The 69th Symposium
A few general principles have been identified over the years that are common to all epigenetic phenomena, and they serve to guide experimental approaches in the search for a detailed understanding. First, the differences between the two phenotypic states ("OFF" and "ON") always have a corresponding difference in structure at a key regulatory point-form translates into function. Hence, identifying the two distinct structures, the components that compose them, and the compositional differences between them have been the primary tasks. Second, the distinct structures must have the ability to be maintained and perpetuated in a milieu of competing factors and entropic forces. Thus, each structure requires selfreinforcement or positive feedback loops which ensure that it is maintained and propagated over many cellular divisions; in some cases, such as X-chromosome inactivation, this appears to be on the order of a lifetime. Many of the mechanistic principles defined in the earlier symposia continued to be refined in the 69th Symposium, but there were also new developments. To put these new developments in context, it is important to note that two other discoveries had a major impact on epigenetics. One was the discovery of RNA interference and related RNA-based mechanisms of regulation. The other was the discovery of mechanisms underlying the prion hypothesis. Both of these fields have advanced rapidly in the past decade, with some of the studies contributing to knowledge about chromatin-based epigenetics and others providing new perspectives about heritable transmission of phenotypes. Many of the accomplishments reported at the Symposium are detailed in the chapters of this book, so I eschew discussing these topics here. However, I will touch upon a few advances that caught my fancy and are not covered within these pages. At the end, I will try to distill the most important concepts I took away from the meeting. 3.1 The Histone Code Hypothesis
In considering histone modifications and their potential information content, there were many discussions about the "histone code hypothesis" (Jenuwein and Allis 2001). Most of those I participated in, or overheard, were informal and rather lively. The proponents of the "code" cite examples such as tri-methylation of histone H3 at K9 and its greater affinity for the HP1 class of heterochromatin proteins (Jenuwein and Allis 2001). Those on the other side cite biochemical and genetic evidence that the net charge on the amino-terminal tail of histone H4, irre-
spective of which position the charge is at, has dramatic effects on DNA binding or phenotype (Megee et al. 1995; Zheng and Hayes 2003). Grunstein presented data that included genome-wide analysis of histone acetylation modifications and chromatin-associated proteins using specific antibodies and ChIP-Chip in S. cerevisiae (Millar et al. 2004). His focus was on the epigenetic switch associated with H4K16 acetylation for binding, or not binding, particular chromatin proteins-thus supporting the histone code hypothesis. Although not discussed, some of his data appeared to support reports from others that for much of the genome, there is no correlation between specific histone modifications and gene expression (i.e., all active genes have the same marks, and these marks are not present on inactive genes) (Schubeler et al. 2004; Dion et al. 2005). Taking all the results together, I suspect that both specific modifications and general net charge effects will be used as mechanisms for regulating chromatin structure and gene expression. 3.2 Dynamic Silent Chromatin
I must confess that, on the basis of static images of heterochromatin and the refractory nature of silent chromatin, I was convinced that once established, a heterochromatic state was as solid as granite. Only when it was time for DNA replication would the impervious structure become relaxed. In thinking this way, I foolishly ignored principles of equilibrium dynamics I had learned in undergraduate chemistry. However, these lessons were brought home again by studies of silent chromatin and heterochromatin, where it was shown that silencing proteins of yeast (Sir3), and heterochromatin proteins in mammalian cells (HP1), were in a dynamic equilibrium-proteins were rapidly exchanged between heterochromatin and the soluble compartment-even when the chromatin was in its most impervious state (Cheng a~artenberg 2000; Cheutin et al. 2003). The realization of its dynamic qualities forced a different view of no.wAn epigenetic chromatin state is maintained and propagated. It suggests that in some systems the epigenetic state can be reversed at any time, not just during DNA replication. Hence, we can infer that mechanisms of reinforcement and propagation for silenced chromatin must function constantly. Methylation of histones was widely held to be the modification that would indeed impart a "permanent" mark on the chromatin (for review, see Kubicek and Jenuwein 2004). In contrast to all other histone modification (e.g.,
EPIGENETfCS:
phosphorylation, acetylation, ubiquitination), there were no enzymes known that could reversibly remove a methyl group from the amine of lysine or arginine. Furthermore, removing the methyl group under physiological conditions by simple hydrolysis was considered thermodynamically disfavored and thus unlikely to occur spontaneously. Those thinking that methylation marks were permanent had their belief system shaken a bit by several reports. First, it was shown that a nuclear peptidylarginine deiminase (PAD4) could eliminate monomethylation from histone H3 at arginine (R) residues (Cuthbert et al. 2004; Wang et al. 2004). Although this methyl removal process results in the arginine residue being converted into citrulline, and hence is not a true reversal of the modification, it nevertheless provided a mechanism for eliminating a permanent methyl mark. Robin Allshire provided a tantalizing genetic argument that the tis2 gene from S. pombe reversed dimethylation on histone H3 at K9 (R. Allshire, pers. comm.). He may have been on the right track, because a few months after the meeting, the unrelated LSD 1 enzyme from mammals was shown to specifically demethylate di- and monomethyl on histone H3 at K4 (Shi et al. 2004), reversing an "active" chromatin mark. Quite interestingly, LSD1 did not work on trimethylated H3K4-thus, methylation could be reversed during the marking process, but reversal was not possible once the mark was fully matured. However, Steve Henikoff presented a way by which a permanent trimethyllysine mark could be eliminated. He showed that the variant histone H3.3 could replace canonical histone H3 in a replication-independent transcriptioncoupled manner (Henikoff et al. 2004). In essence, a histone that contained methyl marks for silencing could be removed and replaced with one that was more conducive to transcription. When total chromatin was isolated, histone H3.3 had many more active chromatin methylation marks (e.g., K79me) on it than canonical histone H3 did. In con~idering all these result~, it s~ems t.h~ the:e may . not be a sImple molecular modIficatIOn Wl~hl hlstones that serves as a memory mark for propagatin the silent chromatin state through cell division. Rath , there must be a more tenuous set of interactions that increase the probability that a silent state will be maintained, although they do not guarantee it. 3.3 Nuclear Organization
Correlations between nuclear location and gene expression have been made for many years (Mirkovitch et al. 1987). These observations began to drive the notion that
FROM
PHENOMENON
TO
FIELD.
9
there were special compartments within the cell where gene expression or silencing was restricted. It was argued that this organization was necessary to keep the complexity of the genome and its regulation in a workable order. This idea was supported by studies in S. cerevisiae, where telomeres are preferentially located at the nuclear periphery, as are key components of the silencing complex, such as Sir4 (Palladino et al. 1993). Mutations that released the telomeres, or Sir4, from the nuclear periphery resulted in a loss of telomeric silencing (Laroche et al. 1998; Andrulis et al. 2002). Furthermore, artificially tethering a partially silenced gene to the periphery caused it to become fully silenced (Andrulis et al. 1998). In an insightful experiment, Gasser showed that if the teloineres and the silencing complex were both released from the periphery, and free to move throughout the nucleus, telomeric silencing was readily established (Gasser et al. 2004). Thus, there does not appear to be a special need for localizing loci to a compartment. This is more consistent with the findings that rapid movement of chromatin proteins on and off chromosomes can still mediate effective regulation such as silencing. Perhaps some of the localization is necessary to keep high local concentrations of relevant factors under special (stressful?) conditions. Alternatively, this may represent a combination of domains put together through evolution that worked long ago, but had no ultimate purpose. 3.4 Prions
Wickner provided an overview and criteria for defining prions, and from his description it is clear that they are part of the epigenetic landscape (Wickner et al. 2004a,b). In the simplest molecular sense, prions are proteins that can cause heritable phenotypic changes, by acting upon and altering their cognate gene product. No DNA sequence changes occur; rather, the prion typically confers a structural change in its substrate. The beststudied and understood class of prions causes soluble forms of a protein to change into amyloid fibers. In many cases, the amyloid form reduces or abolishes normal activity of the protein, thus producing a change in phenotype. Wickner defined another class of prions that do not form amyloid filaments. These are enzymes that require activation by their own enzymatic activity. If a cell should have only inactive forms of the enzyme, then an external source of the active enzyme is required to start what would then become a self-propagating trait, as long as at least one active molecule was passed on to each cell. He provided two examples and the expectation
10 •
CHAPTER
1
that this class of proteins will define a new set of epigenetic mechanisms to pursue. Si presented preliminary evidence that a prion model may explain learned memory in Aplysia (Si et al. 2004). Protein translation of a number of stored mRNAs in neuronal cells is important for the maintenance of shortterm memory in this snail. He found that a regulator of protein translation, CPEB, can exist in two forms, and that the activated form of CPEB acts dominantly to perpetuate itself. Testing of this idea is still in its early days, but it offers an exciting new way of considering the issue of how we remember. 3.5 New Phenomenon
The description of a new and unexpected phenomenon always holds our imagination. One presentation in particular held my thoughts for weeks after the Symposium. Standard genetic analysis of mutant alleles of the HOTHEAD gene, which regulates organ fusion in Arabidopsis, revealed that normal rules of Mendelian genetics were not being followed (Lolle et al. 2005). It was discovered that if heterozygous HOTHEAD/hothead plants self-fertilized and produced a hom*ozygous hothead/hothead plant, and then this hom*ozygous hothead/hothead plant was allowed to self-fertilize, the progeny from this hom*ozygous parent reverted to a HOTHEAD/hothead genotype at a frequency of up to 15%. This stunning level of wild-type reversion produced an exact duplicate, at the nucleotide level, of the wild-type gene seen in the earlier generations. This reversion was not limited to the HOTHEAD locus-several other loci had similar frequencies of reversion to wildtype alleles. However, all the reversions required that the parent be hom*ozygous hothead/hothead. The gene product of HOTHEAD did not offer an obvious explanation as to how this could occur, but discussions certainly suggested that an archival copy of the wild-type gene was transmitted, perhaps via RNA, through successive generations. Although it could be argued that this phenomenon is outside the purview of "epigenetics"-due to the change in DNA sequence-the heritable transmission of the putative archived copy does not follow normal genetic rules. Nevertheless, this phenomenon has enormous implications for genetics, especially in evolutionary thinking.
4 Closing Thoughts
So, what more needs to be done to understand epigenetic mechanisms? For the most part, we are still collecting (discovering) the components. Just as the full sequence of
a genome has greatly facilitated progress in genetics, a clearer understanding for epigenetics will likely come when all the parts are known. It is encouraging to see the great strides that have been made in the last decade. I confess that I cannot discern whether we are close to, or far away from, having an accurate mechanistic understanding about how epigenetic states are maintained and propagated. The prion-based phenomenon may be the first to be understood, but those that are chromatin-based seem the farthest off. The polyvalent nature of interactions that seem to be required to establish a silenced state on a chromosome increases the complexity of the problem. This is further compounded by the dynamic nature of silent chromatin. The ability to know more about movement of components in and out of chromatin structures requires application of enhanced or new methods for an eventual understanding. Whereas chromatin immunoprecipitation has been important in establishing which components reside in a structure, it has temporarily blinded us to the dynamics. I suspect that, given the complexity, simply measuring binding and equilibrium constants between all the components and trying to derive a set of differential equations to simulate epigenetic switches may not be an effective use of resources, nor will it necessarily result in better comprehension. Rather, I speculate that a n'ew type of mathematical approach will need to be developed and combined with new experimental measuring methods, in order to eventually understand epigenetic events. Part of this may require development of in vitro systems, that faithfully recapitulate an epigenetic switch between states. The idea of competition between two states in most epigenetic phenomena likely reflects an "arms race" that is happening at many levels in the cell, followed by attempts to rectify "collateral damage." For instance, silencing proteins may have evolved to protect the genome from transposons. However, because silencing proteins work through the ubiquitous nucleosomes, some critical genes become repressed. To overcome this, histone modifications (e.g., methylation of H3K4 and H3K79) and variant replacement histones (H2A.Z) evolved to prevent silencing proteins from binding to critical genes. Depending on subsequent events, these changes may be co-opted for other processes-e.g., repression of some of the genes by the silencing proteins may have become useful (silent mating loci). The silencing mechanisms may have been co-opted for other functions as well, such as promoting chromosome segregation. And so it goes... I look forward to having the genomes of more organisms sequenced, because this might lead us to understand
EPIGENETICS:
an order of events through evolution that set up the epigenetic processes we see today. For instance, S. cerevisiae does not have RNAi machinery, but many other fungi do. By filling in some of the phylogenetic gap~ between species, we may discover what events led to S. cerevisiae no longer "needing" this system. Perhaps more than any other field of biological research, the study of epigenetics is founded on trying to understand unexpected observations, ranging from H.}. Muller's position-effect variegation, to polar overdominance in the callipyge phenotype (Georges et al. 2004). The hope of understanding something unusual serves as the bait to draw us in, but we soon become entranced by the cleverness of the mechanisms employed. This may explain why this field has drawn more than its share of light-hearted and clever minds. I suspect it will continue to do so, as we develop a deeper understanding of the cleverness, and as new and unexpected epigenetic phenomena are discovered.
Acknowledgments
I thank my colleagues at the University of Chicago and the Fred Hutchinson Cancer Research Center for making my own studies on epigenetics so enjoyable, and I thank the National Institutes of Health for financial support.
References Abraham J., Feldman J., Nasmyth K.A., Strathern J.N., Klar A.J., Broach J.R., and Hicks J.B. 1983. Sites required for position-effect regulation of mating-type information in yeast. Cold Spring Harbor Symp. Quant. BioI. 47: 989-998. AlIfrey v.G., Inoue A., Karn J., Johnson E.M., and Vidali G. 1974. Phosphorylation of DNA-binding nuclear acidic proteins and gene activation in the HeLa cell cycle. Cold Spring Harbor Symp. Quant. BioI. 38: 785-801. Andrulis E.D., Neiman A.M., Zappulla D.C., and Sternglanz R. 1998. Perinuclear localization of chromatin facilitates transcriptional silencing. Nature 394: 592-595. Andrulis E.D., Zappulla D.C., Ansari A., Perrod S., Laiosa c.v., Gartenberg M.R., and Sternglanz R. 2002. Escl, a nuclear periphery protein required for Sir4-based plasmid anchoring and partitioning. Mol. Cell. BioI. 22: 8292-8301. Aparicio O.M. and Gottschling D.E. 1994. Overcoming telomeric silencing: A trans-activator competes to establish gene expression in a cell cycle-dependent way. Genes Dev. 8: 1133-1146. Ariel M., Selig S., Brandeis M'-, Kitsberg D., Kafri T, Weiss A., Keshet 1., Razin A., and Cedar H. 1993. Allele-specific structures in the mouse Igf2-H19 domain. Cold Spring Harbor Symp. Quant. BioI. 58: 307-313. Bell A., Boyes J., Chung J., Pikaart M., Prioleau M.N., Recillas E, Saitoh N., and Felsenfeld G. 1998. The establishment of active chromatin domains. Cold Spring Harbor Symp. Quant. BioI. 63: 509-514. Bell S.P., Marahrens Y, Rao H., and Stillman B. 1993. The replicon
FROM
PHENOMENON
TO
FIELD.
11
model and eukaryotic chromosomes. Cold Spring Harbor Symp. Quant. BioI. 58: 435-442. Belote J.M., McKeown M.B., Andrew D.J., Scott TN., Wolfner M.E, and Baker B.S. 1985. Control of sexual differentiation in Drosophila melanogaster. Cold Spring Harbor Symp. Quant. BioI. 50: 605-614. Beutler E. 1964. Gene inactivation: The distribution of gene products among populations of cells in heterozygous humans. Cold Spring Harbor Symp. Quant. BioI. 29: 261-271. Bird A.P. 1993. Functions for DNA methylation in vertebrates. Cold Spring Harbor Symp. Quant. Bioi. 58: 281-285. Borst P., Gommers-Ampt J.H., Ligtenberg M.J., Rudenko G., Kieft R., Taylor M.C., Blundell P.A., and van Leeuwen E 1993. Control of antigenic variation in African trypanosomes. Cold Spring Harbor Symp. Quant. BioI. 58: 105-114. Brink R.A. 1958. Paramutation at the R locus in maize. Cold Spring Harbor Symp. Quant. BioI. 23: 379-391. Campbell A. 1981. Some general questions about movable elements and their implications. Cold Spring Harbor Symp. Quant. BioI. 45: 1-9. Cattanach B.M. and Kirk M. 1985. Differential activity of maternally and paternally derived chromosome regions in mice. Nature 315: 496-498. Cedar H., Stein R., Gruenbaum Y., Naveh-Many T, Sciaky-Gallili N., and Razin A. 1983. Effect of DNA methylation on gene expression. Cold Spring Harbor Symp. Quant. BioI. 47: 605-609. Cham bon P. 1978. Summary: The molecular biology of the eukaryotic genome is coming of age. Cold Spring Harbor Symp. Quant. BioI. 42: 1209-1234. Cheng TH. and Gartenberg M.R. 2000. Yeast heterochromatin is a dynamic structure that requires silencers continuously. Genes Dev. 14: 452-463. Cheutin T, McNairn A.J., Jenuwein T, Gilbert D.M., Singh P.B., and Misteli T 2003. Maintenance of stable heterochromatin domains by dynamic HPI binding. Science 299: 721-725. Clark D., Reitman M., Studitsky V., Chung J., Westphal H., Lee E., and Felsenfeld G. 1993. Chromatin structure of transcriptionally active genes. Cold Spring Harbor Symp. Quant. BioI. 58: 1-6. co*ckell M., Gotta M., Palladino E, Martin S.G., and Gasser S.M. 1998. Targeting Sir proteins to sites of action: A general mechanism for regulated repression. Cold Spring Harbor Symp. Quant. BioI. 63: 401-412. Cuthbert G.L., Daujat S., Snowden A.W., Erdjument-Bromage H., Hagiwara T, Yamada M., Schneider R., Gregory P.D., Tempst E, Bannister A.J., and Kouzarides T 2004. Histone deimination antagonizes arginine methylation. Cell 118: 545-553. Dion M.E, Altschuler S.J., Wu L.E, and Rando O.J. 2005. Genomic characterization reveals a simple histone H4 acetylation code. Froc. Natl. Acad. Sci. 102: 5308-5309. Doerfler W., Kruczek 1., Eick D., Vardimon L., and Kron B. 1983. DNA methylation and gene activity: The adenovirus system as a model. Cold Spring Harbor Symp. Quant. BioI. 47: 593-603. Feinberg A.E, Kalikin L.M., Johnson L.A., and Thompson;fS.1994. Loss of imprinting in human cancer. Cold Spring Harbor symp) Quant. Bioi. 59: 357-364. Fox C.A., Loo S., Rivier D.H., Foss M.A., and Rine J. 1993. A transcriptional silencer as a specialized origin of replication that establishes functional domains of chromatin. Cold Spring Harbor Symp. Quant. BioI. 58: 443-455. Frankel J. 1990. Positional order and cellular handedness.]. Cell Sci. 97:205-211. Gartler S.M., and Linder D. 1964. Selection In mammalian mosaic cell
12 •
CHAPTER
populations. Cold Spring Harbor Symp. Quant. BioI. 29: 253-260. Gasser S.M., Hediger P., Taddei A., Neumann P.R., and Gartenberg M.R. 2004. The function of telomere clustering in yeast: The circe effect. Cold Spring Harbor Symp. Quant. BioI. 69: 327-337. Georges M., Charlier c., Smit M., Davis E., Shay T., Tordoir X., Takeda H., Caiment P., and co*ckett N. 2004. Toward molecular understanding of polar overdominance at the ovine callipyge locus. Cold Spring Harbor Symp. Quant. BioI. 69: 477-483. Goldschmidt R.B. 1951. The theory of the gene: Chromosomes and genes. Cold Spring Harbor Symp. Quant. Bioi. 16: 1-11. Haber J.E., Weiffenbach B., Rogers D.T., McCusker J., and Rowe L.B. 1981. Chromosomal rearrangements accompanying yeast matingtype switching: Evidence for a gene-conversion model. Cold Spring Harbor Symp. Quant. BioI. 45: 991-1002. Haig D. 2004. The (dual) origin of epigenetics. Cold Spring Harbor Symp. Quant. BioI. 69: 67. Henikoff S. 1990. Position-effect variegation after 60 years. Trends Genet. 6: 422-426. Henikoff S., McKittrick E., and Ahmad K. 2004. Epigenetics, histone H3 variants, and the inheritance of chromatin states. Cold Spring Harbor Symp. Quant. Bioi. 69: 235-243. Hernday A.D., Braaten B.A., and Low D.A. 2003. The mechanism by which DNA adenine methylase and PapI activate the pap epigenetic switch. Mol. Cel/12: 947-957. Hodgkin J., Doniach T., and Shen M. 1985. The sex determination pathway in the nematode Caenorhabditis elegans: Variations on a theme. Cold Spring Harbor Symp. Quant. BioI. 50: 585-593. Imai S., Johnson P.B., Marciniak R.A., McVey M., Park P.D., and Guarente L. 2000. Sir2: An NAD-dependent histone deacetylase that connects chromatin silencing, metabolism, and aging. Cold Spring Harbor Symp. Quant. Bioi. 65: 297-302. Jenuwein T. and Allis C.D. 2001. Translating the histone code. Science 293: 1074-1080. Klar A.J., and Bonaduce M.J. 1993. The mechanism of fission yeast mating-type interconversion: Evidence for two types of epigenetically inherited chromosomal imprinted events. Cold Spring Harbor Symp. Quant. BioI. 58: 457-465. Klar A.J., Hicks J.B., and Strathern J.N. 1981. Irregular transpositions of mating-type genes in yeast. Cold Spring Harbor Symp. Quant. BioI. 45: 983-990. Kubicek S. and Jenuwein T. 2004. A crack in histone lysine methylation. Cel/1l9: 903-906. La Volpe A., Taggart M., Macleod D., and Bird A. 1983. Coupled demethylation of sites in a conserved sequence of Xenopus ribosomal DNA. Cold Spring Harbor Symp. Quant. Bioi. 47: 585-592. Laroche T., Martin S.G., Gotta M., Gorham H.C., Pryde P.E., Louis E.J., and Gasser S.M. 1998. Mutation of yeast Ku genes disrupts the subnuclear organization of telomeres. Curro BioI. 8: 653-656. Laurenson P. and Rine J. 1992. Silencers, silencing, and heritable transcriptional states. Microbiol. Rev. 56: 543-560. Lewis E.B. 1985. Regulation of the genes of the bithorax complex in Drosophila. Cold Spring Harbor Symp. Quant. BioI. 50: 155-164. Li E., Beard c., Forster A.C., Bestor T.H., and Jaenisch R. 1993. DNA methylation, genomic imprinting, and mammalian development. Cold Spring Harbor Symp. Quant. BioI. 58: 297-305. Locke J., Kotarski M.A., and Tartof K.D. 1988. Dosage-dependent modifiers of position effect variegation in Drosophila and a mass action model that explains their effect. Genetics 120: 181-198. Lolle S.J., Victor J.L., Young J.M., and Pruitt R.E. 2005. Genome-wide non-mendelian inheritance of extra-genomic information in Arabidopsis. Nature 434: 505-509. Losick R. 1998. Summary: Three decades after sigma. Cold Spring Har-
bor Symp. Quant. BioI. 63: 653-666. Louie A.J., Candido E.P., and Dixon G.H. 1974. Enzymatic modifications and their possible roles in regulating the binding of basic proteins to DNA and in controlling chromosomal structure. Cold Spring Harbor Symp. Quant. BioI. 38: 803-819. Lyon M.P. 1961. Gene action in the X-chromosome of the mouse (Mus musculus L.). Nature 190: 372-373. - - - . 1993. Epigenetic inheritance in mammals. Trends Genet. 9: 123-128. Maine E.M., Salz H.K., Schedl P., and Cline T.W. 1985. Sex-lethal, a link between sex determination and sexual differentiation in Drosophila melanogaster. Cold Spring Harbor Symp. Quant. BioI. 50: 595-604. Martin-Morris L.E., Loughney K., Kershisnik E.O., Poortinga G., and Henikoff S. 1993. Characterization of sequences responsible for trans-inactivation of the Drosophila brown gene. Cold Spring Harbor Symp. Quant. BioI. 58: 577-584. McClintock B. 1951. Chromosome organization and genic expression. Cold Spring Harbor Symp. Quant. BioI. 16: 13-47. - - - . 1956. Controlling elements and the gene. Cold Spring Harbor Symp. Quant. BioI. 21: 197-216. Megee P.c., Morgan B.A., and Smith M.M. 1995. Histone H4 and the maintenance of genome integrity. Genes Dev. 9: 1716-1727. Millar C.B., Kurdistani S.K., and Grunstein M. 2004. Acetylation of yeast histone H4 lysine 16: A switch for protein interactions in heterochromatin and euchromatin. Cold Spring Harbor Symp. Quant. BioI. 69: 193-200. Mirkovitch J., Gasser S.M., and Laemmli U.K. 1987. Relation of chromosome structure and gene expression. Philos. Trans. R. Soc. Lond. B BioI. Sci. 317: 563-574. Mostoslavsky R., Kirillov A., Ji Y.H., Goldmit M., Holzmann M., Wirth T., Cedar H., and Bergman Y. 1999. Demethylation and the establishment of K allelic exclusion. Cold Spring Harbor Symp. Quant. BioI. 64: 197-206. Muller H.J. 1941. Induced mutations in Drosophila. Cold Spring Harbor Symp. Quant. BioI. 9: 151-167. Nance W.E. 1964. Genetic tests with a sex-linked marker: Glucose-6phosphate dehydrogenase. Cold Spring Harbor Symp. Quant. Bioi. 29: 415-425. Nanney D.L. 1958. Epigenetic factors affecting mating type expression in certain ciliates. Cold Spring Harbor Symp. Quant. BioI. 23: 327-335. Nasmyth K.A., Tatchell K., Hall B.D., Astell c., and Smith M. 1981. Physical analysis of mating-type loci in Saccharomyces cerevisiae. Cold Spring Harbor Symp. Quant. BioI. 45: 961-981. Palladino P., Laroche T., Gilson E., Pillus L., and Gasser S.M. 1993. The positioning of yeast telorneres depends on SIR3, SIR4, and the integrity of the nuclear membrane. Cold Spring Harbor Symp. Quant. BioI. 58: 733-746. Pannetier c., Hu-Li J., and Paul W.E. 1999. Bias in the expression of IL-4 alleles: The use of T cells from a GFP knock-in mouse. Cold Spring Harbor Symp. Quant. BioI. 64: 599-602. Peaco*ck w.J., Brutlag D., Goldring E., Appels R., Hinton c.w., and Lindsley D.L. 1974. The organization of highly repeated DNA sequences in Drosophila rpe/aiWgaster chromosomes. Cold Spring Harbor Symp. Quant. Bioi. 38: 405'+416. Pillus L. and Rine J. 1989. EPigenet;Jinheritance of transcriptional states in S. cerevisiae. Cel/59: 637-647. Ptashne M. 2004. A genetic switch: Phage lambda revisited, 3rd edition. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York. Renauld H., Aparicio O.M., Zierath ED., Billington B.L., Chhablani
EP/GENETICS:
S.K., and Gottschling D.E. 1993. Silent domains are assembled continuously from the telomere and are defined by promoter distance and strength, and by SIR3 dosage. Genes Dev. 7: 1133-1145. Rine J., Jensen R., Hagen D., Blair 1., and Herskowitz 1. 1981. Pattern of switching and fate of the replaced cassette in yeast mating-type interconversion. Cold Spring Harbor Symp. Quant. BioI. 45: 951-960. Rubin G.M. 1985. Summary. Cold Spring Harbor Symp. Quant. BioI. 50: 905-908. Rubin G.M., Hazelrigg T., Karess R.E., Laski EA., Laverty T., Levis R., Rio D.C., Spencer EA., and Zuker C.S. 1985. Germ line specificity of P-element transposition and some novel patterns of expression of transduced copies of the white gene .. Cold Spring Harbor Symp. Quant. BioI. 50: 329-335. Rudkin G.T. and Tartof K.D. 1974. Repetitive DNA in polytene chromosomes of Drosophila melanogaster. Cold Spring Harbor Symp. Quant. Bioi. 38: 397-403. Schubeler D., MacAlpine D.M., Scalzo D., Wirbelauer c., Kooperberg c., van Leeuwen E, Gottschling D.E., O'Neill L.P., Turner B.M., Delrow J., et al. 2004. The histone modification pattern of active genes revealed through genome-wide chromatin analysis of a higher eukaryote. Genes Dev. 18: 1263-1271. Schultz J. 1956. The relation of the heterochromatic chromosome regions to the nucleic acids of the cell. Cold Spring Harbor Symp. Quant. BioI. 21: 307-328. Selker E.U., Richardson G.A., Garrett-Engele P.W., Singer M.J., and Miao V. 1993. Dissection of the signal for DNA methylation in the 1;-11 region of Neurospora. Cold Spring Harbor Symp. Quant. Bioi. 58: 323-329. Shapiro L.J. and Mohandas T. 1983. DNA methylation and the control of gene expression on the human X chromosome. Cold Spring HarborSymp. Quant. BioI. 47: 631-637. Shi Y, Lan E, Matson C., Mulligan P., Whetstine J.R., Cole P.A., and Casero R.A. 2004. Histone demethylation mediated by the nuclear amine oxidase hom*olog LSD 1. Cell 119: 941-953. Si K., Lindquist S., and Kandel E. 2004. A possible epigenetic mechanism for the persistence of memory. Cold Spring Harbor Symp. Quant. BioI. 69: 497-498. Solter D., Aronson J., Gilbert S.E, and McGrath J. 1985. Nuclear transfer in mouse embryos: Activation of the embryonic genome. Cold Spring Harbor Symp. Quant. BioI. 50: 45-50. Swift H. 1974. The organization of genetic material in eukaryotes: Progress and prospects. Cold Spring Harbor Symp. Quant. BioI. 38: 963-979. Thompson J.S., Hecht A., and Grunstein M. 1993. Histones and the
FROM
PHENOMENON
TO
FIELD.
13
regulation of heterochromatin in yeast. Cold Spring Harbor Symp. Quant. BioI. 58: 247-256. Tilghman S.M., Bartolomei M.S., Webber A.L., Brunkow M.E., Saam J., Leighton P.A., Pfeifer K., and Zemel S. 1993. Parental imprinting of the H19 and Igf2 genes in the mouse. Cold Spring Harbor Symp. Quant. BioI. 58: 287-295. Vazquez J., Farkas G., Gaszner M., Ddvardy A., Muller M., Hagstrom K., Gyurkovics H., Sipos L., Gausz J., Galloni M., et al. 1993. Genetic and molecular analysis of chromatin domains. Cold Spring Harbor Symp. Quant. BioI. 58: 45-54. Wade P.A., Jones P.L., Vermaak D., Veenstra G.J., Imhof A., Sera T., Tse c., Ge H., Shi Y.B., Hansen J.c., and Wolffe A.P. 1998. Histone deacetylase directs the dominant silencing of transcription in chromatin: Association with MeCP2 and the Mi-2 chromodomain SWI/SNF ATPase. Cold Spring Harbor Symp. Quant. Bioi. 63: 435-445. Wang Y., Wysocka J., Perlin J.R., Leonelli L., Allis CD., and Coonrod S.A. 2004. Linking covalent histone modifications to epigenetics: The rigidity and plasticity of the marks. Cold Spring Harbor Symp. Quant. Bioi. 69: 161-169. Weintraub H. 1974. The assembly of newly replicated DNA into chromatin. Cold Spring Harbor Symp. Quant. BioI. 38: 247-256. - - - . 1993. Summary: Genetic tinkering local problems, local solutions. Cold Spring Harbor Symp. Quant. Bioi. 58: 819-836. Weintraub H., Flint S.J., Leffak 1.M., Groudine M., and Grainger R.M. 1978. The generation and propagation of variegated chromosome structures. Cold Spring Harbor Symp. Quant. Bioi. 42: 401-407. Wickner R.B., Edskes H.K., Ross E.D., Pierce M.M., Baxa D., Brachmann A., and Shewmaker E 2004a. Prion genetics: New rules for a new kind of gene. Annu. Rev. Genet. 38: 681-707. Wickner R.B., Edskes H.K., Ross E.D., Pierce M.M., Shewmaker E, Baxa D., and Brachmann A. 2004b. Prions of yeast are genes made of protein: Amyloids and enzymes. Cold Spring Harbor Symp. Quant. BioI. 69: 489-496. Willard H.E, Brown CJ., Carrel L., Hendrich B., and Miller A.P. 1993. Epigenetic and chromosomal control of gene expression: Molecular and genetic analysis of X chromosome inactivation. Cold Spring Harbor Symp. Quant. BioI. 58: 315-322. Wood W.B., Meneely P., Schedin P., and Donahue L. 1985. Aspects of dosage compensation and sex determination in Caenorhabditis elegans. Cold Spring Harbor Symp. Quant. BioI. 50: 575-583. Yarmolinsky M.B. 1981. Summary. Cold Spring Harbor Symp. Quant. BioI. 45: 1009-1015. Zheng C, and Hayes p. 2003. Structures and interactions of the core histone tail domains. Biopolymers 68: 539-546.
C
HAP
T
E
R
.2
A Brief History of Epigenetics Gary Felsenfeld National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892-0540
CONTENTS 1. Introduction, 16
5. The Role of Chromatin, 18
2. Clues from Genetics and Development, 16
6. All Mechanisms Are Interrelated, 19
3. DNA Is the Same in All Somatic Cells of an Organism, 17
References, 21
4. The Role of DNA Methylation, 17
15
16 •
CHAPTER
2
1 Introduction
The history of epigenetics is linked with the study of evolution and development. But during the past 50 years, the meaning of the term "epigenetics" has itself undergone an evolution that parallels our dramatically increased understanding of the molecular mechanisms underlying regulation of gene expression in eukaryotes. Our present working definition is "the study of mitotically and/or meiotically heritable changes in gene function that cannot be explained by changes in DNA sequence" (Riggs et al. 1996). Until the 1950s, however, the word epigenetics was used in an entirely different way to categorize all of the developmental events leading from the fertilized zygote to the mature organism-that is, all of the regulated processes that, beginning with the genetic material, shape the final product (Waddington 1953). This concept had its origins in the much earlier studies in cell biology and embryology, beginning in the late 19th century, that laid the groundwork for our present understanding of the relationship between genes and development. There was a long debate among embryologists about the nature and location of the components responsible for carrying out the developmental plan of the organism. In trying to make sense of a large number of ingenious but ultimately confusing experiments involving the manipulation of cells and embryos, embryologists divided into two schools: those who thought that each cell contained preformed elements that enlarged during development, and those who thought the process involved chemical reactions among soluble components that executed a complex developmental plan. These views focused on the relative importance of the nucleus and cytoplasm in the developmental process. Following Flemming's discovery of the existence of chromosomes in 1879, experiments by many . investigators, including Wilson and Boveri, provided strong evidence that the developmental program'feSicled in the chromosomes. Thomas Hunt Morgan (1911) ultimately provided the most persuasive proof of this idea through his demonstration of the genetic linkage of several Drosophila genes to the X chromosome. From that point onward, rapid progress was made in creating linear chromosome maps in which individual genes were assigned to specific sites on the Drosophila chromosomes (Sturtevant 1913). Of course, the questions of classic "epigenesis" remained: What molecules within the chromosomes carried the genetic information, how did they direct the developmental program, and how was the information transmitted during cell division? It was understood that both nucleic acid and proteins were pres-
ent in chromosomes, but their relative contributions were not obvious; certainly, no one believed that the nucleic acid alone could carryall of the developmental information. Furthermore, earlier questions persisted about the possible contribution of the cytoplasm to developmental events. Evidence from Drosophila genetics (see below) suggested that heritable changes in phenotype could occur without corresponding changes in the "genes." This debate was dramatically altered by the identification of DNA as the primary carrier of genetic information. Ultimately, it became useful to redefine epigenetics so as to distinguish heritable changes that arise from sequence changes in DNA from those that do not. 2 Clues from Genetics and Development
Whatever the vagaries of the definition, the ideas and scientific data that underlie the present concept of epigenetics had been accumulating steadily since the early part of the 20th century. In 1930, H.]. Muller (Muller 1930) described a class of Drosophila mutations he called "eversporting displacements" ("eversporting" denoting the high rate of phenotypic change). These mutants involved chromosome translocations (displacements), but "even when all parts of the chromatin appeared to be represented in the right dosage-though abnormally arranged-the phenotypic result was not always normal." In some of these cases, Muller observed flies that had mottled eyes. He thought that this was probably due to a "genetic diversity of the different eye-forming cells;' but further genetic analysis led him to connect the unusual properties with chromosomal rearrangement, and to conclude that "chromosome regions, affecting various characters at once, are somehow concerned, rather than individual genes or suppositious 'gene elements.''' Over the next 10 to 20 years, strong evidence provided by many laboratories (see Hannah 1951) confirmed that this variegation arose when rearrangements juxtaposed the white gene with heterochromatic regions. During that period, chromosomal rearrangements of all kinds were the object of a great deal of attention. It was apparent that genes were not completely independent entities; their function could be affected by their location within the genome-as amply demonstrated by the many Drosophila mutants that led to variegation, as well as by other mutants involving translocation to euchromatic regions, in which more general (non-variegating) position effects could be observed. The role of transposable elements in plant genetics also became clear, largely through the work of McClintock (1965).
A
A second line of reasoning came from the study of developmental processes. It was evident that during development there was a divergence of phenotypes among differentiating cells and tissues, and it appeared that such distinguishing features, once established, could be clonally inherited by the dividing cells. Although it was understood at this point that cell-specific programming existed, and that it could be transmitted to daughter cells, how this was done was less clear. A number of mechanisms could be imagined, and were considered. Particularly for those with a biochemical point of view, a cell was defined by the multiple interdependent biochemical reactions that maintained its identity. For example, it was suggested in 1949 by Delbruck (quoted in Jablonka and Lamb 1995) that a simple pair of biochemical pathways, each of which produced as an intermediate an inhibitor of the other pathway, could establish a system that could switch between one of two stable states. Actual examples of such systems were found somewhat later in the lac operon of Escherichia coli (Novick and Weiner 1957) and in the phage switch between lysogenic and lytic states (Ptashne 1992). Functionally equivalent models could be envisioned in eukaryotes. The extent to which nucleus and cytoplasm each contributed to the transmission of a differentiated state in the developing embryo was of course a matter of intense interest and debate; a self-stabilizing biochemical pathway would presumably have to be maintained through cell division. A second kind of epigenetic transmission was clearly demonstrated in Paramecia and other ciliates, in which the ciliary patterns may vary among individuals and are inherited clonally (Beisson and Sonneborn 1965). Altering the cortical pattern by microsurgery results in transmission of a new pattern to succeeding generations. It has been argued that related mechanisms are at work in metazoans, in which the organization of cellular components is influenced by localized cytoplasmic determinants in a way that can be transmitted during cell division (Grimes and Aufderheide 1991). 3 DNA Is the Same in All Somatic Cells of an Organism
Although chromosome morphology indicated that all somatic cells possessed all of the chromosomes, it could not have been obvious that all somatic cells retained the full complement of DNA present in the fertilized egg. Nor until the work of Avery, MacLeod, and McCarty in 1944, and that of Hershey and Chase (1952), was it even clear that a protein-free DNA molecule could carry
BRIEF
HISTORY
OF
EPIGENETICS
•
17
genetic information, a conclusion strongly reinforced by Watson and Crick's solution of the structure of DNA in 1953. Work by Briggs and King (1952) in Rana pipiens and by Laskey and Gurdon (1970) in Xenopus had demonstrated that introduction of a nucleus from early embryonic cells into enucleated oocytes could result in development of an embryo. But as late as 1970, Laskey and Gurdon could state that "It has yet to be proved that somatic cells of an adult animal possess genes other than those necessary for their own growth and differentiation." In the paper containing this statement, they went on to show that to a first approximation, the DNA of a somatic cell nucleus was competent to direct embryogenesis when introduced into an enucleated egg. It was now clear that the program of development, and the specialization of the repertoire of expression seen in somatic cells, must involve signals that are not the result of some deletion or mutation in the germ-line DNA sequence when it is transmitted to somatic cells. Of course, there are ways in which the DNA of somatic cells can come to differ from that of the germ line, with consequences for the cellular phenotype: For example, transposable elements can alter the pattern of expression in somatic cells, as demonstrated by the work of Barbara McClintock and other plant geneticists. Similarly, the generation of antibody diversity involves DNA rearrangement in a somatic cell lineage. This rearrangement (or more precisely its consequences) can be considered a kind of epigenetic event, consistent with the early observations of position-effect variegation described by Muller. -However, much of the work on epigenetics in recent years has focused on systems in which no DNA rearrangements have occurred, and the emphasis has therefore been on modifications to the bases, and to the proteins that are complexed with DNA within the nucleus. 4 The Role of DNA Methylation
X-chromosome inactivation provided an early model of this kind of epigenetic mechanism (Ohno et al. 1959; Lyon 1961); the silenced X chromosome was clearly chosen at random in somatic cells, and there was no evidence of changes in the DNA sequence itself. In part to account for this kind of inactivation, Riggs (1975) and Holliday and Pugh (1975) proposed that DNA methylation could act as an epigenetic mark. The key elements in this model were the ideas that sites of methylation were palindromic, and that distinct enzymes were responsible for methylation of unmodified DNA and DNA already methylated on one strand. It was postulated that the first methylation
18 •
CHAPTER
2
event would be much more difficult than the second; once the first strand was modified, however, the complementary strand would quickly be modified at the same palindromic site. A methylation mark present on a parental strand would be copied on the daughter strand following replication, resulting in faithful transmission of the methylated state to the next generation. Shortly thereafter, Bird took advantage of the fact that the principal target of methylation in animals is the sequence CpG (Doskocil and Sorm 1962) to introduce the use of methylation-sensitive restriction enzymes as a way of detecting the methylation state. Subsequent studies (Bird 1978; Bird and Southern 1978) then showed that endogenous CpG sites were either completely unmethylated or completely methylated. The predictions of the model were thus confirmed, establishing a mechanism for epigenetic transmission of the methylation mark through semiconservative propagation of the methylation pattern. In the years following these discoveries, a great deal of attention has been focused on endogenous patterns of DNA methylation, on the possible transmission of these patterns through the germ line, on the role of DNA methylation in silencing gene expression, on possible mechanisms for initiation or inhibition of methylation at a fully unmethylated site, and on the identification of the enzymes responsible for de novo methylation and for maintenance of methylation on already methylated sites. Although much of the DNA methylation seen in vertebrates is associated with repetitive and retroviral sequences and may serve to maintain these sequences in a permanently silent state, there can be no question that in many cases this modification provides the basis for epigenetic transmission of the state of gene activity. This is most clearly demonstrated at imprinted loci (Cattanach and Kirk 1985) such as the mouse or human Igf2/H19 locus, where one allele is marked by DNA methylation, which in turn contro~pressionfrom both genes (Bell and Felsenfeld 2006; Hark et al. 2000). At the same time, it was clear that this could riot be the only mechanism for epigenetic transmission of information. For example, as noted above, position-effect variegation had been observed many years earlier in Drosophila, an organism that has extremely low levels of DNA methylation. Furthermore, in subsequent years, Drosophila geneticists had identified the Polycomb and Trithorax groups of genes, which appeared to be involved in permanently "locking in" the state of activity, either off or on, respectively, of clusters of genes during development. The fact that these states were stably transmitted during cell division suggested an underlying epigenetic mechanism.
5 The Role of Chromatin It had been recognized for many years that the proteins
bound to DNA in the eukaryotic nucleus, especially the histones, might be involved in modifying the properties of DNA. Well before most of the work on DNA methylation began, Stedman and Stedman (1950) proposed that the histones could act as general repressors of gene expression. They argued that since all somatic cells of an organism had the same number of chromosomes, they had the same genetic complement (although this was not demonstrated until some years later, as noted above). Understanding the subtlety of histone modifications was far in the future, so the Stedmans operated on the assumption that different kinds of cells in an organism must have different kinds of histones in order to generate the observed differences in phenotype. Histones can indeed reduce levels of transcript far below those commonly observed for inactive genes in prokaryotes. Subsequent work addressed the capacity of chromatin to serve as a template for transcription, and asked whether that capacity was restricted in a cell-type-specific manner. In a 1963 paper, Bonner (Bonner et al. 1963) prepared chromatin from a globulin-producing tissue of the pea plant, and showed that when E. coli RNA polymerase was added, and the resulting transcript translated in an in vitro system, globulin could be detected. The result was specific to this tissue. With the advent of hybridization methods, the transcript populations from such in vitro experiments could be examined (Paul and Gilmour 1968) and shown to be specific for the particular tissue from which the chromatin was derived. Other results suggested that this specificity reflected a restriction in access to transcription initiation sites (Cedar and Felsenfeld 1973). Nonetheless, there was a period in which it was commonly believed that the histones were suppressor proteins that passively silenced gene expression. In this view, activating a gene simply meant stripping off the histones; once that was done, it was thought, transcription would proceed pretty much as it did in prokaryotes. There was, however, some evidence that extended regions of open DNA did not exist in eukaryotic cells (Clark and Felsenfeld 1971). Furthermore, even if the naked DNA model was correct, it was not clear how the decision would be made as to which histone-covered regions should be cleared. The resolution of this problem began as early as 1964, when Allfrey (Allfrey et al. 1964) had speculated that histone acetylation might be correlated with gene activation, and that "active" chromatin might not necessarily be stripped of histones. In the ensuing decade, there was
A
great interest in examining the relationship between histone modifications and gene expression. Modifications other than acetylation (methylation and phosphorylation) were identified, but their functional significance was unclear. It became much easier to address this problem after the discovery by Kornberg and Thomas (1974) of the structure of the nucleosome, the fundamental chromatin subunit. The determination of the crystal structure of the nucleosome, first at 7 A and then at 2.8 A resolution, also provided important structural information, particularly evidence for the extension of the histone amino-terminal tails beyond the DNA-protein octamer core, making evident their accessibility to modification (Richmond et al. 1984; Luger et al. 1997). Beginning in 1980 and extending over some years, Grunstein and his collaborators (Wallis et al. 1980; Durrin et al. 1991), applying yeast genetic analysis, were able to show that the histone amino-terminal tails were essential for regulation of gene expression, and for the establishment of silent chromatin domains. The ultimate connection to detailed mechanisms began with the critical demonstration by Allis (Brownell et al. 1996) that a histone acetyltransferase from Tetrahymena was hom*ologous to yeast transcriptional regulatory protein GcnS, providing direct evidence that histone acetylation was connected to control of gene expression. Since then, of course, there has been an explosion of discovery of histone modifications, as well as a reevaluation of the roles of those that were known previously. This still did not answer the question of how the sites for modification were chosen in vivo. It had been shown, for example (Pazin et al. 1994), that Ga14-VP16 could activate transcription from a reconstituted chromatin template in an ATP-dependent manner. Activation was accompanied by repositioning of nucleosomes, and it was suggested that this was the critical event in making the promoter accessible. A fuller understanding of the significance of these findings required the identification of ATP-dependent nucleosome remodeling complexes such as SWI/SNF and NURF (Peterson and Herskowitz 1992; Tsukiyama and Wu 1995), and the realization that both histone modification and nucleosome remodeling were involved in preparing the chromatin template for transcription. It was not clear how information about the state of activity could, employing these mechanisms, be transmitted through cell division; their role in epigenetic transmission of information was thus unclear. The next important step came from the realization that modified histones recruited, in a modification-specific way, proteins that could affect the local structural and functional
BRIEF
HISTORY
OF
EPIGENETICS
19
states of chromatin. It was found, for example, that methylation of histone H3 lysine 9 resulted in the recruitment of the heterochromatin protein HP1 (Bannister et al. 2001; Lachner et al. 2001; Nakayama et al. 2001). Furthermore, HP1 could recruit the enzyme (Suv39 h 1) that is responsible for that methylation. This led to a model for propagation of the silenced chromatin state along the region through a processive mechanism (Fig. 1a). Equally important, it provided a reasonable explanation of how that state could be transmitted and survive through the replication cycle (Fig. 1b). Analogous mechanisms for propagation of an active state have been proposed that involve methylation of histone H3 lysine 4 and the recruitment of Trithorax group proteins (Wysocka et al. 2005). Different kinds of propagation mechanisms have been suggested that depend on variant histones rather than modified histones (Ahmad and Henikoff 2002; McKittrick et al. 2004). Histone H3 is incorporated into chromatin only during DNA replication. In contrast, the histone variant H3.3, which differs from H3 by four amino acids, is incorporated into nucleosomes in a replication-independent manner, and it tends to accumulate in active chromatin, where it is enriched in the "active" histone modifications (McKittrick et al. 2004). It has been proposed that the presence of H3.3 is sufficient to maintain the active state, and that after replication, although it would be diluted twofold, enough H3.3 would remain to maintain the active state. The consequent transcription would result in replacement of H3 containing nucleosomes with H3.3, thus perpetuating the active state in the next generation.
6 All Mechanisms Are Interrelated These models finally begin to complete the connection between modified or variant histones, specific gene activation, and epigenetics, although of course there is much more to be done. Whereas these mechanisms give us some ideas about how the heterochromatic state may be maintained, they do not explain how silencing chromatin structures are first established. It has only recently become clear that this involves the production of RNA transcripts, particularly from repeated sequences, which are processed into small RNAs through the action of proteins such as Dicer, Argonaute, and RNA-dependent RNA polymerase. These RNAs are subsequently recruited to the hom*ologous DNA sites as part of complexes that include components of the Polycomb group of proteins, thus initiating the formation of heterochromatin. There is now also evi-
20 • C HAP
T ER 2
a
b
/)(\~Jn/)(\9n ,
strand A strand B '!
!
"''6'''V
tI strandA~
"''6
"'V
DNA
replication
~strandB
tI
maintenance DNA methylation
Figure 1. Mechanisms for Maintaining a Pattern of DNA Methylation and a Histone Modification during DNA Replication (0) A mechanism for maintaining a pattern of DNA methylation during DNA replication. During replication, the individual DNA strands, with a specific methylation pattern at CpG or CpXpG residues, become paired with a strand of newly synthesized, un methylated DNA. CpG on one strand has a corresponding CpG on the other. The maintenance DNA methyltransferase recognizes a hemimethylated site, and methylates the cytosine on the new strand, so that the pattern of methylation is undisturbed. (b) A general mechanism for maintaining a histone modification during replication. The modified histone tail (m) interacts with a protein binder (pb) that has a binding site specific for that modification. pb, in turn, has a specific site for the enzyme (e) which carries out that histone modification. e, in turn, can then modify an adjacent nucleosome. During replication, the newly deposited histones which are interspersed with parental histones can thus acquire the parental modification. A similar mechanism would allow propagation of histone modifications from a modified region into an unmodified one at any stage of the cell cycle.
dence that the same mechanisms are required for maintenance of at least some heterochromatic regions. In a way, these stable cyclic reaction pathways are reminiscent of Delbruck's 50-year-old model, of a stable biochemical cycle that maintain~ate of the organism. We now knm/ of countless examples of epigenetic mechanisms at work in the organism. In addition to imprinting at many loci, and the allele-specific and random X-chromosome inactivation described above, there are epigenetic phenomena involved in antibody expression, where the rearrangement of the immunoglobulin genes on one chromosome is selectively inhibited, and in the selection for expression of single odorant receptor genes in olfactory neurons (Chess et al. 1994; Shykind et al. 2004). In Drosophila, the Polycomb group genes are responsible for establishing a silenced chromatin domain that is maintained through all subsequent cell divisions.
Epigenetic changes are also responsible for paramutation in plants, in which one allele can cause a heritable change in expression of the hom*ologous allele (Stam et al. 2002). This is an example of an epigenetic state that is inherited meiotically as well as mitotically, a phenomenon documented in plants but only rarely in animals (Jorgensen 1993). Much of the evidence for the mechanisms described above has come from work on the silencing of mating-type locus and centromeric sequences in Schizosaccharomyces pombe (Hall et al. 2002). In addition, the condensed chromatin structure characteristic of centromeres in organisms as diverse as flies and humans has been shown to be transmissible through centromereassociated proteins rather than DNA sequence. In all of these cases, the DNA sequence remains intact, but its capacity for expression is suppressed. This is likely in all cases to be mediated by DNA methylation, histone mod-
A
ification, or both; in some cases, we already know that to be true. Finally, the epigenetic transmission of "patterns;' described above for Paramecia, now extends to the prion proteins, which maintain and propagate their alternatively folded state to daughter cells. Although this has been presented as a sequential story, it should more properly be viewed as a series of parallel and overlapping attempts to define and explain epigenetic phenomena. The definition of the term epigenetics has changed, but the questions about mechanisms of development raised by earlier generations of scientists have not. Contemporary epigenetics still addresses those central questions. Seventy years have passed since Muller described what is now called position-effect variegation. It is gratifying to trace the slow progress from observation of phenotypes, through elegant genetic studies, to the recent analysis and resolution at the molecular level. With this knowledge has come the understanding that epigenetic mechanisms may in fact be responsible for a considerable part of the phenotype of complex organisms. As is often the case, an observation that at first seemed interesting but perhaps marginal to the main issues turns out to be central, although it may take a long time to come to that realization. References Ahmad K and Henikoff S. 2002. The histone variant H3.3 marks active chromatin by replication-independent nucleosome assembly. Mol. Cell 9: 1191-1200. Allfrey V.G., Faulkner R., and Mirsky A.E. 1964. Acetylation and methylation of histones and their possible role in the regulation of RNA synthesis. Proc. Natl. Acad. Sci. 51: 786-794. Avery O.T, MacLeod CM., and McCarty M. 1944. Studies on the chemical nature of the substance inducing transformation of pneumococcal types. f. Exp. Med. 79: 137-158. Bannister A., Zegerman P., Partridge J., Miska E., Thomas J., Allshire R., and Kouzarides T 2001. Selective recognition of methylated lysine 9 on histone H3 by the HPI chromo domain. Nature 410: 120-124. Beisson J. and Sonneborn TM. 1965. Cytoplasmic inheritance of the organization of the cell cortex in Paramecium aurelia. Proc. Natl. Acad. Sci. 53: 275-282. Bell A.C. and Felsenfeld G. 2000. Methylation of a CTCF-dependent boundary controls imprinted expression of the Igf2 gene. Nature 405: 482-485. Bird A.P. 1978. Use of restriction enzymes to study eukaryotic DNA methylation. II. The symmetry of methylated sites supports semiconservative copying of the methylation pattern. f. Mol. Bioi. 118: 49-60. Bird A.P. and Southern E.M. 1978. Use of restriction enzymes to study eukaryotic DNA methylation. I. The methylation pattern in ribosomal DNA from Xenopus laevis. f. Mol. BioI. 118: 27-47. Bonner J., Huang R.C, and Gilden R.Y. 1963. Chromosomally directed protein synthesis. Proc. Natl. Acad. Sci. 50: 893-900. Briggs R. and King n. 1952. Transplantation of living nuclei from blas-
B R f E F H f 5 TOR Y
aF
E P f G ENE TIC 5
•
21
tula cells into enucleated frogs' eggs. Proc. Natl. Acad. Sci. 38: 455-463. Brownell J.E., Zhou J., Ranalli T., Kobayashi R., Edmondson D.G., Roth S.Y., and Allis CD. 1996. Tetrahymena histone acetyltransferase A: A hom*olog to yeast Gcn5p linking histone acetylation to gene activation. Cell 84: 843-851. Cattanach B.M. and Kirk M. 1985. Differential activity of maternally and paternally derived chromosome regions in mice. Nature 315: 496-498. Cedar H. and Felsenfeld G. 1973. Transcription of chromatin in vitro. f. Mol. BioI. 77: 237-254. Chess A., Simon I., Cedar H., and Axel R. 1994. Allelic inactivation regulates olfactory receptor gene expression. Cell 78: 823-834. Clark R,J. and Felsenfeld G. 1971. Structure of chromatin. Nat. New BioI. 229: 101-106. Doskocil J. and Sorm P. 1962. Distribution of 5-methylcytosine in pyrimidine sequences of deoxyribonucleic acids. Biochim. Biophys. Acta 55: 953-959. Durrin L.K, Mann R.K., Kayne P.S., and Grunstein M. 1991. Yeast histone H4 N-terminal sequence is required for promoter activation in vivo. Cell 65: 1023-1031. Grimes G.W. and Aufderheide K.J. 1991. Cellular aspects of pattern formation: The problem of assembly. Monogr. Dev. BioI. 22: 1-94. Hall I.M., Shankaranarayana G.D., Noma K., Ayoub N., Cohen A., and qrewal S.I. 2002. Establishment and maintenance of a heterochromatin domain. Science 297: 2215-2218. Hannah A. 1951. Localization and function of heterochromatin in Drosophila melanogaster. Adv. Genet. 4: 87-125. Hark A.T., Schoenherr CJ., Katz D.J., Ingram R.S., Levorse J.M., and Tilghman S.M. 2000. CTCF mediates methylation-sensitive enhancer-blocking activity at the H19/lgf2 locus. Nature 405: 486-489. Hershey A.D. and Chase M. 1952. Independent functions of viral protein and nucleic acid in growth of bacteriophage. f. Gen. Physiol. 36: 39-56. Holliday R. and Pugh J.E. 1975. DNA modification mechanisms and gene activity during development. Science 187: 226-232.. Jablonka E. and Lamb M.J. 1995. Epigenetic inheritance and evolution: The Lamarckian dimension. Oxford University Press, New York, p.82. Jorgensen R. 1993. The germinal inheritance of epigenetic information in plants. Philos. Trans. R. Soc. Land. B BioI. Sci. 339: 173-181. Kornberg R.D. and Thomas J.O. 1974. Chromatin structure; oligomers of the histones. Science 184: 865-868. Lachner M., O'Carroll D., Rea S., Mechtler K., and Jenuwein T. 2001. Methylation of histone H3 lysine 9 creates a binding site for HPI proteins. Nature 410: 116-120. Laskey R.A. and Gurdon J.B. 1970. Genetic content of adult somatic cells tested by nuclear transplantation from cultured cells. Nature 228: 1332-1334. Luger K., Mader A.W., Richmond R.K, Sargent D.P., and Richmond T.J. 1997. Crystal structure ofthe nucleosome core particle at 2.8 A resolution. Nature 389: 251-260. Lyon M.P. 1961. Gene action in the X-chromosome of the mouse. Nature 190: 372-373. McClintock B. 1965. The control of gene action in maize. Brookhaven Symp. BioI. 18: 162-184. McKittrick E., Gafken P.R., Ahmad K, and Henikoff S. 2004. Histone H3.3 is enriched in covalent modifications associated with active chromatin. Proc. Natl. Acad. Sci. 101: 1525-1530. Morgan T. 1911. An attempt to analyze the constitution of the chromo-
22 • C HAP
T ER 2
somes on the basis of sex-linked inheritance in Drosophila. f. Exp. Zool. 11: 365-414. Muller H.J. 1930. Types of visible variations induced by X-rays in Drosophila. f. Genet. 22: 299-334. Nakayama J., Rice J.c., Strahl B.D., Allis C.D., and Grewal S.l. 2001. Role of histone H3 lysine 9 methylation in epigenetic control of heterochromatin assembly. Science 292: 110-113. Novick A. and Weiner M. 1957. Enzyme induction as an all-or-none phenomenon. Proc. Natl. Acad. Sci. 43: 553-566. Ohno S., Kaplan W.D., and Kinosita R. 1959. Formation of the sex chromatin by a single X-chromosome in liver cells of Rattus norvegicus. Exp. Cell Res. 18: 415-418. Paul J. and Gilmour R.S. 1968. Organ-specific restriction of transcription in mammalian chromatin. f. Mol. BioI. 34: 305-316. Pazin M,J., Kamakaka R.T., and Kadonaga J.T. 1994. ATP-dependent nucleosome reconfiguration and transcriptional activation from preassembled chromatin templates. Science 266: 2007-2011. Peterson c.L. and Herskowitz 1. 1992. Characterization of the yeast SWIl, SWI2, and SWI3 genes, which encode a global activator of transcription. Cell 68: 573-583. Ptashne M. 1992. A genetic switch: Phage A and higher organisms, 2nd edition. Blackwell Science, Malden, Massachusetts and Cell Press, Cambridge, Massachusetts. Richmond T.J., Finch J.T., Rushton B., Rhodes D., and Klug A. 1984. Structure of the nucleosome core particle at 7 A resolution. Nature 311: 532-537. Riggs A.D. 1975. X inactivation, differentiation, and DNA methylation. Cytogenet. Cell Genet. 14: 9-25. Riggs A.D. and Porter T.N. 1996. Overview of epigenetic mechanisms.
In Epigenetic mechanisms ofgene regulation (ed. Y.E.A. Russo et al.), pp. 29-45. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York. Riggs A.D., Martienssen R.A., and Russo Y.E.A. 1996. Introduction. In Epigenetic mechanisms of gene regulation (ed. Y.E.A. Russo et al.), pp. 1-4. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York. Shykind B.M., Rohani S.c., O'Donnell S., Nemes A., Mendelsohn M., Sun Y., Axel R., and Barnea G. 2004. Gene switching and the stability of odorant receptor gene choice. Cell 117: 801-815. Starn M., Belele c., Dorweiler J., and Chandler Y. 2002. Differential chromatin structure with a tandem array 100 kb upstream of the maize bl locus is associated with paramutation. Genes Dev. 16: 1906-1918. Stedman E. and Stedman E. 1950. Cell specificity of histones. Nature 166: 780-781. Sturtevant A. 1913. The linear arrangement of six sex-linked factors in Drosophila, as shown by their mode of association. f. Exp. Zool. 14: 43-59. Tsukiyama T. and Wu C. 1995. Purification and properties of an ATPdependent nucleosome remodeling factor. Cell 83: 1011-1020. Waddington C.H. 1953. Epigenetics and evolution. Symp. Soc. Exp. BioI. 7: 186-199. Wallis J.W., Hereford L., and Grunstein M. 1980. Histone H2B genes of yeast encode two different proteins. Cell 22: 799-805. Wysocka J., Swigut T., Milne T. Dou Y., Zhang X., Burlingame A., Roeder R., Brivanlou A., and Allis C.D. 2005. WDR5 associates with histone H3 methylated at K4 and is essential for H3 K4 methylation and vertebrate development. Cell 121: 859-872.
c
HAP
T
E
R
3
Overview and Concepts C. David Allis,l Thomas Jenuwein, 2 and Danny Reinberg 3 lThe Rockefeller University, New York, New York; 2Research Institute of Molecular Pathology, Vienna, Austria; 3UMDNJ-Robert Wood Johnson Medical School, Piscataway, New Jersey
CONTENTS 1. Genetics Versus Epigenetics, 25
10. RNAi and RNA-directed Gene Silencing, 42
2. Model Systems for the Study of Epigenetics, 26
11. From Unicellular to Multicellular Systems, 44
3. Defining Epigenetics, 28
12. Polycomb and Trithorax, 45
4. The Chromatin Template, 29
13. X Inactivation and Facultative Heterochromatin, 47
5. Higher-Order Chromatin Organization, 31
14. Reprogramming of Cell Fates, 49
6. The Distinction between Euchromatin and Heterochromatin, 34
15. Cancer, 50
7. Histone Modifications and the Histone Code, 36 8. Chromatin-remodeling Complexes and Histone Variants, 39
16. What Does Epigenetic Control Actually D07,52 17. Big Questions in Epigenetic Research, 55 References, 56
9. DNA Methylation, 41
23
GENERAL SUMMARY The DNA sequencing of the human genome and the genomes of many model organisms has generated considerable excitement within the biomedical community and the general public over the past several years. These genetic "blueprints" that exhibit the well-accepted rules of Mendelian inheritance are now readily available for close inspection, opening the door to improved understanding of human biology and disease. This knowledge is also generating renewed hope for novel therapeutic strategies and treatments. Many fundamental questions nonetheless remain. For example, how does normal development proceed, given that every cell has the same genetic information, yet follows a different developmental pathway, realized with exact temporal and spatial precision? How does a cell decide when to divide and differentiate, or when to retain an unchanged cellular identity, responding and expressing according to its normal developmental program? Mistakes made in the above processes can lead to the generation of disease states such as cancer. Are these mistakes encoded in faulty genetic blueprints that we inherited from one or both of our parents, or are there other layers of regulatory information that are not being properly read and decoded? In humans, the genetic information (DNA) is organized into 23 chromosome pairs consisting of approximately 25,000 genes. These chromosomes can be compared to libraries with different sets of books that together instruct the development of a complete human being. The DNA sequence of our genome is composed of about 3 x 109 bases, abbreviated by the four letters (or bases) A, C, G, and T within its sequence, giving rise to well-defined words (genes), sentences, chapters, and books. However, what dictates when the different books are read, and in what order, remains far from clear. Meeting this extraordinary challenge is likely to reveal insights into how cellular events are coordinated during normal and abnormal development. When summed across all chromosomes, the DNA molecule in higher eukaryotes is about 2 meters long and therefore needs to be maximally condensed about 1O,OOO-foid to fit into a cell's nucleus, the compartment of a cell that stores our genetic material. The wrapping of DNA around "spools" of proteins, so-called histone proteins, provides an elegant solution to this packaging problem, giving rise to a repeating protein:DNA polymer known as chromatin. However, in packaging DNA to better fit into a confined space, a problem develops, much as
when one packs too many books onto library shelves: It becomes harder to find and read the book of choice, and thus, an indexing system is needed. Chromatin, as a genome-organizing platform, provides this indexing. Chromatin is not uniform in structure; it comes in different packaging designs from a highly condensed chromatin fiber (known as heterochromatin) to a less compacted type where genes are typically expressed (known as euchromatin). Variation can enter into the basic chromatin polymer through the introduction of unusual histone proteins (known as histone variants), altered chromatin structures (known as chromatin remodeling), and the addition of chemical flags to the histone proteins themselves (known as covalent modifications). Moreover, addition of a methyl group directly to a cytosine (C) base in the DNA template (known as DNA methylation) can provide docking sites for proteins to alter the chromatin state or affect the covalent modification of resident histones. Recent evidence suggests that noncoding RNAs can "guide" specialized regions of the genome into more compacted chromatin states. Thus, chromatin should be viewed as a dynamic polymer that can index the genome and potentiate signals from the environment, ultimately determining which genes are expressed and which are not. Together, these regulatory options provide chromatin with an organizing principle for genomes known as "epigenetics," the subject of this book. In some cases, epigenetic indexing patterns appear to be inherited through cell divisions, providing cellular "memory" that may extend the heritable information potential of the genetic (DNA) code. Epigenetics can thus be narrowly defined as changes in gene transcription through modulation of chromatin, which is not brought about by changes in the DNA sequence. In this overview, we explain the basic concepts of chromatin and epigenetics, and we discuss how epigenetic control may give us the clues to solve some long-standing mysteries, such as cellular identity, tumorigenesis, stem cell plasticity, regeneration, and aging. As readers comb through the chapters that follow, we encourage them to note the wide range of biological phenomena uncovered in a diverse range of experimental models that seem to have an epigenetic (non-DNA) basis. Understanding how epigenetics operates in mechanistic terms will likely have important and far-reaching implications for human biology and human disease in this "post-genomic" era.
a v E R V lEW 1 Genetics Versus Epigenetics
Determining the structural details of the DNA double helix stands as one of the landmark discoveries in all of biology. DNA is the prime macromolecule that stores genetic information (Avery et a1. 1944), and it propagates this stored information to the next generation through the germ line. From this and other findings, the "central dogma" of modern biology emerged. This dogma encapsulates the processes involved in maintaining and translating the genetic template required for life. The essential stages are (1) the self-propagation of DNA by semiconservative replication; (2) transcription in a unidirectional 5' to 3' direction, templated by the genetic code (DNA), generation of an intermediary messenger RNA (mRNA); (3) translation of mRNA to produce polypeptides consisting of linear amino to carboxyl strings of amino acids that are colinear with the 5' to 3' order of DNA. In simple terms: DNA H RNA ~ protein. The central dogma accommodates feedback from RNA to DNA by the process of reverse transcription, followed by integration into existing DNA (as demonstrated by retroviruses and retrotransposons). However, this dogma disavows feedback from protein to DNA, although a new twist to the genetic dogma is that rare proteins, known as prions, can be inherited in the absence of a DNA or RNA template. Thus, these specialized self-aggregating proteins have properties that resemble some properties of DNA itself, including a mechanism for replication and information storage (Cohen and Prusiner 1998; Shorter and Lindquist 2005). Additionally, emerging evidence suggests that a remarkably large fraction of our genome is transcribed into "noncoding" RNAs. The function of these noncoding RNAs (i.e., non-protein-encoding except tRNAs, rRNAs, snoRNAs) is under active investigation and is only beginning to become clear in a limited number of cases. The origin of epigenetics stems from long-standing studies of seemingly anomalous (i.e., non-Mendelian) and disparate patterns of inheritance in many organisms (see Chapters 1 and 2 for a historical overview). Classic Mendelian inheritance of phenotypic traits (e.g., pea color, number of digits, or hemoglobin insufficiency) results from allelic differences caused by mutations of the DNA sequence. Collectively, mutations underlie the definition of phenotypic traits, which contributes to the determination of species boundaries. These boundaries are then shaped by the pressures of natural selection, as explained by Darwin's theory of evolution. Such concepts place mutations at the heart of classic genetics. In contrast, non-Mendelian inheritance (e.g., variation of embryonic growth, mosaic skin coloring, random X inac-
AND
CON C E P T 5
25
tivation, plant paramutation) (Fig. 1) can manifest, to take one example, from the expression of only one (of two) alleles within the same nuclear environment. Importantly, in these circ*mstances, the DNA sequence is not altered. This is distinct from another commonly referred to non-Mendelian inheritance pattern that arises from the maternal inheritance of mitochondria (Birky 2001). The challenge for epigenetic research is captured by the selective regulation of one allele within a nucleus. What distinguishes two identical alleles, and how is this distinction mechanistically established and maintained through successive cell generations? What underlies differences observed in monozygotic ("identical") twins that make them not totally identical? Epigenetics is sometimes cited as one explanation for the differences in outward traits, by translating the influence of the environment, diet, and potentially other external sources to the expression of the genome (Klar 2004; see Chapters 23 and 24). Determining what components are affected at a molecular level, and how alterations in these components affect human biology and human disease, is a major challenge for future studies. Another key question in the field is, How important is the contribution of epigenetic information for normal development? How do normal pathways become dysfunctional, leading to abnormal development and neoplastic transformation (i.e., cancer)? As mentioned above, "identical" twins share the same DNA sequence, and as such, their phenotypic identity is often used to underscore the defining power of genetics. However, even twins such as these can exhibit outward phenotypic differences, likely imparted by epigenetic modifications that occur over the lifetime of the individuals (Fraga et a1. 2005). Thus, the extent to which epigenetics is important in defining cell fate, identity, and phenotype remains to be fully understood. In the case of tissue regeneration and aging, it remains unclear whether these processes are dictated by alterations in the genetic program of cells or by epigenetic modifications. The intensity of research on a global scale testifies to the recognition that the field of epigenetics is a critical new frontier in this post-genomic era. In the words of others, "We are more than the sum of our genes" (Klar 1998), or "You can inherit something beyond the DNA sequence. That's where the real excitement in genetics is now" (Watson 2003). The overriding motivation for deciding to edit this book was the general belief that we and all the contributors to this volume could transmit this excitement to future generations of students, scientists, and physicians, most of whom were taught genetic, but not epigenetic, principles governing inheritance and chromosome segregation.
26 •
C HAP T E R 3
Figure 1. Biological Examples of Epigenetic Phenotypes
Barr body polytene chromosomes
twins
epigenetic biology yeast mating types
cloned cat
mutant plant
~ lEU]
blood smear
tumor tissue
2 Model Systems for the Study of Epigenetics
The study of epigenetics necessarily requires good experimental models, and as often is the case, these models seem at first sight far removed from studies using human (or mammalian) cells. Collectively, however, results from many systems have yielded a wealth of knowledge. The historical overviews (Chapters 1 and 2) make reference to several important landmark discoveries that have emerged from early cytology, the growth of genetics, the birth of molecular biology, and relatively new advances in chromatinmediated gene regulation. Different model organisms (Fig. 2) have been pivotal in addressing and solving the various questions raised by epigenetic research. Indeed, seemingly disparate epigenetic discoveries made in various model organisms have served to unite the research community. The purpose of this section is to highlight some of these major findings, which are discussed in more detail in the following chapters of this book. As readers note these discoveries, they should focus on the fundamental principles that investigations using these model systems have exposed; their collective contributions point more often to common concepts than to diverging details. Unicellular and "lower" eukaryotic organisms-Saccharomyces cerevisiae, Schizosaccharomyces pombe, and
Epigenetic phenotypes in a range of organisms and cell types, all attributable to non-genetic differences. Twins: Slight variations partially attributable to epigenetics (© Randy Harris, New York). Barr body: The epigenetically silenced X chromosome in female mammalian cells, visible cytologically as condensed heterochromatin. Polytene chromosomes: Giant chromosomes in Drosophila salivary glands, ideally suited for correlating genes with epigenetic marks (reprinted from Schotta et al. 2003 [©SpringerD. Yeast mating type: Sex is determined by the active MAT locus, while copies of both mating-type genes are epigenetically silenced (©Alan Wheals, University of Bath). Blood smear: Heterogeneous cells of the same genotype, but epigenetically determined to serve different functions (courtesy Prof. Christian Sillaber). Tumor tissue: Metastatic cells (left) showing elevated levels of epigenetic marks in the tissue section (reprinted, with permission, from Seligson et al. 2005 [©Macmillan]). Mutant plant: Arabidopsis flower epiphenotypes, genetically identical, with epigenetically caused mutations (reprinted, with permission, from Jackson et al. 2002 [©MacmillanD. Cloned cat: Genetically identical, but with varying coat-color phenotype (reprinted, with permission, from Shin et al. 2002 [©Macmillan]).
Neurospora crassa-permit powerful genetic analyses, in part facilitated by a short life cycle. Mating-type (MAT) switching that occurs in S. cerevisiae (Chapter 3) and S. pombe (Chapter 6) has provided remarkably instructive examples, demonstrating the importance of chromatinmediated gene control. In the budding yeast S. cerevisiae, the unique silent information regulator (SIR) proteins were shown to engage specific modified histones. This was preceded by elegant experiments using genetics to document the active participation of histone proteins in gene regulation (Clark-Adams et al. 1988; Kayne et al. 1988). In the fission yeast S. pombe, the patterns of histone modification operating as activating and repressing signals are remarkably similar to those in metazoan organisms. This has opened the door for powerful genetic screens being employed to look for gene products that suppress or enhance the silencing of genes. Most recently, a wealth of mechanistic insights linking the RNA interference (RNAi) machinery to the induction of histone modifications acting to repress gene expression was discovered in fission yeast (Hall et al. 2002; Volpe et al. 2002). Shortly afterward, the RNAi machinery was also implicated in transcriptional gene silencing in the plant Arabidopsis thaliana, underscoring the potential importance of this regulation in a wide range of organisms (see Section 10).
a v E R V lEW
AND
CON C E P T S
27
S.cerevisiae
ospomb' epigenetic model organisms
c. elegans
Tetrahymena
maize
Arabidopsis
Other "off-beat" organisms have also made disproportionate contributions toward unraveling epigenetic pathways that at first seemed peculiar. The fungal species, N. crassa, revealed the unusual non-Mendelian phenomenon of repeat-induced point mutation (RIP) as a model for studying epigenetic control (Chapter 6). Later, this organism was used to demonstrate the first functional connection between histone modifications and DNA methylation (Tamaru and Selker 2001), a finding later extended to "higher" organisms (Jackson et al. 2002). Ciliated protozoa, such as Tetrahymena and Paramecium, commonly used in biology laboratories as convenient microscopy specimens, facilitated important epigenetic discoveries because of their unique nuclear dimorphism. Each cell carries two nuclei: a somatic macronucleus that is transcriptionally active, and a germ-line micronucleus that is transcriptionally inactive. Using macronuclei as an enriched starting source of "active" chromatin, the biochemical purification of the first nuclear histone-modifying enzyme-a histone acetyltransferase or HAT-was made (Brownell et al. 1996). Ciliates are also well known for their peculiar phenomenon of programmed DNA elimination during their sexual life cycle, triggered by small noncoding RNAs and histone modifications (Chapter 7). In multicellular organisms, genome size and organismal complexity generally increase from invertebrate (Caenorhabditis elegans, Drosophila melanogaster) or
Figure 2. Model Organisms Used in Epigenetic Research Schematic representation of model organisms used in epigenetic research. S. cerevisiae: Mating-type switching to study epigenetic chromatin control. S. pombe: Variegated gene silencing manifests as colony sectoring. Neurospora crassa: Epigenetic genome defense systems include repeat-induced point mutation, quelling, and meiotic silencing of unpaired DNA, revealing an interplay between RNAi pathways, DNA and histone methylation. Tetrahymena: Chromatin in somatic and germ-line nuclei are distinguished byepigenetically regulated mechanisms. Arabidopsis: Model for repression by DNA, histone, and RNA-guided silencing mechanisms. Maize: Model for imprinting, para mutation, and transposon-induced gene silencing. C. elegans: Epigenetic regulation in the germ line. Drosophila: Position-effect variegation (PEV) manifest by clonal patches of expression and silencing of the white gene in the eye. Mammals: X-chromosome inactivation.
plant (A. thaliana) species to "higher," and to some, "more relevant," vertebrate organisms (mammals). Plants, however, have been pivotal to the field of epigenetics, providing a particularly rich source of epigenetic discoveries (Chapter 9) ranging from transposable elements and paramutation (McClintock 1951) to the first description of noncoding RNAs involved in transcriptional silencing (Ratcliff et al. 1997). Crucial links between DNA methylation, histone modification, and components of the RNAi machinery came through plant studies. The discovery of plant epialleles, with comic names such as SUPERMAN and KRYPTONITE (e.g., Jackson et al. 2002), and several vernalizing genes (Bastow et al. 2004; Sung and Amasino 2004) have further provided the research field with insights into understanding the developmental role of epigenetics and cellular memory. Plant meristem cells have also offered the opportunity to study crucial questions such as somatic regeneration and stem cell plasticity (see Chapters 9 and 11). For understanding animal development, Drosophila has been an early and continuous genetic powerhouse. Based on the pioneering work of Muller (1930), many developmental mutations were generated, including the homeotic transformations and position-effect variegation (PEV) mutants explained below (also see Chapter 5). The homeotic transformation mutants led to the idea that there could be regulatory mechanisms for establishing and
28
C HAP T E R 3
maintammg cellular identity/memory which was later shown to be regulated by the Polycomb and trithorax systems (see Chapters 11 and 12). For PEV, gene activity is dictated by the surrounding chromatin structure and not by primary DNA sequence. This system has been a particularly informative source for dissecting factors involved in epigenetic control (Chapter 5). Over 100 suppressors of variegation [Su(var)] genes are believed to encode components of heterochromatin. Without the foundation established by these landmark studies, the discovery of the first histone lysine methyltransferases (HKMTs) (Rea et al. 2000) and the resultant advances in histone lysine methylation would not have been possible. As is often the case in biology, comparable screens have been carried out in fission yeast and in plants, identifying silencing mutants with functional conservation with the Drosophila Su(var) genes. The use of reverse genetics via RNAi libraries in the nematode worm C. elegans has contributed to our understanding of epigenetic regulation in metazoan development. There, comprehensive cell-fate tracking studies, detailing all the developmental pathways of each cell, have highlighted the fact that Polycomb and trithorax systems probably arose with the emergence of multicellularity (see Sections 12 and 13). In particular, these mechanisms of epigenetic control are essential for gene regulation in the germ line (see Chapter 15). The role of epigenetics in mammalian development has mostly been elucidated in the mouse, although a number of studies have been translated to diverse human cell lines and primary cell cultures. The advent of gene "knock-out" and "knock-in" technologies has been instrumental for the functional dissection of key epigenetic regulators. For instance, the Dnmtl DNA methyltransferase mutant mouse provided functional insight for the role of DNA methylation in mammals (Li et al. 1992). It is embryonic-lethal and shows impaired imprinting (see Chapter 18). Disruption of DNA methylation has also been shown to cause genomic instability and reanimation of transposon activity, particularly in germ cells (Walsh et al. 1998; Bourc'his and Bestor 2004). There are approximately 100 characterized chromatin-regulating factors (i.e., histone and DNA-modifying enzymes, components of nucleosome remodeling complexes and of the RNAi machinery) that have been disrupted in the mouse. The mutant phenotypes affect cell proliferation, lineage commitment, stem cell plasticity, genomic stability, DNA repair, and chromosome segregation processes, in both somatic and germ cell lineages. Not surprisingly, most of these mutants are also involved in disease development and cancer. Thus, many of the key advances in epigenetic
control took advantage of unique biological features exhibited by many, if not all, of the above-mentioned model organisms. Without these biological processes and the functional analyses (genetic and biochemical) that delved into them, many of the recent advances in epigenetic control would have remained elusive. 3 Defining Epigenetics
The above discussion begs the question, What is the common thread that allows diverse eukaryotic organisms to be connected with respect to fundamental epigenetic principles? Different epigenetic phenomena are linked largely by the fact that DNA is not "naked" in all organisms that maintain a true nucleus (eukaryotes). Instead, the DNA exists as an intimate complex with specialized proteins, which together comprise chromatin. In its simplest form, chromatin-i.e., DNA spooled around nucleosomal units consisting of small histone proteins (Kornberg 1974)-was initially regarded as a passive packaging molecule to wrap and organize the DNA. Distinctive forms of chromatin arise, however, through an array of covalent and non-covalent mechanisms that are being uncovered at a rapid pace (see Section 6). This includes a plethora of posttranslational histone modifications, energy-dependent chromatin-remodeling steps that mobilize or alter nucleosome structures, the dynamic shuffling of new histones (variants) in and out of nucleosomes, and the targeting role of small noncoding RNAs. DNA itself can also be modified covalently in many higher eukaryotes, by methylation at the cytosine residue, usually but not always, of CpG dinucleotides. Together, these mechanisms provide a set of interrelated pathways that all create variation in the chromatin polymer (Fig. 3). Many, but not all, of these modifications and chromatin changes are reversible and, therefore, are unlikely to be propagated through the germ line. Transitory marks are attractive because they impose changes to the chromatin template in response to intrinsic and external stimuli (Jaenisch and Bird 2003), and in so doing, regulate the access and/or processivity of the transcriptional machinery, needed to "read" the underlying DNA template (Sims et al. 2004; Chapter 10). Some histone modifications (like lysine methylation), methylated DNA regions, and altered nucleosome structures can, however, be stable through several cell divisions. This establishes "epigenetic states" or means of achieving cellular memory, which remain poorly appreciated or understood. From this perspective, chromatin "signatures" can be viewed as a higWy organized system of information storage that can index distinct regions
o
GENETICS
EPIGENETICS
!
muta';o", mod
remodeler
ncRNAs
inherited
stable?
germ line
soma
species
variability
Figure 3. Genetics Versus Epigenetics GENETICS: Mutations (red stars) of the DNA template (green helix) are heritable somatically and through the germ line. EPIGENETlCS: Variations in chromatin structure modulate the use of the genome by (1) histone modifications (mod), (2) chromatin remodeling (remodeler), (3) histone variant composition (yellow nuc/eosome), (4) DNA methylation (Me), and (5) noncoding RNAs. Marks on the chromatin template may be heritable through cell division and collectively contribute to determining cellular phenotype.
of the genome and accommodate a response to environmental signals that dictate gene expression programs. The significance of having a chromatin template that can potentiate the genetic information is that it provides multidimensional layers to the readout of DNA. This is perhaps a necessity, given the vast size and complexity of the eukaryotic genome, particularly for multicellular organisms (see Section 11 for further details). In such organisms, a fertilized egg progresses through development, starting with a single genome that becomes epigenetically programmed to generate a multitude of distinct "epigenomes" in more than 200 different types of cells (Fig. 4). This programmed variation has been proposed to constitute an "epigenetic code" that significantly extends the information potential of the genetic code (StraW and Allis 2000; Turner 2000; Jenuwein and Allis 2001). Although this is an attractive hypothesis, we stress that more work is needed to test this and related provocative theories. Other alternative viewpoints are being advanced which argue that clear combinatorial "codes;' lilee the triplet genetic code, are not lil 25,000 GENES
> 14,000 GENES
> 25,000 GENES
> 25,000 GENES
16 chromosomes
3 chromosomes
5 chromosomes
4 chromosomes
20 chromosomes
23 chromosomes
gene (2 kb)
.•
p
gene (5 kb)....
gene (2 kb)···
p
gene (50 kb) ...
p repeats
simple vs. complex gene organization
Figure 15. Pie Charts of Organismal Genome Organization Genome sizes are indicated for the major model organisms used in epigenetic research at the top of each pie chart. The increase in genome size correlates with the vast expansion of noncoding (i.e., intronic, intergenic, and interspersed repeat sequences) and repeat DNA (e.g., satellite, LINE, SINE DNA) sequences in more complex multicellular organisms. This expansion is accompanied by an increase in the number of epigenetic mechanisms (particularly repressive) that regulate the genome. Expansion of the genome also correlates with an increase in size and complexity of transcription units, with the exception of plants; they have evolved mechanisms that are intolerant to insertions or duplications within the transcription unit. P = Promoter DNA element.
maintains totipotency of its epigenome and what mechanisms are involved in erasing, establishing, and maintaining cell fate (cell memory). Because one germ cell can give rise to another germ cell, it essentially has an infinite proliferative potential, as do unicellular "immortal" organisms. However, to fulfill this role, germ cells are for the most part "resting" and unresponsive to external stimuli, so that integrity of their epigenome can be protected. Indeed, mammalian oocytes can be retained in a resting state for more than 40 years. Similarly, adult stem cells (multipotent) are largely a dormant cell population, proliferating (and self-renewing) only when activated by mitogenic stimuli to enter a restricted number of cell divisions. Thus, the makeup of the epigenome is challenged by many intrinsic (e.g., transcription, DNA replication, chromosome segregation) and external (e.g., cytokines, hormones, DNA damage, or general stress responses) signals, particularly if somatic differentiation has forced cells to leave the protective germ-cell and stem-cell environment. 12 Polycomb and Trithorax
Among some of the main effectors that can transduce signals to the chromatin template and participate in maintaining cellular identity (i.e., provide cellular memory) are members of the PcG and trxG groups of genes (Ringrose
and Paro 2004). These genes were discovered in Drosophila by virtue of their role in the developmental regulation of the Hox gene cluster and homeotic gene regulation. PcG and trxG have since been shown to be key regulators for cell proliferation and cellular identity in multicellular eukaryotes. In addition, these groups of genes are involved in several signaling cascades that respond to mitogens and morphogens; regulate stem cell identity and proliferation, vernalization in plants, homeotic transformations and transdetermination, lineage commitment during B- and T-cell differentiation, and many other aspects of metazoan development (see Chapters 11 and 12). We now briefly address what is known about how the PcG and trxG families of genes convert developmental cues into an "epigenetic memory" through chromatin structure. The PcG and trxG groups of proteins function for the most part antagonistically: The PcG family of proteins establish a silenced chromatin state and the trxG family of proteins in general propagate gene activity. The molecular identification of the Pc gene known to stabilize patterns of gene repression over several cell generations provided the first evidence for a molecular mechanism for cellular or epigenetic memory. As well, PC provided an example of a chromodomain-containing protein with a high degree of similarity to the chromodomain of the heterochromatin-associated protein HP1 (Paro and Hog-
46 •
C HAP T E R
3
ness 1991). As mentioned above, chromodomains are well documented to be specific histone methyl-lysine binding modules (illustrated in Fig. 10). Approximately 20 PcG genes and at least 15 distinct trxG genes have been identified in Drosophila. Functional analyses have shown that these groups of genes constitute a spectrum of diverse proteins yet are higWy conserved between eukaryotes. PcG genes encode products that include DNA-binding proteins (e.g., TIl), histone-modifying enzymes (e.g., Ezh2), and other repressive chromatin-associated factors that contain a chromodomain with affinity for H3K27me3 (e.g., PC). trxG genes encode transcription factors (e.g., GAGA or Zeste), ATP-dependent chromatin-remodeling enzymes (e.g., Brahma), and HKMTs such as Ash1 and Trx (or its mammalian hom*ologs MLL, Setl, and the MLL family). In most instances, the trxG and PcG families of proteins function as components of diverse complexes to establish stable chromatin structures that facilitate the expression or silencing of developmentally regulated genes (see Chapters 11 and 12). Despite recent advances, the mechanism by which PcG- or trxG-containing complexes are targeted to developmentally regulated chromatin regions is not well understood. In Drosophila, heritable gene repression requires the recruitment of PcG protein complexes to DNA elements called polycomb response elements (PREs). Equivalent sequences in mammals have remained elusive. It is unclear how PcG protein complexes cause long-range silencing in a PRE-dependent manner, because PREs are usually located kilobases from the transcription start site of target genes. It can be postulated that repulsion or recruitment of PcG complexes may be discriminated by changes in transcriptional activity, or differences in productive versus nonproductive mRNA processing (Pirrotta 1998; Dellino et a1. 2004; Schmitt et a1. 2005). Current models support PcG binding through interaction with DNA-binding proteins and the affinity of the chromodomain within the PC protein for H3K27me3-modified histones (Cao et a1. 2002). However, PcG complexes can also associate in vitro with nucleosomes that lack histone tails (Francis et al. 2004), and furthermore, PRE elements have reduced nucleosome density (Schwartz et al. 2005). The most logical explanation for some of these disparate observations is that PcG binding in vivo would initially require interaction with DNA-bound factors that is then stabilized by association with nucleosomes and modified H3K27me3 in the adjacent chromatin region. Clearly, more research is needed to link existing evidence of how PcG complexes are targeted to regions of chromatin and how they medi-
ate repression. This is likely to be organism-dependent, because there is great heterogeneity in the PcG complexes (see Chapter 11). Trithorax group proteins maintain in general an active state of gene expression at target genes and overcome (or prevent) PcG-mediated silencing. This transition is even less well understood, but recent evidence suggests that an RNA-based mechanism could provide the trigger for the recruitment of Ash 1 to target promoters (Sanchez-Elsner et a1. 2006). A number of transient and stable changes in chromatin structure are thought to ensue, perhaps facilitated by intergenic transcription that can establish an open chromatin domain and mediate active histone replacement. Documented chromatin changes include the incorporation of "active" histonelysine methylation marks by trxG HKMTs such as Trx and Ash1, and the reading of these marks (e.g., the WDR5 recognition of H3K4me; Wysocka et a1. 2005). The action of trxG ATP-dependent chromatin-remodeling factors such as Brahma is also required, although how these mechanisms interrelate has yet to be fully determined (for more detail, see Chapter 12). Many PcG and trxG proteins cooperate to maintain a tightly controlled level of repressed heterochromatin versus active euchromatin in a normal cell. In mammalian somatic interphase nuclei, the nuclear morphology reveals that constitutive domains of pericentromeric heterochromatin are grouped into 15-20 foci (see Fig. 16). Deregulation of cell fate and proliferation control, which leads to developmental abnormalities and cancer, frequently displays abnormal nuclear morphologies. For example, the nuclear organization in PML-Ieukemia (related to mixed lymphocyte leukemia [MLL]) cells shows an absence of pericentromeric foci (Di Croce 2005). In contrast, senescent (nonproliferating) cells display a nuclear morphology with large ectopic heterochromatin clusters (Narita et a1. 2003; Scaffidi et al. 2005). Thus, nuclear morphology appears to be a good marker for distinguishing between normal and aberrant cell states, indicating that nuclear architecture may yet playa regulatory role in maintaining specialized domains of chromatin. The study of histone modification levels is another indicator of cell normality or abnormality. Many of these changes are attributed to the deregulation of PcG (e.g., Ezh2) or trxG (e.g., MLL) HKMTs, contributing to the progression and even metastatic potential of a tumor (see Section 15). Indeed, the increase in overall levels of either of the above-mentioned proteins is associated with increased risk of prostate cancer, breast cancer, multiple myeloma, or leukemia (Lund and van Lohuizen 2004;
a v E R V lEW Valk-Lingbeek et al. 2004). In other cases of neoplastic transformation, there is a manifest decrease in repressive histone marks and increase in overall acetylation states (Seligson et al. 2005) causing elevated levels of gene transcription and genomic instability. Clearly, changes in the global control of chromatin, possibly through perturbation of histone-modifying enzymes, affects the functionality of the genome and disrupts the proper gene expression profile of a normal cell. In the case of cellular senescence, an increase in repressive histone marks is also an indicator of cellular dysfunction. This, concomitant with reduced definition of histone acetylation, can reinforce and even increase the levels of silent chromatin, blocking cellular plasticity and driving cells into an antiproliferative state (Scaffidi et al. 2005). This is largely an age-related effect, although the disease state, progeria, can prematurely advance aging. Conversely, when repressive pericentromeric methyl marks are decreased in mutants lacking the transducing enzyme (Suv39h), cells display increased rates of immortalization, no longer senesce, and show greater rates of genomic instability (Braig et al. 2005). These examples illustrate that chromatin deregulation, demonstrated by the levels of characteristic histone marks, often trans-
AND
CON C E P T S
~
oocyte
r·········· "- germ cells
L
~
13 X Inactivation and Facultative Heterochromatin
PeG-mediated gene silencing and X-chromosome inactivation are prime examples for developmentally regulated transitions between active and inactive chromatin states (see Fig. 17), often referred to as facultative heterochromatin. This is in contrast to constitutive heterochromatin (at, e.g., pericentromeric domains), which may by default be induced at noncoding and highly repetitive regions. Facultative heterochromatin occurs at coding regions of the genome, where gene silencing is dependent on, and sometimes reversible by, developmental decisions specifying distinct cell fates. One of the best-studied examples for facultative heterochromatin formation is the inactivation of one of the two X chromosomes in female mammals to equalize the dosage of X-linked gene expression with males that possess only one X (and a heteromorphic Y) chromosome (Chapter 17). Here, chromosome-wide gene silencing of
_
-~-
somatic cells ............ llstress ll
CELLULAR IDENTITY
Polycomb and Trithorax
47
duced by PeG and trxG enzymes, and nuclear morphology, is proving to be an important indicator of disease progression.
"protected"
~fertilized.
•
......... ·stress"
···········stress ll
Figure 16. Cellular Identity by PcG and trxG Proteins Two cell compartments are established during embryogenesis, distinguished by their differentiation potency: They are germ cells (totipotent) and somatic cells (including stem cells) with restricted differentiation potentials. The plasticity of a germ or stem cell's genome expression potential is reflected in reduced levels of repressive histone marks which are no longer visible at pericentromeric foci. Normal proliferating cells typically have a nuclear morphology showing 15-20 heterochromatic foci. Polycomb- and Trithorax-containing complexes operate in specifying the epigenetic and, hence, cellular identity of different lineages. They also function in response to external "stress" stimuli, promoting cellular proliferation and appropriate gene expression. Loss of genome plasticity and proliferation potential occurs in senescent (aging) cells, reflected by abnormally large heterochromatic foci and an overall increased level of repressive histone marks. Highly proliferating tumor cells, however, exhibit changes in the balance of repressive and activating histone marks through the deregulation of PcG and trxG histone-modifying enzymes. This is accompanied by perturbed nuclear morphology.
48
C HAP T E R 3
the inactive X chromosome (Xi) induces a high degree of Xi compaction that is visible as the Barr body, localized in the nuclear periphery of female mammalian cells. How the two alleles of the X chromosomes are counted and how one particular X chromosome is chosen for inactivation are challenging questions in today's epigenetic research. X inactivation involves a large (~17 kb) noncoding RNA, Xist, which appears to act as the primary trigger for chromatin remodeling at the Xi. Although there is the potential to form dsRNA between Xist and the antisense transcript Tsix (expressed only before the onset of X inactivation), no compelling evidence exists for RNAidependent mechanisms being involved in the initiation of X inactivation. The X-inactivation center (XIC) and likely DNA "entry" or "docking" sites (postulated to be specialized repetitive DNA elements that are enriched on the X chromosomes) playa role for Xist RNA to associate and function as a scaffolding molecule, decorating the Xi in cis. Xist promotes the recruitment and action of both PRCl (polycomb respressive complex) and PRC2 complexes, involved in establishing a stable inactive X chromosome. PRC2 components include, for example, the HKMT chromatin-modifying enzyme, EZH2, which catalyzes H3K27me3. PRC1 complex binding may be promoted by both H3K27me3 and histone-modification-independent means, whereas other components of
euchromatic gene repression (e.g. Polycomb)
facultative heterochromatin
the complex, such as the Ring1 proteins, ubiquitinate H2A. Such is the heterogeneity of PcG complexes that different components can act independently of other complex components. The chromatin modifications, PcG complex binding, the subsequent incorporation of the histone variant macroH2A along the Xi, and extensive DNA methylation all contribute to generating a facultative heterochromatin structure along the entire Xi chromosome. Once a stable heterochromatic structure is established, Xist RNA is no longer required for its maintenance (Avner and Heard 2001; Heard 2005). A similar form of monoallelic silencing is genomic imprinting, which also uses a noncoding or antisense RNA to silence one allelic copy in a parent-of-origin-specific manner (Chapter 19). It is currently not clear whether and how Dicer-mutant mouse ES cells would affect the processes of X inactivation or genomic imprinting. The general paradigm of dosage compensation, a classic epigenetically controlled mechanism, has also been addressed in other model organisms, notably C. elegans (Meyer et al. 2004; Chapter 15) and Drosophila (Gilfillan et al. 2004; Chapter 16). It is not yet clear whether dosage compensation occurs in birds, despite the fact that they are heterogametic organisms. In Drosophila, dosage compensation between the sexes occurs not by X inactivation in the female, but by a twofold up-regulation from the single X chromosome in the male. Intriguingly, two non-
constitutive heterochromatin
- - -..~.........t ---aberrant RNA 111
dsRNAs=
!
!
Dicer
mis-processed RNA ???
dispersed nuclear distribution
Barr body female cells
Figure 17. RNA Directed Induction of Repressed Chromatin States
heterochromatic foci
Different forms of silent chromatin have different primary signals, but many are likely to be RNA transcript-related (from aberrant transcripts, to Xist RNA, to dsRNAs), depending on the nature of the underlying DNA sequence. This triggers the establishment of a collection of chromatin changes, including a combination of histone modifications (H3K9, H3K27, and H4K20 methylation), the binding of repressive proteins or complexes (e.g., PC or HP1) to the chromatin, DNA methylation, and the presence of histone variants (e.g., macroH2A on the inactive X chromosome). Facultative or constitutive heterochromatin shows visible clustering in the nucleus. Euchromatic repression cannot be determined by nuclear morphology patterns.
a v E R V lEW coding RNAs, roXl and roX2, are known to be essential components, and their expression is male-specific. Although similar mechanistic details probably exist between flies and mammals, it is clear that activating chromatin remodeling and histone modifications, notably MOF-dependent H4K16 acetylation on the male X chromosome, plays a key role in Drosophila dosage compensation. Exactly how histone-modifying activities, such as the MOF histone acetyltransferase, are targeted to the male X chromosome remains a challenge for future studies. Furthermore, ATP-dependent chromatin-remodeling activities, such as nucleosome-remodeling factor (NURF), are thought to antagonize the activities of the dosage compensation complex (DCC). Together, this section and Sections 10 and 11 have described mechanisms for RNA-directed chromatin modifications, as they occur for constitutive heterochromatin, the Xi chromosome, and, possibly, also PcG-mediated gene silencing. On the basis of the intriguing parallels, one might postulate that an RNA moiety(s) or unpaired DNA would provide an attractive primary trigger for stabilizing PcG complexes at PREs or compromised promoter function, where they may "sense" the quality of transcriptional processing. Aberrant or stalled elongation and/or splicing errors could spur the interaction between PRE-bound PcG and a promoter, resulting in transcriptional shutdown. Thus, initiation of PcG silencing would be induced by the transition from productive to nonproductive transcription. The extent to which trxG complexes may utilize RNA quality control and/or processing of primary RNA transcripts as part of maintaining transcriptional "ON" states is beginning to be unraveled (Sanchez-Elsner et al. 2006). 14 Reprogramming of Cell Fates
The question of how cell fate can be altered or reversed has long intrigued scientists. The germ cell and early embryonic cells distinguish themselves from other cell compartments as the "ultimate" stem cell by their innate totipotency. Although cell-fate specification in mammals allows for around 200 different cell types, there are, in principle, two major differentiation transitions: from a stem (pluripotent) cell to a fully differentiated cell, and between a resting (quiescent or Go) and a proliferating cell. These represent the extreme endpoints among many intermediates, consistent with a multitude of different makeups of the epigenome in mammalian development. During embryogenesis, a dynamic increase of epigenetic modifications is detected in the transition from the fertil-
AND
CON C E P T S
•
49
ized oocyte to the blastocyst stage, and then at implantation, gastrulation, organ development, and fetal growth. Most of these modifications or imprints may be erased via transfer of a differentiated cell nucleus to the cytoplasm of an enucleated oocyte. However, some marks may persist, thereby restricting normal development of cloned embryos, and a few could even be inherited as germ-line modifications (g-mod) (see Fig. 18), which, in mammals, are likely to include DNA methylation. Liver regeneration and muscle cell repair are exceptions of mammalian tissues that can regenerate in response to damage or injury, although most other tissues are unable to be reprogrammed. In other organisms, such as plants and Axolotl, certain somatic cells can actually reprogram their epigenome and reenter the cell cycle to regenerate lost or damaged tissue (Tanaka 2003). In general, however, reprogramming of somatic cells is not possible unless they are engineered to recapitulate early development upon nuclear transfer (NT) into an enucleated oocyte. This was first demonstrated in cloned frogs (Xenopus), and more recently by the generation of Dolly, the first cloned mammal (Campbell et al. 1996; see Chapter 22). Three major obstacles to efficient somatic reprogramming in mammals have been identified. First, certain somatic epigenetic marks (e.g., repressive H3K9me3) are stably transmitted through somatic cell divisions and resist reprogramming in the oocyte. Second, a somatic cell nucleus is unable to recapitulate the asymmetry of reprogramming that occurs in the fertilized embryo as a consequence of the differential epigenetic marks inherited by the male and female haploid genomes (see Mayer et al. 2000; van der Heijden et al. 2005; Chapter 20). Third, transmission of imprinted loci that are particularly important in fetal and placental development is not faithfully maintained upon NT (Morgan et al. 2005). Most cloned embryos abort, suggesting that perturbed epigenetic imprints represent a major bottleneck for normal development and could be the cause for the poor efficiencies of assisted reproductive technologies (ART) and the reduced vigor of cloned animals. The use of embryonic stem cells versus somatic cells shows greatly enhanced reprogramming potential. The demonstration that quiescent cells (a frequent characteristic of stem cells) have a reduction in global H3K9me3 and H4K20me3 states could be a factor indicating enhanced plasticity of the epigenome (Baxter et al. 2004). This is also consistent with the fact that "immortal" unicellular organisms (e.g., yeast) with a largely open and active genome lack several repressive epigenetic mechanisms.
50 • C HAP
T ER
3
embryonic g-modmodmod mod mod-
differentiated
normal
mod
NT? mod modtumor
Figure 18. Reprogramming by Nuclear Transfer During the lifetime of an individual, epigenetic modifications (mod) are acquired in different cell lineages (left). Nuclear transfer (NT) of a somatic cell reverses the process of terminal differentiation, eradicating the majority of epigenetic marks (mod); however, some modification that would also be present in the germ line (g-mod) cannot be removed. During neoplastic transformation (from a normal to tumor cell), caused by a series of genetic mutations (red stars), epigenetic lesions accumulate. The epigenetic lesions (mod), but not the mutations, can be erased through reprogramming upon NT. This approach evaluates the interplay between genetic and epigenetic contributions to tumorigenesis. (Figure adapted from R. jaenisch.)
Another feature of normal epigenetic reprogramming in mammals, postfertilization, is its distinct asymmetry. This can first be attributed to different programs of epigenetic specification in the male and female germ cells (Chapters 19 and 20). The sperm genome is largely made up of protamines, although there is a residual but significant level of CENP-A (an H3 histone variant) and other putative epigenetic imprints (Kimmins and Sassone-Corsi 2005), whereas the oocyte is made up of regular nucleosome-containing chromatin. Once fertilized, the sperm and oocyte haploid genomes have another cycle of reprogramming involving DNA demethylation and exchange of histone variants. The modifications can either enhance or balance epigenetic differences of the two parental genomes before nuclear fusion, in the first cell cycle. During differentiation of embryonic (i.e., inner cell mass [ICM]) and extraembryonic (i.e., trophectoderm [TE] and placenta) tissues, different DNAmethylation and histone-modification profiles are established between lineages (Morgan et al. 2005). Somatic cloning cannot faithfully recapitulate these patterns of reprogramming, showing rapid but less extensive demethylation in the first cell cycle, and perturbed DNA methylation and histone lysine methylation between ICM and TE cells. A closely related concern in somatic cell reprogramming is the fate of imprinted gene loci. For normal embryonic development to proceed, correct allelic expression at imprinted loci is required (Chapter 19).
This was demonstrated by the seminal experiments that generated uniparental embryos (Barton et al. 1984; McGrath and Solter 1984; Surani et al. 1984). Androgenetic embryos (both genomes are of male origin) exhibited retarded embryonic development but hyperproliferation of extraembryonic tissues (e.g., placenta). In gyno- or parthenogenetic embryos (both genomes are of female origin), the placenta is underdeveloped. A parent-specific imprint must therefore be established in the germ cell following erasure of preexisting marks (Chapter 20). It is believed that this occurs for approximately 100 or more imprinted genes, largely involved in systems of resource provision for embryonic and placental development (e.g., Igf2 growth factor). Intriguingly, there is evidence that imprinting may be perturbed during in vitro culture of embryos produced by ART or nuclear transfer (Maher 2005). 15 Cancer
There is a delicate balance between self-renewal and differentiation. Neoplastic transformation (also similarly referred to as tumorigenesis) is regarded as the process whereby cells undergo a change involving uncontrolled cell proliferation, a loss of checkpoint control tolerating the accumulation of chromosomal aberrations and genomic aneuploidies, and mis-regulated differentiation (Lengauer et al. 1998). It is commonly thought to be caused by at least one genetic lesion, such as a point mutation, a deletion, or a translocation, disrupting either a tumor suppressor gene or an oncogene (Hanahan and Weinberg 2000). Tumor suppressor genes become silenced in tumor cells. Oncogenes are activated through dominant mutations or overexpression of a normal gene (proto-oncogene). Importantly, an accumulation of aberrant epigenetic modifications is also associated with tumor cells (see Chapter 24). The epigenetic changes involve altered DNA methylation patterns, histone modifications, and chromatin structure (see Fig. 19). Thus, neoplastic transformation is a complex multistep process involving the random activation of oncogenes and/or the silencing of tumor suppressor genes, through genetic or epigenetic events, and is referred to as the "Knudson twohit" theory (Feinberg 2004; Feinberg and Tyko 2004). To illustrate, silencing of the retinoblastoma (Rb) gene, a tumor suppressor, causes loss of checkpoint control, which not only provides a proliferative advantage, but also promotes a "second hit" by affecting downstream functions related to chromatin structure which maintain genome integrity (Gonzalo and Blasco 2005). Inappro-
OVERVIEW
priate activation of an oncogenic product such as the myc gene can have a similar effect (Knoepfler et al 2006). One question raised by current research is, To what extent do aberrant epigenetic changes contribute to the incidence and overall behavior of a tumor? This was addressed by NT experiments using a melanoma cell nucleus as the donor (Hochedlinger et a1. 2004). Any genetic lesions of the donor cell remain; however, NT erases the epigenetic makeup. The tumor incidence of cloned mouse fetuses was then studied, indicating that the spectrum of tumors that arose de novo varied greatly, consistent with different contributions of epigenetic modifications in different tissues that trigger neoplastic progression. DNA hypomethylation (as opposed to hypermethylation) can occur at discrete loci or over Widespread chromosomal regions. DNA hypomethylation was, in fact, the first type of epigenetic transition to be associated with cancer (Feinberg and Vogelstein 1983). This has turned out to be a widespread phenotype of cancer cells. At the individual gene level, DNA hypomethylation can be neoplastic due to the activation of proto-oncogenes, the derepression of genes that cause aberrant cell function, or the biallelic expression of imprinted genes (also termed loss of imprinting or LO!) (see Chapters 23 and 24). On a more global genomic scale, broad DNA hypomethylation, particularly at regions of constitutive heterochromatin, predisposes cells to chromosomal translocations and aneuploidies that contribute to cancer progression. This effect is recapitulated in Dnmtl mutants (Chen et a1. 1998). The genomic instability that ensues when there is DNA hypomethylation is due likely
AND
CONCEPTS
51
to the mutagenic effect of transposon reactivation. With attention turning to the essential role that repressive histone modifications play in maintaining heterochromatin at centromeres and telomeres, evidence has emerged that if these marks are lost, genome instability also results, contributing to cancer progression (Gonzalo and Blasco 2005). Conversely, DNA hypermethylation is concentrated at the promoter regions of CpG islands in many cancers. Silencing of tumor suppressor genes through such aberrant DNA hypermethylation is particularly critical in cancer progression. Recent studies have revealed that there is considerable cross talk between chromatin modifications and DNA methylation, demonstrating that more than one epigenetic mechanism can be involved in the silencing of a tumor suppressor gene. As an illustration, it is known that the tumor suppressor genes, p 16 and hMLH1, are silenced by both DNA methylation and repressive histone lysine methylation in cancer (McGarvey et a1. 2006). The deregulation of chromatin modifiers is implicated in many forms of cancer. Certain histone-modifying enzymes become oncogenic, such as the PcG protein EZH2 and the trxG protein MLL, and exert their effect through perturbing a cell's epigenetic identity, which consequently either transcriptionally silences or activates inappropriate genes (Schneider et a1. 2002; Valk-Lingbeek et a1. 2004). It is clear that the epigenetic identity is crucial to cellular function. In fact, the pattern of global acetyl and methyl histone marks is proving to be a hallmark for the progression of certain cancers, as demonstrated by a study in prostate tumor progression (Seligson et a1. 2005).
Figure 19. Epigenetic Modifications in Cancer
a
normal
tumor (0) Aberrant epigenetic marks at cancer-
OFF
oncogene
tumor suppressor ON
oncogene
ON
tumor suppressor
OFF
...
b 5-aza zebularine
...
SAHA ~
Dnmt inhibitors tumor suppressor
OFF
HDAC inhibitors tumor suppressor ON
causing loci typically involve the derepression of oncogenes or silencing of tumor suppressor genes. Epigenetic marks known to alter a normal cell include DNA methylation, repressive histone methylation, and histone deacetylation. (b) The use of epigenetic therapeutic agents for the treatment of cancer has consequences on the chromatin template, illustrated for a tumor suppressor locus. Exposure to Dnmt inhibitors results in a loss of DNA methylation, and exposure to HDAC inhibitors results in the acquisition of histone acetyl marks and subsequent downstream modifications, including active histone methyl marks and the incorporation of histone variants. The cumulative chromatin changes lead to gene re-expression.
52 •
C HAP T E R 3
The development of drug targets inhibiting the function of the chromatin-modifying effector enzymes has opened up a new horizon for cancer therapeutics (see Fig. 19). The use of DNMT and HDAC inhibitors is in the most advanced stages of clinical trials in this new generation of cancer therapeutics. Zebularine and SAHA are, respectively, two such inhibitors. They are particularly beneficial for cancer cells that have repressed tumor suppressor genes (J.e. Cheng et al. 2004; Garcia-Manero and Issa 2005; Marks and Jiang 2005), because treatment leads to transcriptional stimulation. A major proportion of repressive histone lysine methylation is lost during treatment, most probably due to transcription-coupled histone exchange and nucleosome replacement; however, these inhibitors do not significantly alter H3K9me3 at target promoter regions (McGarvey et al. 2006). It remains to be resolved whether repressive marks that persist could induce subsequent re-silencing of tumor suppressor genes when treatment is paused, thereby counteracting the benefit of "epigenetic therapy." It is possible that a dual epigenetic therapy strategy, using DNMT and HDAC inhibitors, may promise a better prognosis in clinical trials. Identification of inhibitors to other classes of histone modifiers, namely HKMTs and PRMTs, is currently in the development phase. There are approximately 50 SET domain HKMTs alone in the mammalian genome. Most of the well-characterized enzymes, such as SUV39H, EZH2, MLL, and RIZ, have already been implicated in tumor development (Schneider et al. 2002). Thus, highthroughput screens (HTS) are being employed in efforts to identify small-molecule inhibitors that could be used in exploratory research and, eventually, cancer therapy. All the classes of histone-modifying enzymes are suited for such an approach, as their specific substrate-binding sites (i.e., to histone peptides), in contrast to generic cofactor (e.g., acetyl-CoA and SAM) binding sites, would allow more selective drug development. HTS have been successful for HDACs (Su et al. 2000), PRMTs (D. Cheng et al. 2004), and HKMTs (Greiner et al. 2005). For the transfer of knowledge to occur from basic to applied research, both hypothesis-driven and empirical approaches are required to ultimately define the efficacy and usefulness of any histone-modifying enzyme inhibitor. For instance, selective HKMT inhibitors against MLL or EZH2 may be valuable therapeutic agents for leukemia or prostate cancer. Alternatively, the use of a SUV39H HKMT inhibitor, which would seem counterintuitive because of the necessity of this enzyme in maintaining constitutive heterochromatin and genome
stability, may still preferentially sensitize tumor cells. In addition, analysis of the HDAC inhibitor SAHA has revealed that it may operate through additional pathways that are distinct from transcriptional reactivation (Marks and Jiang 2005). For example, HDAC inhibitors can also sensitize chromatin lesions, inhibiting efficient DNA repair and permitting genomic instabilities that can trigger apoptosis in tumor cells. These observations will have to be monitored when assessing the efficacy of dual combination therapies. Judging from the results to date, however, it is conceivable that combination therapy using HDAC and HKMT inhibitors may be more selective in killing pro-neoplastic cells by driving them into information overflow and chromatin catastrophe. It is hoped that continued research will identify the viable candidates for efficient epigenetic cancer therapy. 16 What Does Epigenetic Control Actually Do?
Approximately 10% of the protein pool encoded by the mammalian genome plays a role in transcription or chromatin regulation (Swiss-Prot database). Given that the mammalian genome consists of 3 X 10 9 bp, it must accommodate ~ 1 X 10 7 nucleosomes. This gives rise to an overwhelming array of possible regulatory messages, including DNA-binding interactions, histone modifications, histone variants, nucleosome remodeling, DNA methylation, and noncoding RNAs. Yet, the process of transcriptional regulation alone is quite intricate, often requiring the assembly of large multiprotein complexes (> 100 proteins) to ensure initiation, elongation, and correct processing of messenger RNA from a single selected promoter. If DNA sequence-specific regulation is so elaborate, one would expect the lower-affinity associations along the dynamic DNA-histone polymer to be even more so. On the basis of these considerations, rarely will there will be one modification that correlates with one epigenetic state. More likely, and as experimental evidence suggests, it is the combination or cumulative effect of several (probably many) signals over an extended chromatin region that stabilizes and propagates epigenetic states (Fischle et al. 2003b; Lachner et al. 2003; Henikoff 2005). For the most part, transcription factor binding is transient and lost in successive cell divisions. For persistent gene expression patterns, transcription factors are required at each subsequent cell division. As such, epigenetic control can potentiate a primary signal (e.g., promoter stimulation, gene silencing, centromere definition) to successive (but not indefinite) cell generations by the
a v E R V lEW
heritable transmission of information through the chromatin template (Fig. 20). Interestingly, in S. pombe, Swi6-dependent epigenetic variegation can be suppressed for many cell divisions during both mitosis and meiosis (Grewal and Klar 1996) by histone modifications (most probably H3K9me2). Analogous studies were performed in Drosophila using a pulse of an activating transcription factor to transmit cellular memory for Hox gene expression during the female germ line (Cavalli and Paro 1999). In both of these examples, epigenetic memory is mediated by chromatin alterations that comprise distinct histone modifications and, most likely, also the incorporation of histone variants. If histone modifications function together, an imprint may be left on the chromatin template that will help to mark nucleosomes, particularly if a signal is reestablished after DNA replication (Fig. 20). For even more stable inheritance, collaboration between histone modifications, histone variant incorporation, and chromatin remodeling will convert an extended chromatin region into persistent structural alterations that can then be propagated over many cell divisions. Although explained for the inheritance of transcriptional "ON" states, a similar synergy between repressive epigenetic mechanisms will more stably lock silenced chromatin regions, which is further reinforced by additional DNA methylation. The DNA double helix can be viewed then as a selforganizing polymer which, through its ordering into chromatin, can respond to epigenetic control and amplify a primary signal into a more long-term "memory." In addition, many histone modifications probably evolved in response to intrinsic and external stimuli. In keeping with this, chromatin-modifying enzymes require cofactors, such as ATP (kinases), acetyl-CoA (HATs), and SAM (HKMTs), whose levels are dictated by environmental changes (e.g., diet). Thus, the altered conditions can be translated into a more dynamic or stable DNA-histone polymer. An excellent example is the NAD-dependent HDAC, Sir2, which acts as "sensor" for nutrients and life span/aged cells (Guarente and Picard 2005; Rine 2005). Understanding how these environmental cues are cast into biologically relevant epigenetic signatures, and how they are read, translated, and inherited, lies at the heart of current epigenetic research. It is, however, important to stress that epigenetic control requires an intricate balance between many factors and that functional interaction is not always faithfully reestablished after each cell division. This is a functional contrast with genetics, which involves alteration of the DNA sequence, which is always stably propagated
AND
CON C E P T 5
53
through mitosis and meiosis, if the mutation occurs in the germ line. An important question arising from the above considerations is how the information contained in the chromatin is maintained from mother to daughter cells. If a cell loses its identity, through disease, misregulation, or reprogramming, is this identity loss accompanied by changes in chromatin structure? Bulk synthesis of most core histones is highly regulated during the cell cycle. Transcription of the core histone genes generally occurs during the S phase, the stage when DNA is replicated (replication coupled). This "coordination" assures that as the amount of DNA is doubled in the cell, there are sufficient core histones to be deposited onto the newly replicated DNA, and thus, the packaging of the DNA occurs simultaneously with DNA replication. As presented above, various regions of chromatin may have distinct differences in histone modifications that program the region to be either transcribed or not. How do domains of the newly synthesized daughter chromatin retain this crucial information for appropriate gene expression? How is the program faithfully templated from one cell generation to the next, or through meiosis and germ-cell formation (sperm and egg)? These central questions await future investigation. Although initial studies indicated a semiconservative process, wherein a new H3/H4 tetramer is deposited, followed by the incorporation of two new H2A/H2B dimers, recent data have challenged this hypothesis. In this recent model, the "new" H3 and H4 polypeptides, which may already carry several posttranslational modifications, are incorporated as newly synthesized H3/H4 histone dimers together with the "old" H3/H4 dimers segregating between the mother and daughter DNA. If this is the case, then the modified, parental H3/H4 dimers would now also be present with the newly synthesized dimers on the same DNA. Their co-presence may then dictate that appropriate modifications are placed on the newly added dimers (Tagami et al. 2004). This model is attractive and might help explain the inheritance of histone modifications, and thus, the propagation of epigenetic information through DNA replication and cell division. However, more evidence is needed to support the validity of this or other intriguing models to explain the transmission of chromatin marks through cell division. In closing this chapter, we ask, Does epigenetic control differ in a fundamental way from basic genetic principles? Although we may wish to view Waddington's epigenetic landscape as being demarcated patches of activating versus repressive histone modifications along
54 • C HAP
T ER
3
ON
OFF
OFF
transient signal primary signal
OFF
ON
marked
ON
OFF
ON
ON
ON
recurring signal
+
epigenetic pro g ram
primary signal epigenetic control
~
\.-.. •
J -r-.r..v-rT "
))/j~~))~!,.
..
\.~
partial loss
Figure 20. Epigenetic Potentiation of a Primary Signal (Memory/Inheritance) Classic genetics predicts that gene expression is dependent on the availability and binding of the appropriate panel of transcription factors (TF). Removal of such factors (i.e., a primary signal) results in the loss of gene expression, and thus constitutes a transient activating signal (top). Chromatin structure contributes to gene expression, where some conformations are repressive and others active. The activation of a locus may therefore occur through a primary signal and result in the downstream change in chromatin structure, involving active covalent histone marks (mod) and the replacement of core histones with variants (e.g., H3.3). Through cell division, this chromatin structure may only be reestablished in the presence of an activating signal (denoted "recurring signal"). Epigenetic memory results in the maintenance of a chromatin state through cell division, even in the absence of the primary activating signal. Such a memory system is not absolute, but involves multiple levels of epigenetic regulation for remodeling chromatin structure. The dynamic nature of chromatin means that although a chromatin state may be mitotically stable, it is nonetheless prone to change, hence affecting the longevity of epigenetic memory.
the continuum of the chromatin polymer, this notion could easily be overinterpreted. It is only in recent years that we have learned about the major enzymatic systems through which histone modifications might be propagated. This has shaped our current thinking about the stability, and hence the inheritance, of certain histone marks. In addition, it is underscored by the recent discoveries showing that mutations in chromatin-modifying activities, such as nucleosome remodelers (Cho et al. 2004; Mohrmann and Verrijzer 2005), DNMTs (Robertson 2005), HDACs or HMKTs (Schneider et al. 2002), as they are frequently found in abnormal development and neoplasia, are telling examples of the ultimate power of genetic control. As such, tumor incidence in these mutant mice is generally regarded as a genetic disease. In contrast, alterations in nudeosome structure, DNA methylation, and histone modification profiles-that are not caused by a mutated gene-would classify as "true"
epigenetic aberrations. Excellent examples of these more plastic systems are stochastic decisions in early embryonic development, reprogramming by nuclear transfer, transcriptional memory, genomic imprinting, mosaic X inactivation, centromere identity, and tumor progression. Genetics and epigenetics are thus closely related phenomena, and inherent to both is their propagation through cell division, which, for genetic control, also comprises the germ line, if mutations occur in germ cells. In the case of other-often too easily categorized-epigenetic modifications, we do not know whether they only reflect a minor and transient response to changes in the external environment or significantly contribute to phenotypic differences that can then be maintained over many, but not indefinite, somatic cell divisions, and occasionally affect the germ line. Even with our greatly improved knowledge of epigenetic mechanisms today, there is little, or no, novel support for Lamarckism.
a v E R V lEW 17 Big Questions in Epigenetic Research
This book discusses the fundamental concepts and general principles that explain how epigenetic phenomena occur, as puzzling as they may seem. Our ultimate goal is to expose the reader to the current understanding of mechanisms that guide and shape these concepts, drawing upon the rich biology from which they emerge. In just a few years, epigenetic research has prompted exciting and remarkable insights and breakthrough discoveries, yet many long-standing questions remain unanswered (see Fig. 21). Although it is tempting to draw broad-brush conclusions and to propound general rules from this progress, we caution against this tendency, suspecting that there will be many exceptions that break the rules. For example, it is clear that striking organismal differences occur. Notably, from unicellular to multicellular organisms, the extent and type of histone modifications, histone variants, DNA methylation, and use of the RNAi machinery does vary. There are, however, plenty of reasons for renewed energy in research programs designed to gain molecular insights into epigenetic phenomena. Elegant biochemical and genetic studies have already successfully dissected many of the functional aspects of these pathways, in an unprecedented manner. It could therefore be predicted that careful analysis of epigenetic transitions in different cell types (e.g., stem versus differentiated; resting versus proliferating) will uncover hallmarks of pluripotency (Bernstein et aI. 2006; Boyer et al. 2006; Lee et al. 2006). This will most likely be valuable in diagnosing which chromatin alter-
AND
CON C E P T 5
55
ations are significant during normal differentiation as compared with disease states and tumorigenesis. For example, using large-scale mapping approaches with normal, tumor, or ES cells-"epigenetic landscaping" along entire chromosomes (Brachen et al. 2006b; Squazzo et al. 2006; Epigenomics AG, ENCODE, GEN-AU, EPIGENOME NoE)-it is anticipated that the knowledge generated could be harnessed for novel therapeutic intervention approaches and work toward promoting a worldwide consortium to map the entire human epigenome (Jones and Martienssen 2005). It is conceivable that differences in the relative abundance between distinct histone modifications, such as the apparent underrepresentation of repressive histone lysine tri-methylation in S. pombe and A. thaliana, may reflect the greater proliferative and regenerative potential in these organisms as compared to the more restricted developmental programs of metazoan systems. In addition, the functional links between the RNAi machinery, histone lysine methylation, and DNA methylation will continue to provide exciting surprises into the complex mechanisms required for cell-fate determination during development. Similarly, an enhanced understanding of the dynamics and specificity of nucleosome-remodeling machines will contribute to this end. We predict that more "exotic" enzymatic activities will be uncovered, catalyzing epigenetic transitions through modifications of histone and non-histone substrates. It would appear that chromatin alterations, as induced by the above mechanisms, act largely as a response filter to the environment. Thus, it is hoped that this knowledge can ultimately be applied to enhanced therapeutic strategies for resetting
ENVIRONMENT
! ! ! epigenetic code?
C E L
L
F A r I:
regeneration?
epigenetic inheritance? mod
remodeler
nature of cellular memory? germ line imprint?
stem cells?
cell type identity?
ncRNAs
non-coding RNAs?
aging? epigenetic dysfunction?
1 1 1 ENVIRONMENT
Figure 21. Big Questions in Epigenetic Research The many experimental systems used in epigenetic research have unveiled numerous pathways and novel insights into the mechanisms of epigenetic control. Many questions, as shown in the figure, still remain and require further elucidation or substantiation in new and existing model systems and methods.
56
C HAP T E R 3
some of an individual's epigenetic response that contribute to aging, disease, and cancer. This includes tissue regeneration, therapeutic cloning (using ES cells and their derivatives), and adult stem cell therapy strategies. It is believed such strategies will extend cellular life span, modulate stress responses to external stimuli, reverse disease progression, and improve assisted reproductive technologies. We predict that understanding the "chromatin basis" of pluripotency and totipotency will lie at the heart of understanding stem cell biology and its potential for therapeutic intervention. Many fundamental epigenetic questions remain. For example, What distinguishes one chromatin strand from the other allele when both contain the same DNA sequence in the same nuclear environment? What defines the mechanisms conferring inheritance and propagation of epigenetic information? What is the molecular nature of cellular memory? Are there epigenetic imprints in the germ line that serve to keep this genome in a totipotent state? If so, how are these marks erased during development? Alternatively, or in addition, are new imprints added during development that serve to "lock in" differentiated states? We look forward to the next generation of studies (and students) bold enough to tackle these questions with the heart and passion of previous generations of genetic and epigenetic researchers. In summary, the genetic principles described by Mendel likely govern the vast majority of our development and our outward phenotypes. However, exceptions to the rule can sometimes reveal new principles and new mechanisms leading to inheritance that have been underestimated, and in some cases, poorly understood previously. This book hopes to expose its readers to the newly appreciated basis of phenotypic variation-one that lies outside of DNA alteration. It is our hope that the systems and concepts described in this book will provide a useful foundation for future generations of students and researchers alike who become intrigued by the curiosities of epigenetic phenomena.
References Ahmad K. and Henikoff S. 2002. The histone variant H3.3 marks active chromatin by replication-independent nucleosome assembly. Mol. Cell 9: 1191-1200. Allfrey Y.G., Faulkner R., and Mirsky A.E. 1964. Acetylation and methylation of histones and their possible role in the regulation of RNA synthesis. Proc. Natl. Acad. Sci. 51: 786-794. Almeida R. and Allshire R.C 2005. RNA silencing and genome regulation. Trends Cell BioI. 15: 251-258. Aparicio O.M., Billington B.L., and Gottschling D.E. 1991. Modifiers of position effect are shared between telomeric and silent mating-
type loci in S. cerevisiae. Cell 66: 1279-1287. Avery O.T., Macleod CM., and McCarty M. 1944. Studies on the chemical nature of the substance inducing transformation of pneumococcal types. Induction of transformation by a desoxyribonucleic acid fraction isolated from pneumococcus Type III.]. Exp. Med. 79: 137-158. Avner P. and Heard E. 2001. X-chromosome inactivation: Counting, choice and initiation. Nat. Rev. Genet. 2: 59-67. Bannister A.J. and Kouzarides T. 2005. Reversing histone methylation. Nature 436: 1103-1106. Bannister A.J., Zegerman P., Partridge J.F., Miska E.A., Thomas rO., Allshire R.C, and Kouzarides T. 2001. Selective recognition of methylated lysine 9 on histone H3 by the HPI chromo domain. Nature 410: 120-124. Barton S.C, Surani M.A., and orris M.L. 1984. Role of paternal and maternal genomes in mouse development. Nature 311: 374-376. Bastow R., Mylne J.S., Lister C, Lippman Z., Martienssen R.A., and Dean C 2004. Vernalization requires epigenetic silencing of FLC by histone methylation. Nature 427: 164-167. Baxter J., Sauer 5., Peters A., John R., Williams R., Caparros M.L., Arney K., Otte A., Jenuwein T., Merkenschlager M., and Fisher A.G. 2004. Histone hypomethylation is an indicator of epigenetic plasticity in quiescent lymphocytes. EMBO ]. 23: 4462--4472. Berger S.L. 2002. Histone modifications in transcriptional regulation. Curro Opin. Genet. Dev. 12: 142-148. Bernard P., Maure J.F., Partridge J.F., Genier 5., Javerzat J.P., and Allshire R.C 2001. Requirement of heterochromatin for cohesion at centromeres. Science 294: 2539-2542. Bernstein B.E., Kamal M., Lindblad-Toh K., Bekiranov 5., Bailey D.K., Huebert D.J., McMahon 5., Karlsson E.K., Kulbokas E.J., III, Gingeras T.R., et al. 2005. Genomic maps and comparative analysis of histone modifications in human and mouse. Cell 120: 169-181. Bernstein RE., Mikkelsen T.S., Xie X., Kamal M., Huebert D.J., Cuff J., Fry B., Meissner A., Wernig M., Plath K., et al. 2006. A bivalent chromatin structure marks key developmental genes in embryonic stem cells. Cell 125: 315-326. Bernstein E. and Allis CD. 2005. RNA meets chromatin. Genes Dev. 19: 1635-1655. Bird A.P. 1986. CpG-rich islands and the function of DNA methylation. Nature 321: 209-213. Birky CW., Jr. 2001. The inheritance of genes in mitochondria and chloroplasts: Laws, mechanisms, and models. Annu. Rev. Genet. 35: 125-148. Bolland DJ, Wood A.L., Johnston CM., Bunting S.F., Morgan G., Chakalova L., Fraser P.J., and Corcoran A.E. 2004. Antisense intergenic transcription in V(D)J recombination. Nat. Immunol. 5: 630-637. Bourc'his D. and Bestor T.H. 2004. Meiotic catastrophe and retrotransposon reactivation in male germ cells lacking Dnmt3L. Nature 431: 96-99. Boyer L.A., Plath K., Zeitlinger J., Brambrink T., Medeiros L.A., Lee T.I., Levine 5.5., Wernig M., Tajonar A., Ray M.K., et al. 2006. Polycomb complexes repress developmental regulators in murine embryonic stem cells. Nature 441: 349-353. Bracken A.P., Dietrich N., Pasini D., Hansen K.H., and Helin K. 2006. Genome-wide mapping of Polycomb target genes unravels their roles in cell fate transitions. Genes Dev. 20: 1123-1136. Braig M., Lee 5., Loddenkemper C, Rudolph C, Peters A.H., Schlegelberger B., Stein H., Dorken B., Jenuwein T., and Schmitt CA. 2005. Oncogene-induced senescence as an initial barrier in lymphoma development. Nature 436: 660-665. Brownell J.E., Zhou J., Ranalli T., Kobayashi R., Edmondson D.G.,
o Roth S.Y., and Allis e.D. 1996. Tetrahymena histone acetyltransferase A: A hom*olog to yeast Gcn5p linking histone acetylation to gene activation. Cell 84: 843-851. Cairns B.R. 2005. Chromatin remodeling complexes: Strength in diversity, precision through specialization. Curro Opin. Genet. Dev. 15: 185-190. Campbell K.H., McWhir J., Ritchie WA., and Wilmut 1. 1996. Sheep cloned by nuclear transfer from a cultured cell line. Nature 380: 64--66. Cao R., Wang L., Wang H., Xia L., Erdjument-Bromage H., Tempst P., Jones R.S., and Zhang Y. 2002. Role of histone H3 lysine 27 methylation in Polycomb-group silencing. Science 298: 1039-1043. Cavalli G. and Paro R. 1999. Epigenetic inheritance of active chromatin after removal of the main transactivator. Science 286: 955-958. Chakalova L., Debrand E., Mitchell J.A., Osborne e.S., and Fraser P. 2005. Replication and transcription: Shaping the landscape of the genome. Nat. Rev. Genet. 6: 669-677. Chalker D.L. and Yao M.e. 2001. Nongenic, bidirectional transcription precedes and may promote developmental DNA deletion in Tetrahymena thermophila. Genes Dev. 15: 1287-1298. Chan S.w., Zilberman D., Xie Z., Johansen L.K., Carrington J.e., and Jacobsen S.E.2004. RNA silencing genes control de novo DNA methylation. Science 303: 1336. Chen R.Z., Pettersson U., Beard e., Jackson-Grusby L., and Jaenisch R. 1998. D A hypomethylation leads to elevated mutation rates. Nature 395: 89-93. Cheng D., Yadav ., King R.W., Swanson M.S., Weinstein E.J., and Bedford M.T. 2004. Small molecule regulators of protein arginine methyltransferases.]. BioI. Chem. 279: 23892-23899. Cheng J.e., Yoo e.B., Weisenberger D.J., Chuang J., Wozniak e., Liang G., Marquez Y.E., Greer S., Orntoft TF., Thykjaer T, and Jones P.A. 2004. Preferential response of cancer cells to zebularine. Cancer Cell 6: 151-158. Cho K.S., Elizondo L.1., and Boerkoel e.F. 2004. Advances in chromatin remodeling and human disease. Curro Opin. Genet. Dev. 14: 308-315. Chuikov S., Kurash J.K., Wilson J.R., Xiao B., Justin N., Ivanov G.S., McKinney K., Tempst P., Prives e., Gamblin S.J., et al. Regulation of p53 activity through lysine methylation. Nature 432: 353-360. Clark-Adams e.D., Norris D., Osley M.A., Fassler J.S., and Winston F. 1988. Changes in histone gene dosage alter transcription in yeast. Genes Dev. 2: 150-159. Cohen EE. and Prusiner S.B. 1998. Pathologic conformations of prion proteins. Annu. Rev. Biochem. 67: 793-819. Cosgrove M.S., Boeke J.D., and Wolberger e. 2004. Regulated nucleosome mobility and the histone code. Nat. Struct. Mol. Bioi. 11: 1037-1043. Cremer T. and Cremer e. 2001. Chromosome territories, nuclear architecture and gene regulation in mammalian cells. Nat. Rev. Genet. 2: 292-301. Oaujat S., Zeissler 0., Waldmann T., Happel N., and Schneider R. 2005. HPI binds specifically to Lys26-methylated histone H1.4, whereas simultaneous Ser27 phosphorylation blocks HPI binding. f. BioI. Chem. 280: 38090-38095. Oellino G./., Schwartz Y.B., Farkas G., McCabe D., Elgin S.e., and Pirrotta Y. 2004. Polycomb silencing blocks transcription initiation. Mol. Cell 13: 887-893. Ohalluin C., Carlson J.E., Zeng L., He e., Aggarwal A.K., and Zhou M.M.1999. Structure and ligand of a histone acetyl transferase bromodomain. Nature 399: 491-496. Oi Croce L. 2005. Chromatin modifying activity of leukaemia associ-
V E R V lEW
AND
CON C E P T 5
57
ated fusion proteins. Hum. Mol. Genet. 14 Spec. No.1: R77-R84. Dou Y. and Gorovsky M.A. 2000. Phosphorylation of linker histone HI regulates gene expression in vivo by creating a charge patch. Mol. Cell 6: 225-231. Fan Y., Nikitina T, Zhao J., Fleury TJ., Bhattacharyya R., Bouhassira E.E., Stein A., Woodco*ck e.L., and Skoultchi A.!. 2005. Histone HI depletion in mammals alters global chromatin structure but causes specific changes in gene regulation. Cell 123: 1199-1212. Feinberg A.P. 2004. The epigenetics of cancer etiology. Semin. Cancer Bioi. 14: 427-432. Feinberg A.P. and Tycko B. 2004. The history of cancer epigenetics. Nat. Rev. Cancer 4: 143-153. Feinberg A.P. and Vogelstein B. 1983. Hypomethylation distinguishes genes of some human cancers from their normal counterparts. Nature 301: 89-92. Felsenfeld G. and Groudine M. 2003. Controlling the double helix. Nature 421: 448-453. Fischle W., Wang Y., and Allis e.D. 2003a. Binary switches and modification cassettes in histone biology and beyond. Nature 425: 475-479. - - - . 2003b. Histone and chromatin cross-talk. Curro Opin. Cell BioI. 15: 172-183. Fischle W, Tseng B.S., Dormann H.L., Ueberheide B.M., Garcia B.A., Shabanowitz J., Hunt D.E, Funabiki H., and Allis e.D. 2005. Regulation of HPI-chromatin binding by histone H3 methylation and phosphorylation. Nature 438: 1116-1122. Fisher A.G. and Merkenschlager M. 2002. Gene silencing, cell fate and nuclear organisation. Curro Opin. Genet. Dev. 12: 193-197. Fodor B.D., Kubicek S., Yonezawa M., O'Sullivan R.J., Sengupta R., Perez-Burgos L., Opravil S., Mechtler K., Schotta G., and Jenuwein T 2006. Jmjd2b antagonizes H3K9 trimethylation at pericentric heterochromatin in mammalian cells. Genes Dev. 20: 1557-1562. Fraga M.E, Ballestar E., Paz M.F., Ropero S., Setien F., Ballestar M.L., Heine-Suner D., Cigudosa J.e., Urioste M., Benitez J., et al. 2005. Epigenetic differences arise during the lifetime of monozygotic twins. Proc. Natl. Acad. Sci. 102: 10604--10609. Francis N.J., Kingston R.E., and Woodco*ck e.L. 2004. Chromatin compaction by a polycomb group protein complex. Science 306: 1574-1577. f*ckagawa T, Nogami M., Yoshikawa M., Ikeno M., Okazaki T, Takami Y., Nakayama T, and Oshimura M. 2004. Dicer is essential for formation of the heterochromatin structure in vertebrate cells. Nat. Cell BioI. 6: 784-791. Garcia-Manero G. and Issa J.P. 2005. Histone deacetylase inhibitors: A review of their clinical status as antineoplastic agents. Cancer Invest. 23: 635-642. Gilbert N., Boyle S., Fiegler H., Wood fine K., Carter N.P., and Bickmore W.A. 2004. Chromatin architecture of the human genome: Generich domains are enriched in open chromatin fibers. Cell 118: 555-566. Gilfillan G.D., Dahlsveen 1.K., and Becker P.B. 2004. Lifting a chromosome: Dosage compensation in Drosophila melanogaster. FEBS Lett. 567: 8-14. Goll M.G., Kirpekar F., Maggert K.A., Yoder J.A., Hsieh e.L., Zhang X., Golic K.G., Jacobsen S.E., and Bestor TH. 2006. Methylation of tR AAsp by the D A methyltransferase hom*olog Dnmt2. Science 311: 395-398. Gonzalo S. and Blasco M.A. 2005. Role of Rb family in the epigenetic definition of chromatin. Cell Cycle 4: 752-755. Gottschling D.E., Aparicio O.M., Billington B.L., and Zakian Y.A. 1990. Position effect at S. cerevisiae telomeres: Reversible repression of Pol II transcription. Cell 63: 751-762.
58 • C HAP
T ER
3
Grant P.A., Duggan L., Cote J., Roberts S.M., Brownell J.E., Candau R., Ohba R., Owen-Hughes T, Allis C.D., Winston P., et al. 1997. Yeast Gcn5 functions in two multisubunit complexes to acetylate nucleosomal histones: Characterization of an Ada complex and the SAGA (Spt/Ada) complex. Genes Dev. 11: 1640-1650. Greiner D., Bonaldi T., Eskeland R., Roemer E., and Imhof A. 2005. Identification of a specific inhibitor of the histone methyltransferase SU(VAR)3-9. Nat. Chem. BioI. 1: 143-145. Grewal S.l. and Klar A.I. 1996. Chromosomal inheritance of epigenetic states in fission yeast during mitosis and meiosis. Cell 86: 95-101. Grozinger C.M. and Schreiber S.L. 2002. Deacetylase enzymes: Biological functions and the use of small-molecule inhibitors. Chem. BioI. 9: 3-16. Guarente L. and Picard P. 2005. Calorie restriction-The SIR2 connection. Cell 120: 473-482. Hall l.M., Shankaranarayana G.D., Noma K., Ayoub N., Cohen A., and Grewal S.l. 2002. Establishment and maintenance of a heterochromatin domain. Science 297: 2232-2237. Hanahan D. and Weinberg R.A. 2000. The hallmarks of cancer. Cell 100: 57-70. Harvey A.C. and Downs I.A. 2004. What functions do linker histones provide? Mol. Microbiol. 53: 771-775. Hassan A.H., Prochasson P., Neely K.E., Galasinski S.c., Chandy M., Carrozza M.J., and Workman J.L. 2002. Function and selectivity of bromodomains in anchoring chromatin-modifying complexes to promoter nucleosomes. Cell Ill: 369-379. Heard E. 2005. Delving into the diversity of facultative heterochromatin: The epigenetics of the inactive X chromosome. Curro Opin. Genet. Dev. 15: 482-489. Henikoff S. 2005. Histone modifications: Combinatorial complexity or cumulative simplicity? Proc. Natl. Acad. Sci. 102: 5308-5309. Henikoff S. and Ahmad K. 2005. Assembly of variant histones into chromatin. Annu. Rev. Cell Dev. BioI. 21: 133-153. Herr A.J., Jensen M.B., Dalmay T, and Baulcombe D.C. 2005. RNA polymerase IV directs silencing of endogenous DNA. Science 308: 118-120. Hirota T, Lipp J.J., Toh B.H., and Peters J.M. 2005. Histone H3 serine 10 phosphorylation by Aurora B causes HPI dissociation from heterochromatin. Nature 438: 1176-1180. Hochedlinger K., Blelloch R., Brennan c., Yamada Y., Kim M., Chin L., and Jaenisch R. 2004. Reprogramming of a melanoma genome by nuclear transplantation. Genes Dev. 18: 1875-1885. Holbert M.A. and Marmorstein R. 2005. Structure and activity of enzymes that remove histone modifications. Curro Opin. Struct. BioI. 15: 673-680. Holliday R. 1994. Epigenetics: An overview. Dev. Genet. 15: 453-457. Ishii K.J. and Akira S. 2005. TLR ignores methylated RNA? Immunity 23: 111-113. Jackson J.P., Lindroth A.M., Cao X., and Jacobsen S.E. 2002. Control of CpNpG DNA methylation by the KRYPTONITE histone H3 methyltransferase. Nature 416: 556-560. Jacobson R.H., Ladurner A.G., King D.S. and Tjian R. 2000. Structure and function of a human TAFII250 double bromodomain module. Science. 288: 1422-1425. Jaenisch R. and Bird A. 2003. Epigenetic regulation of gene expression: How the genome integrates intrinsic and environmental signals. Nat. Genet. (suppl.) 33: 245-254. Janicki S.M., Tsukamoto T, Salghetti S.E., Tansey W.P., Sachidanandam R., Prasanth KV., Ried T, Shav-Tal Y., Bertrand E., Singer R.H., and Spector D.L. 2004. From silencing to gene expression: Real-time analysis in single cells. Cell 116: 683-698. Jeddeloh J.A., Stokes TL., and Richards E.I. 1999. Maintenance of
genomic methylation requires a SWI2/SNF2-like protein. Nat. Genet. 22: 94-97. Jenuwein T and Allis C.D. 2001. Translating the histone code. Science 293: 1074-1080. Jones P.A. and Martienssen R. 2005. A blueprint for a Human Epigenome Project: The AACR Human Epigenome Workshop. Cancer Res. 65: 11241-11246. Kanellopoulou c., Muljo S.A., Kung A.L., Ganesan S., Drapkin R., Jenuwein T, Livingston D.M., and Rajewsky K. 2005. Dicer-deficient mouse embryonic stem cells are defective in differentiation and centromeric silencing. Genes Dev. 19: 489-501. Kayne P'S., Kim U.J., Han M., Mullen J.R., Yoshizaki P., and Grunstein M. 1988. Extremely conserved histone H4 N terminus is dispensable for growth but essential for repressing the silent mating loci in yeast. Cell 55: 27-39. Khochbin S. 2001. Histone HI diversity: Bridging regulatory signals to linker histone function. Gene 271: 1-12. Khorasanizadeh S. 2004. The nucleosome: From genomic organization to genomic regulation. Cell 116: 259-272. Kim J., Daniel J., Espejo A., Lake A., Krishna M., Xia L., Zhang Y., and Bedford M.T 2006. Tudor, MBT and chromo domains gauge the degree of lysine methylation. EMBO Rep. 4: 397-403. Kimmins S. and Sassone-Corsi P. 2005. Chromatin remodeling and epigenetic features of germ cells. Nature 434: 583-589. Klar A.I. 1998. Propagating epigenetic states through meiosis: Where Mendel's gene is more than a DNA moiety. Trends Genet. 14: 299-301. - - - . 2004. An epigenetic hypothesis for human brain laterality, handedness, and psychosis development. Cold Spring Harbor Symp. Quant. BioI. 69: 499-506. Knoepfler P.S., Zhang X.-Y., Cheng P.P., Gafken P.R., McMahon S.B., and Eisenman R.N. 2006. Myc influences global chromatin structure. EMBO f. 25: 2723-2734. Kornberg R.D. 1974. Chromatin structure: A repeating unit of histones and DNA. Science 184: 868-871. Lachner M., O'Sullivan R.J., and Jenuwein T. 2003. An epigenetic road map for histone lysine methylation. f. Cell Sci. 116: 2117-2124. Lachner M., Sengupta R., Schotta G., and Jenuwein T 2004. Trilogies of histone lysine methylation as epigenetic landmarks of the eukaryotic genome. Cold Spring Harbor Symp. Quant. BioI. 69: 209-218. Lachner M., O'Carroll D., Rea S., Mechtler K., and Jenuwein T 2001. Methylation of histone H3 lysine 9 creates a binding site for HP I proteins. Nature 410: 116-120. Langst G. and Becker P.B. 2004. Nucleosome remodeling: One mechanism, many phenomena? Biochim. Biophys. Acta 1677: 58-63. Lee D.Y., Teyssier c., Strahl B.D., and Stallcup M.R. 2005. Role of protein methylation in regulation of transcription. Endocr. Rev. 26: 147-170. Lee Tl., Jenner R.G., Boyer L.A., Guenther M.G., Levine S.S., Kumar R.M., Chevalier B., Johnstone S.E., Cole M.P., Isono K., et al. 2006. Control of developmental regulators by Polycomb in human embryonic stem cells. Cell 125: 301-313. Lengauer c., Kinzler KW., and Vogelstein B. 1998. Genetic instabilities in human cancers. Nature 396: 643-649. Li E., Bestor TH., and Jaenisch R. 1992. Targeted mutation of the DNA methyltransferase gene results in embryonic lethality. Cell 69: 915-926. Lippman Z., Gendre! A.V., Black M., Vaughn M.W., Dedhia N., McCombie W.R., Lavine K, Mittal v., May B., Kasschau K.D., et al. 2004. Role of transposable elements in heterochromatin and epigenetic control. Nature 430: 471-476.
o V E R V lEW Litt M.D., Simpson M., Gaszner M., Allis c.D., and Felsenfeld G. 2001. Correlation between histone lysine methylation and developmental changes at the chicken beta-globin locus. Science 293: 2453-2455. Luger K., Mader A.W, Richmond R.K., Sargent D.E, and Richmond T.j. 1997. Crystal structure of the nucleosome core particle at 2.8 A resolution. Nature 389: 251-260. Lund A.H. and van Lohuizen M. 2004. Polycomb complexes and silencing mechanisms. Curro Opin. Cell BioI. 16: 239-246. Lyko E 2001. DNA methylation learns to fly. Trends Genet. 17: 169-172. Maher E.R. 2005. Imprinting and assisted reproductive technology. Hum. Mol. Genet. 14 Spec. No.1: RI33-RI38. Maison c., Bailly D., Peters A.H., Quivy j.P., Roche D., Taddei A., Lachner M., jenuwein T., and Almouzni G. 2002. Higher-order structure in pericentric heterochromatin involves a distinct pattern of histone modification and an RNA component. Nat. Genet. 30: 329-334. Marks P.A. and Jiang X. 2005. Histone deacetylase inhibitors in programmed cell death and cancer therapy. Cell Cycle 4: 549-551. Martens j.H., O'Sullivan R.j., Braunschweig U., Opravil S., Radolf M., Steinlein P., and jenuwein T. 2005. The prome of repeat-associated histone lysine methylation states in the mouse epigenome. EMBO ]. 24: 800-812. Maurer-Stroh S., Dickens N.j., Hughes-Davies L., Kouzarides T., Eisenhaber E, and Ponting c.P. 2003. The Tudor domain 'Royal Family': Tudor, plant Agenet, Chromo, PWWP and MBT domains. Trends Biochem. Sci. 28: 69-74. Mayer w., Niveleau A., Walter j., Fundele R., and Haaf T. 2000. Demethylation of the zygotic paternal genome. Nature 403: 501-502. McClintock B. 1951. Chromosome organization and genic expression. Cold Spring Harbor Symp. Quant. BioI. 16: 13-47. McGarvey K., Fahrner j., Green E., Martens j., jenuwein T., and Baylin S.B. 2006. Silenced tumor suppressor genes reactivated by DNA demethylation do not return to a fully euchromatic chromatin state. Cancer Res. 66: 3541-3549. McGrath j. and Solter D. 1984. Completion of mouse embryogenesis requires both the maternal and paternal genomes. Cell 37: 179-183. Metzger E., Wissmann M., Yin N., Muller j.M., Schneider R., Peters A.H., Gunther T., Buettner R., and Schule R. 2005. LSDI demethylates repressive histone marks to promote androgen-receptordependent transcription. Nature 437: 436-439. Meyer B.j., McDonei P., Csankovszki G., and Ralston E. 2004. Sex and X-chromosome-wide repression in Caenorhabditis elegans. Cold Spring Harbor Symp. Quant. BioI. 69: 71-79. Misteli T. 2004. Spatial positioning; a new dimension in genome function. CelllI9: 153-156. Mito Y., Henikoff j.G., and Henikoff S. 2005. Genome-scale proming of histone H3.3 replacement patterns. Nat. Genet. 37: 1090-1097. Mizuguchi G., Shen X., Landry j., Wu WH., Sen S., and Wu C. 2004. ATP-driven exchange of histone H2AZ variant catalyzed by SWR1 chromatin remodeling complex. Science 303: 343-348. Mochizuki K., Fine N.A., Fujisawa T., and Gorovsky M.A. 2002. Analysis of a piwi-related gene implicates small RNAs in genome rearrangement in tetrahymena. Cell lID: 689-699. Mohrmann L. and Verrijzer c.P. 2005. Composition and functional specificity of SWI2/SNF2 class chromatin remodeling complexes. Biochim. Biophys. Acta 1681: 59-73. Morgan H.D., Santos E, Green K., Dean W., and Reik W. 2005. Epigenetic reprogramming in mammals. Hum. Mol. Genet. 14 Spec. No.1: R47-R58.
AND
CON C E P T 5
59
Motamedi M.R., Verdel A., Colmenares S.U., Gerber S.A., Gygi S.P., and Moazed D. 2004. Two RNAi complexes, RITS and RDRC, physically interact and localize to noncoding centromeric RNAs. CelllI9: 789-802. Muller H.j. 1930. Types of visible variations induced by X-rays in Drosophila. f. Genet. 22: 299-334. Nakayama j., Rice j.c., Strahl B.D., Allis C.D., and Grewal S.l. 200l. Role of histone H3 lysine 9 methylation in epigenetic control of heterochromatin assembly. Science 292: 110-113. Narita M., Nunez S., Heard E., Narita M., Lin A.W., Hearn S.A., Spector D.L., Hannon G.j., and Lowe S.W. 2003. Rb-mediated heterochromatin formation and silencing of E2F target genes during cellular senescence. Cell1l3: 703-716. Narlikar G.j., Fan H.Y., and Kingston R.E. 2002. Cooperation between complexes that regulate chromatin structure and transcription. Cell 108: 475-487. Nowak S.j. and Corces Y.G. 2004. Phosphorylation of histone H3: A balancing act between chromosome condensation and transcriptional activation. Trends Genet. 20: 214-220. Orphanides G., LeRoy G., Chang C.H., Luse D.S., and Reinberg D. 1998. FACT, a factor that facilitates transcript elongation through nucleosomes. Cell 92: 105-116. Pal-Bhadra M., Leibovitch B.A., Gandhi S.G., Rao M., Bhadra U., Birchler j.A., and Elgin S.c. 2004. Heterochromatic silencing and HP1 localization in Drosophila are dependent on the RNAi machinery. Science 303: 669-672. Paro R. and Hogness D.S. 1991. The Polycomb protein shares a hom*ologous domain with a heterochromatin-associated protein of Drosophila. Proc. Natl. Acad. Sci. 88: 263-267. Petersen-Mahrt S. 2005. DNA deamination in immunity. Immunol. Rev. 203: 80-97. Pirrotta Y. 1998. Polycombing the genome: PcG, trxG, and chromatin silencing. Cell 93: 333-336. Pontier D., Yahubyan G., Vega D., Bulski A., Saez-Vasquez j., Hakimi M.A., Lerbs-Mache S., Colot Y., and Lagrange T. 2005. Reinforcement of silencing at transposons and highly repeated sequences requires the concerted action of two distinct R . A polymerases IV in Arabidopsis. Genes Dev. 19: 2030-2040. Ratcliff E, Harrison B.D., and Baulcombe D.C. 1997. A similarity between viral defense and gene silencing in plants. Science 276: 1558-1560. Razin A. and Riggs A.D. 1980. DNA methylation and gene function. Science 210: 604-610. Rea S., Eisenhaber E, O'Carroll D., Strahl B.D., Sun Z.W, Schmid M., Opravil S., Mechtler K., Ponting c.P., Allis C.D., and jenuwein T. 2000. Regulation of chromatin structure by site-specific histone H3 methyltransferases. Nature 406: 593-599. Reinberg D., Chuikov S., Farnham P., Karachentsev D., Kirmizis A., Kuzmichev A., Margueron R., Nishioka K., Preissner T.S., Sarma K., et al. 2004. Steps toward understanding the inheritance of repressive methyl-lysine marks in histones. Cold Spring Harbor Symp. Quant. BioI. 69: 171-182. Reinhart B.j. and Bartel D.P. 2002. Small RNAs correspond to centromere heterochromatic repeats. Science 297: 183l. Rine j. 2005. Cell biology. Twists in the tale of the aging yeast. Science 310: 1124-1125. Ringrose L. and Paro R. 2004. Epigenetic regulation of cellular memory by the Polycomb and Trithorax group proteins. Annu. Rev. Genet. 38: 413-443. Ringrose L., Ehret H., and Paro R. 2004. Distinct contributions of histone H3 lysine 9 and 27 methylation to locus-specific stability of polycomb complexes. Mol. Cell 16: 641-653.
60 • C HAP
T ER
3
Robertson K.D. 2005. DNA methylation and human disease. Nat. Rev. Genet. 6: 597-610. RoloffT.C. and Nuber U.A. 2005. Chromatin, epigenetics and stem cells. Eur.]. Cell BioI. 84: 123-135. Roth S.Y., Denu J.M., and Allis C.D. 2001. Histone acetyltransferases. Annu. Rev. Biochem. 70: 81-120. Sanchez-Elsner T., Gou D., Kremmer E., and Sauer F. 2006. Noncoding RNAs of trithorax response elements recruit Drosophila Ashl to ultrabithorax. Science 311: 1118-1123. Santos-Rosa H., Schneider R., Bannister A.J., Sherriff J., Bernstein B.E., Emre N.C., Schreiber S.L., Mellor T., and Kouzarides T. 2002. Active genes are tri-methylated at K4 of histone H3. Nature 419: 407-411. Sarma K. and Reinberg D. 2005. Histone variants meet their match. Nat. Rev. Mol. Cell BioI. 6: 139-149. Scaffidi P., Gordon L., and Misteli T. 2005. The cell nucleus and aging: Tantalizing clues and hopeful promises. PLoS. BioI. 3: e395. Schalch T., Duda S., Sargent D.E, and Richmond T.T. 2005. X-ray structure of a tetranucleosome and its implications for the chromatin fibre. Nature 436: 138-141. Schmitt S., Prestel M., and Paro R. 2005. lntergenic transcription through a polycomb group response element counteracts silencing. Genes Dev. 19: 697-708. Schneider R., Bannister A.T., and Kouzarides T. 2002. Unsafe SETs: Histone lysine methyltransferases and cancer. Trends Biochem. Sci. 27: 396-402. Schreiber S.L. and Bernstein B.E. 2002. Signaling network model of chromatin. Cell 111: 771-778. Schwartz B.E. and Ahmad K. 2005. Transcriptional activation triggers deposition and removal of the histone variant H3.3. Genes Dev. 19: 804-814. Schwartz Y.B., Kahn T.G., and Pirrotta Y. 2005. Characteristic low density and shear sensitivity of cross-linked chromatin containing polycomb complexes. Mol. Cell BioI. 25: 432-439. Sekinger E.A., Moqtaderi Z., and Struhl K. 2005. Intrinsic histone-DNA interactions and low nucleosome density are important for preferential accessibility of promoter regions in yeast. Mol. Cell 18: 735-748. Seligson D.B., Horvath S., Shi T., Yu H., Tze S., Grunstein M., and Kurdistani S.K. 2005. Global histone modification patterns predict risk of prostate cancer recurrence. Nature 435: 1262-1266. Shen X., Mizuguchi G., Hamiche A., and Wu C. 2000. A chromatin remodeling complex involved in transcription and DNA processing. Nature 406: 541-544. Shi Y., Lan E, Matson c., Mulligan P., Whetstine T.R., Cole P.A., Casero R.A., and Shi Y. 2004. Histone demethylation mediated by the nuclear amine oxidase hom*olog LSD 1. Cell 119: 941-953. Shorter J. and Lindquist S. 2005. Prions as adaptive conduits of memory and inheritance. Nat. Rev. Genet. 6: 435-450. Sims R.T., Ill, Belotserkovskaya R., and Reinberg D. 2004. Elongation by RNA polymerase II: The short and long of it. Genes Dev. 18: 2437-2468. Smith c.L. and Peterson c.L. 2005. ATP-dependent chromatin remodeling. Curro Top. Dev. BioI. 65: 115-148. Squazzo S.L., O'Geen H., Komashko Y.M., Krig S.R., Tin V.x., Jang S.w., Margueron R., Reinberg D., Green R., and Farnham P.T. 2006. Suz12 binds to silenced regions on the genome in a cell-type-specific manner. Genome Res. 16: 890-900. Sterner D.E. and Berger S.L. 2000. Acetylation of histones and transcription-related factors. Microbiol. Mol. BioI. Rev. 64: 435-459. Strahl B.D. and Allis C.D. 2000. The language of covalent histone modifications. Nature 403: 41-45. Strahl B.D., Ohba R., Cook R.G., and Allis C.D. 1999. Methylation of
histone H3 at lysine 4 is highly conserved and correlates with transcriptionally active nuclei in Tetrahymena. Proc. Natl. Acad. Sci. 96: 14967-14972. Su G.H., Sohn T.A., Ryu B., and Kern S.E. 2000. A novel histone deacetylase inhibitor identified by high-throughput transcriptional screening of a compound library. Cancer Res. 60: 3137-3142. Sung S. and Amasino R.M. 2004. Vernalization in Arabidopsis thaliana is mediated by the PHD finger protein VIN3. Nature 427: 159-164. Surani M.A., Barton S.c., and Norris M.L. 1984. Development of reconstituted mouse eggs suggests imprinting of the genome during gametogenesis. Nature 308: 548-550. Tagami H., Ray-Gallet D., Almouzni G., and Nakatani Y. 2004. Histone H3.1 and H3.3 complexes mediate nucleosome assembly pathways dependent or independent of DNA synthesis. Cell 116: 51-61. Tamaru H. and Selker E.U. 2001. A histone H3 methyltransferase controls DNA methylation in Neurospora crassa. Nature 414: 277-283. Tanaka E.M. 2003. Regeneration: If they can do it, why can't we? Cell 113: 559-562. Thomas T.O. 1999. Histone HI: Location and role. Curro Opin. Cell BioI. 11: 312-317. Tsukada Y., Fang T., Erdjument-Bromage H., Warren M.E., Borchers C.H., Tempst E, and Zhang Y. 2006. Histone demethylation by a family of TmjC domain-containing proteins. Nature 439: 811-816. Tsukiyama T., Daniel c., Tamkun T., and Wu C. 1995. ISWI, a member of the SWI2/SNF2 ATPase family, encodes the 140 kDa subunit of the nucleosome remodeling factor. Cell 83: 1021-1026. Turner B.M. 2000. Histone acetylation and an epigenetic code. BioEssays 22: 836-845. VaJk-Lingbeek M.E., Bruggeman S.W., and van Lohuizen M. 2004. Stem cells and cancer; the polycomb connection. Cell 118: 409-418. van Attikum H. and Gasser S.M. 2005. The histone code at DNA breaks: A guide to repair? Nat. Rev. Mol. Cell BioI. 6: 757-765. van der Heijden G.w., Dieker J.w., Derijck A.A., Muller S., Berden T.H., Braat D.D., van der Vlag J., and de Boer P. 2005. Asymmetry in histone H3 variants and lysine methylation between paternal and maternal chromatin of the early mouse zygote. Mech. Dev. 122: 1008-1022. Vaquero A., Loyola A., and Reinberg D. 2003. The constantly changing face of chromatin. Sci. Aging Knowledge Environ. 2003: RE4. Varga-Weisz ED., Wilm M., Bonte E., Dumas K., Mann M., and Becker P.B. 1997. Chromatin-remodeling factor CHRAC contains the ATPases ISWI and topoisomerase II. Nature 388: 598-602. Verdel A., Tia S., Gerber S., Sugiyama T., Gygi S., Grewal S.I., and Moazed D. 2004. RNAl-mediated targeting of heterochromatin by the RITS complex. Science 303: 672-676. Vidanes G.M., Bonilla c.Y., and Toczyski D.E 2005. Complicated tails: Histone modifications and the DNA damage response. Cell 121: 973-976. Vignali M., Hassan A.H., Neely K.E., and Workman J.L.. 2000. ATPdependent chromatin-remodeling complexes. Mol. Cell. BioI. 20: 1899-1910. Vire E., Brenner c., Deplus R., Blanchon L., Fraga M., Didelot c., Morey L., Van E.A., Bernard D., Vanderwinden J.M., et al. 2005. The Polycomb group protein EZH2 directly controls DNA methylation. Nature 439: 861-874. Volpe T.A., Kidner c., HaJI I.M., Teng G., Grewal S.I., and Martienssen R.A. 2002. Regulation of heterochromatic silencing and histone H3 lysine-9 methylation by RNAl. Science 297: 1833-1837. Waddington C.H. 1957. The strategy of the genes. MacMillan, New York. Wade EA., Gegonne A., Tones P.L., BaJlestar E., Aubry E, and Wolffe A.P. 1999. Mi-2 complex couples DNA methylation to chromatin remodeling and histone deacetylation. Nat. Genet. 23: 62-66.
o Walsh c.P., Chaillet J.R., and Bestor TH. 1998. Transcription of lAP endogenous retroviruses is constrained by cytosine methylation. Nat. Genet. 20: 116-117. Watanabe Y., Yokobayashi S., Yamamoto M., and Nurse P. 2001. Pre-meiotic S phase is linked to reductional chromosome segregation and recombination. Nature 409: 359-363. Watson J.D. 2003. Celebrating the genetic jubilee: A conversation with lames D. Watson. Interviewed by John Rennie. Sci. Am. 288: 66-69. Wei Y., Yu 1., Bowen J., Gorovsky M.A., and Allis CD. 1999. Phosphorylation of histone H3 is required for proper chromosome condensation and segregation. Cel/97: 99-109. Whetstine l.R., Nottke A., Lan E, Huarte M., Smolikov S., Chen Z., Spooner E., Li E., Zhang G., Colaiacovo M., and Shi Y. 2006. Reversal of histone lysine trimethylation by the JMJD2 family of histone demethylases. Cel/125: 467-481. Wolffe A.P. and Matzke M.A. 1999. Epigenetics: Regulation through repression. Science 286: 481-486.
V E R V lEW
AND
CON C E P T S
•
61
Wysocka J., Swigut T, Milne TA., Dou Y., Zhang X., Burlingame A.L., Roeder R.G., Brivanlou A.H., and Allis CD. 2005. WDR5 associates with histone H3 methylated at K4 and is essential for H3 K4 methylation and vertebrate development. Cel/121: 859-872. Yan Q., Huang J., Fan T., Zhu H., and Muegge K. 2003. Lsh, a modulator of CpG methylation, is crucial for normal histone methylation. EMBO f. 22: 5154-5162. Yu B., Yang Z., Li J., Minakhina S., Yang M., Padgett R.W, Steward R., and Chen X. 2005. Methylation as a crucial step in plant microRNA biogenesis. Science 307: 932-935. Zhang Y. and Reinberg D. 2001. Transcription regulation by histone methylation: Interplay between different covalent modifications of the core histone tails. Genes Dev. 15: 2343-2360. Zhang Y., LeRoy G., Seelig H.P., Lane WS., and Reinberg D. 1998. The dermatomyositis-specific autoantigen Mi2 is a component of a complex containing histone deacetylase and nucleosome remodeling activities. Cel/95: 279-289.
c
A
H
p
E
T
R
4
Epigenetics in Saccharomyces •
•
cerevlslae Michael Grunstein' and Susan M. Gasser2 I University of California, Los Angeles, California 90095-1570 2Priedrich Miescher Institute for Biomedical Research, 4058 Basel, Switzerland
CONTENTS 1. The Genetic and Molecular Tools of Yeast, 65 2. The Life Cycle of Yeast, 66 3. Yeast Heterochromatin Is Present at the Silent HM Mating Loci and at Telomeres, 67 4. Heterochromatin Is Distinguished by a Repressive Structure That Spreads through the Entire Silent Domain, 69
5. Distinct Steps in Heterochromatin Assembly, 70 5.1
HM Heterochromatin, 70
5.2
Telomeric Heterochromatin, 71
6. Histone Deacetylation by Sir2 Provides Binding Sites for the Spread of SIR Complexes, 71
8. Histone Acetylation in Euchromatin Restricts SIR Complex Spreading, 73
9. Telomere Looping, 73 10. Discontinuity of Repression at Natural Subtelomeric Elements by Telomere Looping, 74 11. Trans-interaction of Telomeres, and Perinuclear Attachment of Heterochromatin, 74 12. Inheritance of Epigenetic States, 75 13. Aging and Sir2: Linked by rONA Repeat Instability, 76 14. Summary, 77 References, 78
7. Sir2 Deacetylates Histone H4 at Lysine 16, 72
63
GENERAL SUMMARY The fraction of chromatin in a eukaryotic nucleus that bears its active genes is termed euchromatin. This chromatin condenses in mitosis to allow chromosomal segregation and decondenses in interphase of the cell cycle to allow transcription to occur. However, some chromosomal domains were observed by cytological criteria to remain condensed in interphase, and this constitutively compacted chromatin was called heterochromatin. With the development of new techniques, molecular rather than cytological features have been used to define this portion of the genome, and the constitutively compacted chromatin found at centromeres and telomeres was shown to contain many thousands of simple repeat sequences. Such heterochromatin tends to replicate late in S phase of the cell cycle and is found clustered at the nuclear periphery or near the nucleolus. Importantly, its characteristic nuclease-resistant chromatin structure can spread and repress nearby genes in a stochastic manner. In the case of the fly locus white, a gene that determines red eye color, epigenetic repression yields a red and white sectored eye due to a phenomenon called position-effect variegation (PEV). Mechanistically, PEV reflects the recognition of methylated histone H3K9 by heterochromatin protein 1 (HP1) and the spreading of this mark along the chromosomal arm. In Saccharomyces cerevisiae, also known as budding yeast, a distinct mechanism of heterochromatin formation has evolved, yet it achieves a very similar result. S. cerevisiae is a microorganism commonly used in making beer and baking bread. However, unlike bacteria, it is a eukaryote. The chromosomes of budding yeast, like those of more complex eukaryotes, are complexed with
histones, enclosed in a nucleus, and replicated from multiple origins during S phase of the cell cycle. Still, the yeast genome is tiny, with only 14 megabase pairs of genomic DNA divided among 16 chromosomes, some not much larger thi;ln certain bacteriophage genomes. There are approximately 6000 genes in the yeast genome, closely packed along chromosomal arms with generally less than 2 kb spacing between them. The vast majority of yeast genes are in an open chromatin state, meaning that they are either actively transcribed or can be very rapidly induced. This, coupled with a very limited amount of simple repeat DNA, makes the detection of heterochromatin by cytological techniques virtually impossible in budding yeast. Nonetheless, using molecular tools, it has been determined that yeast has distinct heterochromatin-like regions adjacent to the telomeres on all 16 chromosomes and at two silent mating loci on chromosome III. Transcriptional repression of these latter two loci is essential for maintaining a mating-competent haploid state. Both the subtelomeric regions and the silent mating-type loci repress integrated reporter genes in a position-dependent, epigenetic manner; they replicate late in S phase and are present at the nuclear periphery. Thus, these loci bear many of the characteristic features of heterochromatin, other than the cytologically visible condensation in interphase. Indeed, for the scientist studying heterochromatin, yeast combines the advantages of a small genome and the genetic and biochemical tools available in microorganisms with important aspects of higher eukaryotic chromosomes.
E PIG ENE TIC SIN
1 The Genetic and Molecular Tools of Yeast
Yeast provides a flexible and rapid genetic system for studying cellular events. With an approximate generation time of 90 minutes, colonies containing millions of cells are produced after just 2 days of growth. In addition, yeast can propagate in both haploid and diploid formsgreatly facilitating genetic analyses. Like bacteria, haploid yeast cells can be mutated to produce specific nutritional requirements or auxotrophic genetic phenotypes, and recessive lethal mutations can be maintained either in haploids bearing conditional lethal alleles (e.g., temperature-sensitive mutants) or in heterozygous diploids (bearing both wild-type and mutant alleles). The highly efficient system of hom*ologous recombination in yeast allows the alteration of any chosen chromosomal sequence at will. In addition, portions of chromosomes can be manipulated by recombinant means on plasmids that can be stably maintained in dividing yeast cells by including short sequences that provide centromere and origin of DNA replication function. Even linear plasmids, or minichromosomes, which carry telomeric repeats to cap their ends, propagate stably in yeast. PEV using the fly white gene as a reporter has been important in defining epigenetic gene regulation and the genes that affect this unique form of gene repression (see Chapter 5 for more detail). The discovery and characterization of a similar phenomenon near yeast telomeres, called telomere position effect (TPE), has been analogously aided by the use of Ura3 and Ade2 reporter genes (Fig. 1). In the presence of 5-fluoroorotic acid (5FOA), the Ura3 protein converts 5-FOA to 5-fluorouracil (5-FU), an inhibitor of DNA synthesis that causes cell death. However, when Ura3 is integrated into regions of heterochromatin, the Ura3 gene is repressed in some, but not all, cells, and only the cells that silence Ura3 are able to grow in the presence of 5-FOA. Thus, by scoring the efficiency of growth on 5-FOA with a serial dilution drop assay (Fig. 1a), one can quantify the repression of this reporter gene over a very large range (e.g., 1O-106 -fold). Moreover, mutations that disrupt TPE can be readily identified by monitoring for increased sensitivity to 5-FOA. Similarly, when the Ade2 gene is targeted for integration into a region of heterochromatin, the gene is repressed and a precursor in adenine biosynthesis accumulates in the cell, turning it a reddish color. Importantly, the epigenetic nature of Ade2 repression is visible within a single colony of genetically identical cells: The gene can be "on" in some cells and "off" in others, pro-
a
SAC C H A ROM Y C ESC ERE V I S I A E
65
TPE of URA3 expression in S.cerevisiae Telomere
URA3 Chr VIIL :: URA3-Tel No. of cells: sir2
10 6
10 5 10 4 10 3 10 2 10
.... ....
~
wt
~
YPD
. . . . . .liliiii
u .....
..
yku70 . . . . . sir2 wt
yku70
b
'~ff
• • • 1$ , -
+ 5-FOA
TPE of AOE2 expression in S.cerevisiae Telomere
ADE2
.,.\YNINi ade2red and white sectors
ADE2
ade2-; ADE2-TeIVR
variegated repression
Figure 1. Silencing and TPE in Yeast (a) The Ura3 gene, inserted near the telomeric simple TG-rich repeat at the left arm of chromosome VII, is silenced by telomeric heterochromatin in this yeast strain. In normal rich medium (YPD), no growth difference can be detected between wild-type (wt) cells that repress the subtelomeric Ura3 gene and silencing mutants that lose telomeric heterochromatin and express Ura3. In media containing 5-FOA (lower panel), on the other hand, cells that repress Ura3 (e.g., wt cells) can grow, whereas cells that express it (sir2 and ykulO mutants) cannot. This is because the Ura3 gene product converts 5-FOA to the toxic intermediate 5-fluorouracil. The serial dilution/drop assay allows detection of silencing in as few as 1 in 10· cells. (b) Cells containing the wt Ade2 gene produce a colony that is "white," whereas those containing mutant ade2 appear red, due to the accumulation of a reddish intermediate in adenine biosynthesis. When the Ade2 gene is inserted near the telomere at the right arm of chromosome V, it is silenced in an epigenetic manner. The silent Ade2 state and the active Ade2 state in genetically identical cells are both inherited, creating red and white sectors in a colony (much like PEV).
66
C HAP T E R
4
ducing red sectors in a white colony background or vice versa (Fig. 1b). Unlike the Ura3 assay, there is no selection against cells that fail to repress Ade2, and therefore, the phenotype of the Ade2 reporter inserted in subtelomeric heterochromatin demonstrates the switching rate as well as the heritability of the epigenetic state. The Ade2 color assay provides a striking illustration of the semi-stable nature of both repressed and derepressed states. Combined with these genetic approaches, biochemical techniques are readily applied to protease-deficient strains grown either synchronously or asynchronously in large cultures. Recently, the battery of tools available has broadened to include sophisticated microarray and protein network techniques that easily accommodate the small genome of yeast. These methods have enabled genome-wide analyses of transcription, transcription factor binding, histone modifications, and protein-protein interactions. This broad range of sophisticated tools has allowed scientists to explore the mechanisms that regulate both the establishment of heterochromatin and its physiological roles in budding yeast. However, before describing these discoveries further, it is necessary to review the life cycle of yeast in more detail.
a
C0\
((§) Mitotis (haploid)
Conjugation
Mitotis (diploid)
@> ~
f-@ @4
~
CQt.LJUGATION
@>@ C9Q)
ha~c:@
F
~@:l
@?~l
~:T~ 0+
Meiosis
SPORULATION
GERMINATION
0G¥>
GERMINATION
b
2 The Life Cycle of Yeast
S. cerevisiae multiplies through mitotic division in either
a haploid or a diploid state, by producing a bud that enlarges and eventually separates from the mother cell (Fig. 2a). Haploid yeast cells can mate with each other (i.e., conjugate), since they exist in one of two mating types, termed a or a, reminiscent of the two sexes in mammals. Yeast cells of each mating type produce a distinct pheromone that attracts the cells of the opposite mating type: a cells produce a peptide of 12 amino acids called a factor, which binds to a membrane-spanning afactor receptor on the surface of an a cell. Conversely, a cells produce a 13 aa peptide that binds to the a-factor receptor on the surface of a cells. These interactions result in the arrest of the cells in mid-to-Iate G I phase of the cell cycle. The arrested cells assume "shmoo"-like shapes (named after the pear-shaped Al Capp cartoon character; Fig. 2b), and the shmoos of opposite mating type fuse at their tips, producing an a/a diploid. The mating response is repressed in diploid cells, which propagate vegetatively (i.e., by mitotic division) just like haploid cells. On the other hand, exposure to starvation conditions will induce a meiotic program that results in the formation of an ascus containing four spores, two of
Figure 2. The life Cycle of Budding Yeast (0) Yeast cells divide mitotically in both haploid and diploid forms. Sporulation is induced in a diploid by starvation, whereas mating occurs spontaneously when haploids of opposite mating type are in the vicinity of each other. This occurs by pheromone secretion, which arrests the cell cycle in G, of a cell of the opposite mating type, and after sufficient exposure to pheromone, the mating pathway is induced. The diploid state represses the mating pathway. (b) In response to pheromone, haploid cells distort toward cells of the opposite mating type. These are called shmoos. The nuclear envelope is visible as green fluorescence.
each mating type. Given sufficient nutrients, the haploid spores grow into cells that are again capable of mating, starting the life cycle over again. Although haploid yeast cells in the laboratory are usually designated as one mating type or the other, in the wild, yeast switch their mating type nearly each cell cycle (Fig. 3a). Mating-type switching is provoked by an endonuclease activity (HO) that induces a site-specific double-strand break at the MAT locus. A gene conversion event then transposes the opposite mating-type
E PIG ENE TIC SIN
a
5 Ace H A ROM Y C ESC ERE V I 5 I A E
67
Yeast life cycle
@
!! haploid
/~
Mating Type Switching
p 0 0 ~ /t
!! haploid
l'-..
0000 ~
a factor
afacto~
@>@ CW)
@ b
Figure 3. Mating Type Switching in Yeast
Conjugation
ala diploid
Chromosome III
HO endonuclease
p
HMLa
MATa
RE
HMRa
85%~~%
~
€}-
.....-e;7!-II-------~.t----{c::::J}-------- -----~~
c
MATa
RE
HMLa
HMRa
Transcriptionally silent domains and silencer elements HMLa
MATa
HMRa
.,...M~ •
i
MATn
2.5kb
is 46
R = Rap1 binding site
Y
a1 al
:
112 kbl
IRIRIRI
x
A
= Abf1
binding site
f
63~'
= ORC consensus
D
IRIRIRI
silenced chromatin region
information from a constitutively silent donor locus, HMLa or HMRa, to the active MAT locus. Such strains are called hom*othallic. This means that a vegetatively growing MATa cell will rapidly produce MATa progeny,
and vice versa. Because in the laboratory it is desirable to have cells with stable mating types, laboratory strains are usually constructed to contain a mutant HO endonuclease gene, which eliminates cleavage at the MAT locus. The loss of HO endonuclease activity prevents mating-type switching, producing a heterothallic strain. These strains contain silent HM loci and an active MAT locus whose mating type information is stably either a or a. Two silent mating loci (Fig. 3b), one for each "sex," are maintained constitutively silent in an epigenetic manner and have become a classic system for the study of heterochromatin.
(0) hom*othallic yeast strains are able to switch mating type after one division cycle. The switch occurs before DNA replication so that both mother and daughter cells assume the new mating type. (b) The position of the silent and expressed mating-type loci on chromosome III are shown here. The active MAT locus is able to switch through gene conversion roughly once per cell cycle, due to a double-strand break induced by the HO endonuclease. The percentages indicated show the frequency with which the gene conversion event replaced the MAT locus with the opposite mating-type information. The directionality of switching is guaranteed by the recombination enhancer (RE) on the left arm of chromosome III. (c) Repression at the silent mating-type loci HMR and HML is mediated by two silencer DNA elements that flank the silent genes. These silencers are termed E (for essential) or I (for important) (Brand et al. 1997) and provide binding sites for Rap1 (R), Abf1 (A), and ORC (0). Artificial silencers can be created using various combinations of the redundant binding sites, although their efficiency is less than that of the native silencers. HMLa and HMRa are 12 kb and 23 kb, respectively, from the telomeres of chromosome III. Telomeric heterochromatin domains at chromosome III are silenced independently from the HM loci in a process that is initiated at the telomeres through multiple binding sites for Rap1 (R).
3 Yeast Heterochromatin Is Present at the Silent HM Mating Loci and at Telomeres
The three mating-type loci, HMLa, MAT, and HMRa are located on chromosome III and contain the information that determines a or a mating type in yeast. HMLa (~11 kb from the left telomere) and HMRa (~23 kb from the right telomere; Fig. 3b,c) are situated between short DNA elements called E and I silencers. Only when either of the silent cassettes is copied and integrated into the active MAT locus is it capable of transcription in a normal cell. The transfer of HMLa information into MAT results in an a mating type (MATa) cell, whereas the transfer of HMRa information into MAT results in the a mating type (MATa)(Fig. 3b). This shows that the genes and promoters at the HM loci are completely intact, although they remain
68 • C HAP
T ER 4
stably repressed when they are positioned at HMR and HML. This is essential for the maintenance of mating potential, because the combined expression of a and ex transcripts in the same cell results in a non-mating sterile state. The scoring of sterility as a phenotype proved very useful for identifying mutations that impair silencing at the HM loci. In this manner, the silent information regulatory proteins, SIR], SIR2, SIR3, and SIR4, were identified as being essential for the full repression of silent HM loci (for review, see Rusche et al. 2003). Mutations in sir2, sirJ, or sir4 caused a complete loss of silencing, whereas in sir] mutants, only a fraction of MATa cells were unable to mate due to a loss of HM repression. Taking advantage of the partial phenotype of sirl-deficient cells, it could be shown that the two alternative states (mating and non-mating) are heritable through successive cell divisions in genetically identical cells (Pillus and Rine 1989). This provided a clear demonstration that mating-type repression displays the hallmark characteristic of epigenetically controlled repression. In addition, it was shown from other studies that the amino termini of histones H3 and H4, repressor activator protein 1 (Rapl), and the origin recognition complex (ORC) are also involved as structural components ofheterochromatin (for review, see Rusche et al. 2003). Heterochromatin is also present immediately adjacent to the yeast telomeric repeat DNA (C\_3A/TG\). As men-
tioned above, when reporter genes such as Ura3 or Ade2 were integrated adjacent to these telomeric repeats, they were repressed in a variegated and epigenetic manner (Gottschling et al. 1990). This TPE shared the HM requirement for Rap1, Sir2, Sir3, Sir4, and the histone amino termini (Kayne et al. 1988; Aparicio et al. 1991). Genetics argued strongly that with the exception of Sirl, similar mechanisms silence genes at the HM mating loci and at telomere-adjacent sites. Moreover, given that the subtelomeric reporters could switch at detectable rates between silent and expressed states, the gene repression appeared to be very similar to fly PEY. In yeast, the four Sir proteins that mediate repression share no extensive hom*ology among themselves, and the Sid, Sid, and Sir4 proteins appear to be conserved only in S. cerevisiae and closely related budding yeasts. Sir2, on the other hand, is the founding member of a large family of NAD-dependent histone deacetylases, which is conserved from bacteria to man (Fig. 4). A role for Sir2-like histone deacetylases in transcriptional repression is observed even in organisms such as fission yeast and flies, which lack the other Sir proteins. The Schizosaccharomyces pombe Sir2 activity is required for transcriptional silencing near telomeres, and Drosophila Sir2 affects the stability of PEV (for review, see Chopra and Mishra 2005). The coupling of NAD hydrolysis with deacetyla-
IV SirT6
II
Figure 4. Sir2 Family of Deacetylases
P.hor
Hst1
P.aby
la
'
Sir2
nuclear
.....
_--_ ...
Sir2 is the founding member of a large family of NAD-dependent deacetylases. The Sir2 family of proteins is unusually conserved and is found in organisms that range from bacteria to humans, and contains both nuclear and cytoplasmic branches of the evolutionary tree. This phylogenetic unrooted tree of Sir2 hom*ologs was generated using CLUSTAL ~ and TREEVIE~ programs to compare the core domain sequences of hom*ologs identified in eDNA and unique libraries. The six subclasses and unlinked group (U) are described in Frye (2000). The mammalian hom*ologs are labeled SirTl-7 and are in bold, and the budding yeast proteins are underlined. Other species are indicated by the species name. (Modified, with permission, from Frye 2000 [© Elsevier].)
E P f G ENE T f C SIN
tion by Sir2 produces O-acetyl ADP ribose, an intermediate that may have a function of its own (Tanner et al. 2000; also see Section 13). It is important to note that the Sir2 family of enzymes modifies many substrates other than histones, with a large branch of the Sir2 family actually being cytoplasmic enzymes (Fig. 4). The diversity of Sir2 functions is illustrated by the fact that mammalian Sir2 deacetylates the transcription factors FOXO and p53 in response to stress and DNA damage, altering their interaction. In budding yeast, Sir2 has an important role in addition to gene silencing, which is to suppress nonreciprocal recombination in the highly repetitive genes of the rDNA locus that is found within the nucleolus (Gottlieb and Esposito 1989).
5 A C C H A ROM Y C ESC ERE V I 5 I A E
•
69
matin-immunopreClpltation techniques, which showed that Sir2, Sir3, and Sir4 proteins interact physically with chromatin throughout the subtelomeric domain of silent chromatin (Hecht et al. 1996; Strahl-Bolsinger et al. 1997). Evidence that this induces a repressive, less accessible chromatin structure comes from other approaches. For instance, it was shown that the DNA of silenced chromatin was not methylated efficiently in yeast cells that express a bacterial dam methylase, although the enzyme readily methylated sequences outside the silent region. This suggested that heterochromatin can restrict access to macromolecules like dam methyltransferase (Gottschling 1992). Similarly, the approximately 3-kb HMR locus in isolated nuclei is preferentially resistant to certain restriction endonucleases (Loo and Rine 1994), and nucleosomes were shown to be tightly positioned between two silencer elements, creating nuclease-resistant domains at silent, but not active, HM loci (Weiss and Simpson 1998). Thus, yeast heterochromatin clearly assumes a distinct chromatin structure. The extent to which either yeast or metazoan heterochromatin is hyper-condensed, and condensation stericallY hinders access to transcription factors, is less certain. Surprisingly, the repressive complex formed by the interaction of Sir proteins and histones appears to be dynamic, because Sir proteins can be incorporated into HM silent chromatin even when cells are arrested at a stage in the cell cycle when heterochromatin assembly generally does not occur (Cheng and Gartenberg 2000).
4 Heterochromatin Is Distinguished by a Repressive Structure That Spreads through the Entire Silent Domain Repression of gene activity in euchromatin can occur due to the presence of a repressive protein or complex that recognizes a specific sequence in the promoter of a gene, thus preventing movement or engagement of the transcription machinery. Heterochromatic repression occurs through a different mechanism that is not promoter-specific: Repression initiates at specific sites, yet spreads continuously throughout the domain, silencing any and all promoters in the region (Fig. 5) (Renauld et al. 1993). This was most clearly demonstrated by the use of chro-
Telomeric heterochromatin TG'-3 repeats
ther workers, revealed numerous links between their senetic behavior and epigenetic regulation (for review, ,ee Fedoroff and Chandler 1994). Indeed, extant transposons and their degenerate remains provide the foundation for establishing epigenetic modifications throughout plant genomes (Section 3.4). More recently, when transgenic technology became routine in the late 1980s for plants such as tobacco, petunia, and Arabidopsis, a major advance in epigenetic research arose from the unexpected results obtained in the course of introducing marker genes (for review, see rorgensen 2003; Matzke and Matzke 2004). The concept e>fhom*ology-dependent gene silencing was formulated as it became evident that silencing was often correlated with multiple copies of linked or unlinked transgenes. Differ~nt cases of hom*ology-dependent gene silencing were due :0 either enhanced turnover of mRNA (posttranscripjonal gene silencing, PTGS) or repression of transcripjon (transcriptional gene silencing, TGS), both of which iVere correlated with increased cytosine methylation of ;ilenced genes. A striking example of PTGS in transgenic Jetunia was initially termed "cosuppression": Attempts to modify floral coloration by overexpression of chalcone iynthase (CHS) genes that condition purple petals often Jroduced variegated or even completely white flowers. rhe lack of pigmentation was shown to result from coorfinate gene silencing of both the CHS transgene and the ~ndogenous CHS gene (Jorgensen 2003). PTGS is now :onsidered the plant equivalent of RNA interference
E PIG ENE TIC
REG U LA T ION
I N
P LAN T 5
•
175
(RNAi) later described in Caenorhabditis elegans and other organisms (see Section 3.2). By the mid-1990s, links between PTGS and virus resistance had been forged. PTGS was shown to naturally protect plants from unchecked replication of viruses, which can be both inducers and targets of PTGS. This principle was exploited in plants to experimentally down-regulate plant genes by constructing viral vectors containing sequences hom*ologous to a target gene, resulting in virusinduced gene silencing (VIGS; for review, see Burch-Smith et al. 2004). In addition, RNA-directed DNA methylation (RdDM) was discovered in viroid-infected plants, providing the first demonstration that RNA could feed back on DNA to elicit epigenetic modifications (Wassenegger et al. 1994). This principle has been successfully used to transcriptionally silence and methylate promoters by intentionally generating hom*ologous double-stranded RNA (see Section 3.4, RNA-directed DNA methylation). 2 Molecular Components of Chromatin in Plants
A number of molecular components of epigenetic regulation in plants were identified by the mutational approaches in Arabidopsis mentioned above (Table 2). However, mutant screens have probably not yet revealed a complete list of epigenetic modifiers because of either functional redundancy in large gene families or the lethal consequences of losing essential components. 2.1 Regulators of DNA Methylation in Plants
Methylation of carbon 5 of cytosines in DNA is a hallmark of epigenetic inactivation and heterochromatin in both plants and mammals (Table 1) (Chapter 18). In plants, however, DNA methylation has a number of unique features with respect to the pattern of methylation, proteins of the methylation machinery, and the possibility to reverse methylation in nondividing cells (for review, see Chan et al. 2005). In this section, we discuss the proteins required to establish, maintain, interpret, and erase DNA methylation. Special components needed for the process of RNA-directed DNA methylation are presented in Section 3.4. DNA
METHYLTRANSFERASES
DNA methylation can be divided into two steps: de novo methylation and maintenance methylation. De novo methylation refers to the modification of a previously unmethylated DNA sequence (Fig. 1) (Chapter 18). In plants, de novo methylation can alter CpG, CpNpG, and
176
C HAP T E R
9
CpNpN nucleotide groups (where N is A, T, or C). In contrast, methylation in mammals is largely restricted to CpG dinucleotides, and there is no evidence for extensive methylation in asymmetric CpNpN nucleotide groups. Although the signals that trigger de novo methylation are largely unknown, double-stranded RNA can fulfill this role in plants (Section 3.4). Maintenance methylation perpetuates methylation patterns during DNA replication and occurs most efficiently at CpG and CpNpG nucleotide groups with their palindromic symmetry. Maintenance of methylation occurs on a hemimethylated substrate after replication or repair, guided by the modification still present on the parental DNA strand. Although it is usually assumed that distinct DNA cytosine methyltransferase enzymes contribute to either de novo or maintenance methylation, an emerging view in plants is that enzymes with different site specificities (CpG or non-CpG) frequently cooperate to catalyze both steps. The three conserved families of DNA methyltransferase are all present in plants. Members of the methyltransferase (METl) family, which are hom*ologs of the mammalian Dnmtl type (see Chapter 18), are considered CpG maintenance methyltransferases, although one has also been assigned a role in CpG de novo methylation in the RdDM pathway (Section 3.4). The Dnmt2 class, of which one member is encoded in the Arabidopsis genome, comprises the most widespread and highly conserved DNA methyltransferase family (Table 2), but its function remains obscure. The plant Domainsrearranged methyltransferases (DRM) and their mammalian hom*ologs, the Dnmt3 group, are usually considered de novo methyltransferases. The DRM enzymes catalyze methylation of cytosines in all sequence contexts and are prominent in the RdDM pathway (Section 4.4). As their name implies, the DRM proteins have rearranged domains (VI-X, followed by I-V) compared to Dnmt3 (I-X). This might give them the ability to methylate asymmetric CpNpN nucleotide groups, which are not methylated in mammalian cells. The plant-specific chromomethylase CMT3 modifies CpNpG trinucleotides. Similarly to METl, CMT3 has been implicated in both de novo and maintenance methylation. The exact function of CMT3 is not entirely clear, although loss-of-function mutants reactivate certain silent transposons (for review, see Chan et al. 2005). In contrast to mammals, where dnmtl and dnmt3 mutants die during embryonic development or shortly after birth, metl, emt3, and drm mutants are viable and usually fertile. The nonlethality of DNA methyltransferase mutations in plants has permitted more extensive
analyses of deficiency mutants during development and sexual reproduction than is possible in mammals (for review, see Chan et al. 2005).
ACTIVE (pG DEMETHYLATION AND
DNA
GlYCOSYLASES
Epigenetic regulation implies that marks corresponding to active or inactive genetic states are potentially reversible. DNA methylation permits such reversibility, because it can be lost through passive or active means. Passive loss occurs when methylation fails to be maintained during multiple rounds of DNA replication. In contrast, active demethylation can occur in nondividing cells and requires enzymatic activities. Early reports from animal systems suggested that active demethylation can result from the action of DNA glycosylases, which are normally involved in base excision repair (for review, see Kress et al. 2001). Interest in this idea has been rekindled by the discovery in Arabidopsis of Demeter (DME) and Repressor of silencing (ROSl), which are large proteins containing DNA glycosylase domains. The ROSl gene was identified in a screen for epigenetic down-regulation and hypermethylation of a stably expressed reporter gene (Gong et al. 2002). The ROSI protein displays nicking activity on methylated but not unmethylated DNA, which is consistent with a role in removing methylated cytosines from DNA in a pathway related to base excision repair. ROSI is expressed constitutively and hence could potentially contribute to loss of DNA methylation in nondividing cells at all stages of development (Kapoor et al. 2005a). In contrast, DME activity is restricted to the female gametophyte, where it activates the imprinting factor Medea (MEA) in a manner that is dependent on a functional DNA glycosylase domain (Choi et al. 2002). The CG methyltransferase METl acts antagonistically to DME, suggesting that DME is indeed required for demethylation of CG dinucleotides (Hsieh and Fisher 2005). In Arabidopsis, there are two additional uncharacterized members of the DME/ROSl family that are unique to plants. The expansion of this gene family suggests that reversible gene silencing by active demethylation is important for plant physiology, development, or adaptation to the environment.
METHYl-DNA-BINDING PROTEINS
Methyl-CG-Binding Domain (MBD) proteins are thought to provide a means to transduce DNA methylation patterns into altered transcriptional activity. In mammals, MBD proteins bind methylated DNA and per-
EPIGENETIC
form various functions, such as recrmtmg histone deacetylases, to reinforce transcriptional silencing. Arabidopsis has 12 MBD-containing genes, compared to 11 in mammals, 5 in Drosophila, 2 in C. elegans, and none in sequenced fungal genomes (Hung and Shen 2003). Little is known about the functions of Arabidopsis MBD proteins, although RNAi-knockdown of one, AtMBDll, was associated with pleiotropic effects on development (Springer and Kaeppler 2005). None of the Arabidopsis MBD proteins has been identified in forward genetic screens, perhaps because of functional redundancy. In addition, despite the amino acid conservation of DNA methyltransferases among plants and mammals, the MBD-containing proteins in the two kingdoms diverge completely outside of the methyl-CG-binding domain. Thus, even though plants and mammals establish and maintain DNA methylation patterns using related enzymes, they might have evolved different ways of interpreting these patterns by means of distinct MBD proteins (Springer and Kaeppler 2005).
REGULATION
IN
PLANTS.
177
histone hypo acetylation and CpG methylation, the latter of which can potentially be actively removed by DNA glycosylases (Section 2.1, Active CG demethylation and DNA glycosylases). Arabidopsis has 18 putative HDACs and 12 putative HATs (Pandey et al. 2002), which is around the same number found in mammals, but more than in other non-plant eukaryotes. The putative HDACs are generally conserved in all eukaryotes, but there is one plant-specific family, HD2, whose function remains obscure. Genetic screens have identified only two members of a conserved family: HDAl and HDA6 (Table 2). HDA6 has roles in maintaining CpG methylation induced by RNA and in repeated sequences, but contributes minimally to development, as indicated by the normal phenotype of deficiency mutants. In contrast, reduced expression of HDAI results in pleiotropic effects on development. None of the Arabidopsis HATs has been identified in forward genetic screens, which might reflect functional redundancy or the direction of most screens toward activation of silent genes.
COMPONENTS OF THE METHYL GROUP DONOR SYNTHESIS
Methylating enzymes require an activated methyl group, usually in the form of S-adenosyl-methionine. Therefore, it is surprising that the biochemical pathways providing this cofactor were not linked with epigenetic regulation earlier. Only recently, however, has a mutation (hogl) in the Arabidopsis gene encoding S-adenosyl-L-hom*ocysteine hydrolase been found to be responsible for epigenetic defects (Rocha et al. 2005). 2.2 Histone-modifying Enzymes
Like other organisms (Table 1), plants contain enzymes that posttranslationally modify the amino-terminal tails of histones, thus establishing a putative histone code (for review, see Loid! 2004). In plants, histone-modifying enzymes are often encoded by comparatively large gene families. Functional information about most family members is still limited. The two most common modifications are histone acetylation/deacetylation and histone methylation.
HISTONE DEACETYLASES AND HISTONE ACETYLTRANSFERASES
The opposing functions of histone acetyltransferases (HATs) and deacetylases (HDACs) ensure reversibility of this epigenetic mark. The potential for reversibility is reinforced by the frequent coexistence at silent genes of
HISTONE METHYLTRANSFERASES
Proteins that are able to methylate lysine residues in histones (referred to in this book as histone lysine methyltransferases or HKMTs) and other proteins contain a common SET domain (SU(VAR)/E(Z)/TRX). Through their ability to methylate histone H3 or H4 at various lysine residues, different complexes containing SET domain proteins play roles in promoting or inhibiting the transcription of specific genes and in forming heterochromatin. Some SET domain proteins are members of the Polycomb group (PcG) or trithorax group (trxG), which maintain transcriptionally repressed or active states, respectively, of homeotic genes during plant and animal development (see Chapters 11 and 12). Other SET domain proteins, such as SU(VAR)3-9, participate in maintaining condensed heterochromatin, often in repetitive regions, by methylating H3 at lysine 9 (H3K9). The Arabidopsis genome encodes 32 SET domain proteins, 30 of which are expressed. They can be grouped into four conserved families: E(Z), TRX, ASH1, and SU(VAR)3-9, as well as a small fifth family present only in yeast and plants (Baumbusch et al. 2001; Springer et al. 2003). The number of expressed SET domain proteins in Arabidopsis is relatively high compared to the 14 in Drosophila and 4 in fission yeast, although there are 50 SET domain proteins in mice. In addition to expansion of the SET domain protein family by polyploidy, retro-
178 •
C HAP T E R 9
transposition has also played a role in the amplification of SU(VAR)3-9 members in Arabidopsis. Outside of the SET domain, the plant and animal proteins are not always well conserved. The divergent regions are predicted to mediate protein-protein interactions, suggesting that plant SET domain proteins might act in complexes distinct from those in animals. Although incomplete, the functional information available for Arabidopsis SET domain proteins implicates them in chromatin regulation and epigenetic inheritance. The first two SET domain proteins to be identified in genetic screens were Curly leaf (CLF) and Medea/Fertilization independent seed formation (MEA/FISl), which are negative regulators related to Drosophila E(Z). In addition to being SET domain proteins, MEA, CLF, and E(Z) are also PcG proteins (Section 2.3, Other polycomb proteins). Mutations in eLF result in altered leaf morphology and homeotic changes in flower development. MEA/PIS 1 regulates gametophyte-specific gene expression and is an imprinting factor that inhibits endosperm development in the absence of fertilization (for review, see Schubert et al. 2005). In contrast, the TRX family member Arabidopsis trithoraxl (ATXl) acts as an activator of floral homeotic genes, presumably by means of its ability to catalyze histone H3 lysine 4 (H3K4) methylation, a mark often associated with transcriptionally active chromatin (for review, see Hsieh and Fischer 2005). Kryptonite/Suppressor of variegation 3-9 hom*olog 4 (KYP/SUVH4) was identified in screens for suppressors of epigenetic silencing at two endogenous genes (Jackson et al. 2002; Malagnac et al. 2002). KYP/SUVH4 catalyzes mono- and dimethylation of H3 at lysine 9 (H3K9me2/me3) and acts together with CMT3 to maintain CpNpG methylation of a subset of sequences in Arabidopsis. KYP/SUVH4 appears to play only a minor role in heterochromatin formation (Chan et al. 2005). In contrast, Suppressor of Variegation 3-9 hom*olog 2 (SUVH2), identified in a screen for reactivation of a silent transgene, appears to be the major activity responsible for methylation ofH3 at lysines 9 (H3K9) and 27 (H3K27) in heterochromatin in Arabidopsis (Naumann et al. 2005). Lysines in histones H3 and H4 can be mono-, di-, or trimethylated, which increases the combinatorial complexity of these modifications. Specific states define heterochromatin in different organisms. For example, H3K9me3 is a prominent feature of heterochromatin in animals and fungi, whereas this epigenetic mark is associated with euchromatin in Arabidopsis. Conversely, H3K9mel and H3K9me2 are the predominant marks for
silenced heterochromatin in Arabidopsis, whereas they are euchromatic modifications in mammals. The origins of these differences and how they relate to the postulated histone code remain to be determined. In addition, the intricate relationships between specific histone modifications and DNA methylation patterns in plants remain to be fully elucidated (Tariq and Paszkowski 2004). In contrast to histone acetylation, which can be dynamically regulated by the opposing activities of HDACs and HATs, histone methylation was thought until recently to be a more permanent epigenetic mark. Recent work in mammals, however, has identified a lysine demethylase, LSDl, that can remove H3K4mel and H3K4me2 but not H3K4me3 (see Chapter 10). Four putative LSD hom*ologs are encoded in the Arabidopsis genome, suggesting that at least some histone methylation is reversible in plants. 2.3 Other Chromatin Proteins OTHER POLYCOMB PROTEINS
PcG proteins were initially identified in Drosophila as factors required to maintain repression of homeotic genes (see Chapter 11). In animals, structurally disparate PcG proteins act together in multiprotein complexes to repress gene expression. The PRCl complex is absent in plants and C. elegans but present in Drosophila and mammals. The PRC2 complex, however, is found in plants and animals, where it has been shown to methylate predominantly H3 at lysine 27 (H3K27) through the histone methyltransferase activity of the SET domain and PcG protein E(Z). Arabidopsis hom*ologs of the core components of PRC2 have been identified in mutant screens designed to dissect various developmental pathways. In Drosophila, PRC2 components are encoded by single-copy genes. In contrast, genes encoding these proteins in Arabidopsis show functional diversification of at least three PRC2 complexes-PIS (fertilization independent seeds), EMF (embryonic flower), and VRN (vernalization)-that differ in their target gene specificity (Schubert et al. 2005; see also Fig. 2 in Chapter 11). PIS genes were identified in screens for mutants showing partial seed development in the absence of fertilization. A major target is the MADS-box transcription factor PHERES (Kohler et al. 2005). Components of the EMF complex were identified by their common role in repressing floral homeotic genes, such as Agamous and Apetala3. A member of the VRN complex, VRN2, was identified on the basis of its contribution to epigenetic memory of vernalization, which is defined as the break of seed dor-
E PIG ENE TIC
mancy by cold treatment. Plants have to program their reproduction to occur during the proper season, and they do this in temperate climates by flowering only after extended periods of cold temperatures. The epigenetic memory of winter requires VRN2, which maintains coldinduced transcriptional repression of the gene encoding the flowering inhibitor FLC during later periods of growth at warmer temperatures. H3K27me2 is lost from FIC in vrn2 mutants, which is consistent with a role for PRC2 complexes in facilitating histone methylation (Schubert et al. 2005).
COMPONENTS OF IMPRINTING
Flowering plants and mammals are the only groups of organisms that have parental imprinting (Table 1), an epigenetic phenomenon in which a gene is differentially expressed depending on the parent from which it was inherited. In view of the parental conflict theory for the evolution of imprinting (for further discussion, see Chapter 19), the occurrence of parental imprinting in flowering plants and mammals likely reflects the fact that both taxa have a special maternal tissue that provides a nutrient source for the developing embryo. In mammals, this tissue is the placenta, and in plants it is the triploid endosperm, a terminally differentiated tissue that contains one paternal and two maternal genomes (Fig. 1). Indeed, the first example of parental imprinting of a single gene in any organism was observed in maize endosperm (for review, see Alleman and Doctor 2000). In Arabidopsis, two genes expressed in the endosperm, MEA and FWA (a flowering time control gene), are imprinted. In these cases, the two maternal copies are activated, presumably by DME-catalyzed active demethylation of CpGs in the female gametophyte (see Section 2.1, Active CpG demethylation and DNA glycosylases), whereas the paternal copy remains silent (for review, see Autran et al. 2005). Intriguingly, even though imprinting evolved independently in plants and mammals, DNA methylation and PcG proteins are required in both cases (Kohler et al. 2005).
CHROMATIN-REMODELING PROTEINS
Switch2/Sucrose Non-Fermentable2 (SWI2/SNF2) chromatin-remodeling factors constitute a conserved family of ATP-dependent chromatin remodelers that are able to displace nucleosomes or loosen histone/DNA contacts. Genetic screens have provided functional information for only a handful of the approximately 40 SWI2/SNF2 hom*o logs encoded in the Arabidopsis
REG U L A T ION
I N
P LAN T S
•
1 79
genome (Plant Chromatin Database). So far, only twoDecreased DNA methylation 1 (DDMl, Jeddeloh et al. 1999) and Defective in RNA-directed DNA methylation 1 (DRD1, Kanno et al. 2004)-have been implicated in regulating DNA methylation. Deficiency mutants of DDMl, which undergo genome-wide reduction of DNA methylation and transcriptionally reactivate a number of silent transposons and repeats, display severe developmental and morphological defects. These appear only after several generations of inbreeding hom*ozygous ddml plants and appear to be due to the accumulation of epimutations and to insertional mutagenesis by transposons that are reactivated in the mutant. DDMl has an ortholog in mammals, Lymphoid-Specific Helicase (ISH), which is likewise important for global CpG methylation and embryonic development. In contrast, DRDl is unique to the plant kingdom and probably has a specialized role in RdDM (Section 3.4). No phenotypic alteration other than a release of certain repetitive targets from silencing is caused by mutations of Morpheus' Molecule (MOM, Amedeo et al. 2000), a plant-specific gene with an incomplete ATPdependent helicase motif. MOM acts synergistically with, but independently of, the DDMl/DNA methylation pathway, indicating multiple layers of transcriptional regulation in plants (Tariq and Paszkowski 2004). Three more proteins with putative chromatin-remodeling function, Splayed (SPD), Photoperiod-independent early flowering (PIE), and Pickle (PKL), which were each identified by developmental effects in deficiency mutants, have not yet been implicated in specific chromatin modifications (Wagner 2003).
CHROMATIN ASSEMBLY FACTORS
Whereas the SWI2/SNF2 proteins probably act on assembled chromatin, other components are required to reestablish chromatin after replication and repair-associated DNA synthesis. The Chromatin Assembly Factor (CAF) complex, composed of three subunits, helps to bring semi-assembled nucleosomes to the replication fork. Mutations in genes of the two larger CAF subunits in Arabidopsis (lasl, fas2) cause characteristic morphological anomalies (fasciation, Fig. 2F), deficiencies in DNA repair, and derepression of repetitive targets (Takeda et al. 2004). This suggests that correct nucleosome deposition is essential for development and epigenetic control. Whereas the lack of CAF subunits does not interfere with maintenance of DNA methylation, it could lead to the erasure of other epigenetic marks, such as histone modifications. Reduced
180 •
C HAP T E R 9
levels of the third CAF unit MSIl do not reiterate fasciation but lead to distorted seed development and several morphological changes (for review, see Hennig et al. 2005). A mutation in the BRU gene that is unrelated to any known chromatin assembly protein, but results in a phenotype very similar to that of the fas mutants, makes it likely that additional factors are involved in maintaining the epigenetic information and genetic integrity during postreplicative chromatin assembly (Takeda et al. 2004). Finally, lack of RPA2, a subunit of the Replication Protein A complex, results in DNA damage sensitivity and release of transcriptional silencing, changing histone modification marks but not DNA methylation patterns (Elmayan et al. 2005; Kapoor et al. 2005b). HETEROCHROMATIN-LIKE PROTEINS
HP1 (heterochromatin protein 1) in Drosophila and mammals, and their hom*ologs in fungi, are important components of silenced heterochromatin. The binding of HP1 through its chromodomain to methylated histone H3 at lysine 9 (H3K9me) promotes spreading of the silenced state to establish heterochromatic domains. The Arabidopsis genome encodes a single protein with hom*ology to Drosophila HPl. Mutations in this gene, termed Like heterochromatin protein (LHP 1) (Gaudin et al. 2001) or Terminal flower 2 (TFL2) (Kotake et al. 2003), result in changes in plant architecture, altered leaf development, and early onset of flowering. Although this mutant phenotype suggests an important role in regulating plant gene expression, it is unlikely that LHP1 acts through the formation of repressive chromatin complexes similarly to HP1 in other organisms. Instead, LHP 1 in Arabidopsis regulates loci in euchromatin that are not targets of DNA methylation (Kotake et al. 2003; Tariq and Paszkowski 2004). Thus, LHP1 in plants and HP1 in other organisms appear to have evolved different modes of action.
3 Molecular Components of RNAi-mediated Gene Silencing Pathways
Modern epigenetics research has traditionally focused on DNA methylation and histone modifications. During the past several years, it has become evident that these alterations can be targeted to specific regions of the genome by the RNA interference pathway. Indeed, it is impossible nowadays to consider epigenetic regulation in many eukaryotes without integrating components of the RNAi machinery (Matzke and Birchler 2005). This is particu-
larly true for plants, where the proliferation of RNAimediated gene-silencing pathways exceeds that present in any other type of organism.
3.1 Elaboration of RNAi-mediated 5i1encing in Plants
RNAi and related types of gene silencing represent cellular responses to double-stranded RNA (dsRNA). In these pathways, the dsRNA is processed by the RNase III-like endonuclease, Dicer, to produce small RNAs which determine the specificity of silencing by base-pairing to complementary target nucleic acids. Small RNAs incorporate into multiprotein silencing effector complexes to direct mRNA degradation, repress translation (PTGS), or guide chromatin modifications (TGS) in a sequence-specific manner. A key component of silencing effector complexes is an Argonaute protein, which binds small RNAs through its PAZ domain. Individual members of the Argonaute protein family, which comprises the largest group of proteins important for RNAi-mediated silencing, confer functional specificity to different silencing effector complexes (for review, see Carmell et al. 2002). In addition to participating in viral defense and transposon control, RNAi-mediated gene silencing plays essential roles in plant and animal development. The elaboration of RNAi-mediated silencing in plants reflects in part their co-evolution with pathogens that generate dsRNA during replication, such as RNA viruses and viroids. Indeed, together with transgenes-another type of "foreign" nucleic acid-these RNA pathogens have been invaluable for detecting and studying various forms of RNAi-mediated gene silencing in plants. The proliferation of RNAi-mediated gene-silencing pathways in plants is illustrated by 1. the expansion and functional diversification of gene families encoding core components of RNAi: the Arabidopsis genome encodes four DICER-LIKE (DCL) proteins and ten Argonaute (AGO) proteins 2. the heterogeneity in length and functional diversity of small RNAs, including the 21-nucleotide short interfering RNAs (siRNA) derived from transgenes and viruses, and several types of endogenous small RNAs, such as 21- to 24-nucleotide microRNAs; 21-nucleotide trans-acting siRNAs, and 24- to 26nucleotide heterochromatic siRNAs 3. the various modes of gene silencing elicited by different small RNAs: PTGS involves mRNA degradation or repression of translation, and TGS is
E PIG ENE T / eRE G U LA T ION
associated with epigenetic modifications such as DNA cytosine methylation and histone methylation 4. the importance of PTGS in antiviral defense, which can be countered by a variety of plant viral proteins that repress silencing at different steps of the pathway 5. the existence of processes, such as non-cellautonomous silencing and transitivity (see Section 3.2, Non-ceil-autonomous silencing and transitivity), that rely on RNA-dependent RNA polymerases, six of which are encoded in the Arabidopsis genome These aspects will be discussed in the framework of three major pathways of RNAi-mediated gene silencing in plants (Fig. 3a-c). However, it should be kept in mind
a
RNAi-mediated gene silencing induced by transgenes and viruses appears to function primarily as a host defense to foreign or invasive nucleic acids, including viruses, transposons, and transgenes. ORIGIN AND PROCESSING OF DsRNA
Transgene constructs can be introduced into plant genomes in sense or antisense orientations or as inverted DNA repeats. Viruses can have single-stranded
c
I miRNA gene I
ITAS gene I
1
1
==~
TGS/RdDM/heterochromatin
'--,-__--' Itarget gene/repeats I
1
POL IVa
l
pri_miRNAl SGS3 RDR6j
DCLl
SDE3
HYLl
WEX HENl
RD:2
...
1
DCL3
vRdRP
HEN1
pre-miRN.FO
1
== ==
DCLl
24-26 nl
DCL4
HENl
miR/miR· ....
SDE3
RDR6 amplification transitivity mobile signals
I
== ~==
21 nt
== I==
24-26 nt
~ 11 ~mRNA ~
mRNA cleavage
1
RdDM, mobile signals
181
3.2 Pathway 1: Transgene-related Posttranscriptional and Virus-induced Gene 5i1encing (PTG5N/G5)
microRNAItrans-acting siRNA
aRNA
P LAN T 5
that the pathways feed into each other at various points. Components with assigned functions are listed in Table 2.
b transgene PTGSIVGS
I N
==
miRNA 21 nl
_ _ lasiRNA
1-----'1 -
~
-
21nl
===:::::::::::_= ~~~31 ~~:~ DDMl
METl
RdDM
11
1
DNA
histone modifications
- _.......;,,;;;;iii. . . mRNA
/ mRNA cleavage or translational block
RdDM mRNA cleavage
Figure 3. RNA-mediated Silencing Pathways in Plants Although there are some overlaps and shared components, three major pathways can be distinguished by the source of dsRNA, class of small RNA, nature of the target sequence, and the mode of silencing evoked. Silencing effector complexes containing an Argonaute protein are shown as light gray spheres. Yellow boxes mark processes known to occur within the nucleus. See text for details and Table 2 for the names of regulatory components. Plant-specific proteins are labeled in green. (PTGS) Posttranscriptional gene silencing, (VIGS) virus-induced gene silencing, (TGS) transcriptional gene silencing, (RdDM) RNA-directed DNA methylation, (IR) inverted repeats, (AS) antisense, (vRdRP) virally encoded RNA-dependent RNA polymerase, (a RNA) aberrant RNA, (siRNA) short interfering RNA, (RISe) RNA-induced silencing complex. (Modified from Meins et al. 2005.)
182
•
C HAP T E R 9
or double-stranded DNA or RNA genomes. Therefore, in this pathway, dsRNA can be produced by a variety of routes. In principle, antisense transcripts can base-pair directly to target mRNAs to form dsRNA. Transcription through inverted DNA repeats can produce hairpin RNAs. RNA viruses, which encode their own RNAdependent RNA polymerase (vRdRP) and replicate via dsRNA intermediates, enter the pathway directly at the level of dsRNA. In contrast, sense transgenes and DNA viruses, such as geminiviruses, require the cellular RNAdependent RNA polymerase RDR6 for dsRNA synthesis as well as several other factors identified genetically (SDE3, SGS3, and WEX; Table 2). To render them substrates for RDR6, transcripts of sense transgenes and DNA viruses are presumed to be aberrant in some way; for example, by lacking a 5' cap or a polyadenylated tail (for review, see Meins et al. 2005). The DCL activity required to process dsRNA into siRNAs in the PTGS pathway has not yet been identified (DCLX). Tests of dell partial loss-of-function mutants indicated that DCLl is unlikely to be involved in this processing step. The plant-specific protein HENl adds a methyl group to the 3'-most nucleotide of small RNAs, thus protecting them from uridylation and subsequent degradation (Li et al. 2005). DCL2 has been implicated in generating siRNAs from some, but not all, RNA viruses (Xie et al. 2004). PTGS and VIGS result in the production of two distinct size classes of siRNA, 21-22 nucleotides and 24-26 nucleotides, that have been implicated in diverse functions (Baulcombe 2004). In general, the 2l-nucleotide siRNAs are thought to guide mRNA cleavage, whereas the 24- to 26-nucleotide size class, termed heterochromatic siRNA, directs epigenetic modifications to hom*ologous DNA sequences (i.e., TGS; see Section 3.4). Following DCL processing, the siRNA duplex is unwound and the antisense strand associates with a member of the Argonaute protein family, as part of the assembly into the RNA-induced silencing complex (RISC). The siRNA-programmed RISC can then direct endonucleolytic cleavage of target mRNAs at a single site near the center of siRNA-mRNA complementarity. For the mammalian equivalent, cleavage is catalyzed by the Ago2 "slicer" activity (see Chapter 8). The Arabidopsis protein carrying out this function in the transgene PTGS pathway is AGOl (Baumberger and Baulcombe 2005). Following endonucleolytic cleavage, the severed 3' segment of the mRNA is degraded in the 5' to 3' direction by the exonuclease AtXRN4 (Souret et al. 2004); the 5' portion is probably degraded by the exosome in a 3' to 5' direction.
NON-CELL-AUTONOMOUS SILENCING AND TRANSITIVITY
PTGS in plants has two special properties that rely on the activity of the RNA-dependent RNA polymerase RDR6: non-ceIl-autonomous silencing and transitivity (Fig. 3a). In the former, RNA signals that induce PTGS move from the cell of origin into neighboring cells through plasmodesmata or-as originally shown in grafting experiments-through the vascular system to induce sequence-specific gene silencing at distant sites (for review, see Voinnet 2005). Mobile small RNAs, providing a systemic silencing signal, thus might play the dual function of influencing plant development by facilitating communication between cells, and coordinating activities in remote parts of the plant. This proposal is supported by the finding of microRNAs (miRNAs; important for development, Section 3.3) and a small RNA-binding protein in phloem sap, which is the main transporter of metabolites through the plant vascular system (Yoo et al. 2004). Transitivity refers to the generation of secondary siRNAs corresponding to sequences located outside the primarily targeted regions. To make these, RNAdependent RNA polymerase catalyzes synthesis of secondary dsRNAs from transgene or viral template RNAs using primary siRNAs as primers. Dicer processing yields secondary siRNAs, which amplify the silencing reaction and, when viral RNAs are involved, strengthen virus resistance (Voinnet 2005). The only other organism in which both non-cellautonomous silencing and transitivity have been observed is C. elegans (see Chapter 8), which has putative RNA-dependent RNA polymerase activities that are absent in mammals and Drosophila. VIRAL SUPPRESSORS OF SILENCING
Plant viruses are not only inducers and targets of silencing; they also encode proteins that can suppress silencing (for review, see Voinnet 2005). This reinforces the idea that PTGS is a natural defense to viruses, since these suppressor proteins constitute a counter-defense "strategy" of the pathogen. Most plant viruses encode at least one silencing suppressor protein that acts at a distinct step of the PTGS pathway, typically downstream of dsRNA processing. Suppression of PTGS by a virus is strikingly revealed in mottled soybeans, where the dark color is the result of reversal of natural PTGS (i.e., reactivation) of a pigment gene (Fig. 2e) (Senda et al. 2004). Viral suppressors of RNAi have recently also been found in an insect virus and a mammalian retrovirus (Lecellier et al. 2005; Voinnet 2005).
E PIG ENE TIC
3.3 Pathway 2: Regulation of Plant Development by miRNAs and Trans-acting siRNAs
The discovery of endogenous populations of miRNAs in plants and animals opened a new era in research of developmental biology and RNAi-mediated gene silencing (for review, see Bartel 2004). miRNAs silence gene expression by base-pairing to target messenger RNAs (mRNAs) and inducing either mRNA cleavage or translation repression. The importance of miRNAs in plant development is illustrated by the fact that many genes needed for miRNA biogenesis and silencing-including DeLl, AG01, HEN1, HYLl, and HST-were identified in screens for developmental mutants and only later shown to be important for miRNA accumulation. The phenotypes of mutants defective in these proteins suggest diverse roles for miRNAs in meristem function, organ polarity, vascular development, floral patterning, and stress/hormone responses (for review, see Kidner and Martienssen 2005). miRNAs have recently been implicated in the biogenesis of a new type of small RNA, the trans-acting siRNAs. ROLES AND BIOGENESIS OF MIRNAs
miRNAs were initially recovered by cloning size-fractionated small RNAs ranging from about 18 to 28 nucleotides in length. Their high degree of complementarity to target mRNAs in plants facilitated identification of additional miRNAs by computational approaches. So far, 92 loci in Arabidopsis that encode 27 distinct miRNAs have been discovered, and there are a similar number in rice. The expression of many miRNA genes is developmentally or environmentally regulated. About 50% of their known targets in Arabidopsis are transcription factors, many of which were known modulators of meristem formation and identity, prior to their identification as miRNA targets. In contrast, animal miRNAs do not preferentially target transcription factors but regulate diverse genes that operate at many levels in the cell. Two essential proteins of the miRNA pathway in Arabidopsis, DCLl and AGO 1, are themselves regulated by miRNAs, providing a means for negative modulation by feedback control (Kidner and Martienssen 2005). Many miRNAs are evolutionarily conserved among eukaryotes (Axtell and Bartel 2005), in some cases over extended periods of time. Remarkably, in flowering plants, gymnosperms, and more primitive plants, mRNAs of a group of transcription factors that regulate meristem formation and lateral organ asymmetry have maintained perfect complementarity to the cognate miRNA. This
REG U L A T ION
I N
P LAN T S
•
183
indicates conservation of function for at least 400 million years (Floyd and Bowman 2004). miRNAs are encoded in regions between protein-coding genes or in introns. They originate from imperfect RNA hairpin precursors, ranging from 70 bp to more than 300 bp in length, that are transcribed by DNA-dependent RNA polymerase II. Processing of plant miRNA precursors occurs in multiple steps in the nucleus. First, the ends of the pri-miRNA are removed by nuclear DCLI. This step requires the dsRNA-binding protein HYLl, originally identified by the hormone response defects of its mutant phenotype (Han et al. 2004; Vasquez et al. 2004a). The second step involves release of the miRNA duplex (miRlmiR*, Fig. 3b), again by DCLl, and 3'-end methylation by HENI (see Section 3.2, Origin and processing of dsRNA). Transport of the miR/miR* duplex from the nucleus to the cytoplasm requires HASTY (HST), a hom*olog of mammalian Exportin 5 (Park et al. 2005). Mature miRNAs are also found in nuclear fractions, suggesting that some may function in the nucleus to direct epigenetic modifications. Indeed, a miRNA that is complementary to the spliced, nascent transcript of a transcription factor induces cytosine methylation of DNA sequences downstream of the target gene, by an unknown mechanism (Schubert et al. 2005). miRNA biogenesis differs somewhat in mammals, which have a single Dicer that is located in the cytoplasm and a second RNase III-type activity, Drosha, in the nucleus. Drosha, together with the dsRNA-binding protein Pasha-neither of which has a hom*olog in plants-cleaves the ends of the pri-miRNA. The resulting pre-miRNA is then transported to the cytoplasm by an Exportin5-mediated pathway to undergo final processing to mature miRNAs by Dicer (Du and Zamore 2005; Kim 2005).
PLANT MIRNAs GUIDE MRNA CLEAVAGE
In general, animal miRNAs show imperfect complementarity to target mRNAs and repress translation by binding to multiple sites in 3'UTRs. In contrast, the nearly perfect complementarity of plant miRNAs to the coding regions of target mRNAs favors mRNA cleavage, presumably in a manner similar to siRNAs. However, there are increasing exceptions to both of these "rules." For example, plant miRI72 is able to block translation, and certain mammalian miRNAs may direct cleavage of target mRNAs (for review, see Du and Zamore 2005). AGOI is the founding member of the Argonaute family of proteins and the mRNA "slicer" component of
184 •
C HAP T E R 9
miRNA-programmed RISC in Arabidopsis (Baumberger and Baulcombe 2005). AG01 was identified prior to the discovery of miRNAs in a screen for Arabidopsis mutants defective in leaf development (for review, see Carmell et al. 2002). The name Argonaute was inspired by the phenotype of agol mutants, which resemble a small squid because of their narrow, filamentous leaves. Agol mutants display shoot apical meristem defects similar to mutants deficient in PNH/ZLL/AG010 (Table 2), which is similar to AGO 1 but not yet shown to be needed for PTGS (Vaucheret et al. 2004). The essential function of AGO proteins in plant meristems is consistent with a conserved function of these proteins in stem cell maintenance (Carmell et al. 2002; Kidner and Martienssen 2005).
3.4 Pathway 3: Transgene-related Transcriptional Silencing, RNA-directed DNA Methylation, and Heterochromatin Formation
Current concepts of RNAi-mediated transcriptional gene silencing grew out of early plant work on hom*ologydependent gene silencing triggered by multiple copies of promoter regions and on RNA-directed DNA methylation (for review, see Matzke and Matzke 2004). More recent studies on RNAi-mediated heterochromatin formation in fission yeast (see Chapters 6 and 8) and on siRNA-mediated TGS in mammalian cells have expanded the phylogenetic scope of this process and confirmed mechanistic overlaps to RNAi. RNA-DIRECTED DNA METHYLATION
TRANS-ACTING SIRNAs
Endogenous trans-acting siRNAs (ta-siRNAs) are a new type of small RNA that have been discovered recently in Arabidopsis. The ta-siRNAs, which elicit cleavage of their target mRNAs, share features with both siRNAs and miRNAs. Similarly to siRNAs, the synthesis of the dsRNA precursor of ta-siRNAs depends on RDR6 and SGS3. Similarly to miRNAs, ta-siRNAs originate from genomic regions that have little overall resemblance to their target mRNA. To ensure formation of the correct ta-siRNA with complementarity to the target mRNA, a miRNA sets the phased cleavage of the dsRNA precursor by DCL4 (Fig. 3b) (Allen et al. 2005; Gasciolli et al. 2005). ta-siRNAs have been assigned a role in developmental timing. During development, the shoot of flowering plants undergoes two phase changes: the vegetative phase change, comprising the juvenile-to-adult transition, and the reproductive phase change, which results in growth of flower-containing branches instead of vegetative shoots (see Fig. 1). Although genetic analysis of floral induction is well advanced, less is known about the vegetative phase change. An Argonaute protein, ZIPPY/AGO?, however, has a specialized role in this transition. A screen for mutants undergoing precocious vegetative phase change similar to zip/ago7 mutants identified RDR6 and SGS3, two genes important for PTGS (Fig. 3b) (Peregrine et al. 2004). Further analysis showed that several genes that are up-regulated in rdr6 and sgs3 mutants are silenced posttranscriptionally by ta-siRNAs (Vazquez et al. 2004b). These findings imply that components of the PTGS machinery are important not only for viral defense and transgene silencing, but also for temporal control of developmental switches. It is not yet known whether ta-siRNAs have counterparts in animals.
RdDM was first observed in tobacco plants infected with viroids (Wassenegger et al. 1994). Viroids are minute plant pathogens consisting solely of a non-protein-coding, circular RNA several hundred bases in length. In the original experiments, replicating viroids were found to trigger de novo methylation of viroid cDNAs integrated as transgenes into the tobacco genome. Transgene systems were subsequently used to establish that RdDM requires a dsRNA that is processed to small RNAs, a hallmark of RNAi. RNA viruses that replicate exclusively in the cytoplasm were shown to elicit methylation of hom*ologous nuclear DNA, indicating that small RNAs produced in the cytoplasm as a consequence of PTGS are able to enter the nucleus and induce epigenetic changes. dsRNAs containing promoter sequences can direct DNA methylation and transcriptional silencing of cognate target promoters (for review, see Mathieu and Bender 2004; Matzke et al. 2005). In plants, RNA induces a distinctive pattern of de novo methylation that is typified by the modification of cytosines in all sequence contexts, largely within the region of RNA-DNA sequence identity. This characteristic pattern hints that RNA-DNA base-pairing provides a substrate for de novo methylation, but this remains to be experimentally verified. Whereas asymmetrical CpNpN methylation is not efficiently retained after withdrawing the trigger RNA, symmetrical CpG and CpNpG methylation can be maintained to varying extents without RNA at different promoters. Differences in the efficiency of maintenance methylation might reflect differences in sequence composition or patterns of histone modifications. Combined data from genetic screens using transgene and endogenous gene systems are revealing the molecular components needed for RdDM and TGS. Transgene sys-
E PIG ENE TIC
terns rely on transcribed inverted repeats or viruses to produce dsRNA that is hom*ologous to target DNA loci. Endogenous genes that have been informative in forward genetic screens include the phosphoribosyl anthranilate isomerase (PA!) gene family (Mathieu and Bender 2004) and the SUPERMAN gene (Chan et al. 2005). These genes have features that render them targets or inducers of RdDM and TGS. For example, the PAl gene family contains four members, two of which are arranged as an inverted repeat. Transcription through the inverted repeats from an unrelated upstream promoter produces a dsRNA that targets the singlet copies of the PAl gene for methylation and silencing. PLANT-SPECIFIC MACHINERY FOR
RDDM
For the most part, conserved DNA methyltransferases and histone-modifying enzymes are required for RdDM (Sections 2.1 and 2.2). De novo methylation of cytosines in all sequence contexts is catalyzed by the conserved DRM class of DNA methyltransferase. The conserved METl and plant-specific CMT3 function primarily to maintain methylation of CpG and CpNpG nucleotide groups, respectively, although minor contributions to de novo methylation have been reported. The conserved histone deacetylase HDA6 and the SWI2/SNF2 protein DDMI help to maintain CpG methylation at some loci. The histone methyltransferase KYP/SUVH4 is involved in locus-specific maintenance of CpNpG methylation induced by RNA (for review, see Chan et al. 2005). A recent, surprising finding is that RdDM requires a plant-specific RNA polymerase, termed pol IV. In all eukaryotes examined so far, there are three DNAdependent RNA polymerases-pol I, pol II, and pol IIIthat contain multiple subunits encoded by distinct genes. The first hint of the existence of pol IV came from analyzing the Arabidopsis genome sequence, which revealed genes encoding the largest and second-largest subunits of an atypical RNA polymerase unique to plants. There appear to be two functionally diversified pol IV complexes that are specified by unique largest subunits that each act with a common second-largest subunit. pol IVa is needed to generate siRNAs, presumably by initially transcribing target genes (Herr et al. 2005; Onodera et al. 2005). The initial transcript is used as a template by RDR2 to synthesize dsRNA, which is processed by DCL3, a nuclear activity that is specialized for producing 24nucleotide heterochromatic siRNAs from transposons and repeats (Xie et al. 2004). Downstream of siRNA production, pol IVb acts together with the plant-specific
REG U L A T ION
I N
P LAN T 5
•
185
SWI2/SNF2-like protein DRDI to signal DNA methylation (Kanno et al. 2005), probably in cooperation with AG04 (Fig. 3c) (Chan et al. 2005). Whether pol IVb actually transcribes RNA is not known, but its net effect is to create a chromatin structure that permits DNA methyltransferases to catalyze de novo cytosine methylation at the siRNA-targeted site. Even though other eukaryotes do not contain pol IV subunits, two subunits of pol II, which transcribes mRNA precursors, are required for RNAi-mediated heterochromatin formation in fission yeast (see Chapters 6 and 8). Although promoter-directed siRNAs can induce TGS in human cells, there are conflicting reports about whether this is accompanied by detectable DNA methylation (Kawasaki et al. 2005; Ting et al. 2005). Many proteins required for RdDM in plants are found only in that kingdom (Fig. 3c). Thus, if RdDM occurs regularly in mammals, the mechanism or protein machinery differs from those in plants. SILENCING OF ENDOGENOUS GENES BY RNAI-MEDIATED
TGS
Many transposons and repetitive DNA sequences, such as the 5S rDNA arrays and the transposon-rich heterochromatic knob on Arabidopsis chromosome 4, are transcriptionally silenced and methylated by an RNAi-mediated mechanism (Lippman and Martienssen 2004; Chan et al. 2005). The endogenous DNA targets reflect the natural roles of RNAi-mediated TGS in repressing transposition and in packaging repeats into heterochromatin. However, plant genes containing transposon insertions can themselves become targets of RNAi-mediated silencing and methylation. For example, transposon-derived repeats in the promoter of the Arabidopsis floral gene FWA are targeted for methylation by cognate siRNAs (Lippman and Martienssen 2004), thus silencing the gene in vegetative tissues where it is not required. In some Arabidopsis accessions, a Mu element in an intron of the FIC gene, a repressor of flowering, renders the gene susceptible to repressive chromatin modifications directed by siRNAs originating from dispersed copies of Mu (Liu et al. 2004). The resulting lowered expression of FIC can accelerate flowering time, which might have adaptive significance in certain environments. Since many plant genes have transposon insertions in the vicinity of promoters or in introns, this mode of regulation might be common in the plant kingdom. Indeed, as more is learned about epigenetic regulation, McClintock's idea of transposons acting as elements that control host genes and development is gaining increasing support (McClintock 1956).
186 •
C HAP T E R 9
4 Epigenetic Regulation without RNA Involvement
Despite the specificity provided by small RNAs, they probably do not induce all epigenetic modifications in plants. For example, MOM, a protein with a partial SNF2 domain, has not yet been implicated in RNAi-mediated TGS. There is also no evidence that PcG proteins in plants are directed to their target genes by small RNAs. Other types of signal, such as hom*ologous pairing of non-transcribed repetitive sequences or special sequence compositions, might nucleate heterochromatin formation or attract DNA methyltransferases. The RNAi machinery, for instance, is dispensable for DNA methylation and histone methylation in Neurospora, where TArich segments are preferentially targeted for modification (see Chapter 6). Moreover, there are pathways for heterochromatin formation in fission yeast that are independent of RNAi (see Chapters 6 and 8). An unusual epigenetic phenomenon in plants that has not yet been shown to involve RNAi is paramutation. Paramutation occurs when certain alleles, termed paramutagenic, impose an epigenetic imprint on susceptible (paramutable) alleles. The epigenetic imprint is inherited through meiosis and persists even after the two interacting alleles segregate in progeny. Paramutation represents a violation of Mendel's law, which stipulates that alleles segregate unchanged from a heterozygote. Paramutation was first observed decades ago in maize and tomato, but the mechanism(s) has remained enigmatic. Paramutation-like phenomena have been observed recently in mammals, suggesting that it is not limited to the plant kingdom (for review, see Chandler and Starn 2004). The B locus in maize, one of the most intensively studied cases of paramutation, contains a series of direct repeats almost 100 kb from the transcription start site that mediate paramutation in an unknown manner. Although RNA-based silencing has not been fully ruled out, alternate mechanisms relying on pairing of alleles are still under consideration (for review, see Starn and Mittelsten Scheid 2005). 5 Outlook
In this chapter, we have discussed what is known about basic epigenetic principles in plants and their relationship to epigenetic regulation in other organisms. Plants clearly share a number of features of epigenetic control with other organisms, yet they have also evolved a number of plant-specific variations and innovations. These likely underpin the unique aspects of plant development
and their extraordinary ability to survive and reproduce successfully in unpredictable environments. Prominent among the plant-specific innovations is a built-in system for reversible epigenetic modifications, which likely makes a key contribution to plant developmental plasticity and adaptability. The capacity to induce or erase repressive modifications in nondividing cells-the former through RdDM and histone modifications, and the latter through the activity of DNA glycosylases such as DME and ROS1-allows epigenetic reprogramming without intervening cycles of DNA replication. The facile erasure of epigenetic marks from plant genomes probably accounts for the relative ease of cloning whole plants from single somatic cells. Nevertheless, induction as well as removal of epigenetic marks is likely neither perfect nor uniform throughout an individual, which creates epigenetic variability in populations of supposedly genetically identical cloned plants. Such somaclonal variation can be exploited in plant breeding programs. Similarly, the differential inheritance of epigenetic marks during sexual reproduction can lead to epigenetic variation in natural populations. Selection can act on this variability by fixing specific epialleles that might have adaptive significance. As we have described for the process of vernalization, environmental cues can trigger epigenetic modifications in plants and alter physiological responses. Thus, plants can "learn" if environmentally or stress-induced epigenetic modifications in shoot meristem cells enter the germ line and are adaptive. Defining the full range of conditions under which epigenetic changes are likely to occur spontaneously or are programmed will reveal more about the biological functions of these modifications. Likewise, unraveling the mechanisms of meiotic inheritance of epigenetic marks in plants could eventually permit scientists to manipulate this feature for improvements in horticulture and agriculture. In addition to responding appropriately to environmental stimuli, plants have confronted a variety of genomic challenges during their evolutionary and breeding histories. Polyploidization or hybridization can have a significant impact on epigenetic modifications owing to the still ill-defined process of genome shock, a response to an unusual stress leading to widespread mobilization of transposons (McClintock 1983). The origin of heterosis, the superior performance of hybrids compared to that of inbred parent lines, is still unknown, but it is likely to involve epigenetic alterations triggered by combining two related but distinct
E PIG ENE TIC
genomes. Similarly, polyploidization combines and/or multiplies whole genomes, with innumerable possibilities for epigenetic changes. Learning the epigenetic consequences of polyploidization in plants would also help to understand our own evolutionary history, which is increasingly thought to involve two whole-genome duplications (Furlong and Holland 2004). Clearly, even at this scale of inquiry, plant epigenetics can be informative for human biology, justifying their reputation as "masters of epigenetic regulation." References Adams K.L. and Wendel J.E 2005. Polyploidy and genome evolution in plants. Curro Opin. Plant BioI. 8: 135-141. Alleman M. and Doctor J. 2000. Genomic imprinting in plants: Observations and evolutionary implications. Plant Mol. BioI. 43: 147-161. Allen E., Xie Z., Gustafson A.M., and Carrington J.c. 2005. microRNAdirected phasing during trans-acting siRNA biogenesis in plants. Cell 121: 207-221. Amedeo P., Habu Y, Afsar K., Mittelsten Scheid 0., and Paszkowski J. 2000. Disruption of the plant gene MOM releases transcriptional silencing of methylated genes. Nature 405: 203-206. Autran D., Huanca-Mamani W., and Vielle-Calzada J.P. 2005. Genomic imprinting in plants: The epigenetic version of an Oedipus complex. Curro Opin. Plant BioI. 8: 19-25. Axtell M.J. and Bartel D.P. 2005. Antiquity of microRNAs and their targets in land plants. Plant Cell 17: 1658-1673. Bartel D.P. 2004. MicroRNAs: Genomics, biogenesis, mechanism, and function. Cell 116: 281-297. Baulcombe D.C. 2004. RNA silencing in plants. Nature 431: 356-363. Baumberger N. and Baulcombe D.C. 2005. Arabidopsis ARGONAUTEI is an RNA slicer that selectively recruits microRNAs and short interfering RNAs. Proc. Natl. Acad. Sci. 102: 11928-11933. Baumbusch L.O., Thorstensen 1., Krauss V, Fischer A., Naumann K., Assalkhou R., Schulz 1., Reuter G., and Aalen R.B. 2001. The Arabidopsis thaliana genome contains at least 29 active genes encoding SET domain proteins that can be assigned to four evolutionarily conserved classes. Nucleic Acids Res. 29: 4319-4333. Burch-Smith 1.M., Anderson J.c., Martin G.B., and Dinesh-Kumar S.P. 2004. Applications and advantages of virus-induced gene silencing for gene function studies in plants. Plant f. 39: 734-746. Carmell M.A., Xuan Z., Zhang M.Q., and Hannon G.J. 2002. The Argonaute family: Tentacles that reach into RNAi, developmental control, stem cell maintenance, and tumorigenesis. Genes Dev. 16: 2733-2742. Chan S.W.-L., Henderson l.R., and Jacobsen S.E. 2005. Gardening the genome: DNA methylation in Arabidopsis thaliana. Nat. Rev. Genet. 6: 351-360. Chandler VL., and Stam M. 2004. Chromatin conversations: Mechanisms and implications of paramutation. Nat. Rev. Genet. 5: 532-544. Chandler V.L" Eggleston W.B., and Dorweiler J.E. 2000. Paramutation in maize. Plant Mol. BioI. 43: 121-145. Choi Y, Gehring M., Johnson L., Hannon M., Harada J.J., Goldberg R.B., Jacobsen S.E., and Fischer R.L. 2002. DEMETER, a DNA glycosylase domain protein, is required for endosperm gene imprinting and seed viability in Arabidopsis. Cell 110: 33-42.
REG U L A T ION
I N
P LAN T S
•
187
Cubas P., Vincent c., and Coen E. 1999. An epigenetic mutation responsible for natural variation in floral symmetry. Nature 401: 157-161. Du 1. and Zamore P.D. 2005. microRNAPrimer: The biogenesis and function of microRNA. Development 132: 4645-4652. Elmayan 1., Proux E, and Vaucheret H. 2005. Arabidopsis RPA2: A genetic link among transcriptional gene silencing, DNA repair, and DNA replication. Curro BioI. 15: 1919-1925. FedoroffN.V and Chandler V 1994. Inactivation of maize transposable elements. In hom*ologous recombination and gene silencing in plants (ed J. Paszkowski), pp. 349-385. KJuwer Academic Publishers, Dordrecht, The Netherlands. Floyd S.K. and Bowman J.L. 2004. Gene regulation: Ancient microRNA target sequences in plants. Nature 428: 485-486. Furlong R.E and Holland P.W. 2004. Polyploidy in vertebrate ancestry: Ohno and beyond. BioI. ]. Linnean Soc. 82: 425-430. Gasciolli V, Mallory A.C., Bartel D.P., and Vaucheret H. 2005. Partially redundant functions of Arabidopsis DICER-like enzymes and a role for DCL4 in producing trans-acting siRNAs. Curro BioI. 15: 1494-1500. Gaudin V, Libault M., Pouteau S., Juul T., Zhao G., Lefebvre D., and Grandjean O. 2001. Mutations in LIKE HETEROCHROMATIN PROTEIN 1 affect flowering time and plant architecture in Arabidopsis. Development 128: 4847-4858. Gong Z., Morales-Ruiz 1., Ariza R.R., Roldan-Arjona 1., David L., and Zhu J.K. 2002. ROSl, a repressor of transcriptional gene silencing in Arabidopsis, encodes a DNA glycosylase/lyase. Cell Ill: 803-814. Han M.H., Goud S., Song L., and Fedoroff N. 2004. The Arabidopsis double-stranded RNA-binding protein HYLl plays a role in microRNA-mediated gene regulation. Proc. Natl. Acad. Sci. 101: 1093-1098. Heitz E. 1928. Das Heterochromatin der Moose. 1. ]ahrb. Wiss. Bot. 69: 762-818. Hennig L., Bouveret R., and Gruissem W. 2005. MSIl-like proteins: An escort service for chromatin assembly and remodeling complexes. Trends Cell BioI. 15: 295-302. Herr A.J., Jensen M.B., Dalmay 1., and Baulcombe D.C. 2005. RNA polymerase IV directs silencing of endogenous DNA. Science 308: 118-120. Hsieh 1.-E and Fischer R.L. 2005. Biology of chromatin dynamics. Annu. Rev. Plant BioI. 56: 327-351. Hung M.-S. and Shen C.-K. 2003. Eukaryotic methyl-CpG-binding domain proteins and chromatin modification. Eukaryot. Cell 2: 841-846. Jackson J.P., Lindroth A.M., Cao X., and Jacobsen S.E. 2002. Control of CpNpG DNA methylation by the KRYPTONITE histone H3 methyltransferase. Nature 416: 556-556. Jeddeloh J.A., Stokes 1.L., and Richards E.]. 1999. Maintenance of genomic methylation requires a SWI2/SNFs-like protein. Nat. Genet. 22: 94-97. Jorgensen R.A. 2003. Sense cosuppression in plants: Past, present, and future. In RNAi: A guide to gene silencing (ed. G.]. Hannon), pp. 5-22. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York. Kaeppler S.M., Kaeppler H.E, and Rhee Y 2000. Epigenetic aspects of somacionaI variation in plants. Plant Mol. BioI. 43: 59-68. Kanno 1., Mette M.E, Kreil D.P., Aufsatz W., Matzke M., and Matzke A.J. 2004. Involvement of putative SNF2 chromatin remodeling protein DRD1 in RNA-directed DNA methylation. Curro BioI. 14: 801-805. Kanno 1., Huettel B., Mette M.E, Aufsatz W., ]aligot E., Daxinger L., Kreil D.P., Matzke M., and Matzke A.J.M. 2005. Atypical RNA poly-
188 •
C HAP T E R
9
me rase subunits required for RNA-directed DNA methylation. Nat. Genet. 37: 761-765. Kapoor A., Agius E, and Zhu J.K. 2005a. Preventing transcriptional gene silencing by active DNA demethylation. FEBS Lett. 579: 5889-5898. Kapoor A., Agarwal M., Andreucci A., Zheng X., Gong Z., Hasegawa P.M., Bressan R.A., and Zhu J.K. 2005b. Mutations in a conserved replication protein suppress transcriptional gene silencing in a DNA-methylation-independent manner in Arabidopsis. Curro BioI. 15: 1912-1918. Kawasaki H., Taira K., and Morris K.V. 2005. siRNA induced transcriptional gene silencing in mammalian cells. Cell Cycle 4: 442-448. Kidner C.A. and Martienssen R.A. 2005. The developmental role of microRNA in plants. Curro Opin. Plant Bioi. 8: 1-7. Kim V.N. 2005. MicroRNA biogenesis: Coordinated cropping and dicing. Nat. Rev. Genet. 6: 376-385. Kohler c., Page D.R., Gagliardini v., and Grossniklaus U. 2005. The Arabidopsis thaliana MEDEA polycomb group protein controls expression of PHERESI by parental imprinting. Nat. Genet. 37: 28-30. Kotake T., Takada S., Nakahigashi K., Ohto M., and Goto K. 2003. Arabidopsis TERMINAL FLOWER 2 gene encodes a heterochromatin protein 1 hom*olog and represses both FLOWERING LOCUS T to regulate flowering time and several floral homeotic genes. Plant Cell Physiol. 44: 555-564. Kress c., Thomassin H., and Grange T. 2001. Local DNA demethylation in vertebrates: How could it be performed and targeted? FEBS Lett. 494: 135-140. Lecellier C.-H., Dunoyer P., Arar K., Lehmann-Che J., Eyquem S., Himber c., Saib A., and Voinnet O. 2005. A cellular microRNA mediates antiviral defense in human cells. Science 308: 557-560. Li J., Yang Z., Yu B., Liu J., and Chen X. 2005. Methylation protects miRNAs and siRNAs from 3' -end uridylation activity in Arabidopsis. Curro BioI. 15: 1501-1507. Lippman Z. and Martienssen R. 2004. The role of RNA interference in heterochromatic silencing. Nature 431: 364-370. Liu J., He Y., Amasino R., and Chen X. 2004. siRNAs targeting an intronic transposon in the regulation of natural flowering behavior in Arabidopsis. Genes Dev. 18: 2873-2878. Loidl P. 2004. A plant dialect of the histone language. Trends Plant Sci. 9: 84-90. Malagnac E, Bartee L., and Bender J. 2002. An Arabidopsis SET domain protein required for maintenance but not establishment of DNA methylation. EMBO f. 21: 6842-6852. Mathieu O. and Bender J. 2004. RNA-directed DNA methylation. f. Cell Sci. 117: 4881-4888. Matzke M.A. and Birchler J.A. 2005. RNAi-mediated pathways in the nucleus. Nat. Rev. Genet. 6: 24-35. Matzke M. and Matzke A.J.M. 2004. Planting the seeds of a new paradigm. PLoS BioI. 2: 528-586. Matzke M., Kanno T., Huettel B., Jaligot E., Mette M.E, Kreil D.P., Daxinger L., Rovina P., Aufsatz W., and Matzke A.J.M. 2005. RNA-directed DNA methylation. In Plant epigenetics (ed. P. Meyer), pp. 69-96. Blackwell Publishing, Oxford, United Kingdom. McClintock B. 1956. Intranuclear systems controlling gene action and mutation. Brookhaven Symp. BioI. 8: 58-74. - - - . 1983. The significance of responses of the genome to challenge. Nobel lecture. http://nobelprize.org/medicine/laureates/ 1983/mcclintock-lecture.pdf Meins E, Si-Ammour A., and Blevins T. 2005. RNA silencing systems
and their relevance to plant development. Annu. Rev. Cell Dev. BioI. 21: 297-318. Naumann K., Fischer A., Hofmann 1., Krauss v., Phalke S., Irmler K., Hause G., Aurich A.-C., Dorn R., Jenuwein T., and Reuter G. 2005. Pivotal role of AtSUVH2 in heterochromatic histone methylation and gene silencing in Arabidopsis. EMBO f. 24: 1418-1429. Onodera Y., Haag J.R., Ream T., Nunes P.c., Pontes 0., and Pikaard C. 2005. Plant nuclear RNA polymerase IV mediates siRNA and DNA methylation-dependent heterochromatin formation. Cell 120: 613-622. Pandey R., Muller A., Napoli C.A., Selinger D.A., Pikaard C.S., Richards E.J., Bender J., Mount D.W., and Jorgensen R.A. 2002. Analysis of histone acetyltransferase and histone deacetylase families of Arabidopsis thaliana suggests functional diversification of chromatin modification among multicellular eukaryotes. Nucl. Acids Res. 30: 5036-5055. Park M.Y., Wu G., Gonzalez-Sulser A., Vaucheret H., and Poethig R.S. 2005. Nuclear processing and export of microRNAs in Arabidopsis. Proc. Natl. Acad. Sci. 102: 3691-3696. Peregrine A., Yoshikawa M., Wu G., Albrecht H.L., and Poethig R.S. 2004. SGS3 and SGS2/SDElIRDR6 are required for juvenile development and the production of trans-acting siRNAs in Arabidopsis. Genes. Dev. 18: 2368-2379. Pikaard C.S. 2000. The epigenetics of nucleolar dominance. Trends Genet. 16: 495-500. Rocha P.S., Sheikh M., Melchiorre R., fa*gard M., Boutet S., Loach R., Moffatt B., Wagner c., Vaucheret H., and Furner 1. 2005. The Arabidopsis hom*oLOGY-DEPENDENT GENE SILENCINGl gene codes for an S-adenosyl-L-hom*ocysteine hydrolase required for DNA methylation-dependent silencing. Plant Cell 17: 404-417. Schubert D., Clarenz 0., and Goodrich J. 2005. Epigenetic control of plant development by Polycomb-group proteins. Curro Opin. Plant BioI. 8: 553-561. Senda M., Masuta c., Ohnishi S., Goto K., Kasai A., Sano T., Hong J.S., and MacFarlane S. 2004. Patterning of virus-infected Glycine max seed coat is associated with suppression of endogenous silencing of chalcone synthase genes. Plant Cell 16: 807-818. Souret EE, Kastenmayer J.P., and Green P.J. 2004. AtXRN4 degrades mRNA in Arabidopsis and its substrates include selected miRNA targets. Mol. Cell 15: 173-183. Springer N.M. and Kaeppler S.M. 2005. Evolutionary divergence of monocot and dicot methyl-CpG-binding domain proteins. Plant Physiol. 138: 92-104. Springer N.M., Napoli C.A., Selinger D.A., Pandey R., Cone K.C., Chandler v.L., Kaeppler H.E, and Kaeppler S.M. 2003. Comparative analysis of SET domain proteins in maize and Arabidopsis reveals multiple duplications preceding the divergence of monocots and dicots. Plant Physiol. 132: 907-925. Starn M. and Mittelsten Scheid O. 2005. Paramutation: An encounter leaving a lasting impression. Trends Plant Sci. 10: 283-290. Takeda S., Tadele Z., Hofmann I., Probst A.V., Angelis K.J., Kaya H., Araki T., Mengiste T., Mittelsten Scheid 0., Shibahara K., et al. 2004. BRUl, a novel link between responses to DNA damage and epigenetic gene silencing in Arabidopsis. Genes Dev. 18: 782-793. Tariq M. and Paszkowski J. 2004. DNA and histone methylation in plants. Trends Genet. 20: 244-251. Ting A.H., Schuebel K.E., Herman J.G., and Baylin S.B. 2005. Short double-stranded RNA induces transcriptional gene silencing in human cancer cells in the absence of DNA methylation. Nat. Genet. 37: 906-910. Vaucheret H., Vazquez E, Crete P., and Bartel D.P. 2004. The action of
E P ! G ENE TIC
ARGONAUTEl in the miRNA pathway and its regulation by the miRNA pathway are crucial for plant development. Genes Dev. 18: 1187-1197. Vazquez E, Gasciolli V., Crete P., and Vaucheret H. 2004a. The nuclear dsRNA binding protein HYLl is required for microRNA accumulation and plant development, but not posttranscriptional transgene silencing. Curro BioI. 14: 346-345. Vazquez E, Vaucheret H., Rajagopalan R., Lepers e., Gasciolli v., Mallory A.C., Hilbert J.-L., Bartel D.P., and Crete P. 2004b. Endogenous trans-acting siRNAs regulate the accum ulation of Arabidopsis mRNAs. Mol. Cell 16: 69-79. Voinnet O. 2005. Induction and suppression of RNA silencing: Insights from viral infections. Nat. Rev. Genet. 6: 206-220. Wagner D. 2003. Chromatin regulation of plant development. Curro Opin. Plant BioI. 6: 20-28. Wang X.J., Gaasterland T., and Chua N.H. 2005. Genome-wide prediction and identification of cis-natural antisense transcripts in Arabidopsis thaliana. Genome BioI. 6: R30. Wassenegger M., Heimes S., Riedel 1., and Sanger H.L. 1994. RNA-
REG U L A T ION
I N
P LAN T 5
•
189
directed de novo methylation of genomic sequences in plants. Cell 76: 567-576. Xie Z., Johansen L.K., Gustafson A.M., Kasschau K.D., Lellis A.D., Zilberman D., Jacobsen S.E., and Carrington J.e. 2004. Genetic and functional diversification of small RNA pathways in plants. PLoS BioI. 2: EI04. Yoo B.-e., Kragler E, Varkonyi-Gasic E., Haywood V., Archer-Evans S., Lee Y.M., Lough T.J., and Lucas W.J. 2004. A systemic small RNA signalling system in plants. Plant Cell 16: 1979-2000.
WWW Resources http://asrp.cgrb.oregonstate.edu. Arabidopsis thaliana small RNA project http://mpss.dbi.udel.edu. MPSS (Massively parallel signature sequencing) http://www.arabidopsis.org/abrc. Arabidopsis Biological Resource Center Stocks http://www.chromdb.org. Plant Chromatin Database
c
HAP
ON
T
E
10
R
OFF
Chromatin Modifications and Their Mechanism of Action Tony Kouzarides' and Shelley L. Berger I
The Gurdon Institute, University of Cambridge, Cambridge, United Kingdom 'The Wistar Institute, Philadelphia, Pennsylvania 19104
CONTENTS 1. Histones and Acetylation Are Regulatory to Transcription, 193
6. Ubiquitylation/Deubiquitylation and Sumoylation, 203
2. Acetylation and Deacetylation, 194
7. Themes in Modifications, 204
3. Phosphorylation, 196 4. Methylation, 197 4.7
Methylation of Lysines, 797
4.2
Demethylation of Lysines, 207
4.3
Methylation of Arginines, 207
5. Deimination, 202
7.7
Histone Code, 204
7.2
Modification Patterns, 204
7.3
Changes in Chromatin Structure Associated with Transcription Activation and Elongation, 205
8. Concluding Remarks, 206 References, 206
191
GENERAL SUMMARY Histones are the building blocks of nucleosomes, making an octameric structure that packages DNA in eukaryotes forming a structure known as chromatin. Chromatin is not a uniform structure, however, and in recent years, an explosion in our knowledge of the variations in chromatin structure has occurred. This, in turn, has enhanced our understanding of the mechanisms that regulate genome templated processes, the posttranslational modifications of histone proteins (a central feature of this genomic regulation). There are, in fact, a large number of histone posttranslational modifications (HPTMs), and they divide into two groups. First, there are the small chemical groups, including acetylation, phosphorylation, and methylation. Second, there are the much larger peptides, including ubiquitylation and sumoylation. How are HPTMs thought to affect genome regulation and function? Three mechanisms are commonly considered, and it is helpful to keep these mechanisms in mind as the wealth of information and history of HPTMs is presented in this chapter. First, HPTMs may somehow affect the structure of chromatin; for example, by preventing crucial contacts that facilitate certain chromatin conformations or higher-order structures (which can be considered as cis-modifying effects). In contrast, two other mechanisms are considered to operate in trans. HPTMs
may disrupt the binding of proteins that associate with chromatin or histones. Alternatively, HPTMs may provide altered binding surfaces that attract certain effector proteins. This third mechanism has been characterized in the most detail, and such recruitment of proteins is defining with regard to the functional consequence: That is, it may have an activating or repressive outcome on transcription. The large number of HPTMs that have been discovered and their various combinations have led to the idea that HPTMs regulate via combinatorial patterns, in temporal sequences, and can be established over short- and long-range distances. These varied mechanisms establish different functional outcomes-some transient, others stable and epigenetically heritable. It was during the 1960s that Vincent Allfrey identified acetylation, methylation, and phosphorylation of histones purified from many eukaryotes. Histones were also the first recognized ubiquitylated protein substrates. However, although Allfrey observed certain correlations between modifications and transcriptionally active chromatin sources, genetic and functional evidence to support a role for HPTMs in gene regulation did not emerge until much later. In fact, many scientists studying the biochemistry and genetics of gene regulation during the 1980s and 1990s were skeptical that HPTMs had a causal role in gene regulation.
C H ROM A TIN
MOD I Fie A T ION 5
1 Histones and Acetylation Are Regulatory to Transcription As shown in this chapter, histones are subject to many different posttranslational modifications (PTMs), and time will undoubtedly reveal new, as-yet-unknown HPTMs. The known modifications can be categorized as either small chemical groups discussed in Sections 2-4 of this chapter, or as larger peptide changes to histones as discussed in Section 5 (see Table 1). The mechanism by which HPTMs affect the chromatin template and related processes such as gene transcription or repression are considered in the context of three conceptual models illustrated in Figure 1. Model 1 proposes that posttranslationally modified histones may, in some way, alter chromatin structure. In Model 2, an HPTM may inhibit the binding of a factor to the chromatin template, whereas Model 3 proposes that an HPTM creates a binding site for a particular protein (see also Section 5 of Chapter 3). From a historical perspective, what changed the mainstream view that chromatin was largely inert packing material for DNA? Early evidence that HPTMs regulated transcriptional activation and silencing came from experiments in Saccharomyces cerevisiae during the late 1980s. This budding yeast provides an efficient organism to carry out genetic experiments (both forward and reverse genetics) to examine the importance of histones. The reason is that, unlike higher eukaryotes where there are multiple copies of each histone gene, the single-copy yeast
Table 1. Types of covalent histone posttranslational modifications Role in transcription
Histone-modified sites
Acetylation
activation
H3 (K9,K14,K18,K56) H4 (K5,K8K12,K16) H2A H2B (K6,K7,K16,K17)
Phosphorylation
activation
H3 (510)
Methylation
activation
H3 (K4,K36,K79)
repression
H3 (K9,K27) H4 (K20)
activation
H2B (K123)
repression
H2A (Kl19)
repression
H3 (7) H4 (K5,K8,K12,K16) H2A (K126) H2B (K6,K7,K16,K17)
GROUP 1
GROUP 2 Ubiquityiation
5umoylation
HPTMs are categorized into two groups: Group 1 represents small chemical group modifications, whereas Group 2 includes larger chemical modifications.
AND
THE I R
M E C HAN ISM
0 F ACT ION
,.
193
Model 1: Chromatin structural change
.• ~~~
---
Model 2: Inhibit binding of negative-acting factor
C
Model 3: Recruit positive-acting factor
CF
mod
binder mod
~
Figure 1. Models Showing How Histone Posttranslational Modifications Affect the Chromatin Template
Modell proposes that changes to chromatin structure are mediated by the cis effects of covalent histone modifications, such as histone acetylation or phosphorylation. Model 2 illustrates the inhibitory effect of an HPTM for the binding of a chromatin-associated factor (CF), as exemplified by H351 0 phosphorylation occluding HP1 binding at methylated H3K9. In Model 3, an HPTM may provide binding specificity for a chromatin-associated factor. A classic example is HP1 binding through its chromodomain to methylated H3K9.
histone genes can easily be genetically manipulated. For instance, in a background where all the histone genes have been deleted, a copy of each gene can be introduced, encoded on an episome that carries a selectable marker, such as the URA3 gene, to maintain the episome. A second copy of the histones can be introduced on a second episome, carrying a different selectable marker. This second copy can be mutated by site-directed mutagenesis, and then the first, wild-type copy can be selectively lost from the cell using the 5-FOA (5-fluoroorotic acid) drug, which causes the URA3 gene product to be toxic to the cell. The end result is that the only copy present in the cell is the altered second episomal copy, which contains any number of mutations to be tested. In S. cerevisiae, the histone genes are arranged in pairs of H3/H4 and H2A/H2B, and their transcription is highly coordinated to coincide with S phase. Each nucleosome is assembled from an H3/H4 tetramer and two dimers of H2A/H2B; when one pair of either duo is under- or overtranscribed, nucleosomes are depleted. This alteration of histone dosage by genetic means provided some of the initial evidence that chromatin structure is crucial for regulating expression. One such
194
C HAP T E R 7 0
approach utilized "forward" genetics, where random mutations were selected that result in gene activation of a marker (Clark-Adams et al. 1988). These mutations were found to alter the amount of histone pairs. A second approach used "reverse" genetics, where directed depletion of histone genes provided clear evidence that histones regulate gene transcription (Han and Grunstein 1988). The next step was to direct deletion of only the histone amino-terminal tails (the sites of many HPTM) or to carry out substitution mutations of acetylation sites in histones. These more surgical changes also caused decreases in gene activation, suggesting that acetylation is required for gene transcription (Durrin et al. 1991). Other approaches investigated whether nucleosomes are naturally altered during gene activation. Biochemical experiments had shown that nucleosomes were repressive to transcription on DNA templates in vitro (Workman and Roeder 1987), but whether this was true in vivo was under debate. Some promoters have naturally positioned nucleosomes upstream of transcriptional start sites, and these positioned nucleosomes became altered when the gene was activated (Svaren et al. 1994; Shim et al. 1998). In the case of PROS, nucleosome alteration required an activator, showing that without transcription the nucleosomes were not changed. However, it was unclear whether this alteration was a cause or an effect of transcription. To address this, the TATA box was deleted, which abrogated transcription in yeast. Nucleosome position nevertheless changed, strongly suggesting that the alteration of nucleosomes preceded transcription. Taken together, these experiments began to provide strong evidence that both nucleosome repositioning and acetylation of specific residues within the histone tails may be required for transcriptional activation. 2 Acetylation and Deacetylation
Additional experimental approaches continued to provide evidence that acetylation (versus the absence of acetylation) correlates with transcription. Regions that are transcriptionally active, or are poised for transcription, tend to have an "open" chromatin configuration and therefore are accessible to enzymes such as DNase and MNase, which, when added to isolated but intact nuclei, can digest DNA. In the early 1990s, researchers began to use chromatin immunoprecipitation (ChIP), a powerful technique for analyzing what proteins are bound to particular DNA sequences in vivo. This involves cross-linking proteins that are bound to DNA using a cell-permeable chemical such as formaldehyde, followed by sonication to break up the
DNA:protein complexes into smaller fragments. The DNA:protein complexes of interest are then immunoprecipitated using a specific antibody as a probe. The crosslinks are then reversed in order to isolate and identify the DNA sequences that associated with the antibody-bound protein, by analysis using either radioactively labeled DNA probes or PCR. One group used this method to investigate the correlation between DNA sites around the active globin genes that are hypersensitive to DNase digestion and associate with acetylated histones in chicken erythrocytes; the correlation was remarkably close (Hebbes et al. 1994). In S. cerevisiae, similar approaches were employed within transcriptionally silenced regions of the genome, and they showed very low levels of histone acetylation (Braunstein et al. 1993). Conversely, genetic disruption of silencing correlated with increased acetylation. All of these experiments were slowly revealing that histones and, in particular, sites of reversible acetylation play a role in gene regulation. However, it was not until the mid-1990s that the first nuclear histone acetylation and deacetylation enzymes were identified, and these provided the "smoking gun"-the most direct evidence that these enzymes playa role during transcription. The first nuclear histone acetyltransferase (HAT) was isolated from the socalled macronucleus (the very large transcriptionally active nucleus, as distinct from the meiotic micronucleus) from Tetrahymena, which has high transcription rates (Brownell et al. 1996). The key approach was the "in-gel" assay to detect HAT activity: A complex mixture of proteins from cell extracts was separated on a histone-permeated SDS gel, the peptides were then subjected to renaturation, and proteins with HAT enzymatic activity labeled the gel by the transfer of radiolabeled cofactor, acetyl coenzyme A, onto localized histones. This allowed further biochemical fractionation and purification of the polypeptide. The HAT enzyme that was identified was hom*ologous to a previously isolated transcriptional coactivator in S. cerevisiae, called GenS, known to interact with transcriptional activators. Contemporaneously, the first histone deacetylase (HDAC) enzyme was isolated via biochemical purification (Taunton et al. 1996). In this case, the enzyme was purified from cell extracts using an inhibitor bound to an insoluble matrix, which physically bound to the catalytic site of the enzyme. The enzyme was hom*ologous to a previously isolated gene, which has a cofactor role in gene repression. These remarkable parallel findings for the first enzymes found to metabolize acetyl groups on histones led to a model that is now the paradigm for gene-specific histone PTMs: DNA-bound activators recruit HATs to acetylate nucleosomal histones, while
C H ROM A TIN
MOD I Fie A T ION 5
repressors recruit HDACs to deacetylate histones. These changes lead to alterations of the nucleosome and up- or down-regulation of the gene, respectively (Fig. 2). Many other well-known coactivators and corepressors were shown to possess HAT or HDAC activity, or to associate with such enzymes (Sterner and Berger 2000; Roth et al. 2001). Moreover, the enzymatic activities of the HATs and HDACs are critical for their role in gene activation and repression. The enzymes are often components of large complexes that are modular in structure and function; histone-modifying enzymatic activity is just one function, and others include, for instance, the recruitment of the TATA-binding protein (TBP) (Grant et al. 1998). Interestingly, certain nuclear hormone receptors function both as DNA-binding transcriptional repressors (when not bound to hormone ligand) and as transcriptional activators (when bound to hormone ligand); the receptors do this partly through the PTM of target chromatin regions, by recruiting HDACs when unliganded and HATs when liganded (Baek and Rosenfeld 2004). HAT proteins can acetylate lysine residues on all four core histones, but different enzymes possess distinct specificities in their substrate of choice (Fig. 3; Table 1), although each enzyme rarely targets just a single site. One major HAT family-GNAT (for GcnS related acetyltrans-
Gene activator recruits histone acetyltransferase
Gene repressor recruits histone deacetylase
Figure 2. Histone-modifying Enzymes Are Recruited to Promoters by DNA-binding Transcription Factors Histone acetyltransferases (HAT) are recruited by activators that bind to specific upstream activating sequences (UAS). This enzyme catalyzes the acetylation of local histones, known to contribute to transcriptional activation. Histone deacetylases (HDAC) are recruited by repressors of transcription that bind to upstream repressive sequences (URS) and deacetylate local histones. This contributes to transcriptional repression.
AND
THE I R
ME C HAN ISM
0 F ACT ION
195
ferase)-targets histone H3 as its main substrate. A second major HAT family, the MYST family, targets histone H4 as its main substrate. A third major familyCBP/p300-targets both H3 and H4, and is the most promiscuous. Structural analyses have been carried out for the catalytic domains of the first two major families (GNAT and MYST), and they are distinct; the structure of the CBP Ip300 family has not yet been solved. Incidentally, each of these acetyltransferase families is also able to acetylate non-histone substrates (Glozak et al. 2005). As discussed above, there are three models for the role of HPTMs in regulating chromatin structure (Fig. 1). The first model considers structural changes to chromatin induced by the direct effects of HPTMs, such as changes in charge. In this case, the neutralization of positively charged lysine by acetylation reduces the strength of binding of the strongly basic histones or histone tails to negatively charged DNA, and thus opens DNA-binding sites (VetteseDadey et al. 1996). Still focusing on the first model, there is also evidence that acetylation can decompact nucleosome arrays, consistent with a role in opening chromatin for gene activation (Shogren-Knaak et al. 2006). The third model proposes that HPTMs provide a binding surface for proteins to associate with chromatin and regulate DNAtemplated processes; this was first shown for acetylation. A specialized protein domain called a bromodomain, commonly found in chromatin-associated proteins, specifically binds to acetylated lysines (Fig. 3) (Dhalluin et al. 1999). Bromodomains are present in many HATs, such as GenS and CBP/p300. Proteins with this motif, when part-oflarge chromatin-associating/altering complexes such as the ATPdependent remodeling complex, SwilSnf, promote its binding to chromatin (Hassan et al. 2002). Other examples of proteins containing bromodomains with binding specificity to acetylated histone include Tafl and Bdfl in the TFIID complex, Rsc4 in the Rsc remodeling complex, and Brd2 in a large family of bromodomain proteins. There are numerous HDAC enzymes that remove acetyl groups (Kurdistani and Grunstein 2003; Yang and Seto 2003). They fall into three catalytic groups, which are conserved through evolution from S. cerevisiae to mammals, and referred to as Type I, Type II, and Type III or Sir2-related enzymes. Type I and Type II have a related mechanism of deacetylation, which does not involve a cofactor, whereas the Sir2-related enzymes require the cofactor NAD as part of their catalytic mechanism. The structures of representatives for all three families have been solved. Many of the HDACs are found within large multisubunit complexes, components of which serve to target the enzymes to genes, leading to transcriptional
196 •
C HAP T E R
70
." Swil
\ Histone
Sot
~H~3~~~....J....-1
Histone H4
Figure 3. Characterized Sites of Histone Acetylation Histones are mostly acetylated at lysine residues located in the amino termini of H3 and H4, with the exception of H3K56 localized in the globular domain. The proteins that express binding specificity to acetylated histones are shown.
repression. For example, Rpd3 is part of a large complex including the HDAC Sin3, which interacts with DNAbound repressors (Kurdistani and Grunstein 2003; Yang and Seto 2003). Rpd3 is also part of a small complex, which is targeted to gene open reading frames (ORFs) via chromodomain association with H3K36me (see Section 4 for further discussion of chromodomains). This results in histone deacetylation, in part to suppress internal RNA polymerase II (pol II) initiation, and also to regulate different steps during the transcription cycle (Carrozza et al. 2005; Joshi and Struhl 2005).
3 Phosphorylation Phosphorylation is the most well known PTM because it has long been understood that kinases regulate signal transduction from the cell surface, through the cytoplasm, and into the nucleus, leading to changes in gene expression. Histones were among the first proteins found to be phosphorylated. By 1991, it was discovered that when cells were stimulated to proliferate, the so-called "immediate-early" genes were induced to become transcriptionally active and to function in stimulating the cell cycle. This increased gene expression correlates with histone H3 phosphorylation (Mahadevan et al. 1991). The histone H3 Serine 10 residue (H3SlO) has turned out to be an important phosphorylation site for transcription from yeast to humans, and appears to be especially important in Drosophila (Nowak and Corces 2004). Many kinases have been identified that target this site, including Mskl/2 and the related Rsk2 in
mammals, and SNFI in s. cerevisiae (Sassone-Corsi et al. 1999; Lo et al. 2001; Soloaga et al. 2003). Studies of linker histone HI in Tetrahymena have revealed that phosphorylation of this histone may also affect transcriptional control. Perhaps counterintuitively, phosphorylation of certain residues correlates with chromosome condensation, during both mitosis and meiosis. It is unclear how phosphorylation contributes to the process, but recently, H3S10 phosphorylation acts like a temporal switch, ejecting HPI bound to the adjacently methylated H3K9 residue, referred to as the methyl-phos binary switch (FiscWe et al. 2005; Hirota et al. 2005). It remains to be seen whether this, perhaps in concert with the phosphorylation of H3S28 and H3Tl1, may effect chromatin condensation by recruiting the condensin complex and the mitotic spindle (Nowak and Corces 2004). Less is known about the precise mechanistic role of histone phosphorylation. There is evidence to support all three models for the role of HPTMs. First, histone phosphorylation alters chromatin compaction in vivo (Model 1). Indeed, work in Tetrahymena demonstrated that the collective negative "charge patch" resulting from phosphorylation of clusters of nearby residues within linker histone HI affects the affinity of its binding to DNA, positively increasing the transcriptional potential of the local chromatin environment (Dou and Gorovsky 2002). In support of Model 2, proteins bound to chromatin can be dislodged by phosphorylation. This was recently demonstrated by the lowered binding affinity of HPI during mitosis subsequent to mitosis-specific H3SlO phosphory-
C H ROM A TIN
MOD I Fie A T ION 5
lation (FiscWe et a1. 2005; Hirota et a1. 2005). In support of Model 3, the 14-3-3 adapter protein, a known phospho-binding protein, recognizes H3S10ph at promoters of inducible genes (Macdonald et a1. 2005). 4 Methylation
Methylation as a histone covalent modification is more complex than any other, since it can occur on either lysines or arginines. Additionally, unlike any other modification in Group 1, the consequence of methylation can be either positive or negative toward transcriptional expression, depending on the position of the residue within the histone (Table l). A further level of complexity lies in the fact that there can be multiple methylated states on each residue. Lysines can be mono- (mel), di- (me2), or tri(me3) methylated, whereas arginines can be mono- (mel) or di- (me2) methylated. Given that there are at least 24 identified sites of lysine and arginine methylation on H3, H4, H2A, and H2B, the number of distinct nucleosomal methylated states is enormous. Such combinational potential of methylated nucleosomes may be necessary, at least partly, to allow for the regulation of complex and dynamic processes such as transcription, which requires sequential and precisely timed events (Jenuwein and Allis 2001; y. Zhang and Reinberg 2001; Lee et a1. 2005; Martin and Zhang 2005; Wysocka et al. 2006a).
TRANSCRIPTION REGULATION Silent heterochromatin Transcriptional
activation ' "
AND
THE I R
ME C HAN ISM
0 F ACT ION
197
4.1 Methylation of Lysines The fact that lysine residues within histones are methylated has been known for many decades. The biological significance of this modification has only come to light recently, however, following the identification of the first lysine methyltransferase that uses histones as its substrate (Rea et a1. 2000). Now, more histone lysine methyltransferases (HKMTs) have been identified, and their sites of modification on histones are defined (Martin and Zhang 2005). All of these enzymes, except Dot 1, share the SET domain, which contains the catalytically active site and allows binding to the S-adenosyl-Lmethionine cofactor. Of the many known methylated sites, six have been well characterized to date: five on H3 (K4, K9, K27, K36, K79) and one on H4 (K20). Methylation at H3K4, H3K36, and H3K79 has, in general, been linked to activation of transcription, and the rest to repression (Table 1). In addition, two of these sites-H3K79me and H4K20me-have been implicated in the process of DNA repair. Specific protein binders have been identified that recognize each of the six characterized methylation sites (Fig. 4). These proteins have one of three distinct types of methyl lysine recognition domains: the chromo, tudor, and PHD repeat domains. Below, each of these characterized modifications is discussed in more detail.
DNA REPAIR
Transcriptional elongation
Transcriptional / memory
1 PC
Histone
HI~3~:--~K~9~-""'~ Figure 4. Sites of Histone Methylation, Their Protein Binders, and Functional Role in Genomic Processes Methylation of histones occurs at lysine residues in histones H3 and H4. Certain methylated lysine residues are associated with activating transcription (green Me flag), whereas others are involved in repressive processes (red Me flag). Proteins that bind particular methylated lysine residues are indicated.
198 • C HAP T E R 7
H3K4
a
METHYLATION
Methylation of H3K4 is associated with euchromatin and, specifically, with genes that are active or destined to be so. The demonstration that H3K4 methylation correlates with active chromatin came from analysis of the chicken ~-glo bin locus and the budding yeast mating-type loci (Litt et al. 2001; Noma et al. 2001). ChIPs using antibodies specific for methylated H3K4 indicated that islands of the modified histones track active genes. Subsequent work in yeast established that different methyl states are important for activity and that the trimethyl state (H3K4me3) appears during the process of active transcription (Santos-Rosa et al. 2002). H3K4me3 is observed at the 5' ends of genes in yeast during activation of transcription. Three components of the transcriptional machinery are thought to be responsible for this mark. First, RNA pol II that has been phosphorylated at Ser-5 of the carboxy-terminal domain (CTD) can recruit the Setl HKMT that methylates H3K4 in the vicinity of promoters (Fig. 5). Such phosphorylation normally releases RNA pol II from the transcription initiation complex into an early elongating complex (often referred to as promoter clearance or escape). The second component that recruits H3K4me3 is the PAF complex, which regulates different steps of RNA metabolism and also interacts with Setl. The third component important for the establishment of H3K4me3 is monoubiquitylation of H2B at Lys-l23 (H2BK123ub1, or H2BK120ubl in humans, discussed further in Section 6). What remains unclear is what transcriptional elongation processes H3K4me3 controls (Hampsey and Reinberg 2003); however, factors that bind specifically to methylated H3K4me3 are beginning to reveal its role. Mechanistically, H3K4 methylation can lead to the recruitment of specific factors such as the CHD 1 protein, shown to bind to H3K4me2 and me3 (Fig. 4), and the
NURF complex, known to mobilize nucleosomes at active genes in Drosophila. The domains that mediate association with methylated H3K4 are a tandem set of chromodomains in Chd1 (Sims et al. 2006) and a PHD finger within NURF (Li et al. 2006). Other proteins recruited by H3K4 methylation include the ISWI ATPase, which binds indirectly via other protein(s). Conversely, there is evidence that the mammalian NuRD repressor complex no longer binds to methylated H3K4 tails (D.Y. Lee et al. 2005; Martin and Zhang 2005). Methylation at H3K4 seems to communicate with other modifications. For instance, methylation of H3K9 by the SUV39H HKMT is prevented in vitro if H3K4 is methylated and H3S10 is phosphorylated. This may well be a way to occlude the repressive H3K9 modification on actively transcribed genes. In a more elaborate "trans-tail" form of communication, the mono ubiquitylation of H2BK123 affects levels of H3K4me3. How this comes about is unclear, but one suggestion is that the Setl complex cannot tri-methylate H3K4 unless the nucleosome(s) is in a certain conformational state defined by ubiquitylation of H2B (Zhang and Reinberg 2001; D.Y. Lee et al. 2005) . The Setl/MLL/ALLl/HRX protein, which is the human hom*olog of Setl, can be recruited to HOX gene promoters. A distinct H3K4 HKMT, SMYD3, has been linked to transcriptional activation. Methylation by SMYD3 has also been linked to the induction of cell proliferation. Indeed, limited analysis of the human H3K4 methylating enzymes suggests that they are implic.ated in the genesis of cancer (D.Y. Lee et al. 2005). H3K36
METHYLATION
Evidence has led to a proposal that H3K36 methylation is necessary for efficient elongation of RNA pol II through
Figure 5. Role of Histone Lysine Methylation in Transcriptional Elongation
H3K36
PROMOTER ESCAPE
H3K36
H3K36
TRANSCRIPTION ELONGATION
RNA polymerase II recruits distinct types of HKMTs, depending on the phosphorylation state of its carboxy-terminal domain (CTD). RNA pol II is activated for transcriptional initiation in the vicinity of the promoter, when Ser-5 is phosphorylated. This recruits the Setl HKMT to methylate H3K4. Phosphorylation of Ser-2 occurs during transcriptional elongation, prompting H3K36 methylation as a result of Set2 HKMT recruitment to the chromatin template.
C H ROM A TIN
MOD I Fie A T ION SAN 0
the coding region. This modification is highly enriched on the coding region of active genes, in contrast to the 5' location of H3K4 methylation. The Set2 protein is the HKMT capable of methylating H3K36. The Set2 enzyme binds preferentially to RNA pol II that has been phosphorylated within its CTD at Ser-2 (Fig. 5). This form of RNA pol II, which, incidentally, is different from the phosphorylated state associated with promoter clearance, tends to accumulate within the transcribed regions as well as at the 3' ends of the genes. This is consistent with the finding that H3K36me3 peaks at the 3' ends of genes that are actively transcribed. The recruitment of Set2 to active genes also requires components of the PAF complex, as in the case for the recruitment of Setl. However, H2B monoubiquitylation has a negative repressive role on H3K36 methylation (Zhang and Reinberg 2001; Martin and Zhang 2005). Indeed, the SAGA complex, recruited to transcribed genes in yeast, contains Ubp8, a deubiquitinase that is specific for H2BK123. Further studies have suggested that ubiquitylation and deubiquitylation of H2BK123 is an active process during transcription elongation. Processivity of RNA pol II through coding regions requires acetylation of nucleosomes. Transcriptional regulation also needs to suppress inappropriate internal initiation of transcription from cryptic start sites that occur inside coding regions. To suppress this process, methylation at H3K36 by Set2 creates a recognition site for the EAF3 protein through its chromodomain, which in turn mediates the recruitment of the Rpd3S HDAC complex. The deacetylase activity of Rpd3S then removes histone acetylation associated with elongation, thus suppressing internal initiation (Carrozza et al. 2005; Joshi and Struhl 2005; Keogh et al. 2005). Methylation of H3K36 has also been found at much lower levels in the promoter of inducible genes, but in this case, its effect appears to be repressive (Zhang and Reinberg 2001). H3K79
METHYLATION
Methylation at H3K79 is unusual because the modification lies within the core of the nucleosome rather than in the tail, where most other characterized methylation sites are found. Global analysis has shown that H3K79 is methylated in euchromatic regions of yeast and associates primarily with the coding region of active genes. Limited analysis in higher eukaryotes shows the same profile. The mammalian enzyme that methylates H3K79, hDOTlL, has been shown to mediate the leukemogenic functions of the MLL-AFI0 fusion protein. There is, however, no protein to date that binds H3K79me and links it to
THE I R M E C HAN ISM
0 F ACT ION
199
transcriptional events. The only mechanistic evidence of how H3K79 methylation functions in transcriptional activation comes from work in budding yeast. This shows that this modification somehow limits repressive proteins such as Sir2 and Sir3 at euchromatin, thus contributing to the regulation and maintenance of silent heterochromatin by enhancing their concentration at repressive chromatin regions. A distinct function ascribed to the H3K79 HKMT DotI in yeast is the mediation of DNA repair checkpoint. Consistent with this latter finding, a protein has been identified in human cells-P53BPl-that can bind to methylated H3K79 and has a role in DNA repair checkpoint function (Martin and Zhang 2005). H3K9
METHYLATION
This has been the most studied of histone modifications to date, primarily because the enzyme that methylates H3K9-SUV39Hl-was the first HKMT to be identified (Rea et al. 2000). The Drosophila hom*olog, Su(var)3-9, was initially identified as a suppressor of variegation, indicating that it was involved in the silencing mechanism of position-effect variegation (PEV), which involves the spreading of heterochromatin into adjacent euchromatic genes (for more detail, see Chapter 5). The realization that SUV39Hl had sequence similarity to a plant methyltransferase which had Rubisco as its substrate led to the identification of the Suv39 SET domain as the catalytic domain capable of methylating H3K9. Progress has been made in defining the function of H3K9 methylation in pericentromeric heterochromatin formation, which is also discussed extensively in other chapters (for studies on Drosophila, see Chapter 5; for studies in S. pombe, see Chapter 6; and for studies on RNAi-mediated heterochromatin formation, see Chapter 8). The results have come largely from studies in fission yeast and mammals, where heterochromatic structures are thought to be reasonably well conserved (but note, H3K9 methylation has not been detected in budding yeast). To summarize, the first stage of our understanding emerged from studies on the factors involved in the establishment of heterochromatin. This involves the cooperation of two proteins: SUV39H (or Clr4 in fission yeast) and its binding partner HPI (or Swi6 in fission yeast [Nakayama et al. 2001; Noma et al. 2001]). A model has been proposed whereby SUV39H methylates H3K9, creating a binding platform for HPl, through its chromodomain (Bannister et al. 2001; Lachner et al. 2001). Once HPI binds, it can spread onto adjacent nucleosomes by its association with SUV39H, which further catalyzes neigh-
200
C HAP T E R
70
boring histone methylation (Nakayama et a1. 2001). In addition, HP1 self-associates via the chromoshadow domain facilitating the spread of heterochromatin. How HP1 spreading dictates the formation of the densely packed heterochromatic structures is, however, unknown. The above model predicts that there should be a specific heterochromatin-based recruitment mechanism for the SUV39H HKMT enzyme, before HP1 can spread. The clue as to what this may be came from a series of experiments in fission yeast which showed a link between heterochromatin formation and the production of short interfering RNAs (siRNAs) (Hall et a1. 2002; Volpe et a1. 2002). These RNAs come from the bidirectional transcription of centrometric repeats which are processed into siRNAs by the enzyme dicer. The siRNAs are then packaged into the RITS complex, which contains the chromodomain-containing protein, Chp1, which binds methylated H3K9. Thus, the targeting of the RITS complex to chromatin forms the initiation stage of heterochromatin formation. The spreading and maintenance of heterochromatin over a 20-kb region, as described above, requires the methylation of histone H3K9 by the Clr4 HKMT and the binding of Swi6 to H3K9 methylated chromatin (Martin and Zhang 2005; for more detail, see Chapter 8). The interdependence of different repressive epigenetic mechanisms has emerged from studies first in Neurospora crassa, but also in plants, notably demonstrating a link between H3K9 methylation and the process of DNA methylation (see Chapters 6 and 8). H3K9 methylation is necessary for DNA methylation to take place, and the reciprocal connection seems to be operational, whereby H3K9 methylation is dependent on DNA methylation. Moreover, recent studies in mammalian cancer cells lacking DNA methyltransferase enzymes (Dnmts) show reduced levels of H3K9 methylation, and this can be attributed to the fact that the methyl-CpG-binding protein 1 (MBD1) associates with the H3K9 HKMT, SETDB1 (Zhang and Reinberg 2001; Martin and Zhang 2005; for more detail, see Chapter 18). Methylation at H3K9 also functions in the repression of euchromatic genes. ChiPs have detected this methylation at the promoter of mammalian genes when the genes are silent. The mechanism of this repression at euchromatic sites appears to be slightly different from those encountered at heterochromatic regions. The RB repressor protein delivers the SUV39H 1 HKMT and HP 1 to euchromatic genes such as the E2F-regulated cyclin E gene. Unlike heterochromatin, however, HP1 occupancy appears to be restricted to one or a few nucleosomes
around the initiation site, even though H3K9 methylation occurs elsewhere on the promoter. In another example, the KAP 1-repressor brings the ESET/SETD B1 HKMT to the promoter of KAP1 regulated genes and silences transcription by methylation of H3K9 and HP1 recruitment. The special restriction of HP1 on these euchromatic promoters, and the prevention of spreading, suggest a distinct mechanism of action for HP1 relative to its heterochromatic role. One possible mode of action for HP1, which has some support, is that it acts as an anchor into heterochromatin-rich nuclear compartments. Movements have been observed during the repression of euchromatic genes which show that a silenced gene is displaced into a heterochromatic region, a movement which is dependent on the gamma isoform of HP1 (Martin and Zhang 2005). Heterochromatin formation at telomeres, although involving HP1 and H3K9 methylation, varies from the aforementioned pericentromeric and silent euchromatic regions. In Drosophila, HP 1 is not recruited to telomere ends through its chromo- or chromoshadow domain, and H3K9 methylation is catalyzed by an unknown HKMT. In mammals, distinct HP1 hom*ologs, CBX1, CBX3, and CBXs, are involved in binding to methylated H3K9, transduced by the SUV39H1 and SUV39H2 proteins to form repressive chromatin domains at chromosome ends (for more detail, see Chapter 14). H3K27
METHYLATION
H3K27 methylation is a repressive modification found in three distinct places in the cell: (1) euchromatic gene loci, predominantly where there are Polycomb Response Elements (PREs) in the case of Drosophila, (2) at pericentromeric heterochromatin, and (3) at the inactive X in mammals. The enzyme that mediates H3K27 methylation is EZH2 in human cells, a hom*olog of the Drosophila ENHANCER OF ZESTE [E(Z)] protein. The EZH2 enzyme is found in a number of distinctive Polycomb repressive complexes (PRCs) which associate with specific repressive Polycomb DNA elements in promoters in Drosophila (see also Chapter 11). What targets the EZH2-containing complexes to specific genes in mammals is unknown, as Polycomb repressive elements have not been identified. However, targeting of these EZH2 complexes may be mediated by a variety of transcription factors, including GAGA and MYc. The mechanisms of repression by EZH2 involve methylation of H3K27 and the recruitment of the Polycomb (Pc) protein to this modified site (as in Model 3 in Fig. 1). An
C H ROM A TIN
MOD I Fie A T ION 5
important aspect of the pathway that leads to H3K27 methylation is that it is implicated in cancer. The EZH2 H3K27 HKMT is found overexpressed in a number of cancer tissues, including breast and prostate (Martin and Zhang 2005). H4K20
METHYlATION
Very little is known mechanistically about the role of this modification in transcriptional control. What is clear is that H4K20me2 and H4K20me3 are present at pericentromeric heterochromatin and that the HKMT enzymes that mediate these modifications are SUV4-20Hl and SUV4-20H2. Methylation of H3K9 seems to be required for methylation of H4K20. Another enzyme that can mono-methylate H4K20 in higher eukaryotes is PR-Set7, which has been implicated in mitotic events. Last, there is functional evidence that H4K20 methylation has been linked to DNA repair via the binding of the DNA damage checkpoint protein CrB2 in budding yeast (Martin and Zhang 2005). 4.2 Demethylation of Lysines
Until recently, it was unclear whether histone lysine demethylation was taking place in the cell. The search for such enzymes had been fruitless, and evidence existed that methyl groups can be quite stable on heterochromatin regions. The discovery of LSDI changed all that (Shi et al. 2004). This protein was shown to be an enzyme that removes methyl groups specifically from H3K4 and is involved in repression. LSD 1 is present in a number of different repressor complexes, and some of these allow it to more efficiently demethylate lysine residues from nucleosomal histone H3 (M.G. Lee et al. 2005; Shi et al. 2005). The specificity of LSDI can be changed if it binds a partner such as the androgen receptor (AR). An LSD 1AR complex demethylates H3K9 instead of H3K4, and under these conditions, activates, rather than represses, transcription (Metzger et al. 2005). Recently, five new demethylases were identified that possess a common catalytic structure distinct from LSD 1, called the JmjC-domain (Fig. 6). This domain was previously predicted to possess enzymatic activity (Trewick et al. 2005). These new demethylases are found to demethylate distinct methyl states of H3K9 and H3K36. JHDMI demethylates H3K36mel and me2, and JHDM2A demethylates H3K9mel and me2 (Tsukada et al. 2006; Yamane et al. 2006). The tri-methyl state of these two modified residues is removed by a distinct set of enzymes. JHDM3A and JMJD2A can act on both H3K36me3 and
AND
THE I R M E C HAN ISM
0 F ACT ION
•
201
H3K9me3 (Cloos et al. 2006; Fodor et al. 2006; Klose et al. 2006; Tsukada et al. 2006; Whetstine et al. 2006). It is perhaps surprising that enzymes exist which can simultaneously demethylate an active (e.g., H3K36me) and a repressive (e.g., H3K9me) mark. This may be explained by the recent finding that H3K9 methylation also associates with actively transcribed genes (Vakoc et al. 2005). In a more classic mechanism, the JHDM2A enzyme is recruited to promoters by AR, where it is involved in activating transcription via demethylation of H3K9 (Yamane et al. 2006). Structural analysis of JMJD2A has revealed that four distinct domains UmjN, JmjC, an unusual Zing finger, and a carboxy-terminal domain) come together to form the catalytic core. A deep cleft is formed by these domains coming together, which demands a conformational change in the enzyme or substrate to accommodate the methyl group for demethylation. Such a conformational shift may explain the specificity of demethylation (Chen et al. 2006). It is interesting to note that one of the newly discovered demethylases, JMJD2C, was previously known as GASCl, a gene amplified in squamous carcinoma. Consistent with a causative role for this enzyme in cancer development, the overexpression of GASCI was shown to induce cell proliferation (Cloos et al. 2006). These results together imply that demethylases as well as HKMTs may be targets for anticancer drug development (see also Chapter 24). 4.3 Methylation of Arginines
The importance of histone arginine methylation in transcriptional control came after the identification of CARMI, an enzyme that can methylate arginines within H3 in vitro (Chen et al. 1999). In vivo, arginine methylation was subsequently demonstrated in experiments using specific antibodies to arginine-methylated sites (Strahl et al. 2001; Wang et al. 2001; Bauer et al. 2002). Arginine methylation has been implicated in the positive and negative regulation of transcription. Two methyltransferases, PRMTl (protein arginine methyltransferase) and PRMT4/CARMl, have been linked to transcriptional activation. PRMTl has the ability to methylate H4R3 (Strahl et al. 2001; Wang et al. 2001), whereas PRMT4/CARMI can catalyze the methylation of H3R2, H3R17, and H3R26 (Schurter et al. 2001; Bauer et al. 2002). Specific transcription factors (NR, p53, YYl, NFKB) recruit these enzymes to specific promoters where they activate transcription. In contrast, PRMT5 (which can methylate H3R8 and H4R3) acts as a repressor of numerous genes, including some regulated by MYC (Fabbrizio et al. 2002; Pal et al. 2003).
202 •
C HAP T E RIO
. .\.~~v. ~ '\ Histone H3
ma
tn
???
\
Histone H4
d,
mo
tn
Figure 6. Histone lysine Demethylases and Their Sites of Demethylation on Histone H3 Sites of histone lysine methylation may be mono-, di-, or tri-methylated. Known histone lysine demethylases show different specificities in demethylating histone residues or methylated states, as illustrated.
Most of our knowledge regarding arginine methylation comes from the analysis of the estrogen-signaling pathway that regulates the pS2 gene. ChIPs have indicated that a complex and cyclic set of events follows the stimulation of this gene by estrogen. The estrogen receptor is first recruited to the pS2 promoter within minutes of the stimulus and brings with it many protein complexes and enzymes that modify histones (Metivier et al. 2003). Relevant here is the recruitment of CARMI and PRMTl, which can methylate arginine residues of histones H3 and H4 (Ma et al. 2001; Bauer et al. 2002). This methylation is detected very soon after the arrival of the enzymes and coincides with the appearance of active RNA pol II on the promoter. Surprisingly, however, minutes after these events, methylation at arginines is no longer detected by specific antibodies, and RNA pol II disappears. Soon after that, methylation of arginines and RNA pol II reappears (Metivier et al. 2003; Cuthbert et al. 2004; Wang et al. 2004). The reason for these cyclic events is not known. One possibility is that it provides a mechanism for rapid shutoff of transcription if estrogen signaling fails. Experiments done on reconstituted chromatin templates have helped establish a direct role for arginine methylation in gene expression. Analysis of p53-mediated activation of transcription in vitro has shown that there is a synergistic effect of methylation transduced by PRMTl and CARMI, and acetylation by CBP/p300 (An et al. 2004). Furthermore, these assays have confirmed the in vivo observations on the pS2 gene that a specific order of events takes place during activation in which the sequen-
tial activity of PRMTl, CBP/p300, and CARMI is necessary (Metivier et al. 2003). Given that arginine methylation is such a dynamic process, several ways have been described in which the effectiveness of arginine methyltransferase is controlled. First, the interaction of the enzyme with another protein can control its substrate specificity. Second, there is potential for competition between enzymes for a given arginine substrate. Both PRMTI and PRMT5 can methylate H4R3, but the first enzyme is an activator and the second is a repressor of transcription. A third level of regulation of the methyl state may come from arginine demethylation. Such an activity has not yet been isolated, but there are clear indications of the rapid disappearance of methyl groups from arginines, making such an activity a very attractive possibility (Zhang and Reinberg 2001; D.Y. Lee et al. 2005; Wysocka et al. 2006a).
5 Deimination The lack of an arginine demethylase prompted the suggestion that other types of enzymatic reactions may antagonize arginine methylation (Bannister et al. 2002). One such reaction is deimination, a process by which an arginine can be converted to citrulline via the removal of an imine group. If the arginine is mono-methylated, removal of methylamine would effectively result in the removal of the methyl group from the arginine. The presence of citrulline in histones has now been demonstrated, and the enzyme, PADI4, has been identified that can convert arginines within histones into citrulline (Cuthbert et
C H ROM A TIN
MOD I Fie A T ION SAN 0
al. 2004; Wang et al. 2004). Moreover, the appearance of citrulline on histones H3 and H4 correlates with the disappearance of arginine methylation in vivo. Additionally, analyses of estrogen-regulated promoters, where arginine methylation coincides with the active state of transcription, have shown that citrulline appears when the promoter is shut off. Many unanswered questions remain regarding this modification. Is the citrulline acting to suppress active methylation at arginines, or does it repress transcription by actively recruiting proteins? What about the reversal of citrulline deposition? This clearly takes place on the promoters at a very rapid pace, but is this an enzymatically driven reaction or is it merely due to the replacement of nucleosomal histones by histone variants, which contain arginine in place of citrulline?
6 Ubiquitylation/Deubiquitylation and Sumoylation Ubiquitin and SUMO are quite distinct PTMs compared to acetylation, phosphorylation, and methylation. Whereas the latter PTMs are small chemical groups, Ub and SUMO are large polypeptides, which increase the size of the histone by approximately two-thirds. Ub and SUMO are 18% identical in sequence and share a threedimensional structure, but are dissimilar in surface charge. Histones were the first proteins shown to be monoubiquitylated, although precise positions of Ub were not identified until relatively recently (Robzyk et al. 2000; Wang et al. 2004). Like methylation, and unlike acetylation and phosphorylation (and, possibly, sumoylation), ubiquitylation can be either repressive or activating, depending on the specific sites. H2A and H2B are
TRANSCRIPTIONAL REPRESSION
THE I R M E C HAN ISM
0 F ACT ION
•
203
monoubiquitylated, which contrasts with proteolysisassociated polyubiquitylation. The effects of monoubiquitylation on each core histone are opposite (Fig. 7). H2B monoubiquitylation is activating to transcription, transduced by Rad6/Brel (and the human counterpart RNF20/RNF40 + UbcH6) (Wood et al. 2003; Kim et al. 2005; Zhu et al. 2005), and leads to H3K4 methylation, as described in the previous section and in the next section (Henry et al. 2003; Kao et al. 2004). This sequence of events, although as yet not understood mechanistically, is conserved from yeast to human (Kim et al. 2005; Zhu et al. 2005). H2AK119ubl, on the other hand, is repressive to transcription in mammals and catalyzed by the Polycomb group Bmil/RinglA protein (Wang et al. 2004). There is no evidence for evolutionary conservation of repressive H2Aub in yeast. To date, no histone-specific ubiquitin-binding proteins have been identified. However, because numerous ubiquitin interaction domains have been documented as binding to non-histone ubiquitylated substrates, it seems highly likely that effectors for ubiquitylated histones will be found. However, they may interact in a different manner than the chromatin interacting domains for acetylation and methylation; i.e., there are likely to be two simultaneous binding interactions, one to a surface on ubiquitin and a second interaction within histone sequences, to provide specificity of interaction. Deubiquitylation of the H2BK123 site is involved in both gene activation and maintenance of heterochromatic silencing, through the action of two distinct proteases, Ubp8 and Ubp 10. Ubp8 is a subunit of the SAGA histone acetylation complex (Sanders et al. 2002) and acts following ubiquitylation by Rad6 (Henry et al. 2003;
TRANSCRIPTIONAL ACTIVATION Ub
Ub
Histone H2A
~
__
.. Histone H2B
~~=-_ K123
Figure 7. Sites of Histone Ubiquitylation and Their Consequence for Transcriptional Regulation Ubiquitylation of H2A at Lys-119 is correlated with transcriptional repression. H2BK123 ubiquitylation is conversely associated with transcriptional activation.
204
C HAP T E RIO
Daniel et al. 2004). This sequence of H2B ubiquitylation followed by deubiquitylation is required to establish the appropriate levels ofH3K4 (H2Bub required) and H3K36 methylation (H2Bub not required) (Henry et al. 2003). UbpIO functions at silenced regions to maintain low levels of H3K4me and H3/H4 lysine acetylation, and thus assists in preventing transcription (Emre et al. 2005; Gardner et al. 2005). Sumoylation is the only HPTM described in yeast as repressive and is conserved in mammals (Shiio and Eisenman 2003). Its role may be generally negative-acting to prevent activating HPTMs. The inhibition of active HPTMs may occur through two mechanisms. First, SUMO-histone may directly block lysine substrate sites that are alternatively acetylated or sumoylated (as in Model 2 in Fig. 1). Second, sumoylated histones may recruit HDACs both to chromatin (Model 3) and via a SUMO group that occurs on DNA-bound repressors.
2004). Similarly, H3K9me has recently been shown to increase during gene induction (Vakoc et al. 2005), in addition to its well-characterized role in heterochromatic silencing. Finally, many of the same HPTMs occur in both transcription and DNA repair, which are mechanistically distinct processes. Based on some of these considerations, a more general hypothesis has been proposed where HPTMs serve as a nuclear DNA-associated signal transduction pathway, similar to cytoplasmic signal transduction that is generated and propagated largely through Ser/Thr phosphorylation (Schreiber and Bernstein 2002). In this model, there is not a strict histone code for specific processes, but rather HPTM recognition and binding via a plethora of proteinbinding motifs. This model explains how any site could be both activating and repressing and involved in more than one process, because different binding effector proteins are cognates for the same HPTM for distinct processes.
7 Themes in Modifications
7.2 Modification Patterns
The preceding discussion of the numerous types and sites of histone PTM occurring in transcription might lead to the conclusion that there are few overarching guiding principles or ideas. However, there do appear to be a number of broad themes that occur repeatedly, although the specifics may change depending on the histone, the sites of HPTM, and the binding proteins. Indeed, chromatin regulation may vary between promoters and distinct pathways.
Some experimental evidence points to the structural alteration of chromatin with certain HPTMs (Model 1). This can result from altering the charge of single or cluster of histone residues. This is particularly true when residues are acetylated or phosphorylated, which reduces the positive charge of histone regions (see Section 5 of Chapter 3). Such cis alterations can alter internucleosomal spacing and reduce the affinity of histones to negatively charged DNA, as exemplified by the. negative charged patches that occur on linker histone (Dou and Gorovsky 2002). These types of HPTMs may be cumulative in their effect on, for example, transcriptional activation or for creating higher-order chromatin structures, rather than producing a binary ON/OFF effect (Kurdistani et al. 2004; Henikoff 2005). Another model for the "output" of the myriad of HPTMs is that the code is complex and is read in patterns and often in temporal sequences. In this view, the intricacy of the patterns in three-dimensional space and over time during a process requiring many chromatin-associated steps, such as transcription, yields a meaningful mechanistic result. Two types of HPTM patterns have been identified. First, there are patterns on the same histone tail, or in "cis;' and second, patterns on different histone tails, or in "trans:' The most well-characterized cis pattern is between H3SIOph and H3KI4ac on the H3 amino-terminal tail (Cheung et al. 2000; Lo et al. 2000), where H3SIOph leads to H3KI4ac. The mechanism underlying the establishment of this pattern is understood in structural detail: The
7.1 Histone Code
One key question emerges after this lengthy discussion of the intricacies of HPTMs: Why are there so many modifications? Clearly, many of them correlate with transcription, and others occur during different DNA-templated processes. Thus, one hypothesis is that there is a histone "code," linking specific modifications with individual processes (Strahl and Allis 2000; Turner 2000). The simplest code would be a binary relationship between HPTMs and either gene activation or repression, and distinct HPTMs for other processes. The evidence supporting such a code is the observed tendency, as described above, for certain HPTMs to be positive-acting and others negative-acting. However, there are observations that are inconsistent with a simple binary code. For example, phosphorylation ofH3SlO is both activating to transcription, which presumably involves opening the chromatin, and involved in chromosome condensation, making chromatin even more inaccessible (Nowak and Corces
C H ROM A TIN
MOD I Fie A T ION SAN 0
enzyme that acetylates binds to the previously phosphorylated H3 tail with increased affinity due to a greater number of amino acid side-chain contact points (Clements et al. 2003). Other cis patterns are H3K23 acetylation and H3R17 methylation (Daujat et al. 2002) and H4R3 methylation and H4K8 acetylation (Wang et al. 2001). As described above, one trans tail pattern has been identified, where initial ubiquitylation of H2BK123 leads to methylation of H3K4 (Briggs et al. 2002; Dover et al. 2002; Sun and Allis 2002). The mechanism linking these HPTMs has not been elucidated, although several possible hypotheses exist. Because the link is from one large modification (ubiquitin) to a nonadjacent HTPM, one model is that ubiquitin wedges the chromatin open, like a crowbar, to allow the methylating enzyme access to its site. A second general model is that H2BK123ubi functions to recruit effector proteins, similar to the role of the other HPTMs. The noncatalytic portion of the proteosome requires H2B ubiquitylation for chromatin association (Ezhkova and Tansey 2004), and the function of the elongation complex FACT is stimulated by H2B ubiquitylation (Pavri et al. 2006), although neither complex has yet been shown to directly bind to ubiquitylated H2B.
THE I R M E C HAN ISM
a
205
F ACT ION
REPRESSED (DEFAULT) promoter
gene body
-~ DEREPRESSED I ACTIVATED
Model 2:
Detamar eviction
Model 3: Histone variant
exchange H2A
7.3 Changes in Chromatin 5tructure Associated with Transcription Activation and Elongation
The transcriptionally active euchromatic regions contain nucleosomes, but in an "unfolded" state, denoted as "beads-on-a-string" or II-nm fiber. The nucleosomes in this state still impose an intrinsic inhibition to the transcription machinery. Some transcription factors, be they activators or repressors, can gain access to their sites when contained in nucleosomes, but others cannot. Moreover, the machinery recruited by the DNA-bound regulators and responsible for delivering RNA pol II to promoters is constrained by the presence of nucleosomes. A number of distinct mechanisms serve to reconfigure the chromatin, poising genes for subsequent transcription, or promoting initiation or elongation. Some of these mechanisms are illustrated in Figure 8. The nucleosome problem during transcription is solved in part by the recruitment of protein complexes to mobilize and/or alter the structure of the nucleosome. These complexes fall into two different families, one represented by SNF2H (or ISWI and ISW2 in yeast), and one by the Brahma-Swi/Snf family (Narlikar et al. 2002; Peterson 2002; Flaus and Owen-Hughes 2004). The first family mobilizes nucleosomes, whereas Swi/Snf also transitorily alters the structure of the nucleosomes. Acetylated nucle-
H2AZ
wr1 H2AlH2B
H2AlH2B
e-Le-L
~
Model 4: FACT recruitment and H2NH2B dimer loss H2AZ
H2AZ
Figure 8. Models for the Involvement of Chromatin Remodeling and Histone Exchange in Transcriptional Processes In Modell, the Swi/Snf family of ATPase binds chromatin through bromodomain recognition of acetylated histones and acts to alter the local chromatin structure. Model 2 depicts the reported octamer eviction that occurs at certain loci such as PH05 by an unknown mechanism. In Model 3, the ATPase SWRl catalyzes the replacement of histone H2A with H2AZ, which poises chromatin for transcription. Model 4 focuses on the involvement of FACT in transcriptional elongation, assisting in nucleosome unraveling by the displacement of an H2A/H2B dimer. Concomitantly, histone H3 may be exchanged with H3.3 during the process.
osomes are recognized by the Swi/Snf complex through bromodomain interaction (Model 1 in Fig. 8). A second mechanism involved in gene activation is selective octamer loss at promoters. For example, histone octamers are evicted at the promoter of the PROS gene in S. cerevisiae during transcriptional induction (Model 2 in Fig. 8) (Boeger et al. 2003; Reinke and Horz 2003). In
206
•
C HAP T E RIO
addition, promoters of S. cerevisiae have a constitutively low density of nucleosomes, which allows access for transcription factors (Sekinger et al. 2005). It is not yet known whether or how ATP-dependent remodeling complexes assist in generating and maintaining this low occupancy. A third major mechanism involved in setting up transcriptional states is the presence of histone variants. There are two types of histone variants associated with gene activity. First, a variant of H2A called H2AZ is found in nucleosomes around the promoter gap, and poises the gene for activation (Santisteban et al. 2000; Raisner et al. 2005; Zhang et al. 2005); a specific ATP-dependent remodeling complex, called Swrl, replaces H2A with H2AZ (Model 3 in Fig. 8) (Mizuguchi et al. 2004; for more detail, see Chapter 13). Second, one H3 isoform, called H3.1, is incorporated into chromatin during replication, whereas isoform H3.3 is incorporated in a replication-independent manner (Ahmad and Henikoff 2002) with the aid of the HIRA (histone regulator A) chaperone. This variant is predominantly found within gene ORFs (Mito et al. 2005), suggesting that its deposition is a transcription-coupled process. There are additional mechanisms to overcome the nucleosomal barrier to elongating RNA pol II (and RNA pol I). A large number of factors have been isolated that affect transcription elongation (Sims et aI. 2004). One of these factors was found to allow the RNA pol II to traverse nucleosomes. This factor is known as FACT (for FAcilitate Chromatin Transcription). Importantly, FACT functions exclusively through nucleosomes, binds to them, and then promotes the displacement of one H2A/H2B dimer (Model 4 in Fig. 8) (Belotserkovskaya et al. 2003). As transcription ceases, FACT also promotes the reconstitution of the nucleosome. Interestingly, FACT performs its functions in the absence of energy, but physically interacts with CHDl, a protein that hydrolyzes ATP to mobilize nucleosomes and bind to the active H3K4me mark. Moreover, FACT also interacts with NuA4, a complex that contains HAT activity. Although FACT can promote displacement of the H2A/H2B dimer in vitro in an ATPindependent manner, it is possible that this is promoted by its interaction with factors such as CHDl, which mobilize or alter the structure of nucleosomes in vivo and also the interplay with HPTMs (Reinberg and Sims 2006). 8 Concluding Remarks
We have come a long way in this "modern era" of histone modifications which covers the last 10 years. In this time, there have been six distinct types of histone modification pathways characterized and numerous sites of modifications identified. Yet this is clearly still the beginning of our
understanding. Mechanistically we know that modifications affect the binding of proteins, but we are still unaware precisely how these proteins result in reorganization of chromatin structure. We still do not know whether there is a code or whether modifications are simply part of a signaling pathway. In addition, our knowledge is lacking on the many cellular processes, other than transcription, that modifications are involved in. Thus, in short, we have become aware of the complexity of the system, but we are a long way from making sense of the complexity. One thing is for sure, it is worth the effort to find out, since histone modifications playa fundamental role in both normal and diseased processes. References Ahmad K. and Henikoff S. 2002. The histone variant H3.3 marks active chromatin by replication-independent nucleosome assembly. Mol. Cell 9: 1191-1200. An W., Kim J., and Roeder R.G. 2004. Ordered cooperative functions of PRMTl, p300, and CARMI in transcriptional activation by p53. Cell1l7: 735-748. Baek S.H. and Rosenfeld M.G. 2004. Nuclear receptor coregulators: Their modification codes and regulatory mechanism by translocation. Biochem. Biophys. Res. Commun. 319: 707-714. Bannister A.J., Schneider R., and Kouzarides T 2002. Histone modification: Dynamic or static? Cell 109: 801-806. Bannister A.J., Zegerman P., Partridge J.P., Miska E.A., Thomas J.O., Allshire R.C, and Kouzarides T 2001. Selective recognition of methylated lysine 9 on histone H3 by the HPI chromo domain. Nature 410: 120-124. Bauer U.M., Daujat S., Nielsen S.J., Nightingale K., and Kouzarides T. 2002. Methylation at arginine 17 of histone H3 is linked to gene activation. EMBO Rep. 3: 39-44. Belotserkovskaya R., Oh S., Bondarenko V.A., Orphanides G., Studitsky V.M., and Reinberg D. 2003. FACT facilitates transcriptiondependent nucleosome alteration. Science 301: 1090-1093. Boeger H., Griesenbeck J., Strattan J.S., and Kornberg R.D. 2003. Nucleosomes unfold completely at a transcriptionally active promoter. Mol. Cellll: 1587-1598. Braunstein M., Rose A.B., Holmes S.G., Allis CD., and Broach J.R. 1993. Transcriptional silencing in yeast is associated with reduced nucleosome acetylation. Genes Dev. 7: 592-604. Briggs S.D., Xiao T, Sun Z.W., Caldwell J.A., Shabanowitz J., Hunt D.P., Allis CD., and Strahl B.D. 2002. Gene silencing: Trans-histone regulatory pathway in chromatin. Nature 418: 498. Brownell J.E., Zhou J., Ranalli T, Kobayashi R., Edmondson D.G., Roth S.Y, and Allis CD. 1996. Tetrahymena histone acetyltransferase A: A hom*olog to yeast Gcn5p linking histone acetylation to gene activation. Cell 84: 843-851. Carrozza M.J., Li B., Florens L., Suganuma T, Swanson S.K., Lee K.K., Shia W.J., Anderson S., Yates J., Washburn M.P., and Workman J.L. 2005. Histone H3 methylation by Set2 directs deacetylation of coding regions by Rpd3S to suppress spurious intragenic transcription. Cell 123: 581-592. Chen D., Ma H., Hong H., Koh S.S., Huang S.M., Schurter B.T, Aswad D.W., and Stallcup M.R. 1999. Regulation of transcription by a protein methyltransferase. Science 284: 2174-2177. Chen Z., Zang J., Whetstine J., Hong X., Davrazou P., Kutateladze TG.,
C H ROM A TIN
MOD I Fie A T ION SAN 0
Simpson M., Mao Q., Pan CH., Dai S., et al. 2006. Structural insights into histone demethylation by JMJD2 family members. Cell 125: 691-702. Cheung P., Tanner K.G., Cheung W.L., Sassone-Corsi P., Denu J.M., and Allis CD. 2000. Synergistic coupling of histone H3 phosphorylation and acetylation in response to epidermal growth factor stimulation. Mol. CellS: 905-915. Clark-Adams CD., Norris 0., Osley M.A., Fassler J.S., and Winston E 1988. Changes in histone gene dosage alter transcription in yeast. Genes Dev. 2: 150-159. Clements A., Poux A.N., Lo W.S., Pill us L., Berger S.L., and Marmorstein R. 2003. Structural basis for histone and phosphohistone binding by the GCN5 histone acetyltransferase. Mol. Cell 12: 461--473. Cloos P.A., Christensen J., Agger K., Maiolica A., Rappsilber J., Antal T., Hansen K.H., and Helin K. 2006. The putative oncogene GASCI demethylates tri- and dimethylated lysine 9 on histone H3. Nature 442: 307-311. Cuthbert G.L., Daujat S., Snowden A.W., Erdjument-Bromage H., Hagiwara T, Yamada M., Schneider R., Gregory P.D., Tempst P., Bannister A.J., and Kouzarides T 2004. Histone deimination antagonizes arginine methylation. CellllS: 545-553. Daniel J.A., Torok M.S., Sun Z.W, Schieltz D., Allis CD., Yates J.R., Ill, and Grant P.A. 2004. Deubiquitination of histone H2B by a yeast acetyltransferase complex regulates transcription. ]. BioI. Chern. 279: 1867-1871. Daujat S., Bauer U.M., Shah v., Turner B., Berger S., and Kouzarides T 2002. Crosstalk between CARMI methylation and CBP acetylation on histone H3. Curro BioI. 12: 2090-2097. Dhalluin C, Carlson J.E., Zeng L., He C, Aggarwal A.K., and Zhou M.M. 1999. Structure and ligand of a histone acetyl transferase bromodomain. Nature 399: 491--496. Dou Y. and Gorovsky M.A. 2002. Regulation of transcription by HI phosphorylation in Tetrahymena is position independent and requires clustered sites. Proc. Natl. Acad. Sci. 99: 6142-6146. Dover J., Schneider J., Tawiah-Boateng M.A., Wood A., Dean K., Johnston M., and Shilatifard A. 2002. Methylation of histone H3 by COMPASS requires ubiquitination of histone H2B by Rad6.]. BioI. Chern. 277: 28368-28371. Durrin L.K., Mann R.K., Kayne P.S., and Grunstein M. 1991. Yeast histone H4 N-terminal sequence is required for promoter activation in vivo. Cell 65: 1023-1031. Emre N.C, Ingvarsdottir K., Wyce A., Wood A., Krogan N.J., Henry K.W., Li K., Marmorstein R., Greenblatt J.E, Shilatifard A., and Berger S.L. 2005. Maintenance of low histone ubiquitylation by Ubpl0 correlates with telomere-proximal Sir2 association and gene silencing. Mol. CellI?: 585-594. Ezhkova E. and Tansey W.P. 2004. Proteasomal ATPases link ubiquitylation of histone H2B to methylation of histone H3. Mol. Cell 13: 435--442. Fabbrizio E., El Messaoudi S., Polanowska J., Paul C, Cook J.R., Lee J.H., Negre v., Rousset M., Pestka S., Le Cam A., and Sardet C 2002. Negative regulation of transcription by the type II arginine methyltransferase PRMT5. EMBO Rep. 3: 641-645. Fischle W., Tseng B.S., Dormann H.L., Ueberheide B.M., Garcia B.A., Shabanowitz J., Hunt D.E, Funabiki H., and Allis CD. 2005. Regulation of HP I-chromatin binding by histone H3 methylation and phosphorylation. Nature 438: 1116--1122. Flaus A. and Owen-Hughes T 2004. Mechanisms for ATP-dependent chromatin remodelling: Farewell to the tuna-can octamer? Curro Opin. Genet. Dev. 14: 165-173. Fodor B.D., Kubicek S., Yonezawa M., O'Sullivan R.J., Sengupta R.,
THE I R M E C HAN ISM
0 F ACT ION
•
207
Perez-Burgos L., Opravil S., MechtJer K., Schotta G., and Jenuwein T 2006. Jmjd2b antagonizes H3K9 trimethylation at pericentric heterochromatin in mammalian cells. Genes Dev. 20: 1557-1562. Gardner R.G., Nelson Z.W, and Gottschling D.E. 2005. Ubpl0/Dot4p regulates the persistence of ubiquitinated histone H2B: Distinct roles in telomeric silencing and general chromatin. Mol. Cell. Bioi. 25: 6123-6139. Glozak M.A., Sengupta N., Zhang X., and Seto E. 2005. Acetylation and deacetylation of non-histone proteins. Gene 363: 15-23. Grant P.A., Sterner D.E., Duggan L.J., Workman J.L., and Berger S.L. 1998. The SAGA unfolds: Convergence of transcription regulators in chromatin-modifying complexes. Trends Cell BioI. 8: 193-197. Hall I.M., Shankaranarayana G.D., Noma K., Ayoub N., Cohen A., and Grewal S.l. 2002. Establishment and maintenance of a heterochromatin domain. Science 297: 2232-2237. Hampsey M. and Reinberg D. 2003. Tails of intrigue: Phosphorylation of RNA polymerase II mediates histone methylation. Cell 113: 429--432. Han M. and Grunstein M. 1988. Nucleosome Joss activates yeast downstream promoters in vivo. Cell 55: 1137-1145. Hassan A.H., Prochasson P., Neely K.E., Galasinski S.C, Chandy M., Carrozza M.J., and Workman J.L. 2002. Function and selectivity of bromodomains in anchoring chromatin-modifying complexes to promoter nucleosomes. Celllll: 369-379. Hebbes TR., Clayton A.L., Thorne A.W., and Crane-Robinson C 1994. Core histone hyperacetylation co-maps with generalized DNase I sensitivity in the chicken beta-globin chromosomal domain. EMBO]. 13: 1823-1830. Henikoff S. 2005. Histone modifications: Combinatorial complexity or cumulative simplicity? Proc. Natl. Acad. Sci. 102: 5308-5309. Henry K.W, Wyce A., Lo WS., Duggan L.J., Emre N.C, Kao CE, Pilius L., Shilatifard A., Osley M.A., and Berger S.L. 2003. Transcriptional activation via sequential histone H2B ubiquitylation and deubiquitylation, mediated by SAGA-associated Ubp8. Genes Dev. 17: 2648-2663. Hirota T, Lipp J.J., Toh B.H., and Peters J.M. 2005. Histone H3 serine 10 phosphorylation by Aurora B causes HPI dissociation from heterochromatin. Nature 438: 1176-1180. Jenuwein T and Allis Co. 2001. Translating the histone code. Science 293: 1074-1080. Joshi A.A. and Struhl K. 2005. EaD chromodomain interaction with methylated H3-K36 links histone deacetylation to Pol II elongation. Mol. Cell 20: 971-978. Kao C.E, Hillyer C., Tsukuda T, Henry K., Berger S., and Osley M.A. 2004. Rad6 plays a role in transcriptional activation through ubiquitylation of histone H2B. Genes Dev. 18: 184-195. Keogh M.C, Kurdistani S.K., Morris S.A., Ahn S.H., Podolny v., Collins S.R., Schuldiner M., Chin K., Punna T, Thompson N.J., et al. 2005. Cotranscriptional set2 methylation of histone H3 lysine 36 recruits a repressive Rpd3 complex. Cell 123: 593-605. Kim J., Hake S.B., and Roeder R.G. 2005. The human hom*olog of yeast BREI functions as a transcriptional coactivator through direct activator interactions. Mol. Cell 20: 759-770. Klose R.J., Yamane K., Bae Y., Zhang D., Erdjument-Bromage H., Tempst P., Wong J., and Zhang Y. 2006. The transcriptional repressor JHDM3A demethylates trimethyl histone H3 lysine 9 and lysine 36. Nature 442: 312-316. Kurdistani S.K. and Grunstein M. 2003. Histone acetylation and deacetylation in yeast. Nat. Rev. Mol. Cell. BioI. 4: 276--284. Kurdistani S.K., Tavazoie S., and Grunstein M. 2004. Mapping global histone acetylation patterns to gene expression. Ce1l1l7: 721-733. Lachner M., O'Carroll D., Rea S., MechtJer K., and Jenuwein T 2001.
208
C HAP T E R
70
Methylation of histone H3 lysine 9 creates a binding site for HPI proteins. Nature 410: 116-120. Lee D.Y, Teyssier C, Strahl B.D., and Stallcup M.R. 2005. Role of protein methylation in regulation of transcription. Endocr. Rev. 26: 147-170. Lee M.G., Wynder C, Cooch N., and Shiekhattar R. 2005. An essential role for CoREST in nucleosomal histone 3 lysine 4 demethylation. Nature 437: 432-435. Li H., llin S., Wang W, Duncan E.M., Wysocka )., Allis CD., and Patel D.). 2006. Molecular basis for site-specific read-out of histone H3K4me3 by the BPTF PHD finger of NURE Nature 442: 91-95. Litt M.D., Simpson M., Gaszner M., Allis CO., and Felsenfeld G. 2001. Correlation between histone lysine methylation and developmental changes at the chicken beta-globin locus. Science 293: 2453-2455. Lo W.S., Duggan L., Emre N.C, Belotserkovskya R., Lane W.S., Shiekhattar R., and Berger S.L. 2001. Snfl-A histone kinase that works in concert with the histone acetyl transferase GenS to regulate transcription. Science 293: 1142-1146. Lo WS., Trievel R.C, Rojas ).R., Duggan L., Hsu ).Y, Allis CO., Marmorstein R., and Berger S.L. 2000. Phosphorylation of serine 10 in histone H3 is functionally linked in vitro and in vivo to Gcn5mediated acetylation at lysine 14. Mol. Cell 5: 917-926. Ma H., Baumann CT, Li H., Strahl B.D., Rice R., Jelinek M.A., Aswad D.W., Allis CD., Hager G.L., and Stallcup M.R. 2001. Hormonedependent, CARM I-directed, arginine-specific methylation of histone H3 on a steroid-regulated promoter. Curro BioI. 11: 1981-1985. Macdonald N., Welburn ).P., Noble M.E., Nguyen A., Yaffe M.B., Clynes D., Moggs ).G., Orphanides G., Thomson S., Edmunds ).W, et al. 2005. Molecular basis for the recognition of phosphorylated and phosphoacetylated histone H3 by 14-3-3. Mol. Cell 20: 199-211. Mahadevan L.C, Willis A.C, and Barratt M.). 1991. Rapid histone H3 phosphorylation in response to growth factors, phorbol esters, okadaic acid, and protein synthesis inhibitors. Cell 65: 775-783. Martin C and Zhang Y. 2005. The diverse functions of histone lysine methylation. Nat. Rev. Mol. Cell. BioI. 6: 838-849. Metivier R., Penot G., Hubner M.R., Reid G., Brand H., Kos M., and Gannon E 2003. Estrogen receptor-a directs ordered, cyclical, and combinatorial recruitment of cofactors on a natural target promoter. Cell 115: 751-763. Metzger E., Wissmann M., Yin N., Muller j.M., Schneider R., Peters A.H., Gunther T., Buettner R., and Schule R. 2005. LSDI demethylates repressive histone marks to promote androgen-receptordependent transcription. Nature 437: 436-439. Mito Y, Henikoff ).G., and Henikoff S. 2005. Genome-scale profiling of histone H3.3 replacement patterns. Nat. Genet. 37: 1090-1097. Mizuguchi G., Shen X., Landry)., Wu W.H., Sen S., and Wu C 2004. ATP-driven exchange of histone H2AZ variant catalyzed by SWRI chromatin remodeling complex. Science 303: 343-348. Nakayama )., Rice ).C, Strahl B.D., Allis CO., and Grewal S.l. 2001. Role of histone H3 lysine 9 methylation in epigenetic control of heterochromatin assembly. Science 292: 11 0-113. Narlikar G.)., Fan H.Y., and Kingston R.E. 2002. Cooperation between complexes that regulate chromatin structure and transcription. Cell 108: 475-487. Noma K., Allis CD., and Grewal S.l. 2001. Transitions in distinct histone H3 methylation patterns at the heterochromatin domain boundaries. Science 293: 1150-1155. Nowak S.). and Corces v.G. 2004. Phosphorylation of histone H3: A balancing act between chromosome condensation and transcriptional activation. Trends Genet. 20: 214-220.
Pal S., Yun R., Datta A., Lacomis L., Erdjument-Bromage H., Kumar )., Tempst P., and Sif S. 2003. mSin3A/histone deacetylase 2- and PRMT5-containing Brgl complex is involved in transcriptional repression of the Myc target gene cad. Mol. Cell. BioI. 21: 7475-7487. Pavri R., Zhu B., Li G., Trojer E, Mandal S., Shilatifard A., and Reinberg D. 2006. Histone H2B monoubiquitination functions cooperatively with FACT to regulate elongation by RNA polymerase II. Cell 125: 703-717. Peterson CL. 2002. Chromatin remodeling: Nucleosomes bulging at the seams. Curro BioI. 12: R245-R247. Raisner R.M., Hartley P.D., Meneghini M.D., Bao M.Z., Liu CL., Schreiber S.L., Rando 0.)., and Madhani H.D. 2005. Histone variant H2A.Z marks the 5' ends of both active and inactive genes in euchromatin. Cell 123: 233-248. Rea S., Eisenhaber E, O'Carroll D., Strahl B.D., Sun Z.W., Schmid M., Opravil S., Mechtler K., Ponting CP., Allis CD., and )enuwein T 2000. Regulation of chromatin structure by site-specific histone H3 methyltransferases. Nature 406: 593-599. Reinberg D. and Sims R.)., III. 2006. de FACTo nucleosome dynamics. /. BioI. Chem.(in press). Reinke H. and Horz W. 2003. Histones are first hyperacetylated and then lose contact with the activated PHOS promoter. Mol. Cell 11: 1599-1607. Robzyk K., Recht )., and Osley M.A. 2000. Rad6-dependent ubiquitination of histone H2B in yeast. Science 287: 501-504. Roth S.Y, Denu ).M., and Allis CD. 2001. Histone acetyltransferases. Annu. Rev. Biochem. 70: 81-120. Sanders S.L., Jennings )., Canutescu A., Link A.)., and Weil P.A. 2002. Proteomics of the eukaryotic transcription machinery: Identification of proteins associated with components of yeast TFIID by multidimensional mass spectrometry. Mol. Cell. BioI. 22: 4723-4738. Santisteban M.S., Kalashnikova T, and Smith M.M. 2000. Histone H2A.Z regulates transcription and is partially redundant with nucleosome remodeling complexes. Cell 103: 411-422. Santos-Rosa H., Schneider R., Bannister A.)., Sherriff )., Bernstein B.E., Emre N.C, Schreiber S.L., Mellor )., and Kouzarides T 2002. Active genes are tri-methylated at K4 of histone H3. Nature 419: 407-411. Sassone-Corsi P., Mizzen CA., Cheung P., Crosio C, Monaco L., Jacquot S., Hanauer A., and Allis CD. 1999. Requirement of Rsk-2 for epidermal growth factor-activated phosphorylation of histone H3. Science 285: 886-891. Schreiber S.L. and Bernstein B.E. 2002. Signaling network model of chromatin. Cell 111: 771-778. Schurter B.T, Koh S.S., Chen D., Bunick G.)., Harp ).M., Hanson RL., Henschen-Edman A., Mackay D.R., Stallcup M.R., and Aswad D.W. 2001. Methylation of histone H3 by coactivator-associated arginine methyltransferase 1. Biochemistry 40: 5747-5756. Sekinger E.A., Moqtaderi Z., and Struhl K. 2005. Intrinsic histone- DNA interactions and low nucleosome density are important for preferential accessibility of promoter regions in yeast. Mol. Cell 18: 735-748. Shi Y., Lan E, Matson C, Mulligan E, Whetstine ).R., Cole EA., Casero R.A., and Shi Y. 2004. Histone demethylation mediated by the nuclear amine oxidase hom*olog LSD1. Cell 119: 941-953. Shi Y.)., Matson C, Lan E, Iwase S., Baba T, and Shi Y 2005. Regulation of LSD 1 histone demethylase activity by its associated factors. Mol. Cell. 19: 857-864. Shiio Y and Eisenman R.N. 2003. Histone sumoylation is associated with transcriptional repression. Proc. Natl. Acad. Sci. 100: 13225-13230.
C H ROM A TIN
MOD I Fie A T ION 5
Shim E.Y, Woodco*ck C, and Zaret K.S. 1998. Nucleosome positioning by the winged helix transcription factor HNF3. Genes Dey. 12: 5-10. Shogren-Knaak M., Ishii H., Sun T.M., Pazin M.J., Davie T.R., and Peterson CL. 2006. Histone H4-K16 acetylation controls chromatin structure and protein interactions. Science 311: 844-847. Sims. R.T., Belotserkovskaya R., and Reinberg D. 2004. Elongation by RNA polymerase II: The short and long of it. Genes Dev. 18: 2437-2468. Sims R.T., Chen CE, Santos-Rosa H., Kouzarides T., Patel 5.5., and Reinberg D. 2006. Human but not yeast CHD 1 binds directly and selectively to histone H3 methylated at lysine 4 via its tandem chromodomains.]. BioI. Chern 51: 41789-41792. Soloaga A., Thomson 5., Wiggin G.R., Rampersaud N., Dyson M.H., Hazzalin CA., Mahadevan L.C, and Arthur T.S. 2003. MSK2 and MSKI mediate the mitogen- and stress-induced phosphorylation of histone H3 and HMG-14. EMBO]. 22: 2788-2797. Sterner D.E. and Berger S.L. 2000. Acetylation of histones and transcription-related factors. Microbial. Mol. BioI. Rev. 64: 435-459. Strahl B.D. and Allis CD. 2000. The language of covalent histone modifications. Nature 403: 41-45. Strahl B.D., Briggs S.D., Brame CT., Caldwell T.A., Koh 5.5., Ma H., Cook R.G., Shabanowitz J., Hunt D.E, Stallcup M.R., and Allis CD. 2001. Methylation of histone H4 at arginine 3 occurs in vivo and is mediated by the nuclear receptor coactivator PRMT1. Curro BioI. 26: 996-1000. Sun Z.W. and Allis CD. 2002. Ubiquitination of histone H2B regulates H3 methylation and gene silencing in yeast. Nature 418: 104-108. Svaren J., Schmitz T., and Horz W. 1994. The transactivation domain of Ph04 is required for nucleosome disruption at the PH05 promoter. EMBO]. 13: 4856-4862. Taunton T., Hassig CA., and Schreiber S.L. 1996. A mammalian histone deacetylase related to the yeast transcriptional regulator Rpd3p. Science 272: 408-411. Trewick S.C, McLaughlin P.T., and Allshire R.C 2005. Methylation: Lost in hydroxylation? EMBO Rep. 6: 315-320. Tsukada Y., Fang ]., Erdjument-Bromage H., Warren M.E., Borchers CH., Tempst P., and Zhang Y 2006. Histone demethylation by a family of TmjC domain-containing proteins. Nature 439: 811-816. Turner B.M. 2000. Histone acetylation and an epigenetic code. Bioessays 22: 836-45. Vakoc CR., Mandat S.A., Olenchock B.A., and Blobel G.A. 2005. Histone H3 lysine 9 methylation and HPly are associated with transcription elongation through mammalian chromatin. Mol. Cell 19: 381-391. Vettese-Dadey M., Grant P.A., Hebbes T.R., Crane-Robinson C, Allis CO., and Workman J,L. 1996. Acetylation of histone H4 plays a primary role in enhancing transcription factor binding to nucleo-
AND
THE I R
M E C HAN ISM
0 F ACT ION
•
209
somal DNA in vitro. EMBO]. 15: 2508-2518. Volpe T.A., Kidner C, Hall I.M., Teng G., Grewal 5.1., and Martienssen R.A. 2002. Regulation of heterochromatic silencing and histone H3 lysine-9 methylation by RNAi. Science 297: 1833-1837. Wang H., Wang L., Erdjument-Bromage H., Vidal M., Tempst P., Tones R.S., and Zhang Y. 2004. Role of histone H2A ubiquitination in Polycomb silencing. Nature 431: 873-878. Wang H., Huang Z.Q., Xia L., Feng Q., Erdjument-Bromage H., StraW B.D., Briggs S.D., Allis CD., Wong T., Tempst P., and Zhang Y. 2001. Methylation of histone H4 at arginine 3 facilitating transcriptional activation by nuclear hormone receptor. Science 293: 853-857. Wang Y, Wysocka T., Sayegh T., Lee YH., Perlin T.R., Leonelli L., Sonbuchner L.S., McDonald CH., Cook R.G., Dou Y., et al. 2004. Human PAD4 regulates histone arginine methylation levels via demethylimination. Science 306: 279-283. Whetstine T.R., Nottke A., Lan E, Huarte M., Smolikov 5., Chen Z., Spooner E., Li E., Zhang G., Colaiacovo M., and Shi Y 2006. Reversal of histone lysine trimethylation by the TMTD2 family of histone demethylases. Cel/125: 467-481. Wood A., Krogan N.T., Dover J., Schneider T., Heidt T., Boateng M.A., Dean K., Golshani A., Zhang Y., Greenblatt T.E, et al. 2003. BreI, an E3 ubiquitin ligase required for recruitment and substrate selection of Rad6 at a promoter. Mol. Cel/11: 267-274. Workman T.L. and Roeder R.G. 1987. Binding of transcription factor TFIID to the major late promoter during in vitro nucleosome assembly potentiates subsequent initiation by RNA polymerase II. Cel/51: 613-622. Wysocka J., Allis CD., and Coonrod S. 2006a. Histone arginine methylation and its dynamic regulation. Front. Biosci. 11: 344-355. Yamane K., Toumazou C, Tsukada Y, Erdjument-Bromage H., Tempst P., Wong T., and Zhang Y 2006. THDM2A, a ]mjC-containing H3K9 demethylase, facilitates transcription activation by androgen receptor. CeLL 125: 483-495. Yang X.T. and Seto E. 2003. Collaborative spirit of histone deacetylases in regulating chromatin structure and gene expression. Curro Opin. Genet. Dey. 13: 143-153. Zhang H., Roberts D.N., and Cairns B.R. 2005. Genome-wide dynamics of Htzl, a histone H2A variant that poises repressed/basal promoters for activation through histone loss. CeLL 123: 219-231. Zhang Y and Reinberg D. 2001. Transcription regulation by histone methylation: Interplay between different covalent modifications of the core histone tails. Genes Dey. 15: 2343-2360. Zhu B., Zheng Y, Pham A.D., MandaI 5.5., Erdjument-Bromage H., Tempst P., and Reinberg D. 2005. Monoubiquitination of human histone H2B: The factors involved and their roles in HOX gene regulation. Mol. Cel/20: 601-611.
c
H
A
p
T
E
11
R
Transcriptional Silencing by Polycomb Group Proteins Ueli Grossniklaus 1 and Renata Paro 2 IInstitute of Plant Biology and Zurich-Basel Plant Science Center, University of Zurich, CH-8008 Zurich, Switzerland 2ZMBH, University of Heidelberg, D-69120 Heidelberg, Germany
CONTENTS 1. Introduction, 213
3.2
Targeting PRCl to Silenced Genes, 222
1. 1
The Concept of Cellular Memory, 213
3.3
1.2
Genetic Identification of the Polycomb Group, 213
Establishment of Repressive Functions by PRCl,223
3.4
Preventing Heritable Repression by Anti-silencing, 224
2. Establishing Silencing Marks on Chromatin, 215 2.1 2.2 2.3
Components and Evolutionary Conservation of PRC2, 215
4.1
Chromatin-modifying Activity of PRC2, 219
From Gene to Chromosome Repression, 224
4.2
Dynamic Function of PRO during Development, 21 9
Consequences of Aberrant Transcriptional Activation 225
4.3
Maintaining Stem Cell Fate, 226
3. Maintaining Transcriptional Silencing, 220 3.1
4. PcG Repression in Mammalian Development, 224
Components of PRCl, 220
5. Conclusion and Outlook, 227 References, 228
Title figures reprinted, with permission, from Yamamoto et al. 1997 (©Company of Biologists Ltd.).
211
GENERAL SUMMARY The organs of humans, animals, and plants are constructed from a large pool of distinct cell types, each performing a specialized physiological or structural function. With very few exceptions, all cell types contain the same genetic information encoded in their DNA. Thus, the distinctiveness of a given cell type is achieved through specific gene expression programs. However, cell lineages need to have these programs of gene expression maintained during growth and cell division. This implies the existence of a memory system that ensures a faithful transmission of information for which gene has to be active or repressed from mother to daughter cells. The existence of such a system is illustrated by the fact that cultured tissues of plants and animals usually maintain their differentiated characters even if grown in a foreign environment. By way of example, ivy plants regenerated after tissue culture produce the type of leaf corresponding to the phase of development from which the original tissue was taken (i.e., juvenile or adult leaf). The major question to be addressed in this and the following chapter concerns the molecular identity of factors contributing to the mechanism(s) which maintains determined states over many cell divisions (a process termed "cellular memory" or "transcriptional memory"). Genetic analyses in Drosophila have identified regulators crucial in maintaining the fate of individual body segments that are determined by the action of the HOX genes. In Drosophila males, the first thoracic segment has legs with sex combs. Legs on the second and third thoracic segment lack these structures (see the left panel of the title images). In the 1940s, Drosophila mutants were identified (Polycomb and extra sex combs) where males had sex combs on all legs (see the right panel of the title images). They correspond to homeotic transformations
of the second and third leg identities into the first leg identity. Genetic and molecular studies showed that these mutations did not affect the products of the HOX genes themselves, but rather the way HOX gene activity was spatially controlled. Over the years, a large number of such regulatory genes were identified, which could be classified into two antagonistic groups, the Polycomb (PcG) and trithorax (trxG) groups. Whereas the PcG proteins are required to maintain the silenced state of developmental regulators such as the HOX genes, the trxG proteins are generally involved in maintaining the active state of gene expression. Thus, PcG and trxG proteins form the molecular basis for cellular memory. Proteins of the PcG and trxG are organized into large multimeric complexes that act on their target genes by modulating chromatin structure. In this chapter, we focus on the molecular nature and function of two major Polycomb Repressive Complexes, PRC1 and PRC2; the molecular nature of the trxG complexes is described in the next chapter. PcG complexes are recruited to target genes through a DNA sequence called a PcG Response Element (PRE) in Drosophila. Once recruited, they establish a silent chromatin state that can be inherited over many cell divisions. Members of PRC2 are conserved between plants and animals, whereas PRC1 proteins are only present in Drosophila and vertebrates. This implies conservation but also diversity in the basic building blocks of the cellular memory system. In addition to their function in the maintenance of cell types, PcG complexes may also play important roles in stem cell plasticity. Their deregulation can lead to neoplastic transformation and cancer in vertebrates. Thus, PcG proteins are crucial for many fundamental processes of normal development and disease in multicellular eukaryotes.
214
C HAP T E R
/ /
Figure 2. Homeotic Transformations in PeG Mutants of Various Species (a-d) Drosophila melanogaster, (e, f) Mus musculus, (g, h) Arabidopsis thaliana. (a, b) Leg imaginal discs undergoing a transdetermination event as indicated by the expression of the wing-specific gene vestigial (which is marked by GFP). (c, d) Cuticles of a wild-type (c) and a Su(z)/2 mutant embryo (d). In the Su(z)/2 mutant embryo, all abdominal, thoracic, and several
head segments (not all visible in this focal plane) are homeotically transformed into copies of the eighth abdominal segment due to misexpression of the Abd-B gene in every segment. (e, f) Axial skeleton of newborn wild-type (e) and Ring/A-/mice (f). Views of the thoracic regions of cleared skeletons showing bone (red) and cartilage (blue). The mutant displays anterior transformation of the eighth thoracic vertebra as indicated by the presence of eight (1-8) vertebrosternal ribs, instead of seven (1-7) as in the wild type. (g, h) Wild-type (g) and c1f-2 mutant (h) flowers. The wild-type flower shows the normal arrangement of sepals, petals, stamens, and carpels. In the c1f-2 flower, petals are absent or reduced in number. (a,b, Courtesy of N. Lee and R. Paro; c,d, reprinted, with permission, from Birve et al. 2001 [©Companyof Biologists Ltd.]; e,f, reprinted, with permission, from del Mar Lorente et al. 2000 [©Company of Biologists Ltd.]; h, courtesy of J. Goodrich.)
Drosophila, the actIvIty of maternally (i.e., inherited through the oocyte) and zygotically produced transcription factors generates a specific combination of HOX expression required for each segment. This segmentspecific profile of HOX gene activity is maintained throughout the development of the fly, long after the early transcriptional regulators have disappeared. When the function of HOX genes was genetically characterized, many trans-acting regulators were isolated. Among them, Polycomb (Pc) was identified and genetically analyzed by Pam and Ed Lewis (Lewis 1978). Heterozygous Pc mutant males have additional sex combs on the second and third legs, whereas wild-type males only carry sex combs on the first leg (see title figure). hom*ozygous mutants are embryonic lethal, exhibiting a transformation of all cuticular segments toward the most posterior abdominal segment (Fig. 2c,d). These classic PcG phenotypes were interpreted as being caused by ectopic expression of HOX genes. Thus, Pc and the other genes with similar phenotypes were defined as repressors of HOX gene activity. Detailed analyses subsequently uncovered the fact that the PcG proteins are only required for the maintenance of HOX repression, rather than the position-specific establishment of HOX activity
during pattern formation. This latter task is performed by the transcription factors encoded by the early acting segmentation genes. Based on their repressing or activating effect on HOX gene expression, these newly identified trans-acting regulators were divided into two antagonistic classes, the PcG and trxG, respectively (Fig. 1) (Kennison 1995). The molecular isolation of Drosophila PcG genes has made it possible to study the function of vertebrate orthologs in mice, where they were also found to be key regulators of HOX gene expression (van der Lugt et al. 1994; Core et al. 1997). In mammals, mutations in PcG genes lead to homeotic transformations of the vertebrae (Fig. 2e,f). In addition, PcG genes playa crucial role in the control of cell proliferation, stem cell maintenance, and cancer (see Sections 4.2 and 4.3). The remarkable conservation ofPcG genes between flies and mammals has facilitated biochemical analyses and led to the identification of some novel members of PcG complexes, e.g., the RING 1 protein (Satijn and Otte 1999). Targeted mutation of RlNG1a in the mouse, for instance, led to the classic homeotic transformation phenotype. Only subsequently was it found to correspond to the PcG gene Sex combs extra in Drosophila.
T RAN S C RIP T ION A LSI LEN C I N G
In two other model organisms, namely the worm Caenorhabditis elegans and the flowering plant Arabidopsis thaliana, the molecular characterization of mutants isolated in various genetic screens revealed the existence of other PcG protein orthologs. In C. elegans, PcG members were identified in screens for maternal effect sterile (mes) mutants and were shown to be involved in X-chromosome silencing in the hermaphrodite germ line (Fong et a1. 2002; see Chapter 15). In Arabidopsis, PcG genes were identified in several genetic screens investigating distinct developmental processes (Hsieh et a1. 2003). The first PcG gene in plants, CURLY LEAF (CLF) , was identified as a mutant with homeotic transformations of floral organs (Fig. 2g,h) (Goodrich et a1. 1997). Mutations in the FERTILIZATIONINDEPENDENT SEED (PIS) class of genes were found in screens for mutants showing maternal-effect seed abortion (Grossniklaus et a1. 1998), or allowing aspects of seed development to occur in the absence of fertilization (Luo et a1. 1999; Ohad et a1. 1999). Finally, PcG genes were identified in screens for flowering time mutants, e.g., mutants that flower directly after germination (Yoshida et a1. 2001) or that disrupt the vernalization response, i.e., the process rendering plants competent to flower after prolonged exposure to cold (Gendall et a1. 2001). The variety of processes regulated by PcG proteins illustrates the importance of maintaining the repressed state of key developmental regulators in different organisms. On the one hand, there is an amazing conservation of some biological functions from plants to mammals, e.g., the regulation of key developmental regulators such as homeotic genes or involvement in the tight regulation of cell proliferation. On the other hand, PcG complexes appear to be versatile and dynamic molecular modules that have been employed to control a large variety of developmental and cellular processes. 2 Establishing Silencing Marks on Chromatin
PcG proteins fall into two biochemically characterized classes, which form the Polycomb repressive complexes 1 and 2 (PRCI and PRC2). The two complexes are required for consecutive steps in the repression of gene expression. First, PRC2 has histone-modifying activity and methylates H3K27 and/or H3K9 at genes targeted for silencing. PRCI components can then recognize and bind to such modifications and induce appropriate structural changes in chromatin. Whereas PRC2 proteins are present in all multicellular model species, PRCI components have not been identified in C. elegans and Arabidopsis.
B Y POL
yeo
M B G R0 U P
PRO TEl N S
•
215
2.1 Components and Evolutionary Conservation of PRC2
Several variants of PRC2 have been purified from Drosophila embryos, but all of these complexes contain four core proteins (Levine et a1. 2004): the SET histonemethyltransferase Enhancer of Zeste (E(Z)), the WD40 protein ESC, the histone-binding protein p55, and Suppressor of Zestel2 (SU(Z)12) (Table 1 and Fig. 3). Based on this composition, PRC2 was originally referred to as the E(Z)-ESC complex. This section highlights the molecular and biochemical details known about the different PRC2 components identified to date in different model organisms. The E(z) gene encodes a 760-amino acid protein, containing a SET domain that confers histone lysine methyltransferase (HKMT) activity. The SET domain is preceded by a CXC or Pre-SET domain (Tschiersch et a1. 1994), which contains nine conserved cysteines that bind three zinc ions and is thought to stabilize the SET domain. Such a structural role is supported by the fact that several temperature-sensitive E(z) alleles affect one of the conserved cysteines (Carrington and Jones 1996). In addition, E(z) contains SANT domains implicated in histone binding, and a C5 domain required for the physical interaction with SU(Z)12. ESC is a short protein of 425 amino acids that contains five WD40 repeats, shown to form a ~ propeller structure. This serves as a platform for protein-protein interactions, hence giving ESC a central role in PRC2, to physically interact with both E(z) and p55 in all model systems analyzed. The SU(Z)12 protein is 900 amino acids long and characterized by a C2 H 2 -type zinc finger and a carboxyterminal VEFS domain. The VEFS domain was identified as a conserved region between SU(Z) 12 and its three hom*ologs in plants: VRN2, EMF2, FIS2 (see Fig. 3). Several mutant Su(z)12 alleles alter this domain, showing it is required for the interaction with the C5 domain of E(Z) (Chanvivattana et a1. 2004; Yamamoto et a1. 2004). The p55 protein was not identified as a PcG member by genetic approaches, possibly because it takes part in a multitude of other protein complexes associated with chromatin (Hennig et a1. 2005). The p55 protein was, however, identified biochemically as part of PRC2. It is 430 amino acids long and contains six WD40 repeats, which physically interact with ESC or its orthologs in mammals and plants (Tie et a1. 2001; Kohler et a1. 2003a). In addition to the core PRC2 proteins, some variants of the complex contain the RPD3 histone deacetylase
216 •
C HAP T E R 1 1
(HDAC) or the Polycomb-like (PCL) protein. The interaction with RPD3 is noteworthy, because histone deacetylation is correlated with a repressed state of gene expression (see Chapter 10). The different compositions of PRC2 likely reflect dynamic changes during development or tissue-specific variants. PRC2 is highly conserved in invertebrates, vertebrates, and plants (Fig. 3). In C. elegans, only hom*ologs of E(Z) and ESC are present: MES-2 and MES-6. Together with another nonconserved protein, MES-3, they form a small complex of about 230 kD required for X-chromosome silencing in the hermaphrodite germ line (see Chapter 15). In plants and mammals, all four core proteins of PRC2 are present. As in Drosophila, the mammalian complex is about 600 kD and plays a role not only in regulating homeotic gene expression, but also in the control of cell proliferation, X-chromosome inactivation, and
imprinted gene expression (see Section 4 and relevant chapters for more detail). In plants, several genes encoding PRC2 components have undergone duplications such that they now are present as small gene families. In Arabidopsis there is only one hom*olog of ESC, FERTILIZATION-INDEPENDENT ENDOSPERM (FIE), but three hom*ologs of E(Z), three hom*ologs ofSU(Z) 12, and five hom*ologs of p55 (referred to as MSIl-5) (Table 1). Varying combinations of these proteins form at least three distinct complexes that control specific developmental processes (Figs. 3 and 4) (Reyes and Grossniklaus 2003; Chanvivattana et al. 2004). The best studied of these complexes is formed by members of the FERTILIZATION-INDEPENDENT SEED (PIS) class, which playa crucial role in the control of cell proliferation in the seed (Grossniklaus et al. 2001). This FIS or MEA-FIE complex contains MEDEA, FIE, FIS2,
Table 1. Core PcG genes in model systems
M. musculus
D. melanogaster
A. thaliana
C. elegans
PcG DNA-binding proteins phD
Pleiohomeotic
zinc finger
phol
Pleiohomeotic-like
zinc finger
Psq
Pipsqueak
BTB-POl domain
Dsp1
Dorsal Switch Protein 1
HMG domain protein
esc
Extra sex combs
WD 40 repeats
Eed
FIE
MES-6
E(z)
Enhancer of zeste
SET domain
Ezh1/Enx2, Ezh2/Enxl
CLF MEA SWN
MES-2
Yy1
HMGB2
PRC2 core complex
Su(z) 12
p55
Suppressor of zeste12
p55
zinc finger VEFS box
mSU(Z) 12
histone-binding domain
RbAp48 RbAp46
FIS2 VRN2 EMF2 MSI1 (MS/2,3,4,5)
PRC1 core complex Pc
Polycomb
chromodomain
Cbx2/M33 Cbx4/MPc2 Cbx6 Cbx8/MPc3 Cbx7
Ph
Polyhomeotic
zinc finger SAM/SPM domain
Edr1/Mph I/Rae28 Edr2/Mph2
Psc
Posterior Sex Combs
zinc finger HTH domain
Bmil Rnf11 O/Zfp 144/ Me/18
dRing / Sce
dRing / Sex combs extra
RING zinc finger
Ring1/Ringla Rnf2/ Ring 1b
SOP-2
T RAN 5 C RIP T ION A LSI LEN C I N G
O. melanogaster
M. musculus
A. thaliana
B Y POL
yeo
M B G Ra U P
PRO TEl N 5
21 7
C. elegans
FIS complex
Figure 3. Conserved PRC2 Core Complexes
EMF complex
VRN complex
and MSIl. The FIS complex was found to regulate the genes encoding PHERES1 (PHE1), a MADS domain transcription factor; and MEIDOS, a hom*olog of Skp1, which in yeast plays a crucial role in the control of cell proliferation (Kohler et al. 2003b). Interestingly, the paternal allele of PHEl is expressed at higher levels than the maternal allele. This regulation of gene expression by genomic imprinting is under the control of the FIS complex, which specifically represses the maternal allele (Kohler et al. 2005). Thus, as outlined below, the FIS complex shares with its mammalian counterpart functions in regulating cell proliferation as well as imprinted gene expression. The EMF complex contains CLF and EMBRYONIC FLOWER2 (EMF2) (Chanvivattana et al. 2004). Mutations in either of these show weak homeotic transformations and an early flowering phenotype. The EMF complex is required to repress homeotic genes, whose combinatorial action determines the identity of floral organs (Goodrich et al. 1997). Thus, the EMF complex has a similar function in maintaining the repressed state of homeotic genes as PRC2 in Drosophila and vertebrates (Fig. 2). However, homeotic genes in plants do not encode homeodomain proteins, but rather other transcription factors belonging to the MADS-domain and the plant-specific AP2-domain families. Strong mutants of EMF2, however, have more severe phenotypes where their seedlings produce flowers directly after germination, bypassing the vegetative phase of development (Yoshida et al. 2001). Thus, the EMF complex plays a role both early in development, where it prevents immediate flowering, and later in floral organogenesis (Chanvivattana et al. 2004). At both stages, the
The core members of PRC2 in D. melanogaster, M. musculus, A. thaliana, and C. elegans are shown. In A. thaliana, an ancestral complex is proposed to have diversified into three variants with discrete functions in development. In C. elegans, the PRC2 core complex contains only three proteins: MES-3 does not have hom*ology with any other identified PRC2 protein. The colors indicate hom*ology, the contacts indicate interactions. (Adapted from Reyes and Grossniklaus 2003 and Chanvivattana et al. 2004.)
EMF complex represses floral homeotic genes such as AG and APETALA3 (AP3) (Fig. 4). The FIS class proteins, FIE and MSIl, have also been implicated in the control of homeotic gene expression (Figs. 3 and 4). Because mutations in both cause maternal-effect embryo lethality, this function was only revealed when partialloss-of-function alleles could be studied at later stages of development (Kinosh*ta et al. 2001; Hennig et aL 2003). Finally, the VRN complex plays a key role in a wellknown epigenetic process: vernalization (extended exposure to low temperature). Vernalization induces flowering in winter annuals, but the effect is only seen after many cell divisions (Fig. 4). A plant cell will remember that it was vernalized for many months, or even years, after the cold period. This cellular memory is maintained through passages in cell culture but not from one generation to the next (Sung and Amasino 2004a). The VERNALIZATION (VRN) genes mediate the response to vernalization. VRN2 was found to encode a SU(Z) 12 hom*olog (Gendall et al. 2001), which interacts with the plant E(Z) hom*ologs CLF and SWiNGER (SWN) in yeast two-hybrid assays (Chanvivattana et al. 2004). The transition to flowering is not only controlled by vernalization, but involves the perception of endogenous (developmental stage and age) as well as exogenous factors (day length, light conditions, temperature). Four pathways have been defined by genetic analyses: (1) The autonomous pathway constitutively represses flowering, (2) the photoperiod pathway accelerates flowering under long days, (3) the vernalization pathway induces flowering in response to exposure to cold temperature, and (4) gibberellins promote flowering. The flowering time
218
C HAP T E R
7 7
~:~e~:OPhyte ~opment
AGIAP3
ge,m;~ Figure 4. Involvement of Distinct PRC2s at Various Stages of Plant Development During the plant life cycle, distinct variants of PRC2 (see Fig. 3) control developmental progression. (A) A cleared wildtype ovule harboring the female gametophyte in its center. The FIS complex represses target genes that control proliferation of the central cell; as in all fis class mutants, this cell proliferates in the absence of fertilization. Around fertilization, MEA is also required to maintain a low level of MEA m expression, but this activity is independent of other components of the FIS complex. (B) Section of a wild-type seed harboring the embryo and endosperm, enclosed by the seed coat. After fertilization, the FIS complex is involved in the control of cell proliferation in embryo and endosperm. It maintains a low level of expression of PHE7 and is required to keep the paternal MEAP allele silent. (C) Wild-type (right) and emf2 mutant (left) seedling 21 days after germination. The emf2 seedling produced a flower with homeotic transformations but no leaves. The EMF complex prevents flowering and represses floral homeotic genes such as AG, AP3, and others. (0) Vernalized (right) and non-vernalized (left) plants, the latter being characterized by a prolonged vegetative phase and the production of many leaves. During the vegetative phase of development, exogenous and endogenous signals induce flowering. Vernalization leads to the repression of the floral repressor FLC and thus promotes flowering. The maintenance of this repression depends on the VRN complex. (E) Wild-type Arabidopsis flower. During flower organogenesis, the EMF complex regulates floral homeotic genes that determine the identity of floral organs. (A, Courtesy of j.M. Moore and U. Grossniklaus; B, courtesy of J.-P. Vielle-Calzada and U. Grossniklaus; C, reprinted, with permission, from Moon et al. 2003 [©ASPB]; 0, reprinted, with permission, from Sung and Amasino 2004a [©Elsevier]; E, reprinted, with permission, from Page and Grossniklaus 2002 [©Macmillan].)
T RAN S C R / P T / 0 N A L S / LEN C / N G
gene FIC, which contains a MADS box, is a key integrator of the flowering response: It represses flowering. FIC expression is reduced by both the vernalization and the autonomous pathway. Whereas the initial repression of FIC is independent of the VRN complex, the maintenance of repression requires VRN2, which alters chromatin organization at the FIC locus (Gendall et al. 2001). Note that one of the components of the autonomous pathway is a p55 hom*olog, FVE (or MSI4), which affects flowering time response but does not act in the vernalization pathway (Ausin et al. 2004; Kim et al. 2004). Because no biochemical studies on the VRN complex have been reported, its exact composition is currently unknown (Figs. 3 and 4). 2.2 Chromatin-modifying Activity of PRCl
How does PRC2 mediate its repressive effect? Several proteins of the PcG and trxG have SET domains, including the PRC2 component E(Z). The discovery that SET domain proteins possess HKMT activity (Rea et al. 2000) suggested an involvement of histone methylation in PcG function. Indeed, mammalian and Drosophila PRC2 complexes were shown to methylate histone H3 at lysine 27 (H3K27) and, to a lesser extent, H3K9 both in vivo and in vitro (Cao et al. 2002; Czermin et al. 2002; Kuzmichev et al. 2002; Muller et al. 2002). These histone marks are usually associated with a transcriptionally silent state. Furthermore, H3K9 and H3K27 methylation has been associated with repressed homeotic genes of the bithorax complex (Miiller et al. 2002). However, only H3K27 methylation was lost in E(z) mutants, stressing the importance of H3K27 methylation in PcG silencing. Unlike the SU(VAR)3-9 protein, which methylates H3K9 on its own, E(Z) proteins on their own do not have H3K27 HKMT activity. The smallest complex acting as a HKMT also requires ESC and SU(Z) 12, which may have modulating functions. It was recently shown that PRC2 complexes can also methylate H1K26 (Kuzmichevet al. 2004). Distinct isoforms of the mammalian ESC hom*olog, Eed, determine the specificity of mammalian PRC2 for H1K26 versus H3K27 methylation (Kuzmichev et al. 2004). However, the functional relevance of H1K26 methylation for PcG silencing remains unclear. In plants, the HKMT activity of PRC2 complexes has not yet been demonstrated in vitro. However, studies of FIC regulation have shown that vernalization induces a loss of acetylation and an increase of H3K9 and H3K27 methylation, mainly in the first intron of the gene (Bastow et al. 2004; Sung and Amasino 2004b). Both methylation marks were lost in vrn2 mutants, implicating the VRN complex in setting these repressive histone methyla-
BY
POL
yeo
M B
G R 0 UP
PRO T E / N S
•
219
tion marks. In two other mutants, vrnl and vernalization insensitive3 (vin3), only the H3K9me2 mark is missing. VRNl and VIN3 encode transcription factors of the B3domain and homeodomain families, respectively, but the exact molecular mechanism of their involvement in modifying chromatin is currently unclear. From numerous studies to date, the main function of PRC2 seems to involve HKMT activity, but there are other chromatin-modifying activities present in some PRC2 variants. The Rpd3 gene encodes a HDAC that has been implicated in PcG silencing (Tie et al. 2001). However, although rpd3 mutations enhance PcG phenotypes, they do not show the typical homeotic transformations by themselves. The fact that RPD3 is not present in all PRC2 preparations may thus reflect either a weak overall interaction, or a tissue- and stage-specific interaction with the PRC2 core components. The interaction of RPD3 with PRC2 represents an interesting partnership, as both HKMT and HDAC activities associate with silent chromatin, and in combination may reinforce transcriptionally silent states. 2.3 Dynamic Function of PRC2 during Development
As pointed out above, the PRC1 and PRC2 core complexes are associated with distinct factors that may playa role in recruiting PcG complexes to tissue-specific target loci or in modulating their activity (Otte and Kwaks 2003). The different steps of PcG repression shown in Figure 5 illustrate the stage-specific compositions.of PcG complexes during Drosophila embryogenesis. So far, it has been difficult to characterize differences with respect to distinct tissues or cell types in flies because whole embryonic extracts are usually used for biochemical purifications. Studies performed in mammals and plants, however, clearly show that PcG complexes have distinct memberships in specific tissues and that their composition changes during cellular differentiation (Chanvivattana et al. 2004; Kuzmichev et al. 2005; Baroux et al. 2006). In mammals, expression levels of PcG genes differ tremendously from one cell line to the next. PcG complexes may even differ between target genes in the same cell, suggesting a highly dynamic behavior at different developmental stages. In Drosophila, PcG proteins maintain repressed states of homeotic genes, established during early embryogenesis, thereby fixing developmental decisions. Once the silent state of a PcG target has been fixed, it will remain in that state for the remainder of an individual's life span. In plants, a similar situation may occur with the VRN com-
220 •
C HAP T E R
1 1
Activation of PRE regulated gene
PRE
b PRE
plex: Once vernalized, the target gene(s) will be permanently inactivated and only reset in the next generation. Other plant PRC2 complexes, however, seem to respond more quickly to developmental or environmental stimuli. For instance, one function of the PIS complex is to repress cell proliferation in the absence of fertilization. Upon fertilization, however, cell proliferation is rapidly induced, presumably through the derepression of PcG target genes. This indicates that PcG repression is the default state, which has to be overcome by some unknown mechanism to allow normal developmental progression. The inactivation of PcG complexes as part of the normal plant life cycle may explain the absence of PRC1 proteins in plants (Fig. 4). PRC1 plays an important role in the permanent, stable, and long-lasting inactivation of target genes. Such permanent inactivation would be detrimental to plant development, where often PcG repression is released upon appropriate stimuli.
3 Maintaining Transcriptional Silencing 3.1 Components of PRC1
e Figure 5. Sequence of Events Leading to the PcG-dependent Repressed State of Gene Expression in Drosophila Embryos The original gene expression state of a PRE-regulated gene is determined by the activity of transcriptional regulators, either transcriptional repressors (TR) or activators (TA). Transcription through the PRE prevents the establishment of the "OFF" state and leads to the trxG-dependent "ON" state (for details, see Fig. 8 in Chapter 12). (a-b) A nontranscribed PRE binds specific DNA-binding proteins (e.g., PHO, PHOL, DSP1, or GAF) that are involved in the recruitment of the early PcG complex containing proteins of both PRC1 and PRC2. (e) This early PeG complex marks chromatin by E(Z)dependent histone methylation. (d) Maintenance of the silent state occurs through interactions of the two distinct complexes, PRC1 and PRC2, in the absence of the original transcriptional repressor. Maintenance of PRC1 is stabilized through binding of H3K27me3 via the chromodomain of Pc. (e) PRC1 can compact chromatin, further establishing tightly condensed, silent chromatin.
The molecular analysis of the PcG gene products has revealed a structurally diverse group of chromatin-associated proteins. PRC1 contains four PcG proteins; Polycomb (PC), Polyhomeotic (PH), Posterior Sex Combs (PSC), and Ring 1 (dRing1/SCE) (see Table 1) (Francis et al. 2001). They occur in stoichiometric amounts, and additional partner proteins have been identified depending on the material used for purification. A related complex has been purified from mammalian cells, suggesting that these four subunits form the core of PRC1 (Levine et al. 2002). Immunostaining of Drosophila polytene chromosomes, using antibodies directed against PRe1 proteins, showed overlapping localization patterns, which indicated that these proteins cooperate at a defined and common set of target genes (Fig. 6a). Additionally, the approximately 100 bands observed on the chromosomes provided evidence that the HOX genes are just part of a larger regulatory network, including other gene targets subject to PcG silencing. The PC gene encodes a 390-amino acid protein containing a chromodomain at its amino-terminal end. This conserved motif has hom*ology with HP1, a Drosophila protein required for heterochromatin formation (Paro and Hogness 1991; see also Chapter 5). The chromodomain was subsequently found to bind to methyl moieties at H3K27 and H3K9 (Bannister et al. 2001; Fischle et al. 2003). Another conserved domain is present at the carboxy-terminal end. The conservation, as well as the occur-
T RAN 5 C RIP T ION A LSI LEN C I N G
8 Y POL
yeo
M8
G R0 U P
PRO TEl N 5
221
x 2L
t
ANT-C
b
• PC binding sites • predicted PREs
Figure 6. Targeting of PRCl to PREs on Polytene Chromosomes (a) Immunostaining of Drosophila polytene chromosomes to visualize the distribution of the PC protein. (b) Alignment of chromosome arms showing the overlap between predicted PRE sites on the Drosophila genome and the cytologically mapped PC-binding sites on polytene chromosomes. The two HOX gene clusters (ANT-C and BX-C) are prominent bind-
ing sites for PRCl s.
rence of several aberrations in mutant alleles, suggests an important but as-yet-unknown regulatory function in this part of the protein. The carboxyl terminus of PC is dispensable for targeting the protein to silenced genes (fulfilled by the chromodomain) but was found to interact in vitro with nucleosomes (Breiling et al. 1999). Whether this indicates an undiscovered recognition motif for another histone modification remains to be seen. For human Pc2, a SUMO E3 ligase activity has been demonstrated, pointing to SUMO modifications as important marks in the PcG silencing process (Kagey et al. 2003). The amino-terminal part of the PSC protein is conserved in the vertebrate proto-oncogene bmi-l and the tumor suppressor gene mel-18. This region contains a C3HC 4 ring finger motif, which may mediate protein-protein interactions. The ring finger motif has been implicated in subnuclear localization of Bmi-l/Mel-18, which is correlated with cellular transformations. In Drosophila, the polyhomeotic (ph) locus is duplicated, consisting of a proximal (ph-p) and a distal (ph-d) gene sharing extensive hom*ology. hom*ologous mouse PH proteins have been identified. All share a conserved single zinc finger and a SAM (also known as SEP or SPM) domain. This domain is also found in another PcG protein, Sex Combs on the Midleg (SCM). SAM domains are involved in protein-protein interactions, as it has been demonstrated that they participate in hom*o- or heterotypic interactions with other proteins. These findings support a possible function in generating large nuclear
complexes, required for silencing. Indeed, PcG proteins have been localized in subnuclear foci called PcG bodies, which might function as silencing compartments (Saurin et al. 1998). As mentioned above, dRING 1 was not initially recognized as a PcG member. Only biochemical purification uncovered the presence of this factor with a RING finger motif in PRCl, in which it is thought to playa structural role (Francis et al. 2001; Lavigne et al. 2004). The Ring1A and Ring1B proteins of mammalian PRC1 have been found to be associated with ubiquitylated H2A on the inactive X chromosome, and the maintenance of this histone mark was dependent on the Ring1 proteins (de Napoles et al. 2004; Fang et al. 2004; Cao et al. 2005; for more detail, see Section 4.1 and Chapter 17). These four proteins comprise the core structure of PRCl. However, other PcG proteins like SCM or the Zeste protein were found to be associated with the complex (OUe and Kwaks 2003). Their molecular function in PRCl remains unclear, as they seem to have additional roles in the nucleus; e.g., a transcriptional activator function of Zeste. Still other PcG genes were identified by virtue of their role as transcriptional regulators of the core PcG genes (Ali and Bender 2004). Namely, three PcG genes are upstream regulators of genes encoding PRC1 components. Negative feedback loops among PRC1 components, as well as positive regulation of PRC1 components by PRC2, further suggest a complicated cross-regulatory network among the PcG genes to ensure the fine-tuning of protein
222 • C HAP T E R
J J
levels in the complexes (Fig. 7a). Similarly, complex regulatory interactions have been described for the genes of the FIS complex in Arabidopsis (Baroux et al. 2006). 3.2 Targeting PRC1 to Silenced Genes
Transgene analyses of Drosophila homeotic gene clusters uncovered regulatory elements that are required for the maintenance of appropriate segment-specific expression of the HOX genes. These DNA elements-called Polycomb Response Elements or PREs-maintain the segment-specific expression of HOX genes beyond the embryonic ini-
a
Su(z)2-??
dRing1
E(PC)1t PSC Asx
ph
Pcl
tlatlOn phase. PREs attract proteins of the PRCI when integrated at ectopic sites in the polytene chromosomes, suggesting that they define sequence specificity for the recognition and anchoring of PRCIs to target genes. However, the issue of PcG targeting appears to be a complex one. The size of functionally characterized PREs ranges from a few hundred to several thousand base pairs, containing consensus binding sites for many different DNAbinding proteins, and usually two or more PREs are found at a given target locus. So far, all characterized PREs come from Drosophila, and no PREs have yet been defined in mammals or plants. Despite the complexity of PREs, four
b
tHOXgenes
Pc
PRE
( esc/E(z)
c
PRE 2 - "ON"
• +~ • +cell
division
PRE 1 - "OFF"
PRE 2 - "ON"
cell cycle
PRE 1 - "OFF"
progression
Figure 7. PRCl Regulation and Function during Cell Division (0) Cross-regulatory interactions among the PcG genes, as suggested from genetic evidence. E(Pc), Pel, and Asx are positive regulators of the core PRCl members acting upstream. PRC2 members Esc and E(z) act as positive regulators of Pc transcription. A negative feedback by core PRCl members on Psc and dRingJ, as well as on Su(z)2, is observed. The finetuning of gene product level is probably required for well-balanced processes based on chemical equilibrium. (b) Sequence-specific transcription factors (TF) tether components of PRCl to a PRE. A stable silencing complex requires anchoring of PRCl via the chromodomain of PC to neighboring methylated histone tails. (c) Possible model for how differential gene expression states can be inherited. The process of intergenic transcription places positive epigenetic marks (e.g., acetylated histone tails, histone variants) at PREs that control active genes (PRE 2). All other PREs are silenced by default (PRE l). During DNA replication and mitosis, only the positive epigenetic signal needs to be transmitted to the daughter cells, ensuring that in the next interphase intergenic transcription is restarted at PRE 2 before default silencing is reestablished at all other PREs. (0, Adapted from Ali and Bender 2004.)
T RAN S C RIP T ION A LSI LEN C I N G
consensus sequence motifs could be identified and were shown to playa role in Drosophila PRE function. One of these motifs (GCCAT) is bound by both the Pleiohomeotic (PHO) and Pleiohomeotic-like (PHOL) proteins, which have partially redundant functions. PHO and PHOL function in PcG targeting, as they are found in PcG complexes isolated from early embryonic extracts, coimmunprecipitate with members of both PRCI and PRC2, and bind PREs in vitro (Fig. Sa) (Poux et al. 2001). Recently, a role in PcG recruitment was also demonstrated for DSPI, which binds the GAAA motif found in many PREs (Dejardin et al. 2005). Finally, the trxG proteins Zeste and GAF (encoded by the Trithorax-like gene) may help to recruit PcG proteins to their targets. A newly developed algorithm, based on the finding that clustered pairs of GAF, Zeste, and PHO/PHOL sites characterize a PRE, predicts known PREs with high probability and thus can identify new potential PcG target genes in the Drosophila genome (Fig. 6b) (Ringrose et al. 2003). The family of PRE-controlled genes ranges from the wellknown developmentally important transcription factor genes required for pattern formation to genes encoding factors involved in cell cycle regulation and senescence. PRCI, once bound, interacts with neighboring histones to generate stable silencing complexes at PREs (Fig. 7b). The H3K27me3 marks provided by the PRC2 act as additional binding sites for the chromodomain of PC (Fig. 7c). In their absence, as shown by competition with a soluble methylated histone tail peptide, the PRCls dissociate from their target genes (Czermin et al. 2002; Ringrose et al. 2004; Wang et al. 2004). The discovery of the HKMT activity of PRC2 and the associated histone marks typical of silent chromatin has suggested a new mechanism for the establishment of PcG repression. Following PRC2-catalyzed modification of H3K27me3, PRCI binds through the chromodomain of the PC protein to stabilize silencing. This is corroborated by the findings that (1) H3K27me3 marks and PC colocalize on polytene chromosomes and (2) PC binding is lost in £(z) mutants, which lack HKMT activity that modifies H3 with H3K27me3 marks at PREs, serving to recruit PC to its targets (Fig. 5). Although such a model is certainly attractive, the situation at PREs seems more complex because PRC2s and PRCls do not act sequentially, but rather are present together on PREs in early embryogenesis (Fig. 5b, c). Thus, it seems likely that H3K27 methylation is a downstream event after PcG recruitment, but plays a crucial role in establishing the silenced state. The model described above shows parallels to heterochromatin formation, where the Heterochromatin Pro-
B Y POL
yeo M B G R 0
UP
PRO TEl N S
•
223
tein I (HP1) is recruited VIa its chromodomain to H3K9me marks generated by SU(VAR)3-9 (see Chapter 5). Thus, a productive silencing complex is targeted by transcription factors to defined DNA sequence elements but requires, in addition, an appropriately modified histone layer in the vicinity to generate a higher-order repressive chromatin structure (Fig. 5). During evolution, PREs have retained remarkably little sequence conservation. Even within closely related Drosophila species, the number, position, and composition of PREs vary substantially (L. Ringrose and R. Paro, unpubl.). This suggests that the sequence requirements as well as the position of the PREs are flexible and may be adapted to species-specific requirements. Nevertheless, the components of PRCI are highly conserved, and they presumably utilize the same basic molecular mechanism(s) to induce higher-order chromatin changes at silenced target genes.
3.3 Establishment of Repressive Functions by PRC1 The way in which PRE-bound PRCls interact with the promoter to prevent transcription is still unknown. The anchoring of paused RNA polymerase complexes at promoters, preventing initiation, has been attributed to PRE-PRCI interactions described for reporter constructs (Dellino et al. 2004). Additionally, PRCl was shown to counteract remodeling of nucleosomes in vitro and to induce a compact chromatin structure. Thus, PRCl potentially blocks the accessibility to DNA of transcription factors and other complexes required for transcription (Francis et al. 2004). Using the algorithm described above, PRE-like sequences are predicted to exist at almost all promoters of PcG-controlled Drosophila target genes. This suggests that PRCI occupation at both promoter and regulatory sites might foster interactions between PREs and promoters, and establish stably repressed chromatin structures unfavorable for transcription (Ringrose et al. 2003). The stability of silencing complexes, as demonstrated by anchoring via methylated histone tails, appears to be a hallmark of the long-term repressive function of the PcG proteins. However, when analyzed in vivo at the cellular level, a remarkably dynamic behavior is observed. PcG proteins cluster in PcG bodies, which vary in size and composition between cells, suggesting an interaction of silencing complexes in the nucleus in a developmentally regulated manner. Furthermore, dynamic in vivo analyses of GFPmarked PC and PH proteins uncovered a very high exchange rate of unbound proteins with their complexes at
224
•
C HAP T E R
7 7
silenced targets (Ficz et al. 2005). These results suggest that long-term repression is primarily based on a chemical equilibrium between bound and unbound proteins rather than on high-affinity protection of DNA-binding sites. 3.4 Preventing Heritable Repression by Anti-silencing
The binding of PRCls to PREs appears to be induced by default, as many of the anchoring PcG components and DNA-binding proteins are expressed in all cells, and PREs globally silence reporter genes in transgenic constructs. The counteracting proteins of the trxG do not, in fact, function as activators, but rather as anti-repressors (Klymenko and Mi.iller 2004; see Chapter 12). Thus, to maintain active transcription of a PRE-controlled gene, the silencing at that PRE has to be prevented in a tissue- and stage-specific manner. In Drosophila, for example, the activation of HOX genes is controlled by the early cascade of transcription factors encoded by the segmentation genes. Interestingly, these factors induce transcription not only of the HOX genes, but also of intergenic, noncoding RNAs that are transcribed through the associated PREs often found upstream or downstream (Fig. 5). It was demonstrated that transcription through PREs is required to prevent silencing and to maintain the active state of a reporter gene using transgenic constructs (Schmitt et al. 2005). The process of transcription most probably remodels PRE chromatin to generate an active state characterized, for instance, by a lack of repressive histone methylation and the presence of histone acetylation. Thus, even though the DNA-binding proteins attract PRC1 to this particular activated PRE, the histone environment does not allow anchoring of PC via H3K27me3, and no stable silencing will be established. Since silencing is induced by default in the PcG system, epigenetic inheritance of a differential gene expression pattern only requires the transmission of the active PRE state during DNA replication and mitosis (Fig. 7c). How this is achieved at the molecular level, and which epigenetic mark(s) is responsible for maintaining an active PRE state, are still open questions. Interestingly, it was recently shown that at a Drosophila PRE of the homeotic Ubx gene, noncoding RNAs produced at the PREs stay associated with chromatin and recruit the trxG regulator Absent Small or Homeotic discs 1 (ASH1). Destruction of these RNAs by RNAi attenuates ASH1 recruitment to the PRE, suggesting that this interaction plays an important role in the epigenetic activation of the homeotic genes, by overriding default PcG-induced silencing (SanchezElsner et al. 2006).
4 PeG Repression in Mammalian Development 4.1 From Gene to Chromosome Repression
Mutations in members of the murine PRC1 exhibit homeotic transformations of the axial skeleton. This can cause the appearance of additional vertebrae as a consequence of a derepression of HOX genes (Fig. 2e,f) (Core et al. 1997). In addition, the mutant mice display severe combined immunodeficiencies, caused by a lack of proliferative responses of hematopoietic cells (Raaphorst 2005). The role of PcG proteins has been particularly well studied in blood cells, in light of the fact that most bloodcell lineages are characterized by their well-described celltype-specific transcription programs. However, lineage commitment and restriction somehow need to be faithfully maintained through cell division. It turns out that in PcG knockout mice, B- and T-cell precursor populations are produced normally, indicating that PcG control is not involved in establishing lineage-specific gene expression patterns. PcG proteins, however, contribute to the irreversibility of the lineage choice, rather than being involved in the decision to follow a particular developmental pathway. Besides the control of the HOX genes, whose expression patterns characterize different blood-cell lineages, PcG proteins playa major role in controlling projiferation. The bmil gene, an ortholog of Drosophila Pse, was initially identified as an oncogene that, in collaboration with mye, induces murine lymphomagenesis (van Lohuizen et al. 1991). The Bmi1 protein controls the cell cycle regulators p161NK4a and p19 ARF (Jacobs et al. 1999). Both Bmi1 and the related protein Mel-18 are negative regulators of the INK4c-ARF locus required for normal lymphoid proliferation control. Misregulation of this important cell cycle checkpoint affects apoptosis and senescence in mice (Akasaka et al. 2001). Mammalian PcG proteins are also associated with the classic epigenetic phenomenon of X-chromosome inactivation (see Chapter 17). The inactivation of one X chromosome in female XX cells is accompanied by a series of chromatin modifications that involve PcG proteins (Heard 2004). In particular, components of PRC2, like the ESC hom*olog, Eed (Embryonic ectoderm expression), or the E(Z) hom*olog, Enx1 (Table 1), playa major role in the establishment of histone marks associated with transcriptional silencing. Transient association of this PRC2 with the X chromosome, coated by Xist RNA, is accompanied by H3K27 methylation. In contrast, eed mutant mouse embryos show no recruitment of the Enx1 HKMT, nor can any H3K27 methylation be
T RAN 5 C RIP T ION A LSI LEN C I N G
observed. However, the absence of these PRC2 components does not lead to a complete derepression of the entire inactive X chromosome; rather, the sporadic reexpression of X-linked genes and an increase in epigenetic marks associated with an active state (H3K9ac and H3K4me3) are observed in some cells. This is likely because other partially redundant epigenetic mechanisms are in place to ensure the maintenance of one inactive X chromosome. Recruitment of PRC2 to the inactive X chromosome appears to be dependent on Xist RNA. Because association of PRC2 to the inactive X is only transient, it appears that the complex is only required to set epigenetic marks (i.e., H3K27me3) for the maintenance of silencing. Currently, it is not known whether PRCI directly recognizes these marks and is required for the permanent silencing of the inactive X chromosome, but PRCI components are found to be associated with the inactive X chromosome. However, DNA methylation is known to accompany the maintenance phase and is required for permanent X inactivation. PRC2 is specifically involved in the regulation of monoallelic expression of the X chromosome both in the embryo, where X-chromosome inactivation is random, and in extraembryonic tissues, where the paternally inherited X chromosome is always inactivated (imprinted X-chromosome inactivation). In addition, it was recently found that PRC2 is involved in the regulation of some autosomal imprinted genes. For instance, an analysis of 14 imprinted loci from six unlinked imprinting clusters showed that four of these were biallelically expressed in eed mutant mice (Mager et al. 2003; for more detail, see Chapter 19). Interestingly, all loci that lost imprinted expression were normally repressed when paternally inherited, whereas none of the maternally repressed loci was affected. Because it was recently shown that Ezh2 directly interacts with the mammalian DNA methyltransferases and is required for their activity (Vin~ et al. 2006), it is possible that PRC2 plays a role in the regulation of these imprinted genes via DNA methylation (see Chapter 18). An involvement of PRC2 in the regulation of imprinted gene expression has also been reported in Arabidospis, where the PRE] locus is expressed at much higher levels from the paternal allele (Kohler et al. 2005). In mutants for the E(z) hom*olog MEA, the maternal PRE] allele is specifically derepressed. Similarly, MEA also regulates its own imprinted expression: Early in reproductive development, the maternal MEA allele is strongly derepressed in mea mutants. This effect, how-
B Y POL
yeo
M B
GR0 UP
PRO TEl N 5
•
225
ever, is independent of the other components of the FIS complex (Baroux et al. 2006). In contrast, later in development, the FIS complex ensures the stable repression of the paternal MEA allele (Baroux et al. 2006; Gehring et al. 2006; Jullien et al. 2006). In this latter case, the PIS complex is involved in the silencing of paternally repressed imprinted genes similar to the situation in mammals. In addition, MEA also has a role in keeping expression of the maternal PRE] and MEA alleles at low levels as described above (Fig. 4). Because PRC2 components are present in plants, invertebrates, and mammals, PRC2 represents an ancient molecular module suitable for gene repression that was already present in the unicellular ancestor of plants and animals, prior to the evolution of multicellularity. Thus, these examples suggest that PRC2 was recruited independently for the regulation of imprinted gene expression in plants and mammals, the two lineages where genomic imprinting evolved (Grossniklaus 2005). 4.2 Consequences of Aberrant Transcriptional Activation
The finding that Emil misregulation causes malignant lymphomas in mice raises the question of whether human BMIl (a PRCI component) itself contributes to the development of cancer in a similar fashion. There is accumulating evidence that altered PcG gene expression is widespread in human malignant lymphomas (Raaphorst 2005). For instance, the level of BMIl overexpression in B-cell lymphomas correlates with the degree of malignancy, suggesting that PRCI components do play a role in the development of human cancer. However, the target genes of BMIl in human cells appear to be different from those of mouse lymphocytes, as no obvious down-regulation of p161NK4a could be correlated to the overexpression of the oncogenes. PcG gene overexpression is not only observed in hematological malignancies, but is also found in solid tumors, including medulloblastomas, and tumors originating from liver, colon, breast, lung, penis, and prostate (Fig. 8). The high expression of a PRC2 marker, Ezh2, is often found in early stages of highly proliferative lung carcinomas. This suggests that the well-known cascade of PRC2 initiation and PRCI maintenance (Fig. 5) might also accompany the development of a tumor cell lineage. Interestingly, PRC2 components also play a crucial role in the control of cell proliferation in Arabidopsis. Although aberrant growth does not lead to cancer and death in plants, a strict control of cell proliferation is
226
C HAP T E R
7 7
type is shared with rbr1 mutants, providing a link to the Rb pathway. Remarkably, a connection between the Rb pathway and PRC2 has also been reported in mammals (Bracken et a1. 2003), illustrating conserved regulatory networks between plants and mammals. 4.3 Maintaining Stem Cell Fate
Figure 8. PRC2 Regulates Cell Proliferation in Mammals and Plants
(a, b) Plant embryos derived from wild-type and mea mutant egg cells. MEA encodes a protein of the FIS complex and regulates cell proliferation. The giant mea embryo is much larger than the corresponding wild-type embryo at the same stage of development (late heart stage). Mutant embryos develop more slowly and have approximately twice the number of cell layers. (c, d) Normal and cancerous prostate epithelium. In the cancerous epithelium, Ezh2 expression is highly increased (labeled with an anti-Ezh2 antibody). Thus, both loss of E(Z) function in plants and overexpression of E(Z) function in humans can lead to defects in cell proliferation. (e, f) Control and RING1 overexpressing rat 1a fibroblast cells. Overexpression of RING1 leads to anchorage-independent growth in soft agar, typical of neoplastically transformed cells. (a,b, Courtesy of J.-P. Vielle-Calzada and U. Grossniklaus; c,d, reprinted, with permission, from Kuzmichev et al. 2005 [©National Academy of Sciences]; e,f, reprinted, with permission, from Satijn and Otte 1999 [©American Society for Microbiology].)
essential for normal development. In mutants of the fis class, the two fertilization products of flowering plants, the embryo and endosperm, overproliferate, and the resulting seeds abort (Fig. 8) (GrossnikJaus et al. 2001; Hsieh et a1. 2003; Guitton and Berger 2005). Effects on cell proliferation are also observed in double mutants of elf and swn, two of the plant £(z) hom*ologs. Such plants undergo normal seed development after germination but produce a mass of proliferating, undifferentiated tissue (callus) rather than leaves (Chanvivattana et al. 2004). Although it is not known how PRC2 controls cell proliferation in plants, it is likely to involve interactions with RBR1, the plant hom*olog of the Retinoblastoma (Rb) protein (Ebel et a1. 2004; Mosquna et a1. 2004). Mutants in the PIS class of genes not only show proliferation defects during seed development after fertilization, but are also required to prevent proliferation of the endosperm in the absence of fertilization. This latter aspect of the pheno-
Stem cells play an ever-increasing role in medicine. Their potential to provide progenitors for the healing of damaged tissue places them into a well-treasured toolbox of regenerative medicine. Not surprisingly, it is in the very well characterized blood-cell lineage where we know most about the identity and location of stem cells. Hematopoietic stem cells (HSCs) maintain the pool of blood cells by self-renewing as well as by producing daughter cells that differentiate into the lymphoid, myeloid, and erythroid lineages. The stem cell niche in the adult bone marrow provides the cells with specific external signals to maintain their fate. On the other hand, cell-intrinsic cues for the maintenance of the "stem cellness" state seem to rely on the PcG system. Mouse mutants affecting PRCI genes (e.g., bmi1/mel18, mphl/rae28, and m33; see Table 1) suffer from various defects in the hematopoietic system, such as hyperplasia (i.e., increased cell proliferation) in spleen and thymus, reduction in Band T cells, and an impaired proliferative response of lymphoid precursors to cytokines. The requirements for Bmi1 and Mel-I8 in stem cell selfrenewal during different stages of development suggest a changing pool of target genes between embryonic and adult stem cells. The PcG system is also required for neural stem cells (NSCs) as indicated by the neuronal defects observed in bmi1 mouse mutants (Bruggeman et a1. 2005; Zencak et a1. 2005). In particular, the mice are depleted of cerebral NSCs postnatally, indicating an in vivo requirement of Bmil in NSC renewal. As was found for the hematopoietic system, it appears that embryonal NSC maintenance is under a different PcG network control than adult NSC self-renewal. External signals like the sonic-Hedgehog signaling cascade modulate the Bmil response in NSCs and ensure a proliferative/self-renewal capacity (Leung et a1. 2004). The identification of these external cues controlling PcG repression came through the analysis of the development of cerebellar granule neuron progenitors (CGNPs). A postnatal wave of proliferation is induced by the signaling factor Sonic hedgehog (Shh), secreted by the Purkinje cells. The Shh signal branches to control N-Myc and Bmi1 levels (Fig. 9). Thus, Bmi1-deficient CGNPs have a
T RAN 5 C RIP T faN A LSI LEN C I N G
Shh
1
cerebellar granule neuron
~/oo~ N-Myc
B Y POL yeo M B
G Ra U P
PRO TEl N 5
•
227
Conceivably, however, the reprogramming of plant cells, which are totipotent and have the potential to form a complete new organism under appropriate conditions, could involve PcG regulation. Indeed, plants lacking the £(z) hom*ologs eLF and SWN produce a mass of undifferentiated cells after germination, suggesting that PcG genes are required to maintain a differentiated state (Chanvivattana et al. 2004).
5 Conclusion and Outlook It has been remarkable to follow the development of our
Cyc-01 102
1
1
Rb
proliferation 1 self-renewal pathway in stem cells Figure 9. Sonic Hedgehog Signaling Maintains Proliferation/Self-renewal of Cerebellar Progenitor Cells The Shh signaling cascade regulates both the Rb pathway and the p53 pathway via Bmil control of the p16/p19 proliferation checkpoint. Inhibition of Smoothened (Smoh) by the Shh receptor Patched (Ptch) results in downstream signaling in the nucleus. One part of the signal induces N-Myc, Cyclin Dl, and D2, whereas the other part activates Bmil via the Gli effectors. (Adapted from ValkLingbeek et al. 2004.)
defective proliferative response upon Shh stimulation. The Shh signal is able to control proliferation of these stem cells ultimately by modulating both the downstream Rb pathway (via N-myc and Bmillpl6 INK4 ') and the p53 pathway (via Bmillpl9 ARF ). This mechanism explains why hyperactivation of Shh signaling leads to the development of medulloblastomas (Leung et al. 2004). HSCs are regulated by a similar Indian hedgehog-controlled pathway. In NSCs, expression of the Hoxd8, Hoxd9, and Hoxc9 loci is under the control of Bmil. The appropriate HOX expression profile confers the necessary stem cell fate. Indeed, because stem cells represent a defined and committed cellular fate, it is not surprising that the PcG system maintains this particular fate in a mitotically heritable fashion. In the future, it will be interesting to identify the pool of targets of the PcG system in the different stem cell populations, and to learn how to influence the maintenance system to allow a controlled reprogramming of stem cell fates. At the moment, it is not clear whether the PcG plays a role in stem cell maintenance in plants.
understanding of PcG epigenetic regulation from the initial genetic identification of a Drosophila mutant possessing additional sex combs on the second and third leg. This eventually led to the discovery of a new class of regulators found to be required for fundamental epigenetic processes such as vernalization in plants and silencing of the mammalian X chromosome. Control of genetic information is highly influenced by chromatin structure and composition of histones in their various modified forms. The proteins of the PcG are directly involved in generating epigenetic marks, for instance, H3K27me3, as a consequence of developmental decisions. The same group "reads" (i.e., shows high affinity to), through the action of the PRCl proteins, these epigenetic marks and translates them into a stable, transcriptionally repressed state. In the model organism Drosophila, we have a relatively clear picture of how PcG complexes are anchored at PREs, for a defined group of target genes that are subject to longterm repression. However, to date, no PREs have been identified in other organisms. Although the basic function of PcG proteins remains the same, it is unclear which part of the plant and vertebrate genomes is subjected to their repression and how they are targeted to their site of action. Additionally, we need to get a better understanding of how an apparently dynamic group of proteins can impose a stable state of transcriptional repression through a chemical equilibrium. The other major question of the PcG research focuses on the heritability of the repressed state, the very essence of epigenetics. What is the identity of the molecular marks required to transmit a state of gene expression through DNA replication and mitosis? We know that the cooperation of trxG and PcG proteins maintains active or silent states of gene expression. Do both states need a corresponding epigenetic mark that is transmitted to daughter cells, or is only one sufficient, while the other represents the default state? The mechanism by which PcG proteins impose silencing on transcription during
228 •
C HAP T E R
1 1
the interphase of the cell cycle has become increasingly clear. In the future, the focus of research will be on how the information regarding a state of gene expression endures the DNA replication process and is faithfully transmitted to the daughter cells following mitosis.
References Akasaka T., van Lohuizen M., van der Lugt N., Mizutani-Koseki Y., Kanno M., Taniguchi M., Vidal M., Alkema M., Berns A., and Koseki H. 2001. Mice doubly deficient for the Polycomb Group genes Me1l8 and Bmil reveal synergy and requirement for maintenance but not initiation of HOX gene expression. Development 128: 1587-1597. Ali J.Y. and Bender W. 2004. Cross-regulation among the Polycomb group genes in Drosophila melanogaster. Mol. Cell. BioI. 24: 7737-7747. Ausin 1., Alonso-Blanco c., Jarillo J.A., Ruiz-Garcia 1., and MartinezZapater J.M. 2004. Regulation of flowering time by FVE, a retinoblastoma-associated protein. Nat. Genet. 36: 162-166. Bannister A.J., Zegerman E, Partridge J.E, Miska E.A., Thomas J.O., Allshire R.C., and Kouzarides T 2001. Selective recognition of methylated lysine 9 on histone H3 by the HPI chromodomain. Nature 410: 120-124. Baroux c., Gagliardini v., Page D., and Grossniklaus U. 2006. Dynamic regulatory interactions of Polycomb group genes: MEDEA autoregulation is required for imprinted gene expression in Arabidopsis. Genes Dev. 20: 1081-1086. Bastow R., Mylne J.S., Lister c., Lippman Z., Martienssen R.A., and Dean C. 2004. Vernalization requires epigenetic silencing of FLC by histone methylation. Nature 427: 164-167. Birve A., Sengupta A.K., Beuchle D., Larsson J., Kennison J.A., Rasmuson-Lestander A., and MUller J. 2001. Su(z)12, a novel Drosophila Polycomb group gene that is conserved in vertebrates and plants. Development 128: 3371-3379. Bracken A.P., Pasini D., Capra M., Prosperini E., Colli E., and Helin K. 2003. EZH2 is downstream of the pRB-E2F pathway, essential for proliferation and amplified in cancer. EMBO f. 22: 5323-5335. Breiling A., Bonte E., Ferrari S., Becker P.B., and Paro R. 1999. The Drosophila Polycomb protein interacts with nucleosomal core particles in vitro via its repression domain. Mol. Cell. BioI. 19: 8451-8460. Bruggeman S.W.M., Valk-Lingbeek M.E., van der Stoop P.P.M., Jacobs J.J.L., Kieboom K., Tanger E., Hulsman D., Leung c., Arsenijevic Y., Marino S., and van Lohuizen M. 2005. Ink4a and Arf differentially affect cell proliferation and neural stem cell self-renewal in Bmil-deficient mice. Genes Dev. 19: 1438-1443. Cao R., Tsukada Y., and Zhang Y. 2005. Role of Bmi-l and RinglA in H2A ubiquitylation and HOX gene silencing. Mol. Cell 20: 845-854. Cao R., Wang 1.J., Wang H.B., Xia 1., Erdjument- Bromage H., Tempst P., Jones R.S., and Zhang Y. 2002. Role of histone H3 lysine 27 methylation in Polycomb-group silencing. Science 298: 1039-1043. Carrington E.A. and Jones R.S. 1996. The Drosophila Enhancer ofzeste gene encodes a chromosomal protein: Examination of wild-type and mutant protein distribution. Development 122: 4073-4083. Chanvivattana Y., Bishopp A., Schubert D., Stock c., Moon Y.H., Sung Z.R., and Goodrich J. 2004. Interaction of Polycomb-group pro-
teins controlling flowering in Arabidopsis. Development 131: 5263-5276. Core N., Charroux B., McCormick A., Vola c., Fasano 1., Scott M.P., and Kerridge S. 1997. Transcriptional regulation of the Drosophila homeotic gene teashirt by the homeodomain protein Fushi tarazu. Mech. Dev. 68: 157-172. Czermin B., Melfi R., McCabe D., Seitz v., Imhof A., and Pirrotta V. 2002. Drosophila enhancer of Zeste/ESC complexes have a histone H3 methyltransferase activity that marks chromosomal Polycomb sites. Cell Ill: 185-196. Dejardin J., Rappailles A., Cuvier 0., Grimaud c., Decoville M., Locker D., and Cavalli G. 2005. Recruitment of Drosophila Polycomb group proteins to chromatin by DSP1. Nature 434: 533-538. Dellino G.1., Schwartz Y.B., Farkas G., McCabe D., Elgin S.c., and Pirrotta V. 2004. Polycomb silencing blocks transcription initiation. Mol. Cell 13: 887-893. del Mar Lorente D., Marcos-Gutierrez c., Perez c., Schoorlemmer J., Ramirez A., Magin T, and Vidal M. 2000. Loss- and gain-of-function mutations show a Polycomb group function for RinglA in mice. Development 127: 5093-5100. de Napoles M., Mermoud J.E., Wakao R:, Tang Y.A., Endoh M., Appanah R., Nesterova TB., Silva J., Otte A.P., Vidal M., et al. 2004. Polycomb group proteins RinglA/B link ubiquitylation of histone H2A to heritable gene silencing and X inactivation. Dev. Cell 7: 663-676. Ebel c., Mariconti 1., and Gruissem W. 2004. Plant retinoblastoma hom*ologues control nuclear proliferation in the female gametophyte. Nature 429: 776-780. Fang J., Chen TP., Chadwick B., Li E., and Zhang Y. 2004. Ringl b-mediated H2A ubiquitination associates with inactive X chromosomes and is involved in initiation of X inactivation. f. BioI. Chem. 279: 52812-52815. Ficz G., Heintzmann R., and Arndt Jovin D.J. 2005. Polycomb group protein complexes exchange rapidly in living Drosophila. Development 132: 3963-3976. Fischle w., Wang Y., Jacobs S.A., Kim Y., Allis C.D., and Khorasanizadeh S. 2003. Molecular basis for the discrimination of repressive methyl-lysine marks in histone H3 by Polycomb and HPI chromodomains. Genes Dev. 17: 1870-1881. Fong Y., Bender 1., Wang W., and Strome S. 2002. Regulation of the different chromatin states of autosomes and X chromosomes in the germ line of C. elegans. Science 296: 2235-2238. Francis N.J., Kingston R.E., and Woodco*ck c.1. 2004. Chromatin compaction by a Polycomb group protein complex. Science 306: 1574-1577. Francis N.J., Saurin A.J., Shao Z., and Kingston R.E. 2001. Reconstitution of a functional core Polycomb repressive complex. Mol. CellS: 545-556. Gehring M., Huh J.H., Hsieh TE, Penterman J., Choi Y., Harada J.J., Goldberg R.B., and Fischer R.1. 2006. DEMETER DNA glycosylase establishes MEDEA Polycomb gene self-imprinting by allele-specific demethylation. CeIl 124: 495-506. Gendall A.R., Levy Y.Y., Wilson A., and Dean C. 2001. The VERNALIZATION2 gene mediates the epigenetic regulation of vernalization in Arabidopsis. Cell 107: 525-535. Goodrich J., Puangsomlee E, Martin M., Long D., Meyerowitz E.M., and Coupland G. 1997. A Polycomb-group gene regulates homeotic gene expression in Arabidopsis. Nature 386: 44-51. Grossniklaus U. 2005. Genomic imprinting in plants: A predominantly maternal affair. In Annual plant reviews: Plant epigenetics (ed. P. Meyer), pp. 174-200. Blackwell, Sheffield, United Kingdom. Grossniklaus u., Spillane c., Page D.R., and Kohler C. 2001. Genomic
T RAN 5 C R f P T f 0 N A LSI LEN C f N G
imprinting and seed development: Endosperm formation with and without sex. Curro Opin. Plant Bioi. 4: 21-27. Grossniklaus U., Vielle-Calzada J.E, Hoeppner M.A., and Gagliano W.B. 1998. Maternal control of embryogenesis by MEDEA, a Polycomb group gene in Arabidopsis. Science 280: 446-450. Guitton A.E. and Berger E 2005. Control of reproduction by Polycomb Group complexes in animals and plants. Int. ]. Dev. BioI. 49: 707-716. Hackett W.P., Cordero R.E., and Sinivasan C 1987. Apical meristem characteristics and activity in relation to juvenility in Hedera. In Manipulation of flowering (ed. J.G. Atherton), pp. 93-99. Butterworth, London. Hadorn E. 1968. Transdetermination in cells. Sci. Am. 219: 110. Heard E. 2004. Recent advances in X-chromosome inactivation. Curro Opin. Cell Bioi. 16: 247-255. Hennig L., Bouveret R., and Gruissem W. 2005. MSll-like proteins: An escort service for chromatin assembly and remodeling complexes. Trends Cell BioI. 15: 295-302. Hennig L., Taranto E, Walser M., Schonrock N., and Gruissem W. 2003. Arabidopsis MSIl is required for epigenetic maintenance of reproductive development. Development 130: 2555-2565. Hsieh T.E, Hakim 0., Ohad N., and Fischer R.L. 2003. From flour to flower: How Polycomb group proteins influence multiple aspects of plant development. Trends Plant Sci. 8: 439-445. Jacobs J.J.L., Scheijen B., Voncken J.W., Kieboom K., Berns A., and van Lohuizen M. 1999. Bmi-l collaborates with c-Myc in tumorigenesis by inhibiting c-Myc-induced apoptosis via INK4a/ARE Genes Dev. 13: 2678-2690. Jullien P.E., Katz A., Oliva M., Ohad N., and Berger E 2006. Polycomb group complexes self-regulate imprinting of the Polycomb group gene MEDEA in Arabidopsis. Curro BioI. 16: 486-492. Kagey M.H., Melhuish T.A., and Wotton D. 2003. The Polycomb protein Pc2 is a SUMO E3. Cell 113: 127-137. Kennison J.A. 1995. The Polycomb and trithorax group proteins of Drosophila: Trans-regulators of homeotic gene function. Annu. Rev. Genet. 29: 289-303. Kim H.J., Hyun Y., Park J.Y., Park M.J" Park M.K., Kim M.D., Kim H.J., Lee M.H., Moon J" Lee I., and Kim J. 2004. A genetic link between cold responses and flowering time through FVE in Arabidopsis thaliana. Nat. Genet. 36: 167-171. Kinosh*ta T., Harada J,J., Goldberg R.B., and Fischer R.L. 2001. Polycomb repression of flowering during early plant development. Proc. Natl. Acad. Sci. 98: 14156-14161. Klebes A., Sustar A., Kechris K., Li H., Schubiger G., and Kornberg T.B. 2005. Regulation of cellular plasticity in Drosophila imaginal disc cells by the Polycomb group, trithorax group and lama genes. Development 132: 3753-3765. Klymenko T. and Muller J. 2004. The histone methyltransferases Trithorax and Ashl prevent transcriptional silencing by Polycomb group proteins. EMBO Rep. 5: 373-377. Kohler C, Page D.R., Gagliardini v., and Grossniklaus U. 2005. The Arabidopsis thaliana MEDEA Polycomb group protein controls expression of PHERESI by parental imprinting. Nat. Genet. 37: 28-30. Kohler C, Hennig L., Bouveret R., Gheyselinck J" Grossniklaus u., and Gruissem W. 2003a. Arabidopsis MSIl is a component of the MEA/FIE Polycomb group complex and required for seed development. EMBO f. 22: 4804-4814. Kohler C, Hennig L., Spillane C, Pien S., Gruissem W., and Grossniklaus U. 2003b. The Polycomb group protein MEDEA regulates seed development by controlling expression of the MADS-box gene PHERESI. Genes Dev. 17: 1540-1553.
8 Y
POL
yeo
M8
GR0 UP
PRO TEl N 5
•
229
Kuzmichev A., Jenuwein T., Tempst P., and Reinberg D. 2004. Different EZH2-containing complexes target methylation of histone HI or nucleosomal histone H3. Mol. Cell 14: 183-193. Kuzmichev A., Nishioka K., Erdjument-Bromage H., Tempst P., and Reinberg D. 2002. Histone methyltransferase activity associated with a human multiprotein complex containing the Enhancer of Zeste protein. Genes Dev. 16: 2893-2905. Kuzmichev A., Margueron R., Vaquero A., Preissner T.S., Scher M., Kirmizis A., Ouyang X., BrockdorffN., Abate Shen C, Farnham P., and Reinberg D. 2005. Composition and histone substrates of Polycomb repressive group complexes change during cellular differentiation. Proc. Natl. Acad. Sci. 102: 1859-1864. Lavigne M., Francis N.J., King I.E, and Kingston R.E. 2004. Propagation of silencing; recruitment and repression of naive chromatin in trans by Polycomb repressed chromatin. Mol. Cell 13: 415-425. Lee N., Maurange C, Ringrose L., and Paro R. 2005. Suppression of Polycomb group proteins by JNK signalling induces transdetermination in Drosophila imaginal discs. Nature 438: 234-237. Leung C, Lingbeek M., Shakhova 0., Liu J" Tanger E., Saremaslani E, van Lohuizen M., and Marino S. 2004. Bmil is essential for cerebellar development and is overexpressed in human medulloblastomas. Nature 428: 337-341. Levine S.S., King I.E, and Kingston R.E. 2004. Division of labor in Polycomb group repression. Trends Biochem. Sci. 29: 478-485. Levine S.S., Weiss A., Erdjument Bromage H., Shao Z., Tempst E, and Kingston R.E. 2002. The core of the Polycomb Repressive Complex is compositionally and functionally conserved in flies and humans. Mol. Cell. Bioi. 22: 6070-6078. Lewis E.B. 1978. A gene complex controlling segmentation in Drosophila. Nature 276: 565-570. Luo M., Bilodeau P., Koltunow A., Dennis E.S., Peaco*ck W.J., and Chaudhury A.M. 1999. Genes controlling fertilization-independent seed development in Arabidopsis thaliana. Proc. Natl. Acad. Sci. 96: 296-301. Mager J., Montgomery N.D., de Villena EEM., and Magnuson T. 2003. Genome imprinting regulated by the mouse Polycomb group protein Eed. Nat. Genet. 33: 502-507. Marx J. 2005. Developmental biology-Combing over the Polycomb group proteins. Science 308: 624-626. Moon Y.H., Chen L., Pan R.L., Chang H.S., Zhu T., Maffeo D.M., and Sung Z.R. 2003. EMF genes maintain vegetative development by repressing the flower program in Arabidopsis. Plant Cell 15: 681-693. Mosquna A., Katz A., Shochat S., Graft G., and Ohad N. 2004. Interaction of FIE, a Polycomb protein, with pRb: a possible mechanism regulating endosperm development. Mol. Genet. Genomics 271: 651-657. Muller J., Hart CM., Francis N.J" Vargas M.L., Sengupta A., Wild B., Miller E.L., O'Connor M.B., Kingston R.E., and Simon JA 2002. Histone methyltransferase activity of a Drosophila polycomb group repressor complex. Cell 111: 197-208. Ohad N., Yadegari R., Margossian L., Hannon M., Michaeli D., Harada J,J., Goldberg R.B., and Fischer R.L. 1999. Mutations in FIE, a WD Polycomb group gene, allow endosperm development without fertilization. Plant Cell 11: 407-415. Otte A.E and Kwaks T.H. 2003. Gene repression by Polycomb group protein complexes: A distinct complex for every occasion? Curro Opin. Genet. Dev. 13: 448-454. Page D.R. and Grossniklaus U. 2002. The art and design of genetic screens: Arabidopsis thaliana. Nat. Rev. Genet. 3: 124-136. Paro R. and Hogness D.S. 1991. The Polycomb protein shares a hom*ologous domain with a heterochromatin-associated protein of Drosophila. Proc. Natl. Acad. Sci. 88: 263-267.
230 •
C HAP T E R 7 7
Poux S., McCabe D., and Pirrotta V. 2001. Recruitment of components of Polycomb group chromatin complexes in Drosophila. Development 128: 75-85. Raaphorst EM. 2005. Deregulated expression of Polycomb-group oncogenes in human malignant lymphomas and epithelial tumors. Hum. Mol. Genet. 14: R93-100. Rea S., Eisenhaber E, O'Carroll N., Strahl B.D., Sun Z.W., Schmid M., Opravil S., MechtJer K., Ponting e.P., Allis e.D., and Jenuwein T 2000. Regulation of chromatin structure by site-specific histone H3 methyltransferases. Nature 406: 593-599. Reyes J.e. and Grossniklaus U. 2003. Diverse functions of Polycomb group proteins during plant development. Semin. Cell Dev. BioI. 14: 77-84. Ringrose 1., Ehret H., and Paro R. 2004. Distinct contributions of histone H3 lysine 9 and 27 methylation to locus-specific stability of Polycomb complexes. Mol. Cell 16: 641-653. Ringrose 1., Rehmsmeier M., Dura J.M., and Paro R. 2003. Genomewide prediction of Polycomb/Trithorax response elements in Drosophila melanogaster. Dev. CellS: 759-771. Sanchez-Elsner T, Gou D., Kremmer E., and Sauer E 2006. Noncoding RNAs of trithorax response elements recruit Drosophila Ashl to Ultrabithorax. Science 311: 1118-1123. Satijn D.P. and Otte A.P. 1999. RINGI interacts with multiple Polycomb-group proteins and displays tumorigenic activity. Mol. Cell. BioI. 19: 57-68. Saurin A.J., Shiels e., Williamson J., Satijn D.P.E., Otte A.P., Sheer D., and Freemont P.S. 1998. The human polycomb group complex associates with pericentromeric heterochromatin to form a novel nuclear domain. f. Cell Bioi. 142: 887-898. Schmitt S., Prestel M., and Paro R. 2005. Intergenic transcription through a Polycomb group response element counteracts silencing. Genes Dev. 19: 697-708. Sung S. and Amasino R.M. 2004a. Vernalization and epigenetics: How plants remember winter. Curro Opin. Plant BioI. 7: 4-10. - - - . 2004b. Vernalization in Arabidopsis thaliana is mediated by the PHD finger protein VIN3. Nature 427: 159-164. Tie E, Furuyama T, Prasad-Sinha J., Jane E., and Harte P.J. 2001. The Drosophila Polycomb group proteins ESC and E(Z) are present in a complex containing the histone-binding protein p55 and the histone deacetylase RPD3. Development 128: 275-286. Tschiersch B., Hofmann A., Krauss v., Dorn R., Korge G., and Reuter
G. 1994. The protein encoded by the Drosophila position-effect variegation suppressor gene Su(var)3-9 combines domains of antagonistic regulators of homeotic gene complexes. EMBO f. 13: 3822-3831. Valk-Lingbeek M.E., Bruggeman S.W.M., and van Lohuizen M. 2004. Stem cells and cancer: The Polycomb connection. Cell 118: 409-418. van der Lugt N.M., Domen J., Linders K., van Roon M., RobanusMaandag E., te Riele H., van der Valk M., Deschamps J., Sofroniew M., van Lohuizen M., et al. 1994. Posterior transformation, neurological abnormalities, and severe hematopoietic defects in mice with a targeted deletion of the bmi-l proto-oncogene. Genes Dev. 8: 757-769. van Lohuizen M., Verbeek S., Scheijen B., Wientjens E., van der Gulden H., and Berns A. 1991. Identification of cooperating oncogenes in E~-myc transgenic mice by provirus tagging. Cell 65: 737-752. Vire E., Brenner e., Deplus R., Blanchon 1., Fraga M., Didelot c., Morey 1., van Eynde A., Bernhard D., Vanderwinden J.M., et al. 2006. The Polycomb group protein EZH2 directly controls DNA methylation. Nature 439: 871-874. Wang 1., Brown J.L., Cao R., Zhang Y, Kassis J.A., and Jones R.S. 2004. Hierarchical recruitment of Polycomb group silencing complexes. Mol. Cell 14: 637-646. Yamamoto Y, Girard E, Bello B., Affolter M., and Gehring W.J. 1997. The cramped gene of Drosophila is a member of the Polycombgroup, and interacts with mus209, the gene encoding proliferating cell nuclear antigen. Development 124: 3385-3394. Yamamoto K., Sonoda M., Inokuchi J., Shirasawa S., and Sasazuki T. 2004. Polycomb group Suppressor of zeste 12 links Heterochromatin Protein la and Enhancer of zeste 2. f. Bioi. Chem. 279: 401-406. Yoshida N., Yanai Y, Chen L.J., Kato Y., Hiratsuka J., Miwa T, Sung Z.R., and Takahashi S. 2001. EMBRYONIC FLOWER2, a novel Polycomb group protein hom*olog, mediates shoot development and flowering in Arabidopsis. Plant Cell 13: 2471-2481. Zencak D., Lingbeek M., Kostic e., Tekaya M., Tanger E., Hornfeld D., Jaquet M., Munier EL., Schorderet D.E, van Lohuizen M., and Arsenijevic Y. 2005. Bmil loss produces an increase in astroglial cells and a decrease in neural stem cell population and proliferation. J. Neurosci. 25: 5774-5783.
c
H
A
p
T
E
R
12
Transcriptional Regulation by Trithorax Group Proteins Robert E. Kingston 1 and John W. Tamkun 2 1Department of Molecular Biology, Massachusetts General Hospital, Boston, Massachusetts 02114 2Department of Molecular, Cell and Developmental Biology, University of California, Santa Cruz, California 95064
CONTENTS 1. Introduction, 233 7.7
Identification of Genes Involved in the Maintenance of the Determined State, 233
7.2
trxG Proteins in Other Organisms, 236
7.3.
trxG Proteins Play Diverse Roles in Eukaryotic Transcription, 237
2. Connections between trxG Proteins and Chromatin, 237 2.7.
trxG Proteins Involved in 238
2.2.
trxG Proteins That Covalently Modify Nucleosomal Histones, 242
3. Connections between trxG Proteins and the General Transcription Machinery, 243 4. Biochemical Functions of Other trxG Proteins, 244 5. Functional Interactions between trxG Proteins, 244 6. trxG Proteins: Activators or Anti-repressors?, 244 7. Conclusion and Outlook, 245 References, 246
231
GENERAL SUMMARY All cells in an organism must be able to "remember" what type of cell they are meant to be. This process, referred to as "cellular memory" or "transcriptional memory," requires two basic classes of mechanisms. The first class, discussed in the previous chapter, functions to maintain an "OFF" state for genes that, if turned on, would specify an inappropriate cell type. The Polycomb-Group (PcG) proteins have as their primary function this repressive role in cellular memory. The second class of mechanisms are those that are required to maintain key genes in an "ON" state. Any cell type requires the expression of master regulatory proteins that direct the specific functions required for that cell type. The genes that encode these master regulatory proteins must be maintained in an "ON" state throughout the lifetime of an organism in order to maintain the proper cell types within that organism. The striking multiple-winged fly in the left title figure illustrates the dramatic phenotypes that can result from the failure to maintain the "ON" state of a master regulatory gene. The proteins that are involved in maintaining the "ON" state are called trithorax-Group (trxG) proteins in honor of the trithorax gene, the founding member of this group of regulatory proteins. A large group of proteins with diverse functions make up the trxG. The roles these proteins play in the epigenetic mechanisms that maintain the "ON" state appear more complex at this juncture than the roles for PcG proteins in repression. The first complexity is that a very large number of proteins and mechanisms are needed to actively transcribe RNA from any gene. Thus, in contrast to repression, which might be accomplished by comparatively simple mechanisms that block access of all proteins, activation of a gene requires numerous steps, any
of which might playa role in maintaining an "ON" state. Thus, there are numerous possible stages at which a trxG protein might work. A second complexity in thinking about trxG proteins is that proteins which function in activation can also, in different contexts, function in repression. This might appear counterintuitive, but, depending on the precise architecture of a gene, the same protein carrying out its function might in one case help a gene become activated, and in another case help a different gene become repressed. At this time, it does not appear that trxG proteins are dedicated solely to the maintenance of gene expression, but that these proteins can also play multiple roles in the cell. These complexities evoke several interesting unanswered questions. Why are only some of the proteins needed to activate transcription also critical for maintenance of transcription? Do these proteins have functions that are uniquely suited to maintaining the active state? Or are some of these proteins needed for maintenance, solely due to an evolutionary accident that made them key regulators of a gene(s) particularly important to development? As shown below, some of the trxG proteins are involved in regulating chromatin structure in opposition to the mechanisms used by the PcG proteins. trxG proteins can place covalent modifications on chromatin or can alter chromatin by changing the structure and position of the nucleosomes that are the building blocks of chromatin. Other trxG proteins function as part of the transcription machinery. Thus, these proteins are found in a wider variety of complexes than the PcG proteins and are likely to play more complicated roles in epigenetic mechanism.
T R f THO R A X
1 Introduction
Numerous developmental decisions-including the determination of cell fates-are made in response to transient positional information in the early embryo. These decisions are dependent on changes in gene expression. This allows cells with identical genetic blueprints to acquire unique identities and to follow distinct pathways of differentiation. The changes in gene expression underlying the determination of cell fates are heritable; a cell's fate rarely changes once it is determined, even after numerous cell divisions and lengthy periods of developmental time. Understanding the molecular mechanisms underlying the maintenance of the determined state has long been a goal of developmental and molecular biologists. Many of the regulatory proteins involved in the maintenance of heritable states of gene expression were identified in studies of Drosophila homeotic (Hox) genes. Hox genes encode homeodomain transcription factors that regulate the transcription of batteries of downstream target genes, which in turn specify the identities of body segments (Gellon and McGinnis 1998). In Drosophila, Hox genes are found in two gene complexes: the Antennapedia complex (ANT-C), which contains the Hox genes labial (lab), Deformed (Dfd), Sex combs reduced (Scr), and Antennapedia (Antp); and the bithorax complex (BX-C), which contains the Hox genes Ultrabithorax (Ubx), abdominalA (abdA), and AbdominalB (AbdB) (Duncan 1987; Kaufman et al. 1990). Each Hox gene specifies the identity of a particular segment, or group of segments, along the anterior-posterior axis of the developing fly. For example, Antp specifies the identity of the second thoracic segment, including the second pair of legs, whereas Ubx specifies the identity of the third thoracic segment, including the balancer organs located behind the wings. Thus, the transcription factors encoded by Hox genes function as master regulatory switches that direct the choice between alternative pathways of development. The transcription of Hox genes must be regulated precisely, because dramatic alterations in cell fates can result from their inappropriate expression (Simon 1995; Simon and Tamkun 2002). For example, the derepression of Antp in head segments transforms antennae into legs, and the inactivation of Ubx in thoracic segments transforms balancer organs into wings. In Drosophila, the initial patterns of Hox transcription are established early in embryogenesis by transcription factors encoded by segmentation genes. The proteins encoded by segmentation genesincluding the gap, pair-rule, and segment polarity genessubdivide the early embryo into 14 identical segments.
GR
a
U P PRO T E f N 5
•
233
These proteins also establish the initial patterns of Hox transcription, the first step toward the development of segments with distinct identities and morphology. Once established, the segmentally restricted patterns of Hox transcription must be maintained throughout subsequent embryonic, larval, and pupal stages in order to maintain the identities of the individual body segments. Because the majority of segmentation genes are transiently expressed during early development, this function is carried out by two other groups of regulatory proteins: the Polycomb group of repressors (PcG) and the trithorax group of transcriptional regulators (trxG) (Fig. 1). The regulation of Hox transcription therefore consists of at least two distinct phases: establishment (by segmentation genes) and maintenance (by PcG and trxG genes) (Fig. 2). 1.1 Identification of Genes Involved in the Maintenance of the Determined State
Because of their roles in the maintenance of cell fates, Drosophila PcG and trxG genes have been the subject of
intense study for decades. As discussed in the previous chapter, the majority of PcG genes were identified by mutations that cause homeotic transformations due to the failure to maintain repressed states of Hox transcription. A classic example of a phenotype associated with PcG mutations is the transformation of second and third legs to first legs. This homeotic transformation results from the derepression of the ANT-C gene Scr and is manifested by the appearance of first leg bristles known as sex comb teeth on the second and third legs of the adult. This "Polycomb" or "extra sex combs" phenotype-together with other homeotic transformations resulting from the failure to maintain repression of Hox genes-led to the identification of more than a dozen PcG genes in Drosophila. The majority of PcG genes encode subunits of two complexes involved in transcriptional repression: Polycomb Repressive Complex (PRC) 1 and PRC2 (Levine et al. 2004). PRCl and PRC2 are targeted to the vicinity of Hox (and other) promoters via cis-regulatory elements known as Polycomb-response elements (PREs). A large body of evidence suggests that PcG complexes repress transcription by modulating chromatin structure (Francis and Kingston 2001; Ringrose and Paro 2004). Members of the trxG, including trithorax (trx); absent, small or homeotic 1 (ashl), absent, small or homeotic 2 (ash2), and female-sterile homeotic (fsh), were initially iden-
tified by mutations that mimic loss-of-function Hox mutations in Drosophila (Fig. 3) (Kennison 1995). For example, mutations in trx-the founding member of the trxG-
234 •
C HAP T E R
72
/
Figure 1. The Concept of Cellular Memory Schematic illustration highlighting the role of trxG complexes in maintaining heritable states of active gene expression in contrast to heritable silencing by PcG complexes, as defined originally for the Drosophila Hox gene cluster.
cause the partial transformation of halteres to wings (due to decreased Ubx transcription); first legs to second legs (due to decreased Scrtranscription); and posterior abdominal segments to more anterior identities (due to decreased abdA and AbdB transcription). Numerous other trxG members were identified in screens for extragenic suppressors of Pc (Su(Pc)) mutations (Kennison and Tamkun
1988). The rationale behind these genetic screens was that a reduction in the level of a protein that maintains an active state should compensate for a reduction in the level of a PcG repressor (Fig. 4). brahma (brm) and numerous other Su(Pc) loci were identified using this approach, bringing the total number of trxG members to more than 16 (Table 1). Many other proteins have been classified as trxG
Establishment ---..... Maintenance (segmentation genes)
trxG early embryo
embryonic, larval, pupal development
adult
Figure 2. Regulation of Hox Transcription The boundaries of abd-A transcription and other Hox genes are established by segmentation proteins. These include the products of gap and pair-rule genes, which subdivide the embryo into 14 identical segments. During subsequent development, the "OFF" or "ON" states of Hox transcription are maintained by the ubiquitously expressed members of the trxG of activators and the PcG of repressors via mechanisms that remain poorly understood.
T R I THO R A X
G R a U P PRO TEl N 5
Figure 3. Examples of Developmental Cell Fate Transformations Associated with Mutations in Drosophila trxG Genes (A) Wild-type first leg. The sex comb, unique to the first leg, is marked by an arrow. (8) A patch of kis mutant tissue (marked by an arrow) is partially transformed from the first leg to the second leg due to decreased Sa transcription, albeit incomplete, as evidenced by a reduction in the number of sex comb teeth. (C) A patch of mor mutant tissue (marked by an arrow) displays the partial transformation from balancer organ to wing, due to decreased Ubx expression. (D) A patch of kis mutant tissue (marked by an arrow) in the fifth abdominal segment is partially transformed to a more anterior identity
due to decreased Abd 8 expression, as evidenced by the loss of the dark pigmentation characteristic of this segment. (A,8,D, Reprinted, with permission, from Daubresse et al. 1999.)
adult phenotype
Scr expression in leg imaginal discs
b
wild-type
PcG mutant
PcG/trxG double mutant
J
~.
first leg
second leg
third leg
first leg
second leg
third leg
Figure 4. trxG Mutations Block the Derepression of Hox Genes in PeG Mutants (a) Leg imaginal discs stained with antibodies against the protein encoded by the Hox gene, Sa, which specifies the identity of the labial and first thoracic segments, including the first leg. (b) Basitarsal segments of the legs of Wild-type and mutant adults. Note the presence of sex comb teeth on the first leg, but not the second and third legs of wild-type adults. The Scr gene is partially derepressed in the second and third leg discs, where it is normally silent, in individuals heterozygous for mutations in PcG genes, leading to the appearance of ectopic sex comb teeth on the second and third legs. These phenotypes are suppressed by mutations in brm and many other trxG genes (a, Reprinted, with permission, from Tamkun et al. 1992 [© Elsevier]; b, portion modified, with permission, from Kennison 2003 [©Elsevier].).
235
236 •
C HAP T E R 1 2
Table 1. Biochemical functions of trxG proteins
yeast
Complexed with non-trxG proteins?
Organism Known function ATP-dependent chromatin remodeling
Histone methyltransferases
Mediator subunits
Drosophila
human
BRM
BRG1/HBRM
Swi2/Snf2, Sth 1
yes (5-10)'
OSA
BAF250
Swi1/Adr6
yes (5-10)
MOR
BAF155, BAF170
Swi3, Rsc8
yes (5-10)
SNR1
hSNF5/INI1
Snf5, Sfh 1
yes (5-10)
Set1
yes (5-20)
Kismet (KIS)
CHD7
Trithorax (TRX)
MLLl, MLL2, hSET1
not known
Absent, small or homeotic 1(ASH1)
hASH1
Kohtalo (KTO)
TRAP230
Srb8
yes (13-24)
Skuld (SKD)
TRAP240
Srb9
yes (13-24)
not known
Transcription factor
Trithorax-like (TRL)
BTBD14B
no
Growth factor receptor
Breathless (BTL)
FGFR3
not known
Sallimus (SLS)
Titin
ASH2
hASH2Lb
Other
not known Bre2
yes (5-20)
'BRM, OSA, MOR, and SNRl can all be found in stable association with each other in a single complex. bRelatively low sequence similarity to ASH2.
members based on other, less stringent criteria, including sequence hom*ology with known trxG proteins, physical association with trxG proteins, biochemical activity, or effects on Hox transcription in vitro or in vivo. The functional relationship between members of the trxG, and the mechanistic connection between trxG function and maintenance of cell fate, is complicated. There are numerous mechanisms via which a protein might maintain an appropriately high level of expression of a homeotic gene (the genetic definition of a trxG protein) without being a devoted transcriptional activator, or a protein devoted to epigenetic control. Formal possibilities for trxG function (in addition to the ability to directly activate transcription) include the ability to increase function of direct activators, the ability to block function of PcG repressors, and the ability to create a "permissive" chromatin state that facilitates the function of numerous other regulatory complexes. Furthermore, as discussed below, some trxG proteins play complicated mechanistic roles that on some genes contribute to activation and that on other genes can contribute to repression. Two brief examples illustrate the complexity of potential roles for trxG proteins. ATP-dependent remodeling complexes such as the one that contains trxG proteins BRM and MOR have been proposed to increase the ability of any sequence-specific DNA-binding protein to bind to chromatin. An unsettled issue is whether this ATP-depend-
ent remodeling complex can therefore use this ability to promote both activation of genes through increased binding of activators and repression through increased binding of repressors. Other studies have led to the hypothesis that some trxG protein complexes might function primarily by blocking the ability of a PcG repressor complex to function, and that repression by PcG proteins is the. default state. Thus, in this latter instance, the role in maintaining an active state by some trxG proteins might reflect indirect, as opposed to direct, actions. The evolutionary conservation of this family, and the conserved functions of this family, offer hints concerning what types of mechanisms are needed to maintain the appropriate level of activation of master regulatory genes that determine cell fate. 1.2 trxG Proteins in Other Organisms
Functional counterparts of virtually all Drosophila trxG proteins are present in mammals, including humans (Table 1). Genetic and biochemical studies have shown that the fly and mammalian proteins play higWy conserved roles in both gene expression and development. A good example of the functional conservation of trxG proteins is provided by MLL, the mammalian ortholog of Drosophila trx. Mutations in MLL cause homeotic transformations of the axial skeleton of mice due to the failure to maintain active transcription of Hox genes (Yu et al.
T R I THO R A X
1995, 1998). Both MLL and trx function as histone lysine methyltransferases (HKMTs), and direct evidence of functional hom*ology between the two proteins was provided by the use of human MLL to partially rescue developmental defects resulting from the loss of trx function in flies (Muyrers-Chen et al. 2004). Thus, the mechanisms underlying the maintenance of the determined state have been highly conserved during evolution. Cancer and other human diseases can result from the failure to maintain a heritable state of gene expression. Not surprisingly, many human PcG and trxG genes function as proto-oncogenes or tumor suppressor genes. For example, the human trxG gene MLL was originally identified by l1q23 chromosome translocations associated with acute lymphoblastic (ALL) or myeloid (AML) leukemia. Mutations in other mammalian trxG genes are also associated with a variety of cancers (for more detail, see Chapter 23). For example, BRG 1, the human counterpart of Drosophila brm, physically interacts with the retinoblastoma tumor suppressor protein; disruption of this interaction leads to increased cell division and malignant transformation in certain human tumor cell lines (Dunaief et al. 1994; Strober et al. 1996). Consistent with a role of BRG1 in tumor suppression, mice heterozygous for mutations in this gene are prone to develop a variety of tumors (Bultman et al. 2000). Mutations in INIl, the human counterpart of the Drosophila trxG gene SNF5related gene 1 (SNRl), also predispose individuals to cancers and have been identified in a large percentage of malignant rhabdoid tumors, an aggressive cancer of children (Versteege et al. 1998). These and other connections to human disease have provided researchers with additional motivation to understand the mechanism of action of trxG proteins.
1.3 trxC Proteins Play Diverse Roles in Eukaryotic Transcription
The trxG of activators is a large and functionally diverse group of regulatory proteins. This may reflect the complexity of eukaryotic transcription, which involves highly regulated interactions between gene-specific transcriptional activators, the numerous components of the general transcription machinery, and the DNA template that is transcribed. Transcriptional activation involves the binding of sequence-specific activating proteins, the recruitment of the general transcription machinery by those proteins, the formation of a pre-initiation complex in which RNA polymerase II is bound to the promoter, the opening of the DNA helix near the promoter, the effi-
G R 0 UP
PRO TEl N S
•
237
cient escape of RNA polymerase from the promoter, and efficient elongation of RNA polymerase through the gene. The ability to maintain an active transcriptional state might involve any of the numerous steps required for activation, because on any given gene, different steps might play a rate-determining role for transcriptional activity. The packaging of eukaryotic DNA into chromatin provides another level at which trxG proteins can regulate transcription. Nucleosomes and other components of chromatin tend to inhibit the binding of general and gene-specific transcription factors to DNA, as well as inhibit the elongation of RNA polymerase. Alterations in chromatin structure-including changes in the structure or positioning of nucleosomes-can influence virtually every step in the process of transcription. Any protein that is required for transcription is required for the maintenance of the active state. Indeed, some trxG proteins play relatively general roles in transcription and are not dedicated solely to the maintenance of the determined state. Other trxG proteins, however, may play specialized roles in this process, either by directly counteracting PcG repression or by maintaining heritable states of gene activity through DNA replication and mitosis. The latter class of trxG proteins is of particular interest to developmental biologists. 2 Connections between trxG Proteins and Chromatin
Genetic studies indicating that trxG genes play key roles in transcription and development stimulated significant work to understand the biochemical function of their products. Many of these experiments have used, as their conceptual basis, the hypothesis that chromatin will be the biologically relevant substrate of trxG proteins. All genes are packaged into chromatin, and that packaging can create a compacted and inaccessible state or can be in an open and permissive state. Both the permissive and inaccessible states may conceivably be heritable. These considerations led to the simple hypothesis that trxG proteins might modulate chromatin structure to affect regulation. Furthermore, as trxG genes were cloned and sequenced, it became apparent that some of their products are related to proteins involved in ATP-dependent chromatin remodeling or the covalent modification of nucleosomal histones in other organisms, including the yeast Saccharomyces cerevisiae. Thus, although yeast lack either Hox genes or PcG repressors, this organism has provided valuable clues about potential roles for trxG proteins in eukaryotic transcription.
238 •
C HAP T E R
12
One of the first connections between the trxG and chromatin was provided by the discovery that the Drosophila trxG gene brm is highly related to yeast SWI2/SNF2 (Tamkun et al. 1992). SWI2/SNF2 was identified in screens for genes involved in mating-type switching (switch [swi] genes) and sucrose-fermentation (sucrose-nonfermenting [snj] genes). It was subsequently shown to be required for the activation of numerous inducible yeast genes (Holstege et al. 1998; Sudarsanam et al. 2000). The transcription defects observed in swi2/snf2 mutants are suppressed by mutations in nucleosomal histones, an early observation which first suggested that SWI2/SNF2 activates transcription by counteracting chromatin repression (Kruger et al. 1995). Biochemical studies conducted in the early 1990s confirmed this hypothesis; SWI2/SNF2 and many of the other proteins identified in the swi/snf screens function as subunits of a large protein complex (SWI/SNF) that uses the energy of ATP hydrolysis to increase the ability of proteins to bind to nucleosomal DNA (Cote et al. 1994; Imbalzano et al. 1994; Kwon et al. 1994). SWI2/SNF2 functions as the ATPase subunit, or "engine:' of this chromatin-remodeling machine; other subunits of the SWI/SNF complex mediate interactions with regulatory proteins or its chromatin substrate (Phelan et al. 1999). Another connection between trxG and chromatin was suggested by the presence of SET domains in the trxG proteins Trithorax (TRX) and Absent, small or homeotic (ASHl). The SET domain was originally defined by a stretch of amino acids that shows hom*ology between Su(var)3-9, Enhancer of zeste (E(z)), and TRX, the latter two proteins being, respectively, PcG and trxG members. In the late 1990s, the SET family of proteins was shown to have HKMT activity. Su(var)3-9 methylates H3K9, whereas (E(z)) methylates H3K27 (Rea et al. 2000; Levine et al. 2004; Ringrose and Paro 2004). As discussed elsewhere, H3K9 methylation promotes heterochromatin assembly, whereas H3K27 methylation appears to be required for PcG repression (for more detailed discussion, see Chapters 5 and 11, respectively). The presence of SET domains in trxG proteins suggested that the methylation of histone tails might also be important for the maintenance of active transcriptional states. These findings, together with the growing realization that chromatin-remodeling and -modifying enzymes play key roles in transcriptional activation, motivated biochemists to identify protein complexes that contain trxG proteins and to examine the effect of these complexes on chromatin structure in vitro. Other experiments tested the hypothesis that trxG proteins might interact directly
with the transcriptional machinery, another well-established method of affecting regulation. As described below, these studies revealed that some trxG proteins affect regulation by modifying chromatin structure whereas others function via direct interactions with components of the transcription machinery.
.
2.1 trxG Proteins Involved in ATP-dependent Chromatin Remodeling
Chromatin-remodeling complexes have been implicated in a wide variety of biological processes, including transcriptional repression and activation, chromatin assembly, the regulation of higher-order chromatin structure, and cellular differentiation. The most extensively studied trxG proteins involved in chromatin remodeling are BRM and its human counterparts, BRG1 and HBRM. As predicted, these proteins function as the ATPase subunits of complexes that are highly related to yeast SWI/SNF (Kwon et al. 1994; Wang et al. 1996). SWI/SNF complexes contain between 8 and 15 subunits and have been highly conserved during evolution (Fig. 5). The ATPase of each of these complexes is able to function as an isolated subunit. Although this family of proteins is historically referred to as containing a "helicase" motif, due to the similarity of their ATPase domain to that of true helicases, the proteins related to BRM have never been shown to possess helicase activity, but rather appear to use other mechanisms such as translocation along the DNA to effect changes in chromatin structure (Whitehouse et al. 2003; Saha et al. 2005). A second trxG gene identified in this screen, moira (mar), encodes another key member of this ATP-dependent remodeling complex in Drosophila, and hom*ologs of BRM and MaR interact directly to form a functional core of SWI/SNF in humans (Phelan et al. 1999). SWI/SNF and other chromatin-remodeling complexes use the energy of ATP hydrolysis to alter the structure or positioning of nucleosomes. By catalyzing ATP-dependent changes in chromatin structure, chromatin-remodeling complexes help transcription factors and other regulatory proteins gain access to DNA sequences that would normally be occluded by the histone proteins (Polach and Widom 1995; Logie and Peterson 1997). Models to create access to specific sites include "sliding" the histones along the DNA to move a site into a linker region, looping DNA away from the histone octamer or, most dramatically, evicting the entire histone octamer to a different place in the nucleus (Fig. 6). ATP-dependent remodeling can also lead to changes in the position of the nucleosome along a DNA
T R I THO R A X
GR0 UP
PRO TEl N 5
239
a BRM ATPase domain
b
yeast
c
Drosophila
d
human
BAP
SWI/SNF
BAF
RSC2
polybromo
PBAP
PBAF
Figure 5. The SWI/SNF Family of Remodeling Complexes Each complex contains a member of the SNF2/SWI2 family of ATPases and at least 8 other subunits. (a) Schematic diagram of the BRM protein, showing the location of the ATPase domain and carboxy-terminal bromodomain (which shows affinity to acetylated lysine residues in histone tails) that are conserved in all SNF2/SWI2 family members. SWI/SNF complexes in yeast (b), Drosophila (c), and human (d) are shown. Drosophila trxG proteins (BRM, MOR, and OSA) and their counterparts in other organisms are shown in color. Further information about these complexes and their subunits may be found in Mohrmann and Verrijzer (2005).
sequence, to changes in the spacing of nucleosomes, and to the exchange of histones into and out of the histone octamer that is the core of the nucleosome. Different remodeling complexes display different proclivities for each of these functions. SWI/SNF complexes are abundant in higher eukaryotes; for example, each mammalian nucleus contains about 25,000 copies of SWI/SNF-family complexes. Biochemical analyses show that SWI/SNF complexes are able to create access to an unusually large spectrum of sites within the nucleosome when compared with other ATPdependent chromatin-remodeling complexes (Fan et al. 2003). For example, SWI/SNF complexes are able to efficiently create access to sites at the center of a mononucleosome, which is energetically difficult because sites at the center of the nucleosome have approximately 70 base pairs of constrained nucleosomal DNA on both sides. Whether this is caused by an unusually potent ability to utilize the
energy of ATP hydrolysis relative to other remodelers, or instead represents a distinct mechanism of remodeling, is a topic of ongoing research (Kassabov et al. 2003). SWI/SNF complexes do not display measurable ability to evenly space nucleosomes, a hallmark of other chromatinremodeling complexes. They also do not display the same degree of efficiency in "swapping" H2A/H2B dimers as some other chromatin-remodeling complexes, although they are able to do this and to evict octamers when tested in vitro (Lorch et al. 1999). Which of these abilities is related to the function of these complexes in the maintenance of the active state is not yet clear. SWI/SNF complexes have been implicated in transcriptional activation in every species that has been examined. This family of complexes can be targeted to genes by interactions with transcriptional activators, can remodel nucleosomes to assist in the initial binding of general transcription factors and RNA polymerase II, and can
240 • C HAP T E R 7 2
b. histone exchange
a. nucleosome sliding
, ,,
,
j:
c. nucleosome eviction
d. altered nucleosome structure
Figure 6. Mechanisms for ATP-dependent Remodeling Models for chromatin remodeling are illustrated by showing the change in position or composition of nucleosomes relative to the DNA wrapped around it. The central panel indicates a starting chromatin region where linker DNA is indicated in yellow and nucleosomal DNA in red. (0) Movement of a nucleosome translationally along the DNA (sliding) to expose a region (marked in red) that was previously occluded; (b) exchange of a variant histone for a standard histone to create a variant nucleosome; (c) eviction of nucleosomes to open a large region of DNA. This mechanism might depend on other proteins, such as histone chaperones or DNA-binding factors, in addition to remodeling proteins; (d) creating a loop on the surface of the nucleosome. Remodelers in the SWI/SNF family have been hypothesized to use alternative mechanisms, such as creating stable loops of DNA on the surface of the nucleosome, to make sites available that are central to the nucleosome.
become targeted later in the activation process to assist with transcriptional elongation. Thus, SWl/SNF complexes appear to function at every step in the process of transcriptional activation, although there appears to be an emphasis on function at the early steps that lead to loading of RNA polymerase II. Microarray analysis in yeast shows that, in addition to these effects promoting activation, SWl/SNF complexes can also facilitate repression of some genes (Sudarsanam et al. 2000). One simple hypothesis to explain these broad in vivo functions is that these remodeling complexes alter nucleosome structure in a manner that facilitates binding and function of a wide variety of regulatory factors and complexes. Thus, the potent remodeling characteristics observed in vitro might reflect an ability to significantly expand access to regulatory factors in vivo. It is possible that SWl/SNF complexes are uniquely able to broadly create access, which may account for their importance in the maintenance of the active state. Each species studied has at least two distinct SWl/SNF complexes, all of which contain BRM or a highly related
chromatin-remodeling ATPase. Another trxG protein, OSA, provides distinction between the complexes, in that one class of complexes contains OSA and another evolutionarily conserved complex contains a Polybromodomain protein (Fig. 6) (Mohrmann and Verrijzer 2005). The biochemical function of OSA is not clear. One attractive possibility is that it might target the SWl/SNF complex in which it resides to a specific set of genes. SWI/SNF is not the only chromatin-remodeling factor that is present in eukaryotic cells. Dozens of different chromatin-remodeling complexes have been identified, including NURF, NURD, ACF, and CHRAC (Vignali et al. 2000). These complexes can be subdivided into several major groups based on the identities of their ATPase subunits. SWI/SNF complexes contain ATPases related to SWI2/SNF2; ISWI complexes (e.g., NURF, CHRAC, and ACF) contain ATPases related to Imitation-SWI (ISWI); and CHD complexes (e.g., NURD) contain ATPases related to CHD 1 and Mi2. Recent studies have implicated a Drosophila member of the CHD family of chromatin-remodeling factors-
TRITHORAX
kismet (kis)-in the maintenance of the active state. Like brm, mar, and osa, kis was identified in a screen for extragenic suppressors of Pc, suggesting that it acts antagonistically to PcG proteins to maintain active states of Hox transcription (Kennison and Tamkun 1988). Genetic studies revealed that kis is required for both segmentation and the maintenance of Hox transcription during Drosophila development (Daubresse et al. 1999). The molecular analysis of kis revealed that it encodes several large proteins, including an approximately 575-kD isoform (KIS-L) that contains an ATPase domain characteristic of chromatin-remodeling factors (Daubresse et al. 1999; Therrien et al. 2000). Conserved domains outside the ATPase domain (including bromodomains and chromodomains) contribute to the functional specificity of chromatin-remodeling factors by mediating interactions with nucleosomes or other proteins. BRM and other ATPase subunits of SWI/SNF complexes contain a single bromodomain (a protein motif associated with the binding of certain acetylated histones), whereas KIS-L contains two chromodomains (protein motifs that bind certain methylated histones) and is therefore more similar to Mi2 and other members of the CHD family of chromatin-remodeling factors. Although the large size of KIS-L (-575 kD) has made it difficult to analyze this protein biochemically, its sequence strongly suggests that it activates transcription by remodeling chromatin. KIS-L is not physically associated with BRM and behaves chromatographically as if it is in a distinct protein complex (Srinivasan et al. 2005). The two proteins overlap extensively with each other and RNA polymerase II on polytene chromosomes, however, suggesting that both play relatively global roles in transcription (Fig. 7) (Armstrong et al. 2002; Srinivasan et al. 2005). Loss of BRM function blocks a relatively early step in transcrip-
GROUP
PROTEINS
241
tion (Armstrong et al. 2002), whereas the loss of KIS-L function leads to a decrease in the level of elongating, but not initiating, forms of RNA polymerase II (Srinivasan et al. 2005). These findings suggest that BRM and KIS-L facilitate distinct steps in transcription by RNA polymerase II by catalyzing ATP-dependent alterations in nucleosome structure or spacing. An important question for future research concerns the role that ATP-dependent remodeling plays in maintenance of the activated state. It is intriguing that four trxG members are known (BRM, KIS, and OSA) or suspected (KIS) members of large ATP-dependent chromatin-remodeling complexes, but none of the other numerous ATP-dependent remodeling complexes has been identified in genetic screens for Drosophila trxG proteins. Two predominant hypotheses, not mutually exclusive, to eA'Plain this are that the BRM and KIS chromatin-remodeling complexes are targeted to genes important for developmental progression or that they have special remodeling characteristics which are uniquely required for maintenance. Thus, it is possible that generic ATP-dependent remodeling is required for all active states, and that maintenance of the active state of developmentally important genes happens to require these trxG members because they are targeted to these genes. It is also possible that maintenance requires special ATPdependent functions that can only be carried out by the complexes that contain trxG members. It is also intriguing to think about the mechanisms that remodelers might use to contribute to epigenetic regulation of the active state. At least three classes of mechanisms can be envisioned that might apply. First, remodeling functions might be required in a somewhat indirect manner to facilitate the binding (or re-binding following replication) of gene-specific activating proteins that are needed to maintain active transcription. In this case, the remodelers
Figure 7. Chromosomal Distribution of trxG Proteins The genome-wide distribution of trxG proteins was examined by staining Drosophila salivary gland polytene chromosomes with antibodies against BRM (a) or TRX (b). Consistent with a relatively global role in transcriptional activation, BRM is associated with hundreds of sites in a pattern that overlaps extensively with RNA pol II. In contrast, strong TRX signals are detected at a much smaller number of sites on polytene chromosomes.
242
C HAP T E R 7 2
would not be the "brains" of the epigenetic mechanism, but instead would act as a necessary tool to allow the proteins required to function efficiently. Second, remodelers could work alone or with histone chaperones to evict nucleosomes from a region, and this lack of occupancy by nucleosomes would hypothetically cause the region to remain non-nucleosomal following replication. As mentioned above, the ability of the replication/nucleosome deposition machinery to accurately recapitulate nucleosome modification or location is an important unanswered issue in epigenetics. Finally, remodeling machineries could reposition nucleosomes to create a structure that is amenable to activation. This latter mechanism has experimental support from studies of the albumin gene (Chaya et al. 2001; Cirillo et al. 2002). Several DNA-binding factors are required to maintain activity of this key gene in the liver. One of these factors, FoxA, binds to a site on a nucleosome, and the specific nucleosome-FoxA architecture is key to maintaining the active state of the albumin gene. Although it is not clear whether there is a required role for ATP-dependent remodeling to position this specific nucleosome in the liver, this example demonstrates the potential for specific nucleosome positioning to playa key epigenetic role. 2.2 trxG Proteins That Covalently
Modify Nucleosomal Histones
A second common method of regulating gene expression involves covalent modification of the amino-terminal tails of the core histones that comprise the protein component of the nucleosome. These tails, which protrude from the surface of the nucleosome, can mediate interactions with other nucleosomes, as well as with a wide variety of structural and regulatory proteins. The covalent modification of histone tails by acetylation, methylation, or phosphorylation can help target regulatory complexes to chromatin and can also directly change the ability of nucleosomes to compact into repressive structures by changing the charge on the tails. Covalent modification might also provide a mark to help maintain a specific regulated state, as the covalently modified histones have the potential to divide to the two daughter strands and thereby propagate the information contained in the covalent mark to both mother and daughter cells following replication. Whether histones remain associated with one or both daughter strands following replication is an issue key to potential mechanisms of epigenetic regulation that remains controversial, in large part due to the challenge of finding techniques that will allow accurate tracking of individual histones in living cells.
Several trxG proteins are able to covalently modify histone tails, and these proteins are frequently found in complexes that are able to perform more than one type of modification reaction. For example, Drosophila TRX and its counterparts in other organisms methylate histone H3 at lysine 4 (H3K4): This covalent mark is tightly associated with active genes in a wide variety of organisms, including yeast, flies, and humans. A second trxG protein, ASHI (see below), also has H3K4 methyltransferase activity (Beisel et al. 2002; Byrd and Shearn 2003). H3K4 methylation has been implicated in maintenance of active gene expression in yeast by the timing of its appearance and removal on active genes (Santos-Rosa et al. 2002; Pokholok et al. 2005). The finding that trxG members have this histone modification activity further ties the H3K4 mark to maintenance of the active state. In yeast and in humans, counterparts of TRX are found in a complex that also contains a third trxG protein, Ash2, which is not related in sequence to Ashl. The yeast hom*olog of trithorax, Setl, is found in a complex (COMPASS or SetlC) that is approximately 400 kD in size and contains five other proteins in addition to Set! and Ash2 (Miller et al. 2001; Roguev et al. 2001). The only known biochemical activity of this complex is methylation of H3K4; it is not yet clear what the function of each of the other proteins might be, although one component might help propagate the methylation mark (see below). In humans, there are three TRX hom*ologs, called MLLl, MLL2, and hSETl. The MLLl protein has received the most attention in biochemical analyses and is found in a large complex (> 10 members) that also contains the human hom*olog of ASH2 (Hughes et al. 2004; Yokoyama et al. 2004). This complex and the yeast complex both contain a WD40 repeat protein which is called WDR5 in humans (Dou et al. 2005; Wysocka et al. 2005). Recently, it has been shown that the WDR5 protein can bind to histone H3 that has been methylated at lysine 4 (Wysocka et al. 2005). Thus, binding of this protein to the mark created by the MLLl complex in which it resides might provide a mechanism to facilitate spreading of the mark. This is similar to proposals made concerning the repressive complexes that methylate H3K9, which contain HP1, a protein that binds specifically to methylated K9 (for more details, see Chapters 5 and 6). There is evidence from both Drosophila and humans that the complex containing TRX/MLL is also involved in acetylation. In humans, MLL is associated with the MOF acetyltransferase, which acetylates lysine 16 of histone H4, another modification normally linked to activation (Dou et al. 2005). In flies, TRX is associated with
T R I THO R A X
dCBP, an acetyltransferase with broad specificity that is involved in activation (Petruk et al. 2001). Acetylation might work synergistically with H3K4 methylation to direct an active state following function of these trxG complexes. Acetylation is also known to prevent the methylation of residues such as H3K9 and H3K27 that direct repression of the template. The ASHI protein, another trxG member, is also a histone methyltransferase that methylates H3K4 (Beisel et al. 2002; Byrd and Shearn 2003). The composition of any ASHI-containing complexes has not been established, nor is it understood how the activities of ASHI and the complexes containing TRX/MLLl/SETl are coordinated. However, ASH 1 has also been seen to colocalize and associate with the CBP family of acetyltransferases (Bantignies et al. 2000), once again suggesting that methylation and acetylation go hand in hand. There are numerous fascinating, as yet unanswered, questions concerning how covalent modification of histones might contribute to trxG function. What functional role do the marks play? Covalent modification can contribute to epigenetic regulation via a wide spectrum of mechanisms. Methylation and acetylation marks might serve to directly alter chromatin compaction (sometimes termed cis-effects, as in Fig. 8 of Chapter 3). The ability of chromatin to enter a compacted state, which is generally assumed to be repressive for transcription, is influenced by the charge distribution on the histone tails. Modifications that occur on lysine (e.g., acetylation) can eliminate the positive charge normally found with this residue, and therefore might directly decrease the ability of nucleosomes to form compacted structures, thus increasing the ability of the template to be transcribed. Covalent marks have been proposed to create strong binding sites for complexes that direct transcriptional activation. These covalent modifications are able to create specific "knobs" on the surface of the nucleosome that fit into pockets on the complexes that promote activation, thus increasing binding energy and function of these complexes. For example, acetylation of histone tails increases binding by hom*ologs of the BRM protein, thus promoting ATP-dependent remodeling of acetylated templates (Hassan et al. 2001). This type of mechanism, frequently referred to as the "histone code" or trans-effects of covalent histone modifications, has the potential to be a central epigenetic function. Further studies are needed to determine which marks created by trxG proteins enhance binding of which complexes, to determine the extent to which the energy of binding to a single modified residue can influence function and
GR0 UP
PRO TEl N S
243
targeting, and to determine the temporal order of addition of the marks and whether they are maintained across mitosis. The flip side of this mechanism is that the marks could inhibit binding by repressive complexes. A covalent mark on a key residue required for optimal binding by a repressive complex could strongly inhibit binding by the repressive complex. For example, it is known that binding by repressive complexes is increased by methylation of histone H3 at K9 and K27 (Khorasanizadeh 2004). Acetylation of these residues would both block methylation and create an ill-shaped "knob" on the histone that impairs binding by the repressive complex. Thus, the ability of modifications to influence function of other complexes can cut in both diJ;ections, increasing the potency of this potential mode of epigenetic regulation. These mechanisms not only ';lre not mutually exclusive, but are likely to work together to help maintain an active state. Marks that chemically increase the ability to form a compacted state (a cis-effect) might also increase the ability of complexes to bind (a trans-effect), and further promote a compacted state. Conversely, marks that chemically decrease compaction might increase binding of complexes that also decompact nucleosomes. This mechanistically parsimonious use of covalent marks to alter several characteristics of chromatin structure and of the ability of regulatory complexes to bind could create a powerful means of maintaining an active state.
3 Connections between trxG Proteins and the General Transcription Machinery
The theme that trxG proteins frequently are found in the same complex is continued with the skuld (skd) and kohtalo (kto) proteins. These two proteins are hom*ologs of the proteins identified biochemically as TRAP240 (Skuld) and TRAP 230 (Kohtalo), which are both members of the "Mediator" complex (Janody et al. 2003). The mediator complex is a large complex that functions at the interface between gene-specific activator proteins and formation of the pre-initiation complex that contains RNA polymerase II (Lewis and Reinberg 2003). Thus, these proteins are involved in general activation processes, much in the same way that the SWI/SNF-family remodelers are involved in general activation. SKD and KTO might have some special function involved in maintenance, because other components of the mediator complex were not identified in screens for trxG genes. The observation that SKD and KTO interact with each other, and that skd kto double mutants have the same phenotype
244
•
C HAP T E R 1 2
as either single mutant, has led to the hypothesis that the two proteins together form a functional module that somehow alters mediator action (Janody et al. 2003). 4 Biochemical Functions of Other trxG Proteins
The biochemical activities of the majority of other trxG proteins remain relatively mysterious. Ring3, a human counterpart of the Drosophila trxG gene female-sterile homeotic (fsh), encodes a nuclear protein kinase with two bromodomains that has been implicated in cell cycle progression and leukemogenesis, but the substrates of this kinase are currently unknown (Denis and Green 1996). The trxG gene Tonalli (Tna) encodes a protein related to SP-RING finger proteins involved in sumoylation, suggesting that it may also regulate transcription via the covalent modification of other, as yet unidentified, proteins (Gutierrez et al. 2003). The trxG gene sallimus (sis) was identified in a screen for extragenic suppressors of Pc (Kennison and Tarnkun 1988) and subsequently found to encode Drosophila Titin (Machado and Andrew 2000). Like its vertebrate counterpart, Drosophila Titin helps maintain the integrity and elasticity of the sarcomere. In addition, Titin is a chromosomal protein that is required for chromosome condensation and segregation (Machado and Andrew 2000). These intriguing findings suggest a potential role for trxG proteins in the regulation of higher-order chromatin structure. 5 Functional Interactions between trxG Proteins
Now that the basic biochemical activities of many trxG and PcG members have been identified, attention has shifted to the way in which their activities are coordinated to regulate transcription and maintain the active state. Despite the lack of in vitro systems for studying the maintenance of the determined state, good progress has been made toward addressing this issue. One popular hypothesis is that the trxG and PcG members facilitate a sequence of dependent events required for the maintenance of the active or repressed state. Support for this idea has come from recent studies of the PcG complexes PRC1 and PRC2; by methylating H3K27, the E(z) histone methyltransferase subunit of PRC2 creates a covalent mark that is directly recognized by the chromodomain of the Pc subunit of PRC1 (Jacobs and Khorasanizadeh 2002; Min et al. 2003). Thus, one PcG complex appears to directly promote the binding of another PcG complex to chromatin. By analogy, it is possible that the covalent modification of nucleosomes by trxG members with histone methyltransferase or acetyltransferase activities (e.g.,
TRX or ASH 1) directly regulates the targeting or activities of trxG members involved in ATP-dependent chromatin remodeling (e.g., BRM [SWI/SNFJ, or KIS) (Fig. 8). Consistent with this possibility, BRM and other subunits of SWI/SNF complexes contain bromodomains that can directly interact with acetylated histone tails, and KIS contains two chromodomains that may directly interact with methylated histone tails. This model, which is supported by recent studies of chromatin-remodeling factors in both yeast and mammals (Agalioti et al. 2000; Hassan et al. 2001), is particularly attractive because it provides a mechanism by which a heritable 'histone modification could perpetuate a constitutively "open" chromatin configuration that is permissive for active transcription. 6 trxG Proteins: Activators or Anti-repressors?
Another important issue concerns the functional relationship between PcG repressors and trxG activators. Do these regulatory proteins have independent roles in activation and repression, or do they act in direct opposition to maintain the heritable state? Recent genetic studies show that removal of PcG complexes will reactivate genes even in the absence of TRX and ASH1 (Klymenko and Muller 2004), suggesting that trxG proteins with histone methyltransferase activity may function as PcG antirepressors, as opposed to activators (Fig. 8). Both biochemical and genetic analyses provide evidence that there might be direct connections between trxG function and PcG function. One interesting property of PcG proteins is that they are capable of repressing transcription when tethered near virtually any gene transcribed by RNA polymerase II. trxG members that play global roles in transcription-including BRM, KIS, and other trxG members involved in chromatin-remodelingare thus excellent candidates for direct targets of PcG repressors (Fig. 8). One of the major PcG complexes, PRCl, blocks the function of SWI/SNF- family remodeling complexes, apparently by blocking access of this complex to the template (Francis et al. 2001). This is consistent with the notion that one mechanism for PcG repression might be to prevent ATP-dependent remodeling by trxG members. The Brahma complex and PRC1 are further connected by the fact that both directly interact with the Zeste protein, a protein which plays a complicated role in regulation of gene expression in Drosophila that might help direct cross talk between the two complexes. A second protein that connects PcG proteins and trxG proteins is the GAGA factor, which is encoded by the Trithorax-like gene and is thus a trxG member (Farkas et al. 1994). This protein can function as a
T R I THO R A X
G R a U P PRO TEl N 5
•
245
/ anti-repression
Modification of nucleosomes by PcG proteins (PRC2)
Modification of nucleosomes by trxG proteins (ASH1, TRX)
11
11
ATP
inhibition of remodeling
Recognition of modification by trxG proteins with chr9matin-remodeling activity (SWI/SNF, KIS)
Recognition of modifications by PcG proteins (PRC1)
1
1 heritable repression
active transcription Figure 8. Trithorax Group and Polycomb Group Functions and Interactions
Both trxG and PcG families include proteins that covalently modify histones and proteins that noncovalently modify chromatin. Covalent modifications on histones can increase binding by noncovalent modifying complexes such as SWI/SNF, KIS, or PRCl. Binding by these latter complexes has the potential to lead to further covalent modification, thus leading to iterative cycles of covalent modification and recognition of the covalent marks.
sequence-specific activator protein at some promoters, but also is a prominent member of the proteins that bind to the Polycomb Repressive Element (PRE, see Chapter 8). PRE sequences direct PeG function, and at least one PRE can act as a memory module when affixed to a reporter construct, emphasizing the importance of these sequences. Sequences that bind the GAGA factor play an important role in PRE function, and tethering the GAGA protein to DNA has been proposed to enhance binding and function of PRC1 (Mahmoudi and Verrijzer 2001). Thus, the GAGA factor might play key roles in maintenance of activation (via its transcrip-
tional activating properties) and maintenance of repression (via interactions with the PeG proteins). An important issue for future research is to understand why proteins such as GAGA and Zeste appear to interact with both the activating and repressing machineries of maintenance.
7 Conclusion and Outlook
Two of the major issues regarding function of trxG proteins remain largely a matter for conjecture. First, why does a relatively small subset of the proteins required for tran-
246 • C HAP T E R 7 2
scriptional activation score genetically as being important for maintenance' of the active state? Is this because these proteins play global roles in transcription but are expressed in limiting quantities or happen, by evolutionary serendipity, to be especially important for developmentally important genes? Second, how can the active state be maintained across replication and mitosis? Replication will create two daughter strands that must both be regulated identically, and mitosis requires condensation and thereby inhibition of transcription of most genes in a celL What mechanisms create the epigenetic mark(s) that ensures reactivation of a gene on both daughter strands following mitosis? The majority of trxG proteins are part of complexes that are broadly used in gene expression, and most of these complexes also contain many other proteins not in. the trxG (see Table 1). This raises the important question as to whether there are special functions that are used for maintenance of active gene expression. It is possible that SWIISNF remodelers are able to perform a special remodeling function, that H3K4 methylation targets special complexes and/or chromatin conformations, and that Skuld/Kohtalo alter function of Mediator in a specific manner important for maintenance. Alternatively, it is possible that each of these proteins performs a reaction that is normally used in activation of all types of genes, and that these complexes are among those that have emerged as being important for maintenance for a relatively uninteresting reason (e.g., because even relatively subtle changes in the expression of Drosophila Hox genes cause homeotic transformations). To resolve these issues, considerably more information is needed about the precise mechanisms that each of these proteins uses in activation. For example, do the SWIISNF complexes harness the energy of ATP hydrolysis in the same manner as other ATP-dependent remodeling complexes, or do they differ in an important way in how this energy is used to alter nucleosome structure? Structural techniques including crystallography, biophysical techniques such as single-molecule analysis and FRET (fluorescence resonance energy transfer), and detailed imaging in vivo might help to shed light on whether there are mechanisms specially designed for epigenetic maintenance of activation. The initial functional studies that have been done with trxG complexes on simple model templates are just the beginning of the process for answering these important questions. The epigenetic mechanisms that might maintain an active state are even less well understood. Are covalent marks distributed to help create an active mark? Are nucleosome positions maintained following replication to
create "open" stretches of chromatin, or specially positioned nucleosomes, that increase binding of activators? Does trxG function cause active genes to compartmentalize within the nucleus to regions that favor active transcription? These are all viable hypotheses; more hypotheses exist, and others have not yet even been envisioned. The incredible complexity of the machinery that transcribes DNA offers numerous possibilities for regulation, and for the development of mechanisms that allow an epigenetic maintenance of active transcription. This intersection of two fields rich in intellectual history, transcriptional activation and epigenetic mechanism, will provide fertile ground for experimentalists for many years.
References Agalioti T., Lomvardas S., Parekh B., Yie J., Maniatis T., and Thanos D. 2000. Ordered recruitment of chromatin modifying and general transcription factors to the IFN-~ promoter. Cell 103: 667-678. Armstrong J.A., Papoulas 0., Daubresse G., Sperling A.5., Lis J.T., Scott M.P., and Tamkun J.W. 2002. The Drosophila BRM complex facilitates global transcription by RNA polymerase II. EMBO ]. 21: 5245-5254. Bantignies E, Goodman R.H., and Smolik S.M. 2000. Functional interaction between the coactivator Drosophila CREB-binding protein and ASHI, a member of the trithorax group of chromatin modifiers. Mol. Cell. BioI. 20: 9317-9330. Beisel C, Imhof A., Greene J., Kremmer E., and Sauer E 2002. Histone methylation by the Drosophila epigenetic transcriptional regulator Ash1. Nature 419: 857-862. Bultman S., Gebuhr T., Yee D., La Mantia C, Nicholson J., Gilliam A., Randazzo E, Metzger D., Chambon P., Crabtree G., and Magnuson T. 2000. A Brgl null mutation in the mouse reveals functional differences among mammalian SWI/SNF complexes. Mol. Cell 6: 1287-1295. Byrd K.N. and Shearn A. 2003. ASH I, a Drosophila trithorax group protein, is required for methylation oflysine 4 resid ues on histone H3. Proc. Natl. Acad. Sci. 100: 11535-11540. Chaya D., Hayamizu T., Bustin M., and Zaret K.S. 2001. Transcription factor FoxA (HNF3) on a nucleosome at an enhancer complex in liver chromatin.]. BioI. Chern. 276: 44385-44389. Cirillo L.A., Lin ER., Cuesta I., Friedman D., Jarnik M., and Zaret K.S. 2002. Opening of compacted chromatin by early developmental transcription factors HNF3 (FoxA) and GATA-4. Mol. Cell 9: 279-289. Cote J., Quinn J., Workman J.L., and Peterson CL. 1994. Stimulation of GAL4 derivative binding to nucleosomal DNA by the yeast SWI/SNF complex. Science 265: 53-60. Daubresse G., Deuring R., Moore L., Papoulas 0., Zakrajsek I., Waldrip W.R., Scott M.P., Kennison J.A., and Tamkun J.W. 1999. The Drosophila kismet gene is related to chromatin-remodeling factors and is required for both segmentation and segment identity. Development 126: 1175-1187. Denis G.Y. and Green M.R. 1996. A novel, mitogen-activated nuclear kinase is related to a Drosophila developmental regulator. Genes Dev. 10: 261-271. Dou Y., Milne T.A., Tackett A.J., Smith E.R., f*ckuda A., Wysocka J., Allis CD., Chait B.T., Hess J.L., and Roeder R.G. 2005. Physical associa-
T R I THO R A X
tion and coordinate function of the H3 K4 methyltransferase MLLI and the H4 K16 acetyl transferase MOE Cell 121: 873-885. Dunaief J.L., Strober RE., Guha S., Khavari P.A., Alin K., Luban J., Begemann M., Crabtree G.R., and Goff S.P. 1994. The retinoblastoma protein and BRG1 form a complex and cooperate to induce cell cycle arrest. Cell 79: 119-130. Duncan 1. 1987. The bithorax complex. Annu. Rev. Genet. 21: 285-319. Fan H.Y., He X., Kingston R.E., and Narlikar G.J. 2003. Distinct strategies to make nucleosomal DNA accessible. Mol. Cell 11: 1311-1322. Farkas G., Gausz J., Galloni M., Reuter G., Gyurkovics H., and Karch E 1994. The Trithorax-like gene encodes the Drosophila GAGA factor. Nature 371: 806-808. Francis N.J. and Kingston R.E. 2001. Mechanisms of transcriptional memory. Nat. Rev. Mol. Cell. BioI. 2: 409--421. Francis N.J., Saurin A.J., Shao Z., and Kingston R.E. 2001. Reconstitution of a functional core polycomb rep'ressive complex. Mol. Cell 8: 545-556. Gellon G. and McGinnis W. 1998. Shaping animal body plans in development and evolution by modulation of Hox expression patterns. Bioessays 20: 116-125. Gutierrez L., Zurita M., Kennison J.A., and Vazquez M. 2003. The Drosophila trithorax group gene tonalli (tna) interacts genetically with the Brahma remodeling complex and encodes an SP-RING finger protein. Development 130: 343-354. Hassan A.H., Neely K.E., and Workman J.L. 2001. Histone acetyltransferase complexes stabilize swi/snf binding to promoter nucleosomes. Cell 104: 817-827. Holstege EC., Jennings E.G., Wyrick J.J., Lee T.1., Hengartner c.J., Green M.R., Golub T.R., Lander E.S., and Young R.A. 1998. Dissecting the regulatory circuitry of a eukaryotic genome. Cell 95: 717-728. Hughes C.M., Rozenblatt-Rosen 0., Milne T.A., Copeland TD., Levine S.S., Lee J.c., Hayes D. ., Shanmugam K.S., Bhattacharjee A., Biondi c.A., et al. 2004. Menin associates with a trithorax family histone methyltransferase complex and with the hoxcS locus. Mol. Cell 13: 587-597. 1mbalzano A.N., Kwon H., Green M.R., and Kingston R.E. 1994. Facilitated binding of TATA-binding protein to nucleosomal DNA. Nature 370: 481--485. Jacobs S.A. and Khorasanizadeh S. 2002. Structure of HP1 chromodomain bound to a lysine 9-methylated histone H3 tail. Science 295: 2080-2083. Janody E, Martirosyan Z., Benlali A., and Treisman J.E. 2003. Two subunits of the Drosophila mediator complex act together to control cell affinity. Development 130: 3691-3701. Kassabov S.R., Zhang B., Persinger J., and Bartholomew B. 2003. SWI/SNF unwraps, slides, and rewraps the nucleosome. Mol. Cell 11: 391--403. Kaufman TC., Seeger ·M.A., and Olsen G. 1990. Molecular and genetic organization of the antennapedia gene complex of Drosophila melanogaster. Adv. Genet. 27: 309-362. Kennison J.A. 1995. The Polycomb and trithorax group proteins of Drosophila: Trans-regulators of homeotic gene function. Annu. Rev. Genet. 29: 289-303. - - - . 2003. Introduction to Trx-G and Pc-G genes. Methods Enzymol. 377: 61-70. Kennison J.A. and Tamkun J.w. 1988. Dosage-dependent modifiers of Polycomb and Antennapedia mutations in Drosophila. Proc. Natl. Acad. Sci. 85: 8136-8140. Khorasanizadeh S. 2004. The nucleosome: From genomic organization to genomic regulation. Cell 116: 259-272.
GR0 UP
PRO TEl N S
•
247
Klymenko T and Muller J. 2004. The histone methyltransferases Trithorax and Ash 1 prevent transcriptional silencing by Polycomb group proteins. EMBO Rep. 5: 373-377. Kruger W., Peterson c.L., Sil A., Coburn c., Arents G., Moudrianakis E.N., and Herskowitz I. 1995. Amino acid substitutions in the structured domains of histones H3 and H4 partially relieve the requirement of the yeast SWl/SNF complex for transcription. Genes Dev. 9: 2770-2779. Kwon H., Imbalzano A.N., Khavari P.A., Kingston R.E., and Green M.R. 1994. Nucleosome disruption and enhancement of activator binding by a human SWl/SNF complex. Nature 370: 477--481. Levine S.S., King I.E, and Kingston R.E. 2004. Division of labor in Polycomb group repression. Trends Biochem. Sci. 29: 478-485. Lewis B.A. and Reinberg D. 2003. The mediator coactiva tor complex: Functional and physical roles in transcriptional regulation. f. Cell Sci. 116: 3667-3675. Logie C. and Peterson c.L. 1997. Catalytic activity of the yeast SWl/SNF complex on reconstituted nucleosome arrays. EMBO f. 16: 6772-6782. Lorch Y., Zhang M., and Kornberg R.D. 1999. Histone octamer transfer by a chromatin-remodeling complex. Cell 96: 389-392. Machado C. and Andrew D.J. 2000. D-Titin: A giant protein with dual roles in chromosomes and muscles. f. Cell BioI. 151: 639-652. Mahmoudi T and Verrijzer c.P. 2001. Chromatin silencing and activation by Polycomb and trithorax group proteins. Oncogene 20: 3055-3066. Miller T, Krogan N.J., Dover J., Erdjument-Bromage H., Tempst P., Johnston M., Greenblatt J.E, and Shilatifard A. 2001. COMPASS: A complex of proteins associated with a trithorax-related SET domain protein. Proc. Natl. Acad. Sci. 98: 12902-12907. Min J., Zhang Y., and Xu R.M. 2003. Structural basis for specific binding of Polycomb chromodomain to histone H3 methylated at Lys 27. Genes Dev. 17: 1823-1828. Mohrmann L. and Verrijzer c.P. 2005. Composition and functional specificity of SWI2/SNF2 class chromatin remodeling complexes. Biochim. Biophys. Acta 1681: 59-73. Muyrers-Chen 1., Rozovskaia T., Lee N., Kersey ].H., Nakamura T, Canaani E., and Paro R. 2004. Expression of leukemic MLL fusion proteins in Drosophila affects cell cycle control and chromosome morphology. Oncogene 23: 8639-8648. Petruk S., Sedkov Y., Smith S., Tillib S., Kraevski v., Nakamura T, Canaani E., Croce C.M., and Mazo A. 2001. Trithorax and dCBP acting in a complex to maintain expression of a homeotic gene. Science 294: 1331-1334. Phelan M.L., Sif S., Narlikar G.J., and Kingston R.E. 1999. Reconstitution of a core chromatin remodeling complex from SWI/SNF subunits. Mol. Cell 3: 247-253. Pokholok D.K., Harbison C.T, Levine S., Cole M., Hannett N.M., Lee TI., Bell G.W., Walker K., Rolfe P.A., Herbolsheimer E., et al. 2005. Genome-wide map of nucleosome acetylation and methylation in yeast. Cell 122: 517-527. Polach K.J. and Widom J. 1995. Mechanism of protein access to specific D A sequences in chromatin: A dynamic equilibrium model for gene regulation. f. Mol. BioI. 254: 130-149. Rea S., Eisenhaber E, O'Carroll D., Strahl B.D., Sun Z.w., Schmid M., Opravil S., Mechtler K., Ponting c.P., Allis C.D., and Jenuwein T 2000. Regulation of chromatin structure by site-specific histone H3 methyltransferases. Nature 406: 593-599. Ringrose L. and Paro R. 2004. Epigenetic regulation of cellular memory by the polycomb and trithorax group proteins. Annu. Rev. Genet. 38: 413-443.
248 •
C HAP T E R
12
Roguev A., Schaft D., Shevchenko A., Pijnappel w.w., Wilm M., Aasland R., and Stewart A.E 2001. The Saccharomyces cerevisiae Setl complex includes an Ash2 hom*ologue and methylates histone 3 lysine 4. EMBO]. 20: 7137-7148. Saha A., Wittmeyer J., and Cairns B.R. 2005. Chromatin remodeling through directional DNA translocation from an internal nucleosomal site. Nat. Struct. Mol. BioI. 12: 747-755. Santos-Rosa H., Schneider R., Bannister A.J., Sherriff J., Bernstein B.E., Emre N.C, Schreiber S.L., Mellor J., and Kouzarides T 2002. Active genes are tri-methylated at K4 of histone H3. Nature 419: 407-411. Simon J. 1995. Locking in stable states of gene expression: Transcriptional control during Drosophila development. Curro Opin. Cell BioI. 7: 376-385. Simon J.A. and Tamkun J.W. 2002. Programming off and on states in chromatin: Mechanisms of Polycomb and trithorax group complexes. Curro Opin. Genet. Dev. 12: 210-218. Srinivasan S., Armstrong J.A., Deuring R., Dahlsveen 1.K, McNeill H., and Tamkun J.W. 2005. The Drosophila trithorax group protein Kismet facilitates an early step in transcriptional elongation by RNA Polymerase II. Development 132: 1623-1635. Strober B.E., Dunaief J.L., Guha S, and Goff S.P. 1996. Functional interactions between the hBRM/hBRGI transcriptional activators and the pRB family of proteins. Mol. Cell. BioI. 16: 1576-1583. Sudarsanam P., Iyer V.R., Brown P.O., and Winston E 2000. Wholegenome expression analysis of snflswi mutants of Saccharomyces cerevisiae. Proc. Natl. Acad. Sci. 97: 3364-3369. Tamkun J.W., Deuring R., Scott M.P., Kissinger M., Pattatucci A.M., Kaufman TC, and Kennison J.A. 1992. brahma: A regulator of Drosophila homeotic genes structurally related to the yeast transcriptional activator SNF2/SWI2. Cell 68: 561-572. Therrien M., Morrison D.K, Wong A.M., and Rubin G.M. 2000. A
genetic screen for modifiers of a kinase suppressor ofRas-dependent rough eye phenotype in Drosophila. Genetics 156: 1231-1242. Versteege 1., Sevenet N., Lange J., Rousseau-Merck M.E, Ambros P., Handgretinger R., Aurias A., and Delattre O. 1998. Truncating mutations of hSNF5/INIl in aggressive paediatric cancer. Nature 394: 203-206. Vignali M., Hassan A.H., Neely KE., and Workman J.L. 2000. ATPdependent chromatin-remodeling complexes. Mol. Cell. BioI. 20: 1899-1910. Wang W., Cote J., Xue Y., Zhou S., Khavari P.A., Biggar S.R., Muchardt C, Kalpana G.v., Goff S.P., Yaniv M., et al. 1996. Purification and biochemical heterogeneity of the mammalian SWI-SNF complex. EMBO f. 15: 5370-5382. Whitehouse 1., Stockdale C, Flaus A., Szczelkun M.D., and OwenHughes T 2003. Evidence for DNA translocation by the ISWI chromatin-remodeling enzyme. Mol. Cell. BioI. 23: 1935-1945. Wysocka J., Swigut T, Milne TA., Dou Y., Zhang X., Burlingame A.I.., Roeder R.G., Brivanlou A.H., and Allis CD. 2005. WDR5 associates with histone H3 methylated at K4 and is essential for H3 K4 methylation and vertebrate development. Cell 121: 859-872. Yokoyama A., Wang Z., Wysocka J., Sanyal M., Aufiero D.J., Kitabayashi 1., Herr W., and Cleary M.L. 2004. Leukemia proto-oncoprotein MLL forms a SETI-like histone methyltransferase complex with menin to regulate Hox gene expression. Mol. Cell. BioI. 24: 5639-5649. Yu B.D., Hanson R.D., Hess J.L., Horning S.E., and Korsmeyer S.J. 1998. MLL, a mammalian trithorax-group gene, functions as a transcriptional maintenance factor in morphogenesis. Proc. Natl. Acad. Sci. 95: 10632-10636. Yu B.D., Hess J.L., Horning S.E., Brown G.A., and Korsmeyer S.r. 1995. Altered Hox expression and segmental identity in Mil-mutant mice. Nature 378: 505-508.
c
H
APT
E
R
13
Histone Variants and Epigenetics Steven Henikoff' and M. Mitchell Smith 2 Howard Hughes Medical Institute, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109-1024 2Department ofMicrobiology, University of Virginia, Charlottesville, Virginia 22908
1
CONTENTS 1. DNA Is Packaged by Architectural Proteins in All Organisms, 251
7. Phosphorylation of H2AX Functions in DNA Double-Strand Break Repair, 258
2. Eukaryotic Core Histones Evolved from Archaeal Histones, 251
8. H2AZ Plays Roles in Transcriptional Regulation, 259
3. Bulk Histones Are Deposited after DNA Replication, 253
9. Protein Complexes for the Deposition and Replacement of H2A Variants, 261
4. Variant Histones Are Deposited Throughout the Cell Cycle, 254 5. Centromeres Are Identified by a Special H3 Variant, 254 6. The Replacement Histone Variant H3.3 Is Found at Active Chromatin, 256
10. Other H2A Variants Differentiate Chromatin but Their Functions Are As Yet Unknown, 262 11. Many Histones Have Evolved to More Tightly Package DNA, 262 12. Conclusions and Future Research, 263 References, 263
249'
GENERAL SUMMARY Histones package DNA by assembling into nucleosome core particles while the double helix wraps around. Over evolutionary time, histone fold domain proteins have diversified from archaeal ancestors into the four distinct subunits that comprise the familiar octamer of the eukaryotic nucleosome. Further diversification of histones into variants results in differentiation of chromatin that can have epigenetic consequences. Investigations into the evolution, structure, and metabolism of histone variants provide a foundation for understanding the participation of chromatin in important cellular processes and in epigenetic memory. Most histones are synthesized at S phase for rapid deposition behind replication forks to fill in gaps resulting from the distribution of preexisting histones. In addition, the replacement of canonical S-phase histones by variants, independent of replication, can potentially differentiate chromatin. The differentiation of chromatin by a histone variant is especially conspicuous at centromeres, where the H3 variant, CENP-A, is assembled into specialized nucleosomes that form the foundation for kinetochore assembly (see left panel of title figure). A centromeric H3 (CenH3) counterpart of CENP-A is found in all eukaryotes. In plants and animals, the faithful assembly of CenH3-containing nucleosomes at centromeres does not appear to require centromeric DNA sequences, a spectacular example of epigenetic inheritance. Some CenH3s have evolved adaptively in regions that contact DNA, which suggests that centromeres compete with each other, and that CenH3s and other centromere-specific DNA-binding proteins have adapted in response. This process could account for the large size and complexity of centromeres in plants and animals. Chromatin can also be differentiated outside of centromeres by incorporation of a constitutively expressed form of H3, called H3.3, which is the substrate for repli-
cation-independent nucleosome assembly. Replacement with H3.3 occurs at active genes (see right panel of title figure, showing H3.3 in green on a fruit fly chromosome), a dynamic process with potential epigenetic consequences. Differences between H3 and H3.3 in their complement of covalent modifications might underlie changes in the properties of chromatin at actively transcribed loci. Several H2A variants can also differentiate or regulate chromatin. H2A.X is defined as a variant by a 4-amino acid carboxy-terminal motif whose serine residue is the site for phosphorylation at sites of DNA double-stranded breaks. Phosphorylation of H2AX is an early event in double-strand break repair, where it is thought to concentrate components of the repair machinery. H2AX phosphorylation also marks the inactive XY bivalent during mammalian spermatogenesis and is required for condensation, pairing, and fertility. H2AZ is a structurally diverged variant that has long presented an enigma. Studies in yeast have implicated H2AZ in establishing transcriptional competence and in counteracting heterochromatic silencing. The biochemical complex that replaces H2A with H2AZ in nucleosomes is an ATP-dependent nucleosome remodeler, providing the first example of a specific function for a member of this diverse class of chromatin-associated machines. Two vertebrate-specific variants, macroH2A and H2ABbd , display contrasting features when packaged into nucleosomes in vitro, with macroH2A impeding and H2ABbd facilitating transcription. These features are consistent with their localization patterns on the epigenetically inactivated mammalian X chromosome, with macroH2A showing enrichment and H2ABbd showing depletion. The emerging view from these studies is that histone variants and the processes that deposit them into nucleosomes provide a primary differentiation of chromatin that might serve as the basis for epigenetic processes.
HIS TON E
V A R I ANT 5
AND
E PIG ENE TIC 5
•
251
1 DNA Is Packaged by Architectural Proteins in All Organisms
2 Eukaryotic Core Histones Evolved from Archaeal Histones
The enormous length of the DNA double helix relative to the size of the organelle that contains it requires tight packaging, and architectural proteins have evolved for this purpose. The first level of packaging shortens the double helix and protects it from damage, while still allowing DNA polymerase to gain full access to each base pair every cell cycle. In addition, these architectural proteins facilitate higher-order folding to further reduce the length of a chromosome. Perhaps because of stringent requirements for packaging DNA, only two structural classes of architectural proteins are found in nearly all cellular life forms (Malik and Henikoff 2003). Bacterial DNA is packaged by HU proteins, eukaryotic DNA is packaged by histones, and archaeal DNA is packaged by either HU proteins or histones. Histones package DNA into nucleosome particles, and this architectural role can account for the fact that histones comprise half of the mass of a eukaryotic chromosome. However, histones have also been found to play diverse roles in gene expression, chromosome segregation, DNA repair, and other basic chromosomal processes in eukaryotes. Specific requirements of these chromosomal processes have led to the evolution of distinct histone variants. The incorporation of a variant histone into a nucleosome represents a potentially profound alteration of chromatin. Indeed, recent work has revealed that some histone variants are deposited by distinct nucleosome assembly complexes, which suggests that chromatin is diversified, at least in part, by the incorporation and replacement of histone variants. The four core histones differ with respect to their propensity to diversify into variants. For example, humans have only one H4 isotype but several H2A paralogs with different properties and functions. Evidently, the different positions of the core histones within the nucleosome particle have subjected them to different evolutionary forces, leading to important diversifications of H2A and H3 but not of H2B and H4. The availability of genomic sequences from a wide variety of eukaryotes allows us to conclude that these diversifications have occurred at various times during eukaryotic evolution. However, the evident diversification of an ancestral histone fold protein into the familiar four core histones must have occurred early in the evolution of the eukaryotic nucleus or perhaps before. By considering these ancient events, we gain insight into the forces that have resulted in subsequent diversification into present-day variants.
The eukaryotic nucleosome is a complex structure, consisting of an octamer of four core histones wrapped nearly twice by DNA, with histone tails and linker histones mediating a variety of packaging interactions outside the core particle (Arents et al. 1991; Wolffe 1992; Luger et al. 1997). Archaeal nucleosomes are much simpler, and it is evident that they resemble the ancestral particle from which eukaryotic nucleosomes evolved (Malik and Henikoff 2003). An archaeal nucleosome consists of histone fold domain proteins that lack tails and form a tetrameric particle that is wrapped only once by DNA. The kinship between archaeal and eukaryotic nucleosomes can be seen by comparing their structures: The backbone of the archaeal tetramer nearly superimposes over that of the (H3'H4)z tetramer (Fig. 1). When archaeal nucleosomes are reconstituted to form chromatin, the resulting fiber behaves similarly to "tetrasomes" of (H3' H4)z. Therefore, it is thought that eukaryotic nucleosomes evolved from archaeal nucleosomes by addition of H2A'H2B dimers on either side of the tetrasorne to allow a second DNA wrap, and by acqui-
Archaeal histones
Archaeal 'doublet' histones
Eukaryotic tetramer
Eukaryotic octamer
Figure 1. Model for the Evolution of the Eukaryotic Nucleosome from an Archaeal Doublet Histone Ancestor An archaeal tetra mer with interchangeable subunits A and B (AlB) may have evolved into a dimer of fused dimers ("doublet"). This could have been followed by a gene split to give rise to the eukaryotic tetra mer of H3 and H4, forming an (H3·H4)2 "tetrasome" that occupies a single turn of DNA. H2A and H2B may have arisen from a similar event, assembling above and below the tetramer as suggested in the cartoon so being able to accommodate two turns of DNA (not illustrated). Single dots in the top part of the diagram represent dimeric contacts and double dots represent four-helix bundles between adjacent dimers (Reprinted from Malik and Henikoff 2003).
252 • C HAP T E R 7 3
s1t1On of histone tails. In addition, DNA wraps into a right-handed superhelix around archaeal cores, but into a left-handed superhelix around eukaryotic cores. Further insight into the origin of the eukaryotic nucleosomes comes from examination of the subunit structures of archaeal nucleosomes. Whereas most archaeal histones are undifferentiated monomers or are differentiated into structurally interchangeable variants that come together to form a tetramer, some are head-totail dimeric fusions that come together to form a dimer of fused dimers (Fig. 1). When two of these fused dimers assemble into a nucleosome particle, each member of the fused pair is in a structurally distinguishable position. By occupying distinct positions in the particle, each member of the archaeal fused dimer evolves independently, allowing it to adapt to a single position in the nucleosome particle. In contrast, monomers that occupy interchangeable positions are not free to adapt to particular positions. Indeed, the two members of archaeal dimers have diverged from one another in both independent lineages in which they are found. This process provides a possible scenario for the differentiation of an ancestral histone fold domain protein into four distinct subunits that occupy distinct positions in the eukaryotic nucleosome. Like their presumed archaeal ancestors, eukaryotic histones form dimers, where H2A dimerizes with H2B and H3 with H4 (which also stably tetramerizes in solution). The structural backbone of an archaeal histone dimer superimposes with those of H2A'H2B and H3'H4 at 2 A resolution, with the first member of the dimeric repeat superimposing on H2A or H3 and the second member superimposing on H2B or H4. So, although all four eukaryotic histones lack significant sequence similarity to one another and to archaeal histones, the striking structural superposition of dimeric units suggests that eukaryotic histones evolved and differentiated from simpler archaeal ancestors. The asymmetry of H2A'H2B and H3'H4 dimers, which appears to have originated from archaeal tandem dimers, could have led the way to subsequent diversification of eukaryotic histone variants. Both H2A and H3 correspond to the first member of archaeal tandem histone dimers, and both have subsequently diversified multiple times in eukaryotic evolution. In contrast, H2B and H4 correspond to the second member and have shown little (H2B) or no (H4) functional diversification. Both H3 and H2A make hom*odimeric contacts in the octamer (Fig. 2), whereas H4 and H2B only contact other histones. As a result, changes in the residues involved in hom*odimerization of either H2A or H3 can
Loop 1 CenH3
N-terminal tail H3.3 CenH3 .
------=
Loop 1""'-< CenH3
--r.....---~~
N
N-terminal tail H3.3 CenH3
Loop
~~ii:~~- a-helix 2 H3.3
r.
~
N-terminal tail H3.3 CenH3 N
1_-+-~_,
CenH3
Figure 2. location of Histone H3 (blue) and H2A (brown) in the Nucleosome Core Particle The four residues that differ between H3 variants are indicated in yellow. (Reprinted, with permission, from Henikoff and Ahmad 2005.)
potentially resist formation of mixed octamers, allowing nucleosomes containing an H2A or H3 variant to evolve independently of parental nucleosomes. For example, the four-helix bundle comprising the interface between H3s determines the left-handed supercoiling of the DNA around the nucleosome (Arents et al. 1991; Luger et al. 1997), whereas DNA supercoils are right-handed in archaeal nucleosomes (Marc et al. 2002). Evidently, mutation of the four-helix bundle in an H3 ancestor was responsible for this reversal. In general, structural features that facilitated independent evolution of subunits may have been prerequisites for diversification of nucleosome particles.
HIS TON E
Although we can rationalize the descent of the eukaryotic core histones from archaeal tandem dimers, other basic questions remain. Where did histone tails come from? When did H2A·H2B arrive on the scene? Did these events occur before, during, or after the evolution of the eukaryotic nucleus? Why do all known archaeal nucleosomes consist of tetramers with one wrap, whereas eukaryotic nucleosomes consist of octamers with two wraps? Why did the superhelical handedness switch? Perhaps the sequences of more archaea or of primitive eukaryotes will reveal intermediate forms that can answer these questions. 3 Bulk Histones Are Deposited after DNA Replication
The packaging of essentially all DNA in a eukaryotic cell into nucleosomes requires that chromatin is duplicated when DNA replicates (Fig. 3). Thus, canonical histones are produced during the DNA synthesis (S) phase of the cell cycle. S-phase coupling of histone synthesis to DNA synthesis is under tight cell cycle control (Marzluff and Duronio 2002). This is especially evident in animals, where special processing of histone transcripts by the U7 small
Figure 3. Old Nucleosomes (dark disks) Are Randomly Distributed behind the Replication Fork and New Nucleosomes (light disks) Are Deposited in the Gaps CAF-l-mediated nucleosome assembly is depicted on the leading and lagging strand in magnification. DNA polymerase (green); replication processivity clamp, PCNA (blue ring); histone H3·H4 tetramers (pink); newly synthesized DNA (red).
V A R I ANT 5
AND
E PIG ENE TIC 5
253
nuclear ribonuclear protein complex, and mRNA stabilization by the stem-loop-binding protein (SLBP), contribute to the tight coordination of histone synthesis with DNA replication. The need for rapid and massive production of histones during S phase is very likely responsible for the fact that replication-coupled (RC) histones in animals are encoded in clusters that comprise many histone genes. For example, there are 14 H4 genes in the human genome, most of which are found in two major clusters, where these H4 genes are interspersed with other RC histone genes (Marzluff et al. 2002). In animals, RC histones are recognizable by the presence of a 26-bp 3' sequence that forms a stem-loop for recognition by SLBP when transcribed into histone mRNA. Canonical plant histones are also encoded by multiple genes and are deposited during S phase, although plant histone transcripts are polyadenylated and there does not appear to be a counterpart to SLBP. To the extent that epigenetic inheritance results from inheritance of a chromatin "state," the process of RC nucleosome assembly has been of intense interest. The biochemistry of the process was elucidated with the development of in vitro systems that could assemble nucleosomes onto replicating DNA. These studies revealed that a three-subunit complex, chromatin assembly factor 1 (CAF-l), acts as a histone chaperone that facilitates the incorporation of H3·H4 as a first step in nucleosome assembly (Loyola and Almouzni 2004). CAF-1 was shown to interact with the replication processivity clamp, PCNA, which implies that DNA replication and RC assembly occur in close proximity. Work in budding yeast revealed that none of the subunits of complexes involved in RC assembly in vitro is essential for growth, suggesting that, in vivo, there are redundant mechanisms for RC assembly. The fact that much of yeast chromatin is assembled in a replication-independent (RI) manner (Altheim and Schultz 1999) provides a rationale for this evident redundancy. As shown below, histone variants are typically deposited by RI nucleosome assembly. RC assembly is not completely redundant in budding yeast. An intriguing finding is that absence of the large CAF-1 subunit leads to loss of epigenetic silencing at telomeres (Loyola and Almouzni 2004). The connection between RC assembly and epigenetic silencing has been extended to Arabidopsis, where loss of CAP-1 subunits results in a variety of defects attributable to loss of epigenetic memory. Although the mechanistic basis for these observations is unknown, it seems clear that the proper deposition of new nucleosomes behind the replication fork is important for maintaining an epigenetically silenced state.
254 •
C HAP T E R
73
A prerequisite for epigenetic inheritance of a nucleosome state is that preexisting nucleosomes must be distributed to daughter chromatids following replication (Fig. 3). Indeed, this is the case: Extensive studies have shown that old nucleosomes are inherited intact and evidently at random to daughter chromatids (Fig. 3) (Annunziato 2005). However, this process of inheritance is poorly understood, as is the process by which new histones might acquire epigenetic information. A popular model is that new nucleosomes are modified by their proximity to old nucleosomes (Jenuwein 2001); however, evidence for this hypothetical process is lacking, and alternative means of perpetuating an epigenetic state must be considered (Henikoff and Ahmad 2005). How epigenetic information is inherited to daughter cells remains a major unanswered question in biology, and the study of histone variants and the mechanisms of their deposition may provide clues. 4 Variant Histones Are Deposited Throughout the Cell Cycle
As we have seen, core histones can be classified on the basis of their ancestral sequence and position in the nucleosome. Linker histones are characterized by a winged helix domain, rather than a histone fold domain, and bind to the linker DNA that separates nucleosomes (Wolffe 1992). Although minor variants of these canonical histones exist, they appear to be interchangeable with the major form. For example, mammalian H3.1 and H3.2 differ by a single amino acid that is not known to impart different biological properties to the two isoforms. The existence of multiple genes that produce large amounts of canonical histones for S-phase deposition is typical of eukaryotic genomes. The near ubiquity and overwhelming abundance of canonical S-phase histones has resulted in relatively little attention being paid to histone variants until recently. The renaissance of interest in histone variants came in part from the realization that they differ from canonical S-phase histones in ways that can lead to profound differentiation of chromatin. One way that they differ is in their mode of incorporation into chromatin. RC assembly incorporates new nucleosomes into gaps between old nucleosomes genome-wide, whereas RI assembly involves local replacement of an existing nucleosome or subunit (Marzluff et al. 2002). RI assembly therefore has the potential of switching a chromatin state by replacing a canonical histone with a variant. Replacing one histone with another also could erase or alter the pattern of posttranslational modifications. Therefore, RI assembly can potentially reset epigenetic states that are thought to be
mediated by histones and their modifications. Recent progress in studying histone variants and the processes by which they are deposited has led to new insights into the basis for epigenetic inheritance and remodeling. Below, we discuss features of particular histone variants that contribute to chromatin differentiation and might be involved in propagating epigenetic information. 5 Centromeres Are Identified by a Special H3 Variant
A defining feature of the eukaryotic chromosome is the centromere, which is the site of attachment of spindle microtubules at mitosis. The first centromeres to be described in molecular detail were those of budding yeast (Saccharomyces cerevisiae), where a 125-bp sequence is necessary and sufficient for centromere formation (Amor et al. 2004a). However, centromeres of plants and animals are very different, typically consisting of megabase arrays of short tandem repeats. Unlike the situation for budding yeast, the role of DNA sequence at these complex centromeres is uncertain, because fully functional human neocentromeres are known to form spontaneously at ectopic sites that entirely lack sequences resembling centromeric repeats (Fig. 4). These and other observations argue against a direct role of
Figure 4. Human Neocentromeres (Indicated by an Arrow) Lack Centromeric a-Satellite DNA but Have CENP-A and Heterochromatin Anti-CENP-A staining in green and anti-CENP-B staining in red (which marks a-satellite DNA) identify a Chromosome 4 neocentromere that lacks a-satellite (main pane!). This Chromosome 4 is otherwise normal, having been transmitted for at least three meiotic generations in normal individuals. Inset shows anti-HPl staining, which indicates that despite the lack of satellite DNA, heterochromatin forms around active neocentromeres. (Reprinted, with permission, from Amor et al. 2004b [© National Academy of Sciences].)
HIS TON E
DNA sequence in determining the location of centromeres (see Chapter 6). A key insight into the basis for centromere identity and inheritance came from the identification of a histone H3 variant, CENP-A (title figure), which was found to localize specifically to centromeres and to be incorporated into nucleosomal particles in place of H3 itself (Palmer et al. 1991). Remarkably, CENP-A remains associated with centromeres during the transition from histones to protamines during spermatogenesis, when essentially all other histones are lost (Palmer et al. 1990). This early observation in the study of CENP-A suggested that CENP-A contributes to centromere identity of the male genome. The generality of this insight was not fully appreciated until it was realized that CENP-A is a much better marker for centromeres than is DNA sequence (Amor et al. 2004a) and that counterparts of CENP-A can be found in the genomes of all eukaryotes (Fig. 5) (Malik and Henikoff 2003). Thus, although budding yeast centromeres are determined by a 12S-bp consensus sequence, this is also the site of a centromeric nucleosome that contains the Cse4 centromeric H3 (CenH3) variant. In fission yeast (Schizosaccharomyces pombe), an array of CenH3-containing nucleosomes occupies the central core region of the centromere flanked by H3-containing nucleosomes that display heterochromatic features (Amor et al. 2004a). In flies and vertebrates, CenH3s are present in arrays which alternate with H3-containing arrays which display a unique pattern of histone modifications (Sullivan and Karpen 2004). Alternation can account for the fact that centromeres occupy only the outside edge of the cen-
.v A R I ANT 5
AND
E PIG ENE TIC 5
255
tromeric constnctlOn of metaphase chromosomes (title figure). This is consistent with the observation that in worm "holokinetic" chromosomes, microtubules attach throughout the length of each anaphase chromosome, and CenH3 occupies the leading edge all along its length (Fig. 5, right) (Malik and Henikoff 2003). Indeed, a unique CenH3 variant is found to precisely mark the centromere in all eukaryotes that have been examined. This apparent ubiquity, and the presence of centromeres to perform mitosis in all eukaryotes, raises the possibility that the first canonical H3 evolved from a CenH3. Genetic experiments in a variety of eukaryotes have confirmed the essentiality of CenH3 for formation of the kinetochore and for chromosome segregation (Amor et al. 2004a). Because they remain in place throughout the cell cycle, CenH3-containing nucleosomes form the foundation for assembly of other kinetochore proteins during mitosis and meiosis (see Chapter 6). An outstanding question in chromosome research is just how these proteins interact to provide a linkage between the centromere and spindle microtubules that can hold up to the strong pulling forces exerted on kinetochores at anaphase. Several dozen kinetochore-specific proteins have been identified in yeast (for more detail, see Chapter 6), although how they interact with CenH3-containing nucleosomes and other foundation proteins, such as CE P-C, is currently unknown. An additional challenge is elucidation of the process that assembles CenH3 into nucleosomes. The fact that centromeres account for such a small proportion of chromatin overall has hampered biochemical approaches to this outstanding problem, but we expect
Figure 5. Centromeric H3 Variants in Model Eukaryotes (Left) Human chromosome stained with an antibody against the centromere-specific histone H3 variant CENP-A (green) and anti-CENP-B (red) marking a-satellite DNA (image courtesy of Peter Warburton). (Center) Drosophila melanogaster antiCenH3 antibody (red) stains centromeres in metaphase chromosomes and throughout interphase (image courtesy of Suso Platero). (Right) Caenorhabditis elegans anti-CenH3 antibody (green) stains the end-to-end holocentromeres of prophase chromosomes (red) (image courtesy of Landon Moore).
256 • C HAP T E R 7 3
that improving technologies will lead to a better understanding of kinetochore structure and dynamics. The evolution of CenH3s is unlike that of any other histone class. Whereas histone H3 is almost invariant in sequence, which reflects extraordinarily strong purifying selection on every residue, CenH3s are evolving rapidly, especially in plant and animal lineages (Malik and Henikoff2003). This is most evident from the amino-terminal tails, which differ in length and sequence to such an extent that they cannot be aligned between the CenH3s of different taxonomic groups. Even the histone fold domain of CenH3 is evolving orders of magnitude faster than that of H3. What is the reason for this striking evolutionary difference between an H3 that functions at centromeres and an H3 that functions everywhere else? Rapidly evolving regions of Drosophila and Arabidopsis CenH3 genes display an excess of replacement nucleotide substitutions over what would be expected from the rate of synonymous substitutions (Malik and Henikoff 2003). This excess is a hallmark of adaptive evolution. Adaptive evolution in plants and animals is also seen for another major centromere foundation protein, CENP-C (Talbert et al. 2004). Although adaptive evolution is well documented for genes involved in genetic conflicts, such as arms races between host and parasite interactions, these are the only known essential singlecopy genes that are adaptively evolving in any organism. In the case of CenH3 and CENP-C, the regions of adaptive evolution correspond to regions of DNA binding and targeting. This suggests that the major centromere-binding proteins are adapting to the evolving centromeric DNA, thus allowing centromeric chromatin to interact with the conserved kinetochore machinery that connects the centromere to spindle microtubules. It has been proposed that centromeres compete during female meiosis to be included in the egg nucleus rather than being lost as polar bodies (Talbert et al. 2004). An arms race would develop leading to expansion of centromeres, probably by unequal crossing-over between sister chromatids. Host suppression of this meiotic drive process by CenH3 and CENP-C would lead to an excess of replacement changes in regions that interact with DNA. Organisms in which there is no opportunity for centromeres to compete, such as budding yeast, would not undergo centromere drive, and this might account for the fact that they have small centromeres and their CenH3 and CENP-C proteins are under strong purifying selection. Thus, we see that a special region of the genome, the centromere, is distinguished by a single histone variant class, whose sequences reveal remnants of an arms race
that may have led to the extraordinary complexity of centromeres. The RI assembly process that targets new CenH3-containing nucleosomes to centromeres every cell cycle remains unknown (Amor et al. 2004a). Centromeric nucleosomes show a remarkable lack of sequence specificity in that they not only can faithfully localize to neocentromeres that are completely unlike native centromeres (Fig. 4), but also the yeast hom*olog Cse4 can functionally replace human CENP-A (Wieland et al. 2004) (neither of which is adaptively evolving; Talbert et al. 2004). It is extraordinary that our centromeres have remained in the same positions for tens of millions of years without any evident sequence determinants involved in the process that maintains them. To the extent that epigenetics refers to inheritance that does not depend on DNA sequence, the inheritance of centromeres on a geological timescale is the most extreme form imaginable. Yet, we are still seeking a mechanism to explain how they have maintained themselves for even a single cell cycle (topic discussed further in Chapter 14). 6 The Replacement Histone Variant H3.3 Is Found at Active Chromatin
Like centromeres, transcriptionally active chromatin is thought to be maintained epigenetically, and like centromeres, active chromatin is enriched in an H3 variant, called H3.3 (Henikoff and Ahmad 2005). H3.3 is very similar in sequence to the canonical forms of H3, differing by only four amino acids. With so few differences, it might be assumed that these two forms are interchangeable. However, in Drosophila, H3.3 is deposited by either RC or RI nucleosome assembly, whereas H3 is deposited only at replication foci in a RC manner. This difference between the two variants is encoded in the protein itself, with three of the four differences between H3 and H3.3 evidently involved in preventing H3 from being deposited by an RI pathway (in a-helix 2, Fig. 2). Purification of soluble human assembly complexes confirmed that these two forms participate in distinct assembly processes: H3.1 copurified with CAF-1 for RC assembly, and H3.3 copurified with other components, including HirA, and participated in RI assembly. Although four-amino acid differences might seem practically insignificant, when one considers that humans, flies, and clams have precisely the same H3.3 sequence, these differences from H3 stand out. Phylogenetic analysis reveals that the H3/H3.3 pair evolved at least four separate times during eukaryotic evolution: in plants, animals/fungi, ciliates, and apicomplexans (Malik
HIS TON E
and Henikoff 2003). Despite having a separate ongm from animals and fungi, the animal H3/H3.3 pair and the pair from plants (called H3.1 [RC] and H3.2 [RI]-to avoid confusion, we refer to all RC isoforms as H3 and all RI isoforms as H3.3) are strikingly similar. The same cluster of amino acids (positions 87-90) that prevents RI deposition of H3 in Drosophila is found to differ in plants, and the remaining difference in animals (position 31 is Ala for H3 and either Ser or Thr for H3.3) is also found in plants. Fungi are especially interesting. Ancestrally, they have both H3 and H3.3; however, ascomycetes, which include yeasts and molds, have lost the H3 form. Thus, the obligate RC form of histone 3 that has received the most attention in animals is not even present in yeast. Studies of H3.3 in bulk chromatin showed that it is enriched in active fractions (Henikoff and Ahmad 2005). However, various factors contributed to the obscurity of this potential "mark" of active chromatin during a time of great excitement in the chromatin field when it was realized that histone modifications can distinguish active from silent chromatin. For one thing, no antibodies were available that could effectively distinguish H3 from H3.3 in chromatin (positions 87-90 are blocked by the DNA gyres in the nucleosome), whereas excellent antibodies against many different posttranslational modifications were readily available. In addition, the seemingly slight sequence differences between H3 and H3.3 did not suggest any fundamental distinctions in chromatin, whereas histone modifications were mostly on tail lysines that were known to affect chromatin interactions or to bind chromatin-associated proteins. This perception that the two histone-3 forms should be interchangeable was confirmed by the finding in Tetrahymena that the S-phase form can substitute for its replacement counterpart. Finally, the influential "histone code" hypothesis envisioned nucleosomes as fixed targets of modification enzymes during chromatin differentiation (Jenuwein and Allis 2001). However, it has become increasingly evident that chromatin is highly dynamic, and even heterochromatin-associated proteins bind with residence times of a minute or less (Phair et al. 2004). It appears that the chromatin of actively transcribed genes is in constant flux, characterized by continual histone replacement (Henikoff and Ahmad 2005). The three core differences that distinguish H3 and H3.3 make H3.3-H4 dimers the substrate for RI assembly, and RI assembly itself profoundly changes chromatin. As a result of this process, actively transcribed regions become marked by H3.3 (Fig. 6), and evidence for this process comes from the observation of RI replacement
V A R I ANT SAN 0
E PIG ENE TIC S
257
Figure 6. H3.3 Preferentially localizes to Actively Transcribed Regions of Drosophila Polytene Chromosomes DAPI staining (red) shows the DNA banding pattern (left), and H3.3GFP (green) localizes to interbands (middle), which are sites of RNA polymerase II localization. Right shows the merge. (Reprinted from Schwartz and Ahmad 2005).
of H3 methylated on lysine 9 (H3K9me) with tagged H3.3 at RNA polymerase I and II (pol I and II) transcribed loci (Schwartz and Ahmad 2005). The dynamic nature of chromatin at active loci results in the erasure of preexisting histone modifications. This provides a potential solution to the problem of how silent chromatin can become activated when it is hypermethylated on H3K9 and H3K27 (histone modifications commonly associated with repressive chromatin). Time-course studies showed that methyls on histones are as stable as the histones themselves (Waterborg 1993), although the recent discovery of a demethylase specific for mono- and di-methyl H3K4 (Shi et al. 2004) indicates that some methyls can be enzymatically removed from histones. In general, patterns of histone covalent modifications might result from modifications already present on the histones at the time that they are deposited. In this way, modification enzymes would track with the assembly machinery, perhaps facilitating the process (Henikoff and Ahmad 2005). This dynamic assembly model predicts that histone modifications found to be enriched on active chromatin should be enriched on H3.3, and bulk measurements of modifications on H3 and H3.3 have shown this to be the case for both plants and animals. Furthermore, it is expected from this model that active lysine modifications such as acetylation of H3 and H4 and methylation of H3K4 and H3K79 will be strongly correlated with one another, as has been observed in diverse systems (O'Neill et al. 2003; Kurdistani et al. 2004; Schubeler et al. 2004). Finally, dynamic RI assembly at active genes can explain why CAF-l mutations cause a loss of silencing (Loyola and
258
C HAP T E R
13
Almouzni 2004): Only about 10% of the yeast genome is considered to be in a silent state, and this may be the only chromatin that is not dynamically replaced in the yeast genome. In the absence of CAF-l-mediated RC assembly, RI assembly would occur over the entire yeast genome, activating previously silent regions. Perhaps the existence of an H3 variant dedicated to RC assembly in multicellular eukaryotes is an adaptation to keep the large majority of the chromatin in a cell in an epigenetically silent state. Replacement by differentially modified H3.3·H4 dimers suggests a simple model for inheritance of active chromatin in dividing cells (Henikoff and Ahmad 2005). Active chromatin would remain active following dilution by ordinary nucleosomes after RC assembly if this random mixture of RI-deposited and RC-deposited nucleosomes does not obstruct active processes such as transcriptional initiation and elongation. Continuation of transcriptional activity as a result would restore chromatin in the next cell cycle, leading to perpetual maintenance of active chromatin throughout development. The possibility that a histone variant is perpetually maintained by an RI assembly process may also hold for CenH3s, which would incorporate into gaps caused by the unraveling of ordinary nucleosomes resulting from anaphase tension. When cells exit the cell cycle and differentiate, they no longer produce or incorporate S-phase histones, and H3.3 accumulates as a result. For example, H3.3 accumulates in rat brains to a level of 87% of the histone 3 by the time that rats are 400 days old (Henikoff and Ahmad 2005). Whether or not this gradual replacement of chromatin is of functional significance is unknown. It is also unknown whether the active process that allows replacement to occur is the same as that seen at transcriptionally active loci. One possibility is that disruption of chromatin by a
transltmg RNA polymerase or chromatin-remodeling machine causes local unraveling of the nucleosome and occasional loss of an H3.3·H4 dimer (Fig. 7). This would be followed by reassembly of the nucleosome in the wake of the polymerase with replacement of the lost dimer with an H3.3·H4 dimer by the HirA complex. Only when polymerases are too densely packed for assembly to occur would nucleosomes completely unravel. 7 Phosphorylation of H2AX Functions in DNA Double-Strand Break Repair
The H2A histones also comprise a family of distinct variants found throughout eukaryotes. The H2AX variant is defined by the presence of a carboxy-terminal amino acid sequence motif, SQ(E or D)8, where 8 indicates a hydrophobic amino acid. The serine in this sequence motif is the site of phosphorylation producing a modified protein designated "y-H2AX." The dynamic nature of chromatin, and H2AX phosphorylation, is especially evident when double-strand (ds) breaks occur in DNA (Morrison and Shen 2005). The lethality of even a single ds break requires immediate action to repair the lesion and restore the continuity of the double helix. The detection of a ds break normally occurs within a minute or so of its formation and this, in turn, triggers the rapid phosphorylation of H2AX in the immediate vicinity of a break site. This phosphorylation is carried out by members of the phosphoinositol 3-kinase-like kinase family. Following this initial event, H2AX phosphorylation then spreads quickly along the chromosome marking a relatively large chromatin domain surrounding the break. Finally, the ds break is eventually repaired by either hom*ologous recombination or nonhom*ologous end-joining, and the phosphorylation mark is removed.
Figure 7. Model for Replicationindependent Replacement or Exchange A large molecular machine (either the SWRl complex or RNA polymerase II) partially or completely unravels a nucleosome during transit. The result is either retention of heterodimeric subunits, such as the FACT-facilitated transfer of H2A'H2B from in front of RNA polymerase to behind (Formosa et al. 2002; Belotserkovskaya et al. 2003), or loss of a heterodimer. In the latter case, chromatin repair replaces the lost heterodimer with either H2AZ'H2B (top) or H3.3·H4 (bottom).
H f 5 TON E
Phosphorylation of H2AX is not essential for detection or repair of ds breaks, because deletion of the gene or mutation of the target serine residue does not abolish repair. However, H2AX is not just a marker of damage, since such mutants have reduced efficiency of repair and are hypersensitive to radiation damage and genotoxic agents. Currently, H2AX is thought to function in ds break repair in at least two ways. First, it may help recruit or retain proteins required for repair at the site of the break (Morrison and Shen 2005). Second, it may stabilize the chromosome surrounding the broken ends, through the recruitment of cohesin, the protein complex responsible for keeping sister chromatids together (Lowndes and Toh 2005). The evolution ofH2AX is unlike that of other histone variants. Although a gene for H2AX is found in nearly all eukaryotes, it has had multiple relatively recent origins (Malik and Henikoff 2003). For example, the version of H2AX found in Drosophila is different from that found in another dipteran insect, Anopheles. Presumably, the ability to evolve a new H2AX from the canonical form of H2A is a consequence of the simple SQ(E or D)8 motif. Evolving such a motif at the carboxyl terminus of the canonical H2A is expected to occur repeatedly over evolutionary time. Occasional loss of an existing H2AX with a newly minted version might be fueled by the need for H2AX to be very uniformly distributed, because ds breaks can occur anywhere in the genome. If mutations occur in an existing H2AX gene that reduce its similarity to the canonical H2A in such a way that its assembly becomes less efficient or uniform, there will be strong selection to replace it with a version that is more similar to canonical H2A. This rationale could help account for the exceptional case of Drosophila H2AX, which, unlike other eukaryotes, is not derived from its canonical H2A, but rather from the distant H2AZ lineage (described below). If all that is necessary to be an H2AX is to be in the H2A position in a nucleosome and to have the carboxy-terminal motif for phosphorylation, an H2AZ can evolve this capability. ds break repair is clearly the universal function of H2AX phosphorylation, and there would seem to be no stable epigenetic aspect to this process. However, H2AX null mice are sterile, and cytological examination of mammalian spermatogenesis has revealed a striking epigenetic feature, in which H2AX is specifically phosphorylated on the XY bivalent (Fig. 8) (Fernandez-Capetillo et al. 2003). This chromosome pair occupies a distinct "sex body" during meiotic prophase which has been implicated in silencing of sex-linked genes during male meio-
V A R fAN T 5
AND
SCP3
E PIG ENE TIC 5
259
SCP3 + XMR
H2AX+/+
H2AX-/-
Figure 8. Pachytene Stage of Spermatogenesis Showing the Dependence of Sex Body Formation on H2AX In normal mammalian spermatocytes, a nuclear structure, the sex body (arrow, labeled green in right panels), is seen to encompass the unpaired XY bivalent (labeled in left panels). The synaptonemal complex, which aligns paired chromosomes, is stained red. H2AX is normally enriched in the sex body (H2AX+/+). In H2AX-I - spermatocytes, the sex body does not form and a sex body epitope becomes dispersed over autosomes (lower right pane0. Bar, 10 Jlm. Images courtesy of Shantha Mahadevaiah and Paul Burgoyne (FernandezCapetillo et al. 2003).
sis. H2AX phosphorylation is essential for normal sex body formation, and H2AX-deficient spermatocytes fail to pair or condense and fail to inactivate X and Y genes during meiosis. H2AX phosphorylation of the XY bivalent is distinct from the process that occurs at ds breaks. XY phosphorylation in the sex body does not require breaks, but rather occurs most conspicuously at unpaired regions of the chromosomes. The mechanisms whereby H2AX phosphorylation is targeted to unpaired chromosomes, and how this event leads to condensation, pairing, and silencing, are currently unknown. However, it is interesting to speculate that this role may be related to its ability to interact with and recruit cohesin. 8 H2AZ Plays Roles in Transcriptional Regulation
The renaissance of interest in histone variants has been especially strong in the case of H2AZ (or H2A.Z) (Kamakaka and Biggins 2005). H2AZ is nearly ubiquitous, and it diverged from an ancestral H2A early in eukaryotic evolution. Consistent with this separate lineage, genetic experiments in budding yeast and flies have shown that histones H2A and H2AZ have evolved to perform separate nonoverlapping functions. H2AZ is an essential histone in most organisms, from ciliated proto-
260 • C HAP
T ER
73
zoans to mammals. However, in budding and fission yeasts, deletion of the H2AZ gene produces viable cells, although the null mutants exhibit a variety of phenotypes. These properties have facilitated its genetic and biochemical characterization in yeast. H2AZ makes up approximately 10% of the total H2A protein in most organisms tested to date. It is widely, but not uniformly, distributed throughout the chromosomes. This is most elegantly visualized in the case of Drosophila polytene chromosomes, where it produces a distinct banding pattern. The results of chromatin immunoprecipitation experiments using yeast and mouse cells are consistent with this pattern. Although H2AZ is preferentially localized to the promoter regions of yeast genes, this specificity is not true for all sites of deposition. In Drosophila, there is no discernible relationship between H2AZ localization and gene expression. Thus, although the mechanism of H2AZ deposition is known (discussed below), at present, the rules that determine where it is concentrated are not. A variety of observations point to important roles for H2AZ in regulating gene expression (Kamakaka and Biggins 2005). Mutational analysis of budding yeast revealed that the function of H2AZ is partially redundant with two different classes of global transcription factors, the nucleosome-remodeling complex, Swi/Snf, and the histone modification complex, SAGA. Although the individual loss of function of H2AZ, Swi/Snf, or SAGA is viable, the simultaneous loss of any combination of two pathways is lethal. Additional genetic and biochemical experiments suggest that these roles include functions in both transcription initiation and elongation (for more detail, see Chapter 10). Moreover, the balance ofH2AZ deposition is causally linked to epigenetics through its role as an antisilencing factor. Deletion of the H2AZ gene results in extended spreading of silent chromatin inward from the telomeres, and this defect can be suppressed by the additional deletion of genes encoding the silencing factors themselves (Fig. 7) (see Chapter 4). The effect of deleting H2AZ on global gene expression has been assayed using yeast gene microarrays. Although the majority of regulated genes show decreased expression in the H2AZ null mutant, a substantial fraction show an increase in expression. Since it is not yet clear which changes reflect direct regulation and which are indirect, it may be that H2AZ nucleosomes function both positively and negatively to regulate gene transcription. Furthermore, it is not known whether the diverse roles of H2AZ in transcription and heterochromatin stem from a single unifying mechanism or a more complex combination of pathways.
In contrast to the current picture in budding yeast, H2AZ is preferentially located in heterochromatic regions of mammalian cells. Indeed, it has been shown to physically interact with Heterochromatin-associated Protein 1 (HPl) (Fan et al. 2004). Although this might suggest a role for H2AZ in silencing in metazoans, it is worth noting that the subset of expressed genes located in heterochromatin in Drosophila actually requires HP1 for expression (Weiler and Wakimoto 1995). If the location of H2AZ in mammalian cells reflects a similar process, then the clearly established roles for this variant in facilitating transcription and counteracting silencing in yeast would likely reflect general fundamental properties of this variant. H2AZ may have an additional role in the epigenetics of chromosome segregation. One of the first phenotypes to be recognized for an H2AZ null mutant was a defect in mitotic chromosome segregation observed in fission yeast. More recent experiments have strengthened this connection. The experimental depletion of H2AZ in mammalian cells by RNA interference (RNAi) causes defects in pericentric HP1 association, genome instability, and chromosome mis-segregation (Kamakaka and Biggins 2005). Similarly, in budding yeast, H2AZ null mutants show increased mitotic chromosome loss and significant genetic interactions with genes encoding known components of the centromere and mitotic spindle (Krogan et al. 2004). It remains formally possible that the effect of H2AZ on chromosome segregation is an indirect consequence of its role in setting the program of gene transcription. However, an intriguing hypothesis is that mechanisms of chromosome segregation have evolved to exploit not only an H3 variant, but an H2A variant as well. How does H2AZ affect transcriptional competence, silencing, heterochromatin, and perhaps chromosome segregation? The high-resolution structure of an H2AZcontaining nucleosome reveals several unique properties of the variant (Suto et al. 2000). Compared with H2A nucleosomes, H2AZ presents an extended acidic patch domain on the surface of the nucleosome, and this difference is likely to have functional significance. For example, it is part of the "docking domain" (Fig. 2) that interacts with histone H4 and defines the segment essential for function in Drosophila. Furthermore, the results of mutational studies and binding experiments in vitro argue that this extended acid patch makes a major contribution to the interaction of the nucleosome with HP1 (Dryhurst et al. 2004). Interestingly, HP1 contains a chromodomain, a protein motif that can bind to methylated H3lysine 9 (see Chapters 3 and 4). Thus, H2AZ may act in synergy with
HIS TON E
histone H3 methylation to provide a binding platform for chromatin-associated proteins. In addition to its extended acid patch, H2AZ has a pair of histidine residues that coordinate an additional metal ion in the structure which, in vivo, might provide a unique physiological response that is unavailable to nucleosomes containing H2A. Finally, the crystal structure predicts that an asymmetric histone octamer, made up of one major H2A·H2B dimer plus one variant H2AZ·H2B dimer, would produce a clash of protein structures at Loop 1 (Fig. 2) and seems unlikely to occur in vivo. Together, these novel features of H2AZ nucleosomes argue that the variant should confer unique physical properties to chromatin. This prediction is borne out experimentally. For example, H2AZ may stabilize dimertetramer interactions within the nucleosome, and nucleosome arrays composed of H2AZ nucleosomes can show enhanced higher-order folding and decreased intermolecular aggregation (Dryhurst et al. 2004). Thus, H2AZ is likely to modulate chromatin function in at least three ways. First, it undoubtedly alters the physical properties of its chromatin environment, thus influencing access or activity of trans-acting factors. Second, as is the case for other histones, posttranslational modifications within its amino-terminal and carboxy-terminal domains are likely to provide unique docking sites for chromatin-associated proteins (so-called trans-effects introduced in Chapter 3), or regulated changes in charge density (cis-effects). Third, its restricted and specific deposition in chromatin is likely to target unique functions to specific loci.
9 Protein Complexes for the Deposition and Replacement of H2A Variants
Although important questions still remain as to how H2A histone variants function, recent studies have elucidated the basis for their incorporation into chromatin. The first breakthrough came with the biochemical purification of the complex that catalyzes the transfer of H2AZ·H2B dimers into chromatin (Sarma and Reinberg 2005). This multisubunit complex, termed SWR1-C, contains as its catalytic subunit the protein Swrl, a member of the SWl/SNF family of ATP-dependent chromatin remodelers. In vivo, SWR1-C appears to be dedicated to this task, because the effects of deleting the gene SWRl are similar to the effects of deleting the gene encoding H2AZ itself. Furthermore, in a swrl null mutant, the preferential deposition of H2AZ at specific loci is completely lost. In vitro, when purified SWR1-C is presented with a nucleosomal array, it specifically replaces H2A·H2B dimers with
V A R I ANT SAN 0
E P f G ENE TIC S
•
261
H2AZ·H2B dimers in an ATP-dependent reaction (Fig. 7). An interesting aspect of this reaction stems from a prediction of the crystal structure mentioned above: Mixed nucleosomes containing both H2A and H2AZ should not be stable. Thus, the dimer replacement mediated by SWR1-C may be a concerted reaction in which the substitution of one H2AZ·H2B dimer facilitates the ejection and replacement of the remaining H2A·H2B dimer. A second multisubunit protein complex carries out the replacement of phosphorylated H2AX with an unphosphorylated molecule in Drosophila (Morrison and Shen 2005). Remarkably, this single Drosophila complex, termed dTip60, is composed of proteins ordinarily found in two separate complexes: SWR1-C, the ATP-dependent chromatin-remodeling complex described above, and NuA4/Tip60, a histone modification complex with acetyltransferase activity. In vitro, the reaction requires both ATP and acetyl-CoA. Thus, this one complex integrates histone acetylation, nucleosome remodeling, and histone variant replacement. This combination likely reflects the fact that Drosophila H2AX is also its H2AZ, whereas H2AX in other eukaryotes evolved from canonical H2A. Despite this difference, there are reasons to expect that the basic pathway is conserved. In budding yeast and mammalian cells, SWR1-C, NuA4/Tip60, and another ATPdependent nucleosome-remodeling complex, IN080-C, share common subunits. One of these is the actin-related protein Arp4. Interestingly, Arp4 has been shown to interact with phosphorylated H2AX in bud9ing yeast and to result in the sequential recruitment of NuA4, SWR1, and IN080 complexes (Downs et al. 2004). This suggests that these complexes catalyze the replacement of both H2AX and H2AZ in this organism as well. This prediction remains to be demonstrated directly. The discovery that chromatin-remodeling complexes are dedicated to RI nucleosome assembly is important not just for understanding how histone variants are incorporated, but also for providing the first specific in vivo functions for chromatin-remodeling machines. Prior to these discoveries, it was not clear why cells would have such an abundance of large machines that facilitate the movement of nucleosomes (Becker and Horz 2002). The diversity of SWl/SNF ATPases presented a puzzle that now can perhaps be better understood if some remodeling machines are dedicated to the assembly of different variants into nucleosomes. Perhaps nucleosome assembly is a concerted process in which histone-modifying enzymes act on their substrates while ATP-dependent chromatin remodelers provide the power stroke and specificity needed for RI replacement.
262 •
C HAP T E R
1 3
10 Other H2A Variants Differentiate Chromatin but Their Functions Are As Yet Unknown
Further diversification ofH2A has occurred in vertebrates. Bbd In mammals, macroH2A and H2A (H2A Barr body deficient) represent unique lineages that appear to play roles in the epigenetic phenomenon of dosage compensation (discussed in detail in Chapter 17). macroH2A is socalled because in addition to the histone fold domain and amino- and carboxy-terminal tails, it contains a large carboxy-terminal globular domain (Ladurner 2003). Considering that the H2A carboxy-terminal tail exits near the linker DNA, it is possible that this globular domain interacts with linkers, H3 tails, or linker proteins such as HI and High Mobility Group (HMG) proteins. Just what this interaction would be is unknown, although an intriguing possibility is that it has an enzymatic activity. This possibility is encouraged by the resemblance of the 200-amino acid globular domain to proteins with hydrolytic activities on polynucleotides and peptides. Alternatively, the globular domain might simply act as an impediment to transcriptional initiation, a role suggested by its ability to block transcription factors from binding in vitro (Sarma and Reinberg 2005). The histone fold domain of macroH2A also has distinct properties, as it is not acted upon by chromatin remodelers. These observations suggest that macroH2A-containing nucleosomes are less mobile and so may be resistant to active transcription. This might account for the enrichment of macroH2A in discrete regions of the facultatively inactive X chromosome of human females that alternate with regions of constitutive heterochromatin (Fig. 9a) (Chadwick and Willard 2004).
Figure 9. H2A Variants and the Inactive X Chromosome of Human Females (a) macroH2A (red) stains discrete regions of the inactive X chromosome that alternate with a marker for heterochromatin (histone H3K9me3). (b) H2ABbd (green) is excluded from the inactive X chromosome (red dot with arrow pointing to it). (c) Same nucleus as in b, but stained with DAPI to show chromatin. (a, Reprinted, with permission, from Chadwick and Willard 2004 [© National Academy of Sciences]; b,c, reprinted, with permission, from Chadwick and Willard 2001 [© The Rockefeller University Press].)
In contrast to macroH2A, H2A Bbd appears to be undetectable on the Barr body, but otherwise ubiquitous throughout the nucleus (Fig. 9b) (Chadwick and Willard 2001). The in vitro behavior of H2A Bbd -containing nucleosomes is consistent with its playing a role in facilitating transcription (Sarma and Reinberg 2005). H2A Bbd is rapidly evolving relative to other known H2A isoforms, although the reason for this accelerated evolution is not clear. 11 Many Histones Have Evolved to More Tightly Package DNA
When it is no longer necessary to gain access to DNA for replication and transcription, chromatin typically becomes further condensed, and this often involves replacement of canonical histones. This is obviously the case for sperm, and in some lineages, histone paralogs have evolved specialized packaging roles. For example, sea urchin sperm contains HI and H2B variants with repeated tail motifs that bind to the minor grooves of DNA (Malik and Henikoff 2003), presumably an adaptation to tightly package chromosomes for inclusion into sperm heads. A similar adaptation is found in pollen-specific H2A variants in flowering plants. In vertebrates, sperm-specific specialized histone variants are found in mammalian testes, including an H2B paralog (SubH2Bv) that localizes to the acrosome and a testes-specific H3 variant (Witt et al. 1996). The replacement of histones during sperm maturation by protamines and other proteins provides a potential means of erasing epigenetic information in th~ male germ line. However, evidence for trans-generational inheritance (Rakyan and Whitelaw 2003), especially in animals that lack DNA methylation, raises the possibility that a subset of nucleosomal histones survive this transition and transmit epigenetic information. As already pointed out, this is just what occurs for CENP-A at centromeres (Palmer et al. 1990), and it is possible that a small fraction of other variants, such as H3.3, remain with sperm for epigenetic inheritance of gene expression information. Although our understanding of the process that replaces histones during sperm development is rudimentary, we expect that much can be learned by understanding how CENP-A survives this transition. Increased compaction also occurs in somatic cells that have finished dividing and undergo differentiation. In some cases, compaction involves quantitative and qualitative changes in linker histones. The stoichiometry of histone HI relative to nucleosomes determines the average spacing within nucleosome arrays in vivo (Fan et al. 2003). In addition, the presence of HI in chromatin pro-
HIS TON E
motes higher-order chromatin structure that generally inhibits transcription (Wolffe 1992). Linker histones are much more mobile than core histones in vivo. Residence times for H2A and H2B are hours in length, and cannot even be measured for H3 and H4, whereas the residence time of HI is a few minutes (Phair et al. 2004). As a result, the incorporation of variant linker histones is unlikely to differentiate chromatin in a heritable manner. Rather, the role of HI variants is thought to change the bulk properties of chromatin that can affect overall compaction (Wolffe 1992). HI variants share with core histones a distinction between RC and RI forms (Marzluff et al. 2002). RC variant forms of HI appear to be interchangeable with one another, based on the fact that knock-out mice lacking one or two of the five RC HI variants are phenotypically normal (Fan et al. 2003). In birds, the H5 linker histone variant is deposited during erythrocyte maturation, which accompanies extreme compaction of the nucleus. Another variant that is deposited at high levels in nondividing cells is HI 0, which is highly diverged from the canonical forms. Overexpression of HI 0 renders chromatin less accessible to nucleases than similar overexpression of a canonical form. The natural accumulation of Hl° in nondividing cells might be a general mechanism for chromatin compaction as cells become quiescent. 12 Conclusions and Future Research
Histone variants provide the most fundamental level of differentiation of chromatin, and alternative mechanisms for depositing different variants can potentially establish and maintain epigenetic states. Histones H2A, H2B, H3, and H4 occupy distinct positions in the core particle as a result of an evolutionary process that began before the last common ancestor of eukaryotes. Key evolutionary innovations remain uncertain, including the emergence of an octamer from an ancestral H3 o H4-like tetramer, and we look forward to the sequencing of more archaeal and primitive eukaryotic genomes that might provide missing links. Subsequent elaborations of the four core histones into distinct variants have provided the basis for epigenetic processes, including development and chromosome segregation. For a full understanding of epigenetic inheritance, we need a better understanding of the processes that incorporate variants by replacing canonical histones. An important recent development is the initial characterization of replication-independent assembly pathways dedicated to particular variants. Centromeres are the most conspicuous examples of profoundly different chromatin that is attributable to spe-
V A R I ANT SAN 0
E PIG ENE TIC S
•
263
cial properties of a histone variant. Although it is clear that CenH3-containing nucleosomes form the foundation of the centromere, just how they are deposited in the same place every cell generation without any hint of sequence specificity is a major challenge for future research. It is becoming evident that histone variants are also involved in epigenetic properties of active genes. Both H3.3 and H2AZ are enriched at transcriptionally active loci, and understanding the assembly processes that are responsible for their enrichment is an exciting area of current research. The dynamic behavior of chromatin leads to the realization that transcription, chromatin remodeling, and histone modification might be coupled to nucleosome assembly and disassembly. The study of dynamic processes coupled to histone turnover is only at an early stage, and we look forward to technological advances in molecular biology, cytogenetics, biochemistry, and structural biology that can be harnessed to better understand the dynamic nature of chromatin. In addition to these universal processes, histone variants are also involved in particular epigenetic phenomena. In the case of the mammalian X chromosome, three different H2A variants, phospho-H2AX, macroH2A, and H2A Bbd , have been recruited to participate in silencing or activation of genes for purposes of germ-line inactivation or dosage compensation. Understanding the function of these variants in epigenetic processes remains a major challenge for the future. The availability of the first high-resolution structure of the nucleosome core particle (Luger et al. 1997). was a seminal advance in elucidating the properties of chromatin. By elaborating this basic structure in a way that has biological consequences, histone variants provide an opportunity to deepen our understanding of how these fascinating architectural proteins have evolved to play diverse roles in epigenetic processes. References Altheim B.A. and Schultz M.e. 1999. Histone modification governs the cell cycle regulation of a replication-independent chromatin assembly pathway in Saccharomyces cerevisiae. Proc. Nat/. Acad. Sci. 96: 1345-1350. Amor D.L Kalitsis P., Sumer H., and Choo K.H. 2004a. Building the centromere: From foundation proteins to 3D organization. Trends Cell Bioi. 14: 359. Amor D.J., Bentley K., Ryan J., Perry J., Wong 1., Slater H., and Choo K.H. 2004b. Human centromere repositioning "in progress': Proc. Nat/. Acad. Sci. 101: 6542-6547. Annunziato A.T. 2005. Split decision: What happens to nucleosomes during DNA replication?]. Bio/. Chern. 280: 12065-12068. Arents G., Burlingame R.W., Wang B.e., Love W.E., and Moudrianakis E.N. 1991. The nucleosomal core histone octamer at 3.1 A resolution: A tripartite protein assembly and a left-handed superhelix.
264 •
C HAP T E R
13
Proc. Natl. Acad. Sci. 88: 10148-10152. Becker P.B. and Horz W. 2002. ATP-dependent nucleosome remodeling. Annu. Rev. Biochem. 71: 247-273. Belotserkovskaya R., Oh S., Bondarenko V.A., Orphanides G., Studitsky V.M., and Reinberg D. 2003. FACT facilitates transcriptiondependent nucleosome alteration. Science 301: 1090-1093. Chadwick B.P. and Willard H.E 2001. A novel chromatin protein, distantly related to histone H2A, is largely excluded from the inactive X chromosome.]. Cell BioI. 152: 375-384. - - - . 2004. Multiple spatially distinct types of facultative heterochromatin on the human inactive X chromosome. Proc. Natl. Acad. Sci. 101: 17450-17455. Downs J.A., Allard S., Jobin-Robitaille 0., Javaheri A., Auger A., Bouchard N., Kron S.J., Jackson S.P., and Cote J. 2004. Binding of chromatin-modifying activities to phosphorylated histone H2A at DNA damage sites. Mol. Cell 16: 979-990. Dryhurst D., Thambirajah A.A., and Ausio J. 2004. New twist.§ on H2A.Z: A histone variant with a controversial structural and functional past. Biochem. Cell BioI. 82: 490-497. Fan y., Nikitina T., Morin-Kensicki E.M., Zhao J., Magnuson T.R., Woodco*ck CL., and Skoultchi A.I. 2003. HI linker histones are essential for mouse development and affect nucleosome spacing in vivo. Mol. Cell. BioI. 23: 4559-4572. Fernandez-Capetillo 0., Mahadevaiah S.K., Celeste A., Romanienko P.J., Camerini-Otero R.D., Bonner W.M., Manova K., Burgoyne P., and ussenzweig A. 2003. H2AX is required for chromatin remodeling and inactivation of sex chromosomes in male mouse meiosis. Dev. Cell 4: 497-508. Formosa T., Ruone S., Adams M.D., Olsen A.E., Eriksson P., Yu Y., Roades A.R., Kaufman P.D., and Stillman D.J. 2002. Defects in SPT16 or POB3 (yFACT) in Saccharomyces cerevisiae cause dependence on the HirlHpc pathway: Polymerase passage may degrade chromatin structure. Genetics 162: 1557-1571. Henikoff S. and Ahmad K. 2005. Assembly of variant histones into chromatin. Annu. Rev. Cell Dev. BioI. 21: 133-153. Jenuwein T. 2001. Re-SET-ting heterochromatin by histone methyltransferases. Trends Cell BioI. 11: 266-273. Jenuwein T. and Allis CD. 2001. Translating the histone code. Science 293: 1074-1080. Kamakaka R.T. and Biggins S. 2005. Histone variants: Deviants? Genes Dev. 19: 295-310. Krogan N.J., Baetz K., Keogh M.C, Datta N., Sawa C, Kwok T.C, Thompson N.J., Davey M.G., Pootoolal J., Hughes T.R., et al. 2004. Regulation of chromosome stability by the histone H2A variant Htz1, the Swr1 chromatin remodeling complex, and the histone acetyltransferase NuA4. Proc. Natl. Acad. Sci. 101: 13513-13518. Kurdistani S.K., Tavazoie S., and Grunstein M. 2004. Mapping global histone acetylation patterns to gene expression. Cell 117: 721-733. Ladurner A.G. 2003. Inactivating chromosomes: A macro domain that minimizes transcription. Mol. Cell 12: 1-3. Lowndes N.E and Toh G.W. 2005. DNA repair: The importance of phosphorylating histone H2AX. Curro BioI. IS: R99-R102. Loyola A. and Almouzni G. 2004. Histone chaperones, a supporting role in the limelight. Biochim. Biophys. Acta 1677: 3-11. Luger K., Mader A.W., Richmond R.K., Sargent D.E, and Richmond T.J. 1997. Crystal structure of the nucleosome core particle at 2.8 A resolution. Nature 389: 251-260. Malik H.S. and Henikoff S. 2003. Phylogenomics of the nucleosome. Nat. Struct. Bioi. 10: 882-891. Marc E, Sandman K., Lurz R., and Reeve J.N. 2002. Archaeal histone
tetramerization determines DNA affinity and the direction of D A supercoiling.]. BioI. Chem. 277: 30879-30886. Marzluff W.E and Duronio R.J. 2002. Histone mRNA expression: Multiple levels of cell cycle regulation and important developmental consequences. Curro Opin. Cell BioI. 14: 692-699. Marzluff W.E, Gongidi P., Woods K.R., Jin J., and Maltais L.J. 2002. The human and mouse replication-dependent histone genes. Genomics 80: 487-498. Morrison A.J. and Shen X. 2005. DNA repair in the context of chromatin. Cell Cycle 4: 568-571. O'Neill L.P., Randall T.E., Lavender J., Spotswood H.T., Lee J.T., and Turner B.M. 2003. X-linked genes in female embryonic stem cells carry an epigenetic mark prior to the onset of X inactivation. Hum. Mol. Genet. 12: 1783-1790. Palmer D.K., O'Day K., and Margolis R.L. 1990. The centromere specific histone CENP-A is selectively retained in discrete foci in mammalian sperm nuclei. Chromosoma 100: 32-36. Palmer D.K., O'Day K., Trong H.L., Charbonneau H., and Margolis R.L. 1991. Purification of the centromere-specific protein CENPA and demonstration that it is a distinctive histone. Proc. Natl. Acad. Sci. 88: 3734-3738. Phair R.D., Scaffidi P., Elbi C, Vecerova J., Dey A., Ozato K., Brown D.T., Hager G., Bustin M., and Misteli T. 2004. Global nature of dynamic protein-chromatin interactions in vivo: Three-dimensional genome scanning and dynamic interaction networks of chromatin proteins. Mol. Cell. BioI. 24: 6393-6402. Rakyan V. and Whitelaw E. 2003. Transgenerational epigenetic inheritance. Curro BioI. 13: R6. Sarma K. and Reinberg D. 2005. Histone variants meet their match. Nat. Rev. Mol. Cell Bioi. 6: 139-149. Schubeler D., MacAlpine D.M., Scalzo D., Wirbelauer C, Kooperberg C, van Leeuwen E, Gottschling D.E., 0' eill L.P., Turner B.M., Delrow J., et a1. The histone modification pattern of active genes revealed through genome-wide chromatin analysis of a higher eukaryote. Genes Dev. 18: 1263-1271. Schwartz B.E. and Ahmad K. 2005. Transcriptional activation triggers deposition and removal of the histone variant H3.3. Genes Dev. 19: 804-814. Shi Y., Lan E, Matson C, Mulligan P., Whetstine J.R., Cole P.A., Casero R.A., and Shi Y. 2004. Histone demethylation mediated by the nuclear amine oxidase hom*olog LSD1. Cell 119: 941-953. Sullivan B.A. and Karpen G.H. 2004. Centromeric chromatin exhibits a histone modification pattern that is distinct from both euchromatin and heterochromatin. Nat. Struct. Mol. BioI. 11: 1076-1083. Suto R.K., Clarkson M.J., Tremethick D.J., and Luger K. 2000. Crystal structure of a nucleosome core particle containing the variant histone H2A.Z. Nat. Struct. BioI. 7: 1121-1124. Talbert P.B., Bryson T.D., and Henikoff S. 2004. Adaptive evolution of centromere proteins in plants and animals.]. BioI. 3: 18. Waterborg J.H. 1993. Dynamic methylation of alfalfa histone H3. ]. BioI. Chern. 268: 4918-4921. Weiler K.S. and Wakimoto B.T. 1995. Heterochromatin and gene expression in Drosophila. Annu. Rev. Genet. 29: 577--605. Wieland G., Orthaus S., Ohndorf S., Diekmann S., and Hemmerich P. 2004. Functional complementation of human centromere protein A (CENP-A) by Cse4p from Saccharomyces cerevisiae. Mol. Cell. BioI. 24: 6620-6630. Witt 0., Albig W., and Doenecke D. 1996. Testis-specific expression of a novel human H3 histone gene. Exp. Cell Res. 229: 301-306. Wolffe A.P. 1992. Chromatin: Structure and function. Academic Press, San Diego.
c
H
A
P
T
E
14
R
Epigenetic Regulation of Chromosome Inheritance Gary H. Karpen 1 and-R. Scott Hawl ey 2 Lawrence Berkeley National Laboratory, Department of Genome Biology, University of California at Berkeley, Department of Molecular and Cell Biology, Berkeley, California 94720 ]Stowers Institute for Medical Research, Kansas City, Missouri 64110 and Department of Physiology, University of Kansas Medical Center, Kansas City, Missouri 66160 1
CONTENTS 1. Introduction, 267 1.1
How Is Chromosome Inheritance Accomplished?, 267
1.2
What Elements Are Required for Chromosome Inheritance?, 267
2. Epigenetic Regulation of DNA Replication, Repair, and Telomeres, 268
4.2
Heterochromatin Pairing Facilitates Segregation in Drosophila Females, 281
4.3
Role of the Centromere in Facilitating Achiasmate Segregation in Budding Yeast, 282
4.4
The Heterochromatin-associated Ph1 Locus in Maizl;! and Its Role in Mediating hom*ologous Versus Homeologous Pairing, 283
2.1
Initiation of DNA Replication Is Controlled by Epigenetic Mechanisms, 268
2.2
DNA Repair Involves Epigenetic Alterations in Chromatin Structure, 269
5.1
Segregation Distorter in Drosophila Males, 283
5.2
Epigenetic Control of Telomere Structure and Function, 270
Paternal Chromosome Loss in Sciara and Mapping of the Response Element, 284
5.3
Paternal Chromosome Loss in Nasonia, 284
5.4
Knob 10 in Corn-The Role of Heterochromatic "Knob" Sequences in Facilitating Chromosome Segregation at Meiosis I, 284
2.3
3. Epigenetic Regulation of Centromere Identity and Function, 273 3.1
Centromere Structure and Function in Different Eukaryotes, 273
5.
Heterochromatin and Meiotic Drive, 283
6. The Silencing of Genes by Unpaired DNA during Meiosis, 285
3.2
Centromeric Sequences Are Not Necessary or Sufficient for Kinetochore Formation and Function, 274
6.1
3.3
The Unusual Composition of Centromeric Chromatin, 275
Meiotic Silencing of Unpaired DNA during Meiosis in Neurospora, 285
6.2
3.4
Models for Centromere Structure, Function, and Propagation, 278
Silencing of Unsynapsed Chromosomes in the Mouse, 286
6.3
Sex Chromosome Dysfunction in Drosophila, 286
3.5
Epigenetics and Centromere Evolution, 279
4. Heterochromatin and Meiotic Pairing/Segregation, 280
4.1
7. Perspectives and Conclusions, 286 References, 287
Discovery of a Heterochromatic Pairing Site in Drosophila Male~, 280
265
GENERAL SUMMARY The duplication and transmission of genetic information are accomplished by two types of cell division, mitosis and meiosis, both of which are fundamental to life and evolution. Mitosis is the nuclear division that occurs in somatic cells, involving the identical partitioning of duplicated genetic material by way of chromosomes to daughter cells. Meiosis is a reductional nuclear division that occurs only in the germ cells of multicellular organisms or at particular stages of a unicellular life cycle to produce cells with a haploid genome prior to fertilization (or conjugation in some eukaryotes). Abnormal DNA replication or repair results in mutations and chromosome rearrangements. Perhaps more importantly, chromosome missegregation during nuclear division causes loss or gain of whole chromosomes (aneuploidy). These kinds of "genome instability" affect the viability of cells and fertility of all eukaryotes. Moreover, they play key roles in the etiology of human birth defects and cancer. Early studies suggested that DNA sequence played a predominant role in specifying the sites and functions of chromosomal elements required for proper mitosis and meiosis, such as origins of DNA replication, sites of spin-
die attachment (centromeres and kinetochores), chromosome ends (telomeres), and meiotic pairing sites. However, in the last decade we have come to understand that epigenetic mechanisms regulate many key functions required for genome stability and chromosome inheritance. These include roles in the initiation of DNA replication, DNA repair and recombination, chromosome end protection (telomeres), chromosome movement (centromeres), and segregation of hom*ologous chromosomes in meiosis. At first glance, epigenetic regulation appears to be at odds with the fact that these chromosomal functions are essential for cell and organismal viability, which implies that they should be "hard-wired" in the DNA sequence. However, when viewed through the lens of evolution, epigenetic "plasticity" of chromosomes during mitosis and meiosis appears to be important to compensate for the types of sequence changes and chromosome rearrangements associated with speciation. Understanding the molecular basis for epigenetic regulation of inheritance is fundamental to elucidating these basic biological processes, and for the diagnosis and treatment of human diseases.
E PIG ENE TIC
REG U L A T ION
F
C H ROM
a5a
MEl N HER I TAN C E
1.1 How Is Chromosome Inheritance Accomplished?
Mitosis is a basic type of cell division that produces identical, diploid daughter cells, and is utilized by somatic cells and premeiotic germ cells (Fig. 1a). There are four phases to the mitotic cell cycle, called G I (gap 1, a "resting" stage after mitosis), S (synthesis, DNA replication and gene expression), G z (gap 2, "resting" after S, preparation for mitosis), and M (mitosis, consisting of prophase, metaphase, anaphase, and telophase). During S phase, DNA is replicated, and the duplicated sister chromatids are held together by the establishment of cohesion. At the beginning of mitosis, chromosomes condense, and histone H3 becomes phosphorylated at Ser-10 (H3SlOph) (see Chapter 10). In addition, in most organisms, a single site (the centromere) on each sister chromatid forms a structure referred to as the kinetochore, which mediates attachment to spindle microtubules and serves as a cell cycle checkpoint (Fig~ 1a). Pairs of sister chromatids congress to the metaphase plate in prometaphase, and segregate to the poles in anaphase. These movements are achieved by both the activities of kinetochore-associated
1.2 What Elements Are Required for Chromosome Inheritance?
Both mitotic and meiotic cell divisions require the activities of specific chromosomal elements and binding proteins to accomplish accurate genome duplication and
S-PHASE - - - - - - - - - - - - - -MITOSIS -------------------prophase
metaphase
anaphase
telophase
I
. - "X. :~ " . J.,,''''r*'
DAPI CEN TUBULIN
.'"
b
\, >""t..
'10
replication cohesion
PREMEIOTIC S-PHASE
condensation congression
MEIOSIS I PROPHASE
replication cohesion
lose cohesion segregation
----------------------------------
leptotene
zygotene
condensation
267
microtubule motors (kinesins and dyneins) and regulation of microtubule assembly and disassembly, and also require destruction of sister chromatid cohesion during the metaphase-anaphase transition. Meiosis occurs only in germ cells and is characterized by one round of replication followed by two divisions (meiosis I and II, Fig. 1b); this produces haploid eggs in the female germ line and haploid sperm in the male germ line of metazoans, rather than diploid daughter cells. In meiosis I, the replicated hom*ologs are paired and segregate together. The sister chromatids of each hom*olog do not segregate from each other until meiosis II. Normal segregation during meiosis requires frequent recombination between hom*ologs, as well as specialized cohesion in the centromere region that ensures the association of sister chromatids during meiosis I (Watanabe 2005).
1 Introduction
a
a
pachytene
diplotene
.............................. condensation
initiate pairing & synapsis recombination
diakinesis
~
complete recombination
Figure 1. Stages of Mitosis and Meiosis (a) Images from Drosophila cells indicate the behaviors of chromosomes (blue, text descriptions below), microtubules (green), and centromeres (red) in interphase and mitosis. (b) Chromosome behaviors are shown for maize meiosis I prophase, which is the stage in which hom*olog pairing, synapsis, and recombination occur (images supplied by Hank Bass and Shaun Murphy, Florida State University). Key chromosome functions that occur during each stage are indicated below (blue text). Subsequently, hom*ologs segregate to opposite poles during meiosis I anaphase, completing a reductional division. Sister chromatids only separate during meiosis II (see Fig. 9).
268
C HAP T E R
74
replication origin
sister chromatid cohesion
•
centromere
!
peri centromeric heterochromatin
/'
---+J~r--Lr-"""-Mrf-J:...1o.J.Jlr....J...I,I~~"'-L--''''''-,-t...
chromosome segregation (Fig. 2). DNA replication is initiated at "origins," which in most eukaryotes are not strictly sequence dependent (discussed in Section 2). Sister chromatid cohesion then becomes visible along the entire length of the chromatids in mitotically dividing cells, although there is a higher concentration of cohesins in pericentromeric heterochromatin. Centromeres are large regions composed of DNA and specialized chromatin proteins that serve as the foundation for kinetochore formation and are critical for spindle attachments and normal meiotic and mitotic chromosome segregation (discussed in Section 3). In most eukaryotes, there is one and only one centromere per chromosome. Loss of the centromere results in spindle attachment failures and chromosome loss, and the presence of more than one centromere leads to attachments of the same chromatid to both poles, which causes chromosome bridges arrd fragmentation during anaphase. Rarely, organisms (e.g., the roundworm Caenorhabditis elegans) contain "polycentric" or "holocentric" chromosomes, in which kinetochores are present in multiple regions (for example, see Fig. 5 of Chapter 13). These chromosomes utilize special mechanisms to ensure attachment and segregation of sister chromatids to opposite poles. Telomeres are specialized chromatin structures found at the ends of chromosomes to protect them from degradation or recombination, and ensure complete DNA duplication. Meiotic segregation also requires centromeres, telomeres, cohesion, and origins of replication. However, additional elements and modification of centromere behavior are required to ensure hom*olog pairing and segregation in meiosis I (discussed in Section 4). The essential nature of chromosome inheritance suggests that the specification and localization of inheritance elements should be "hard-wired" in the DNA sequence. Thus, it is surprising that many elements, including those outlined in this section, are instead regulated epigenetically, especially in multicellular eukaryotes. In summary, elements that are prone to epigenetic regulation to ensure faithful chromosome inheritance include DNA replication origins, telomeres, sister chromatid cohesion sites, centromeres, and hom*olog pairing sites.
Figure 2. Chromosome Inheritance Elements The diagram indicates the chromosomal elements essential for normal duplication (replication origins) and inheritance (centromeres, cohesion, telomeres) through mitosis and meiosis. Normal meiotic segregation also requires hom*olog pairing sites (not shown) and, in most cases, recombination.
2 Epigenetic Regulation of DNA Replication, Repair, and Telomeres
The first step in ensuring inheritance of genetic information involves the faithful duplication of the entire genome, which is accomplished by a process known as DNA replication. Unfortunately, errors occur during replication, causing changes in the DNA (mutations), which can be harmful to organismal viability. In addition, environmental agents such as radiation produce mutations, including DNA base changes, deletions, insertions, and rearrangements. Cells respond to DNA damage by activating DNA repair pathways, which do an amazing job of maintaining genome fidelity and stability. Finally, duplication of linear DNA molecules poses challenges that are overcome by the presence of specialized sequences and structures at chromosome ends, known as telomeres. Recent studies have shown that these basic processes required for accurate duplication and maintenance of DNA sequences are affected by epigenetic mechanisms that regulate chromatin. 2.1 Initiation of DNA Replication Is Controlled by Epigenetic Mechanisms
The faithful copying of DNA is accomplished by the 5' to 3' action of DNA polymerases, and starts at specific sites called "origins." A "bubble" consisting of two replication "forks" is formed at origins, and replication proceeds bidirectionally until forks generated by the next origins are met. Domains replicated from a single origin are usually quite large, covering hundreds of kilobases (Aladjem and Fanning 2004). Origins in the yeast Saccharomyces cerevisiae (also known as ARSs, for autonomously replicating sequences) function ectopically when cloned into plasmids, and usually upon integration into other chromosomal sites, indicating that initiation of replication is regulated by specific DNA sequences. ARSs are approximately 100-150 bp in size and contain one or more copies of an essential approximately ll-bp AT-rich sequence, plus other conserved elements (Weinreich et al. 2004). A protein complex known as the origin recognition complex (ORC) is required for initiation and is
E PIG ENE TIC
REG U L A T ION
responsible for the recruitment of the prereplication complex (PRC) that includes minichromosome maintenance (MCM) proteins (Prasanth et al. 2004). For metazoans, replication origins can be identified in situ, but they are typically inactive upon cloning and reintroduction, suggesting that initiation in these organisms is regulated epigenetically, rather than by strict DNA sequence dependence (Aladjem and Fanning 2004). Although ORC and MCM proteins are conserved in metazoans, the factors and mechanisms responsible for regulating origin activity in these organisms remain mysterious. In addition, it is unclear whether chromatin structure affects the processivity of replication forks. Clues about how metazoan origins might be regulated epigenetically come from detailed studies in S. cerevisiae. Although DNA sequences at origins are necessary and, for the most part, sufficient for replication initiation in this organism, there is clear evidence for chromatin structure effects on origin activity (Weinreich et al. 2004). Microarray analysis has shown that not all of the 332 sites of bidirectional replication, or the 429 sites bound by ORC and MCM proteins, are active in every cell cycle. Chromosomal context and chromatin structure affect the ability of a putative origin to be active in replication initiation. For example, an ARS located in the silenced mating-type loci cannot initiate replication unless moved to other chromosome locations. Another example of epigenetic regulation of origins involves the approximately 100-200 genes encoding ribosomal RNA (rDNA), which are present in a tandemly repeated array (Pasero et al. 2002). Each 9.1-kb rDNA unit contains an ARS, which initiates replication when inserted into a plasmid or elsewhere in the genome. However, only about 20% of rDNA origins are active during each S phase. Origins can also be regulated to fire at different stages during S phase (Weinreich et al. 2004). For example, origins near telomeres are normally active in late S phase
telomere
centromere
late
•
269
(see Section 2.3), but insertion of the same sequences into circular plasmids results in replication early in S phase (Fig. 3) (Ferguson and Fangman 1992). Late replication can be recapitulated by linearization of the plasmid and telomere addition, which places the ARSs near the ends. These results demonstrate that both origin activity and the timing of origin firing during S phase are regulated epigenetically. Further evidence comes from the observations that S-phase timing, and origin silencing at rDNA, telomeres, and mating-type loci, are regulated by many of the histone modifications and proteins responsible for epigenetic gene silencing, including the SIR proteins (see Chapter 4). 2.2 DNA Repair Involves Epigenetic Alterations in Chromatin Structure
Accumulation of DNA damage, due to replication errors or exposure to environmental agents, can lead to deleterious mutations, genome instability, cancer, cell senescence, and death. DNA damage is repaired by error-correction mechanisms during DNA replication, as well as independent pathways that act during G2 • Cells contain "checkpoints" that identify the presence of DNA damage and arrest or delay the cell cycle until repair is complete; these pathways also induce cell death (apoptosis) if the damage is not repaired, which contributes to organismal viability by removing defective cells. These processes are normally extremely efficient; for example, human skin cells exposed to the UV radiation present in sunlight con~ tain a surprisingly large number of DNA lesions, the vast majority of which are properly repaired (Friedberg et al. 1995). Individuals who are deficient in repair, due to mutations in one or more components of the checkpoint or repair pathways, suffer from a variety of diseases, including predispositions to colon, breast, and skin cancers, and premature aging.
0 -ns-e-rt-or. ;i~ in· 0
repii~:~in~" origin
0 F C H ROM 0 S 0 MEl N HER I TAN C E
,
_L-..L....L.l.1.-.....-...,..-...-...IL..l...I.-..L--'·....-i
into plasmid
linearize
~d
•
add telomeres
early
late
replicating
repl icati ng
Figure 3. Epigenetic Regulation of Replication Timing in Yeast One of the best examples of epigenetic effects on replication was demonstrated in the yeast Saccharomyces cerevisiae. Insertion of a late-replicating origin into a circular plasmid results in early replication in 5 phase, and late replication is restored upon linearization of the plasmid and addition of telomeres (Ferguson and Fangman 1992; Weinreich et al. 2004).
270 •
C HAP T E R
74
Early studies successfully identified molecules and pathways that recognize different types of lesions in DNA, such as double-strand breaks (DSBs) and pyrimidine dimers, and that recruit specific complexes to repair the damage. However, the packaging of DNA into chromatin could potentially block access of factors involved in recognizing sites of damage, or effecting repair, similar to the repressive impact of heterochromatin on gene expression (Hassa and Hottiger 2005). Repair, however, employs ATP-dependent chromatin-remodeling complexes, which presumably act to "expose" the defective DNA for repair. More recent studies have shown that specific changes to the chromatin template, such as the presence of histone variants and posttranslational histone modifications, play key roles in the recognition of DNA lesions and the recruitment of the appropriate repair machinery (Hassa and Hottiger 2005). For example, the presence of DSBs results in rapid phosphorylation of the histone H2A variant H2AX at serines 136 and 139 (known as yH2AX). H2AX phosphorylation is required for the accumulation of repair proteins to large (megabase) regions that surround DSBs and for the assembly of repair "foci," rather than the initial recruitment of repair complexes to the primary sites of DNA damage (Bassing et al. 2002; Celeste et al. 2002). These observations suggest that yH2AX "spreading" from DSBs acts to amplify the signal emanating from DSBs, enhancing the recruitment and perhaps retention of repair factors (Fernandez-Capetillo et al. 2004). In addition to involvement in DSB repair, H2AX phosphorylation affects V(D)J recombination in mammalian lymphocytes and also acts as a suppressor of genome instability and tumors in mice (Fernandez-Capetillo et al. 2004). Other types of chromatin changes, such as histone acetylation, SUMOylation, ubiquitination, and methylation have also demonstrated roles in successful repair of DNA lesions. For example, methylation of histone H3 at lysine 79 (H3K79me) is required for recruitment of the repair checkpoint protein 53BP1 to DSBs, and is mediated by the DOn histone lysine methyltransferase (HKMT) (Huyen et al. 2004). Interestingly, induced DNA damage does not change H3K79 methylation levels, suggesting that this modification is not added in response to DSBs. One possibility is that 53BP 1 recruitment and "sensing" of DSBs involves exposure of preexisting H3K79 methylations in response to chromatin remodeling at sites of DNA lesions. Although our current understanding of the impact of these and other chromatin factors on DNA repair suggests roles in signaling and recruitment of appropriate complexes to DNA lesions, it is likely that future studies will reveal more ways in which epigenetic mechanisms regulate
pathways that maintain genome stability. It is important to note that the role of chromatin in DNA repair is dynamic and occurs in response to damage. Although histone modifications and other epigenetic regulatory proteins play key roles, the changes are not heritable through cell division, unlike the other examples discussed in this chapter. 2.3 Epigenetic Control of Telomere 5tructure and Function
The ends of linear eukaryotic chromosomes are specialized sites known as telomeres, which serve three essential functions. First, telomeres ensure that DNA replication includes the very ends of the chromosomes, overcoming the "endreplication problem" (Lue 2004). Second, telomeres protect the ends of the chromosomes from degradation and inhibit fusions with other chromosomes. Third, in many but not all organisms, telomeres facilitate chromosome pairing in meiosis. In most eukaryotes, telomeres are composed of simple, short repeats that are restored by the enzyme, telomerase. Telomere functions are regulated by both sequence-based and epigenetic mechanisms. The end-replication problem arises because DNA polymerases require a primer to initiate 5' to 3' "lagging strand" synthesis; the consequence of this restricted enzyme activity is that replication cannot proceed all the way to the end of the chromosome (Lue 2004). Two mechanisms are utilized to overcome this problem. The predominant mechanism used by most organisms, including yeasts, mammals, and plants, involves an unusual enzyme complex known as telomerase. Telomeres in most eukaryotes are composed of simple, 6-bp repeats extending for tens to hundreds of kilobases. Telomerase complexes contain a reverse transcriptase-like enzymatic activity, as well as RNAs that have hom*ology with the telomeric repeats. In essence, end replication is accomplished by targeting of the complex to telomeric repeats via the RNA component, followed by reverse transcription (3' to 5') to produce new repeats. Interestingly, loss of telomerase activity and shortened telomeres are correlated with cell senescence and aging, and conversely, cancer cells display enhanced telomerase activity and elongated telomeres (Blasco 2005). There are also telomerase-independent mechanisms that maintain chromosome ends (Louis and Vershinin 2005). One well-studied alternative system appears to be restricted to Drosophila and other dipterans. These organisms lack an identified telomerase, and the ends do not contain the simple, short repeats found in most other eukaryotes. Instead, the ends of Drosophila chromosomes are composed of scrambled clusters of different non-LTR
E PIG ENE TIC
REG U L A T ION
(long terminal repeat) retrotransposons ranging in size from 3 to 5 kb, and other repeats (TAS, for telomere-associated repeats) (Biessmann and Mason 2003). These transposons encode reverse transcriptase enzymes (hence retrotransposon), suggesting that there may be an evolutionary relationship with the more standard telomerase mechanisms. The major difference, however, is that Drosophila chromosomes are not replicated to the very end; they lose about 70 bp per fly generation, roughly the amount expected from the end-replication problem. This loss does not cause deletion of essential genes, because the telomeric and subtelomeric repeat domains are about 50-100 kb in length, and it would take many generations to lose enough DNA to reach the genic regions. The loss of telomeric sequences, however, is compensated by infrequent addition of non-LTR retrotransposons (Biessmann and Mason 2003). Epigenetic regulation affects telomere function, and gene expression of loci residing in the region. "Naked" telomeric DNA or internal DSBs both result in chromosome fusions and aneuploidy. Barbara McClintock first described a phenomenon known as the breakage-fusionbridge cycle, in which fusions between broken chromosomes, or chromosome ends, produce dicentric chromosomes and anaphase bridges, which generate further breakage. Evidence for the epigenetic regulation of telomere end protection comes from studies in Drosophila, which showed it to be independent of DNA sequence. A broken chromosome end in Drosophila can behave as a DSB in one generation, but acts as a fully functional telomere subsequently, without any addition of retrotransposons or any sequence changes (Ahmad and Golic 1998). Furthermore, any end generated in Drosophila (known as terminal deletions) can be packaged as a telomere and protected against fusion events (Karpen and Spradling 1992). Additionally, telomere function in Schizosaccharomyces pombe depends on the Taz1 protein (Miller and Cooper 2003) and telomeric chromatin, in a manner that is independent of canonical telomeric repeats (Sadaie et al. 2003). Telomeric regions contain chromatin modifications and properties that are similar to pericentromeric heterochromatin described in Section 3. Characterization of the epigenetic mechanisms that regulate telomeric and subtelomeric regions came from studies of gene expression in yeasts and Drosophila, but also occur in humans. Euchromatic genes inserted into telomeric regions are variably silenced. This is referred to as telomere position effect (TPE) and is similar to position-effect variegation (PEV) induced by adjacent centromeric heterochromatin
a
F C H ROM
a
SaM E
I N HER I TAN C E
•
271
in flies and S. pombe (for more detail, see Chapters 5 and 6, respectively). In budding yeast, many of the distinct chromatin-related factors, such as the SIR proteins that affect mating-type silencing, also affect telomere-induced silencing (see Chapter 4). Surprisingly, almost none of the genes known to regulate PEV in Drosophila (Suppressors and Enhancers of Variegation, Su(var)s and E(var)s described in Chapter 5) have any effect on telomeric silencing. This suggests that PEV and TPE are mediated at least in part by different pathways (Cryderman et al. 1999; Donaldson et al. 2002). Heterochromatin protein 1 (HP1, a Su(var) gene product) and H3K9 methylation, which are key components of heterochromatin-mediated silencing (see Chapter 8), are present at Drosophila telomeres and are required for telomere elongation (Fig. 4) (Perrini et al. 2004). Deletion ofHPl or its binding partner HOAP (for HP1/0RCassociated protein) results in a very high frequency of telomeric fusions (Cenci et al. 2003). HP1 typically is recruited to chromatin through its affinity to methylated H3K9 via the chromodomain. Interestingly, telomere capping by HP1 is independent of H3K9 methylation, suggesting that end protection is mediated by an alternative mechanism involving direct binding to telomeric DNA or non-telomeric sequences present in terminal deletions (Fig.4a) (Perrini et al. 2004). One attractive model is that HP1 binds and protects ends independent of DNA sequence, then recruits an unknown H3K9 HKMT; local methylation ofH3K9 would then recruit more HP1 to the region, which promotes the spreading of telomeric silencing (Fig. 4). This mechanism likely does not require the RNAi pathway, involved in establishing and silencing centromeric heterochromatin (see Chapter 8), but this component of the model needs to be tested directly. Recent studies have shown that telomerase-dependent telomere elongation is also regulated epigenetically in mammals (Lai et al. 2005). For example, mice deleted for both copies of the H3K9 HKMTs, Suvar39hl/2, contain telomeres with reduced levels of H3K9me2 and H3K9me3 and exhibit abnormally long telomeres (Fig. 4b) (Garcia-Cao et al. 2004). These results suggest that Suv39h1/2 HKMT activity transduces the H3K9me modification into the di- and tri-methylated forms, facilitating the binding of HP1 hom*ologs Cbx 3 and 5, which are required for the assembly of normal telomeric chromatin structure and regulation of telomere length. Finally, meiotic recombination and chromosome transmission are also affected by the epigenetic modifications that occur at telomeres. For example, loss of Ndj 1, a telomere protein necessary for both telomere
Figure 4. Telomere Function Is Epigenetically Regulated in Flies and Mammals (a) In Drosophila, Heterochromatin Protein 1 (HP1) binds telomeric DNA independent of its chromodomain, and "caps" telomeres, which ensures normal segregation by blocking telomere fusions (Fanti et al. 1998; Perrini et al. 2004). HP1 then recruits an unknown histone methyltransferase (HKMT; not Su(var)3-9) that tri-methylates H3K9 on nearby nucleosomes; HP1 binds H3K9me3 through its chromodomain, which in turn recruits more HKMT, and successive rounds of HP1 binding/HKMT recruitment promote spreading of silent chromatin through subtelomeric regions. (b) In mice, knock-outs of both Suv39 HKMT loci results in reduced levels of H3K9me3 and me2, and increased H3K9me modifications, altered chromatin structure, and changes in levels of proteins that bind di- and tri-methylated H3K9 (J. Cbx 3 and 5), H3K9me (iCbx 1), and TERFs 1 and 2 (not shown) at telomeres (Garcia-Cao et al. 2004). These changes are correlated with extended telomere length, suggesting that tri-methylation of H3K9 by Suv39hs is required for normal telomerase function and regulation of telomere size.
bouquet formation (i.e., clustering) and meiotic recombination (Wu and Burgess 2006), confers a severe reduction in telomere deletion rates in the budding yeast (Joseph et al. 2005). Joseph et al. propose that Ndjl facilitates telomere deletion "by promoting telomeric interactions during meiosis, resulting in an effective increase
in the factors required for deletion." Similarly, mutants that are defective in the transcriptional silencing of genes placed near telomeres display severe defects in meiotic pairing and recombination, resulting in chromosome missegregation during meiosis (Nimmo et al. 1998). Thus, the epigenetic events that control both
E PIG ENE TIC
REG U L A T ION
telomere length and transcriptional competence also appear to be employed in processes controlling chromosome behavior during meiosis. 3 Epigenetic Regulation of Centromere Identity and Function
Normal inheritance of genetic material requires that chromosomes segregate faithfully during mitosis and meiosis, after the genome is accurately duplicated and repaired during S phase. Centromeres were originally defined in 1880 by Flemming as a cytologically visible "primary" constriction in the chromosome. In the early 1900s, centromeres were also defined genetically as chromosomal sites essential for normal inheritance, and as regions of greatly reduced or absent meiotic recombination. We now define the centromere (CEN) as the DNA plus chromatin proteins responsible for kinetochore formation. The kinetochore is a proteinaceous structure facilitating the attachment to and travel along microtubules, plateward during prometaphase and poleward during anaphase of mitosis and meiosis. The kinetochore also serves as the site of action for a key cell cycle checkpoint, known as the spindle assembly checkpoint (SAC) or mitotic checkpoint (Cleveland et al. 2003). A key question for organisms with mono-centric chromosomes concerns how one and only one site per chromosome is associated with centromere function (known as "centromere identity") and how this information is transmitted from one cell or organismal generation to the next ("centromere propagation"). Here we present the evidence that in most eukaryotes centromere identity and propagation are regulated epigenetically through chromatin structure, rather than by specific DNA sequences (Carroll and Straight 2006). A summary of the key pieces of data includes: (1) Centromeric sequences are not conserved between even closely related species, or even among chromosomes in a single species, (2) centromeric DNA is not necessary or sufficient for kinetochore formation, and (3) centromere positioning along a chromosome displays dramatic plasticity during evolution.
3.1 Centromere Structure and Function in Different Eukaryotes
Studies in the yeast S. cerevisiae during the 1980s led to the first cloning and analysis of a eukaryotic centromere. A 125-bp structure present on all 16 S. cerevisiae chromosomes was shown to be both necessary and sufficient for normal centromere function (Bloom et al. 1989); even
0 F C H ROM 0 5 0 MEl N HER I TAN C E
•
273
single-base changes in the highly conserved elements I and III resulted in complete loss of function. Thus, centromere identity and propagation in this single-cell eukaryote are determined by DNA sequence. The hope that similar sequence-based mechanisms could regulate centromere identity in other eukaryotes was first dispelled by studies in another "simple" eukaryote, S. pombe. Centromeric sequences in this fission yeast are structurally larger and more complex than observed in S. cerevisiae (Clarke et al. 1986; Nakaseko et al. 1987). Nonhom*ologous 4- to 5-kb-long "central core" sequences, which are the sites of kinetochore formation, are flanked by various classes of inverted repeats that are shared among the three chromosomes. A minimum of 25 kb, containing the nonrepetitive central core, inner repeats, and a portion of the outer repeats is absolutely required for centromere function and stable chromosome transmission (Baum et al. 1994). Reasonable centromere function is observed for transfected plasmid constructs that carry a central core plus inner repeats (i.e., the central domain) and two flanking outer repeats. Interestingly, the deletion of inner repeats compromises meiotic sister chromatid segregation, demonstrating that centromeric regions play roles in processes other than kinetochore assembly. Indeed, both kinetochore and cohesion domains are closely linked and important for proper chromosome segregation. Although centromeric regions in multicellular eukaryotes are even larger and more complex than in S. pombe (hundreds to thousands of kilobases of repeated DNAs), the overall organization and function of fission yeast centromeres has served as an excellent model for centromeres in mammals, plants, and insects. Centromeres in these organisms are embedded in the large heterochromatic blocks present on each chromosome, which are predominantly composed of satellite DNAs (simple, short repeats) and transposons. These centromeric regions are composed of subdomains responsible for different functions, most notably kinetochore formation and sister cohesion. Centromeric sequences, however, are not conserved among eukaryotes, or even among the different chromosomes in an individual species. It is the epigenetic composition of centromere functional subdomains that shows conservation, notably through histone variant composition and histone modification patterns, which appear to be epigenetically regulated. In the nematode C. elegans and in other species, the holocentric chromosomes recruit and assemble centromeric proteins along the entire chromosome length (Dernburg 2001). Specific worm sequences are apparently not required, as concatemers of lambda and many
274 •
C HAP T E R 1 4
other types of DNA are stably transmitted. Proteins are recruited in "bundles" in prophase, but by metaphase are spread evenly on the poleward face of chromosome arms, suggesting that many areas of the C. elegans genome can support kinetochore assembly in an epigenetic manner. Despite obvious differences with monocentric chromosomes, it is possible that organizational and structural attributes, such as 3D spiraling or looping of CEN DNA, are conserved (see Section 3.3).
a Human •
The large size and complexity of centromeric sequences in multicellular eukaryotes have made it difficult to analyze DNA sequence requirements with the kinds of defined constructs used so successfully in the yeast studies. Human artificial chromosomes (HACs) have nonetheless been generated at low frequency by transfecting tissue culture cells with arrays of satellite DNAs, but they exhibit a high rate of mitotic instability (Rudd et al. 2003). We know, however, that HACs are formed by concatemerization of the introduced satellite arrays, yet some alpha satellite arrays cannot form centromeres de novo, suggesting a requirement for multiple, unknown steps or factors. More recent studies have shown that the unique properties and components of centromeric chromatin (as explained in Section 3.3) are present on both the satellite arrays and non-centromeric sequences (e.g., plasmid vector and selectable marker sequences) in HACs (Lam et al. 2006). Thus, the sufficiency of specific DNA sequences in assembling and maintaining functional human centromeres is still unclear. The first indication that centromere identity and propagation are regulated epigenetically resulted from studies of "minimal" centromere constructs in S. pombe (Steiner and Clarke 1994). A low frequency of the construct transformants exhibited a switch from reduced centromere function to high "active" centromere activity (0.6% of cells), which could subsequently be perpetuated in a lineage for many generations. Thus, the same DNA sequences can display two functionally different, heritable states, similar to observations of epigenetic effects seen for PEV (Chapter 5) or TPE (Chapter 4) on gene expression. Other observations strongly suggest a primary role for epigenetic mechanisms in determining centromere identity and forming kinetochores in multicellular eukaryotes. First, DNA sequences normally associated with centromeres are not sufficient for function. For example, only a subset of mouse and human heterochromatic satellite
telomere
o
centromere
)
kinetochore
o
neocentromere
~
~
)
deletion
-~-
• ~
(~)
mar del (10)
10
b 3.2 Centromeric Sequences Are Not Necessary or Sufficient for Kinetochore Formation and Function
(
Drosophila telomere and subtelomeric euchromatin heterochromatin\l. ~
/
~
pericentromeric heterochromatin / \..
o._-e::===:CC:~::C::::3Dp 8-23
I
320 kb
l'
. ,.
,.
inversion
/£J-~::::::=:::::r::::====tc=3 y 238
C?-0 neocentromere
Figure 5. Neocentromere Formation in Humans and Flies (0) Human chromosomes carrying neocentromeres, which exhibit
centromere function/kinetochore formation in the absence of centromeric DNA, are usually associated with gross rearrangements (Amor and Choo 2002). In this classic example, a chromosome-10derived neocentromere (mar(del)10), whose structure indicates formation via a large interstitial deletion that removed the endogenous centromere (gray dotted lines). Mar(del) 10 was recovered in an individual whose karyotype also contained a ring chromosome (ring(del)10, not shown) that contains the DNA from the deleted region. The order of events for human neocentromeres is unclear; neocentromere formation could occur first, producing a dicentric chromosome that subsequently undergoes rearrangements, or neocentromere formation could occur after deletion of the endogenous centromere. (b) Neocentromeres can be generated experimentally in flies from a molecularly defined minichromosome. A 320-kb fragment of euchromatin and telomeric chromatin, which contains no centromeric DNA, can be separated from the rest of the minichromosome by irradiation. This fragment, which should be "acentric:' can become a functional neocentromere that is propagated faithfully through mitosis and meiosis, and contains centromere and kinetochore proteins normally restricted to the endogenous centromere (Blower and Karpen 2001). However, neocentromere formation requires proximity to the endogenous centromere (420 kb), as in the inversion derivative "1238; furthermore, neocentromere formation does not occur on either side of the centromere when peri centric heterochromatin is present (Maggert and Karpen 2001). These results suggest that neocentromere formation occurs via epigenetic spreading of centromeric chromatin into adjacent euchromatin, followed by epigenetic propagation of centromere identity and function. The blocking of this process by heterochromatin is consistent with the observation that overexpressed CENP-A is incorporated ectopically into euchromatin but not heterochromatin (Heun et al. 2006) and suggests that the extent of centromeric chromatin is determined by two epigenetic processes: CENP-A loading and spreading, and heterochromatin formation/blocking.
E PIG ENE TIC
REG U L A T ION
sequences are associated with centromere function (Lam et al. 2006). Additionally, in functional chromosomes with two regions of centromeric satellites (dicentrics) observed in flies and humans, one of the regions loses the ability to form a kinetochore (Sullivan and Willard 1998). Second, centromeric sequences are not necessary for kinetochore formation, since non-centromeric DNA can acquire and faithfully propagate centromere function through a process known as "neocentromere formation" (Fig. 5a). Many functional neocentromeres have been identified in humans, and sequence analysis has shown that the new kinetochore-forming regions have not acquired satellite DNAs. The regions flanking the new kinetochore, however, have acquired epigenetic properties comparable to the corresponding regions in endogenous centromeres (i.e., pericentromeric heterochromatin), such as H3K9 methylation and HP1 binding (Lo et al. 2001). Although the mechanism for neocentromere formation in humans is unknown, neocentromeres have been generated experimentally in a model system. In Drosophila, neocentromeres are produced from minichromosomes when non-centromeric DNA and an endogenous centromere are juxtaposed (Fig. 5b) (Maggert and Karpen 2001). Thus, proximity to a functional centromere is required for neocentromere activation in Drosophila, su'ggesting that one mechanism for centromere gain is spreading of centromeric proteins in cis onto adjacent, non-centromeric regions. Once this spreading has occurred, centromere function is then propagated epigenetically at this new site. Interestingly, neocentromere formation is inhibited when heterochromatin is present between the endogenous centromere and the neocentromere-forming region, suggesting that additional epigenetic mechanisms playa role in determining centromere size.
a
0 F
C H ROM 0 S 0 MEl N HER I TAN C E
3.3 The Unusual Composition of Centromeric Chromatin
The evidence for epigenetic regulation of centromere identity and propagation points to the likelihood that chromatin structure and composition are the key determinants, rather than primary DNA sequences. Here, we discuss the distinct components and structures found in CEN chromatin, and the surprising observation that these properties are conserved among distantly related eukaryotes. The CENP-A family of centromere-specific histone H3-like proteins is present in centromeric nucleosomes in all eukaryotes (Fig. 6a). They serve as both the structural and functional foundations for the kinetochore and are excellent candidates for an epigenetic mark that establishes and propagates centromere identity (Cleveland et al. 2003). Unlike most kinetochore proteins that are assembled during mitosis, CENP-A is present at centromeres throughout the cell cycle, which is one of the first indications of its importance to centromere identity. CENP-A containing chromatin also provides the base that is essential for the recruitment of kinetochore proteins, the establishment of spindle attachments, and normal chromosome segregation in yeasts, worms, flies, and mammals (Carroll and Straight 2006). Reciprocal epistasis experiments have shown that CENP-A is the first protein in the kinetochore assembly pathway, consistent with its physical location in chromatin at the base of the kinetochore in mitotic chromosomes. Further evidence for
, euchromatin
"-
centromeric
peri centromeric
~OOlOCCic~XN)CPlC-Al()cOhOroocmooca!..xinlClCH,030000c:lOCh)CelQ;=:~oa~~
DNA
275
Finally, chromosome rearrangements are a hallmark of evolution and speciation. These changes are accompanied by centromere gains, losses, and movements with respect to genome sequences (Ferreri et al. 2005). Such plasticity is best explained if centromere identity is determined epigenetically, as described in Section 3.5.
b
CENP-A
•
«- ------ ----- ----- ----- -----»« 0.2 to 1.5 Mbs
»
many Mbs
Figure 6. The Organization of Centromeric Chromatin (a) CENP-A is a highly conserved, centromere-specific histone variant. Image shows localization exclusively to centromeres in Drosophila mitotic chromosomes. (b) CEN chromatin in flies and humans contains interspersed blocks of H3-containing and CENP-A-containing nucleosomes, and is flanked by pericentromeric heterochromatin (Blower et al. 2002).
276 •
C HAP T E R
1 4
the importance of CENP-A in kinetochore formation comes from overexpression studies in flies, in which CENP-A mislocalization to non-centromeric regions produces functional ectopic kinetochores (Heun et al. 2006). Therefore, because this histone variant is essential for centromere function, we specifically define CEN chromatin as the region of DNA and proteins associated with CENP-A. The structure of CENP-A-containing nucleosomes is unusual compared to canonical histone cores containing H3, H2A, H2B, and H4. CENP-A nucleosomes can be assembled in vitro from purified CENP-A and histones H2A, H2B, and H4, consistent with previous observations indicating that they are hom*otypic in vivo (i.e., they contain two copies of CENP-A and not one copy of H3 and one of CENP-A) (Yoda et al. 2000). Detailed biophysical analysis showed that the interface between CENP-A and H4 is different from the H3-H4 interface, and the H4 interacting domain is sufficient to target CENP-A to centromeres in the presence of endogenous CENP-A (Black et al. 2004). The replacement of H3 by CENP-A in centromeric nucleosomes initially suggested that CENP-A constituted all of the chromatin associated with the kinetochore. In S. pombe, CENP-A is uniformly distributed across the 5- to 7-kb central core regions. The large heterochromatic domains that contain H3K9 methylation and heterochromatin proteins flank these cores (Pidoux and Allshire 2004). However, detailed cytological and immunoprecipitation studies have revealed that centromeric chromatin has a more complex composition and organization in multicellular eukaryotes. Drosophila, human, and rice centromeres contain interspersed blocks of H3 and CENP-A-containing nucleosomes (collectively called CEN chromatin) flanked by even larger blocks (hundreds of kilobases to megabases) of pericentromeric heterochromatin (Fig. 6b) (Blower et al. 2002; Yan et al. 2005). The interspersion of H3 and CENP-A domains raised key questions about the epigenetic nature of CEN chromatin. In particular, are H3 subdomains within the CEN chromatin of multicellular eukaryotes modified like heterochromatin or euchromatin? Or are they uniquely marked? Furthermore, is each interspersed CENP-A/H3 unit in larger eukaryotic centromeres equivalent to a single S. pombe centromere? These questions were addressed by examining the posttranslational modifications that characterize the interspersed blocks of H3 and CENP-A nucleosomes, which revealed even greater complexity. Surprisingly, the interspersed H3 domains in humans and flies contain H3K4me2, a mark usually associated with euchromatin and, moreover, lack
the H3K9me2 and me3 associated with flanking heterochromatin (Fig. 7a) (Sullivan and Karpen 2004; Lam et al. 2006). However, like heterochromatin, multiple forms of H3 and H4 acetylation were absent from the interspersed H3 nucleosomes, as was H3K4me3. Thus, the H3 nucleosomes within CEN chromatin display a pattern of modifications that are distinct from canonical euchromatin or heterochromatin (Fig. 7b). These results also suggest that fly and human centromeres are not composed simply of repeated, S. pombe-like centromeres. However, it is important to note that the overall organization of the centromere regions is conserved, such that the entire CENP-A chromatin domain is flanked by pericentromeric heterochromatin that contains H3K9 methylation in all multicellular eukaryotes and in S. pombe. What are the possible functional roles of histone modifications in CEN and flanking chromatin? Distinct chromatin states in the CEN region are likely to contribute to the diverse properties of centromeric domains, such as differential replication timing of the CEN and flanking heterochromatin (Sullivan and Karpen 2001; Blower et al. 2002). Flanking pericentromeric heterochromatic modifications may also maintain centromere size by creating a barrier against expansion of CEN chromatin. In Drosophila, CEN chromatin readily spreads into neighboring sequences when flanking heterochromatin is removed, allowing neocentromere activation (Fig. 5b) (Maggert and Karpen 2001), and overexpression of CENP-A results in mislocalization to euchromatin, but not heterochromatin (Heun et al. 2006). Interestingly, overexpression of CENP-A in human cells results in spreading of CEN chromatin and alterations in H3K9 methylation in the flanking regions (Lam et al. 2006). Thus, centromere size appears to be determined by a balance between two epigenetic states: CEN chromatin and flanking pericentromeric heterochromatin. Proteins required for sister chromatid cohesion, which is established in conjunction with DNA replication in S phase, are most highly concentrated in the heterochromatin that flanks the centromere. This distribution appears to contribute to proper bi-orientation of kinetochores in mitosis, as well as the maintenance of cohesion during metaphase despite spindle-mediated forces concentrated at kinetochores/centromeres (Watanabe 2005). Epigenetic regulation of cohesion involves the recruitment of cohesins by HPl proteins (Swi6 in S. pombe), which is in turn mediated by the high concentrations of H3K9 methylation in the pericentromeric heterochromatin (Nonaka et al. 2002). Thus, CEN-spe-
E PIG ENE TIC
REG U LA T ION
aF
C H ROM
a
SaM E I N HER I TAN C E
277
a H3K4me2
--
-
--
-.-
.
- -.
... Figure 7. Distinct Patterns of Histone Modifications in Centromeric Chromatin
b
.
peri centromeric heterochromatin
CENP-A
+ + -. + +
+
+ +++
?
H3
euchromatin
~
H3K4me2 H3K4me3 H3K9me2 & 3 H3 & H4 acetylations cohesin proteins
c inner and outer kinetochore
microtubules cohesin
cific combinations of histone modifications could also be important for the recruitment of cohesion complexes to heterochromatin near sister kinetochores, while ensuring spatial separation of cohesion and kinetochore domains (Sullivan and Karpen 2004; see also Chapter 6). In human and Drosophila mitotic chromosomes, CENP-A subdomains merge to form a 3D cylindrical structure that largely excludes H3 nucleosomes (Blower et al. 2002). Blocks of CENP-A nucleosomes are oriented on the poleward face of the chromosome, and blocks of H3 nucleosomes are located toward the inner chromatid region. Inner and outer kinetochore proteins are wrapped around the CENP-A cylinder; this 3D arrangement is consistent with CENP-A playing a central role in the recruitment of other kinetochore pro-
(0) Immunofluorescence using antibodies that recognize specific histone modifications on extended chromosome fibers showed that the interspersed H3-containing nucleosome blocks have a pattern of modifications that are distinct from canonical euchromatin and heterochromatin (Sullivan and Karpen 2004). For example, despite the fact that centromeres in most eukaryotes are embedded in large blocks of pericentric heterochromatin, the interspersed H3 blocks contain the H3K4me2 modification normally associated with "open" euchromatin (top), and lacks the heterochromatin marker H3K9me2 present in the pericentric flanking regions (bottom). (b) Summary of "2D" organization of centromeric chromatin in interphase based on extended chromatin fiber studies in flies and humans. + and - indicate the presence and absence of the indicated histone modification (respectively) in euchromatin, pericentromeric heterochromatin, and the interspersed blocks of H3 nucleosomes in centromeric chromatin (Sullivan and Karpen 2004; Lam et al. 2006). (c) Model for the 3D organization of chromatin in the centromere region of mitotic chromosomes. Associations between similarly modified nucleosomes are proposed to contribute to the formation of distinct 3D structures in centromeric and flanking chromatin. Interspersed CENP-A/CID and distinctly modified H3 and H4 may mediate formation of the "cylindrical" 3D structures observed in metaphase chromosomes (Blower et al. 2002; Sullivan and Karpen 2004). H3K9me2 chromatin, which recruits heterochromatin proteins such as HP1, and cohesion proteins such as RAD21 /SCC1, is present in the inner kinetochore space between mitotic sister chromatids and in regions that flank centromeric chromatin. This arrangement may be necessary to "present" CENP-A toward the poleward face of the mitotic chromosome and facilitate recruitment of outer kinetochore proteins, and to promote HPl self-interaction and proper chromosome condensation/cohesion. Cohesins are presented as ringed structures, in accord with recent models.
teins (Blower et al. 2002). In order to reconcile the 2D interspersion of CENP-A and H3 blocks (Figs. 6b and 7b) with separation in 3D mitotic chromosomes, it has been proposed that CEN DNA may spiral or loop through the cylindrical structure, leading to alignment or stacking of nucleosomes with the same composition (Fig. 7c). Thus, the distinctly modified interspersed H3 nucleosomes and flanking heterochromatin could be responsible for assembling the 3D structure of CE chromatin in mitosis (Sullivan and Karpen 2004). This arrangement may be necessary to expose CE P-A chromatin to the outside of the chromosome, where it can recruit kinetochore proteins in a manner that establishes proper bi-orientation of sister kinetochores with respect to the spindle poles.
278
C HAP T E R
14
3.4 Models for Centromere Structure, Function, and Propagation
The key question under investigation at this time is how CENP-A, and other epigenetic marks, are specifically localized and propagated at centromeres. Another way to think about this question is to consider how CENP-A is assembled only at centromeric chromatin. One attractive model proposed that differential timing of both CEN DNA replication and CENP-A expression compared to bulk chromatin regulated CENP-A incorporation specifically at centromeres (Ahmad and Henikoff 2001). However, CEN DNA replication in humans and flies occurs throughout S phase, concurrent with bulk DNA replication (Shelby et al. 2000; Sullivan and Karpen 2001), and CENP-A incorporation occurs in the absence of DNA replication (Shelby et al. 2000; Ahmad and Henikoff 2001). These observations rule out a strict replication timing mechanism for propagation of CENP-A and centromere identity. A more attractive mechanism is suggested by the intriguing observation that CENP-A is actively incorporated into nucleosomes in a replication-independent manner by a histone exchange complex (Shelby et al. 2000; Ahmad and Henikoff 2001). H3.3 is an H3 variant whose replication-independent assembly (Ahmad and Henikoff 2002) is mediated by a complex known as histone regulator A (HIRA), and not the chromatin assembly factor (CAF) complexes responsible for replication-dependent incorporation of canonical H3 nucleosomes (Nakatani et al. 2004). Depletion of HIRA components results in CENP-A mislocalization in S. cerevisiae (Sharp et al. 2002). However, it is currently unclear whether HIRA components affect centromeric chromatin in multicellular eukaryotes or s. pombe, where centromere identity is determined epigenetically. More importantly, these proteins play general roles in chromatin assembly and structure, such as H3.3 deposition; thus, the broad activity of the identified HIRA components does not explain the specificity of CENP-A incorporation at centromeres. It is possible that a subset of HIRA complexes contain factors that interact only with CENP-A, and recognize existing CENP-A nucleosomes in replicated CEN chromatin; however, no such specificity factors have been identified. One way to accommodate the involvement of nonspecific assembly factors is to imagine that specificity is provided by CENP-A or CENP-A nucleosomes. For example, the distinct structural relationship between CENP-A and H4 could provide specificity for assembly of new CENP-A at
centromeres, as suggested by the ability of the interacting domain to target CENP-A to centromeres (Black et al. 2004). However, it is unclear whether these domains are sufficient for targeting centromeres in the absence of endogenous CENP-A. New ways of thinking about the epigenetic regulation of centromere identity and propagation are clearly required at this time. In S. cerevisiae, defects in CENP-A proteolysis result in misincorporation into normally non-centromeric regions, which is normally removed by an unknown "clearing" mechanism from everywhere except the endogenous centromere (Collins et al. 2004). This suggests that centromere identity may be regulated at a time subsequent to nucleosome assembly. However, mislocalization of CENP-A in flies results in ectopic kinetochore formation (Heun et al. 2006), suggesting that removal of misincorporated CENP-A may be specific to s. cerevisiae. Nevertheless, variations of this kind of "negative specificity" model are worth considering. The key question for all centromere identity models is, What provides specificity? In this case, Why would proteins such as CENP-A be retained only at one site? One novel idea arises from the fact that stable association with the spindle is one property that is unique to functional centromeres/kinetochores (Mellone and Allshire 2003). Thus, centromere propagation and the site of CENP-A incorporation may be determined during mitosis, utilizing a mechanism that senses productive kinetochorespindle attachments, or spindle-mediated tension. Another idea worth considering is that the modification pattern of interspersed H3 nucleosomes by histone modification proteins (e.g., acetyltransferases, methyltransferases, and kinases) may help propagate centromere identity, in lieu of (or in addition to) CENP-A-associated proteins (Sullivan and Karpen 2004). Distinctly modified interspersed H3 subdomains (Fig. 7) could create a "permissive" chromatin structure necessary for the assembly of new CENP-A. Identification of factors required for CENP-A deposition at centromeres, without bias for a particular model, is a strategy that is likely to provide new insights. Biochemical and genetic studies have identified some as affecting CENP-A signals at centromeres, including previously known factors involved in replication-independent chromatin assembly. However, none of the factors identified to date interacts specifically with CENP-A or other centromeric chromatin proteins or modifications. Nevertheless, it is exciting that factors are being identified, and elucidating specific mechanisms should soon follow.
E PIG ENE TIC
REG U L A T ION
3.5 Epigenetics and Centromere Evolution
Given the importance of centromeres to cell and organismal viability, there should be no room for gain or loss of centromere function. Then why would centromeres utilize epigenetic mechanisms of regulation if there are significant advantages for the individual cell and organism to contain centromeres "hard-wired" into the primary DNA sequence? A strong argument can be made that epigenetic regulation of centromere identity is necessary to accommodate changes occurring to chromosomes, sequences, and proteins during evolution. Studies in mammals (e.g., primates and marsupials), insects, and other taxa have shown that centromere gains and losses are a hallmark of chromosome evolution (Ferreri et al. 2005). Related species frequently differ in the arrangement and association of chromosome arms, even when the DNA sequences are nearly identical. These centromere gains and losses frequently accompany, and arguably are mandated by, translocations and other rearrangements. For example, the requirement for one and only one centromere would render many of the resulting
a
b
W
1---=
/- pro-B cells, indicating that Pax5 functions downstream of E2A and EBFl in B-cell development (Nutt et al. 1997; Busslinger 2004). Consistent with this observation, Pax5 transcripts are absent in E2A-/- or EBFr'- progenitors (Ikawa et al. 2004; Medina et al. 2004). E2K'- progenitors express EBFl at a low level (Ikawa et al. 2004), whereas E2A is normally transcribed in EBFr'- progenitors (Medina et al. 2004). Moreover, retroviral restoration of EBF1 expression in E2A-/- progenitors activates Pax5 transcription, thereby initiating pro-B-cell differentiation (Seet et al. 2004). Hence, the three transcription factors promote early B-cell development in the genetic hierarchy E2A-EBFI-Pax5. Within the B-lymphoid lineage, EBFl expression is, however,
Cd79a -
21
« 'i'
9
me
me
promoter
\' '( 5' '(
4( me
me me
...
EBF1
me
!
me
'( me
5'
me
Runx1
!
assembly of functional promoter
EBF1
E2A
Runx1
+,1
'(
4(
me
me
DNA demethylation and chromatin remodeling
E2A
-----'
'i' me
'(
'(
me
me
«
me
Figure 6. Epigenetic Activation of the Cd79a Gene in Early lymphopoiesis A schematic diagram of the Cd79a (mb-l, Iga) promoter is shown together with the CpG methylation (me) pattern and sequential binding of the different transcription factors during the transition of HSCs to committed pro-B cells (Maier et al. 2003).
E PIG ENE TIC
E2A and Pax5 contribute to the local formation of open acetylated chromatin through their interaction with SAGA and p300/CBP histone acetyltransferase (HAT) complexes (Massari et al. 1999; Barlev et al. 2003).
3 Epigenetic Control of Antigen Receptor Diversity 3.1 Developmental Regulation of Antigen Receptor Gene Rearrangements The guiding principle of the acquired immune system is that every newly generated lymphocyte recognizes a unique antigen and that the overall diversity of lymphocytes is great enough to counteract any possible antigen. To this end, Band T cells express lineage-specific antigen receptors that mediate antibody-dependent humoral or cellular immunity, respectively. The BCR consists of the immunoglobulin heavy chain (IgH) and an IgK or IgA. light chain (lgL). T cells of the a~ lineage, which comprise the majority of T lymphocytes in mouse and man, express the T-cell receptor (TCR) ~ polypeptide in association with TCRa, while the functionally distinct y8 T cells contain TCRy paired with TCR8 on their cell surface. These antigen receptor proteins are encoded by large gene loci containing discontinuous variable (V), diversity (D), and joining (J) gene segments, which are assembled by V(D)J recombination into a functional gene during lymphocyte development. The multiplicity of D, 1, and especially V gene segments, combined with the randomness of their recombination, is responsible for the virtually unlimited diversity of the immune repertoire (Bassing et al. 2002). The mechanics of V(D)J recombination at the DNA level is rather simple. All V, D, and J gene segments are flanked by recombination signal sequences (RSSs), which consist of relatively conserved heptamer and nonamer elements separated by a spacer of either 12 or 23 bp. The lymphoid-specific recombinase proteins RAGland RAG2, assisted by high-mobility group proteins, assemble 12-bp and 23-bp RSSs into a synaptic complex and then generate double-strand DNA breaks between the RSSs and coding segments. These DNA breaks are subsequently processed and religated by ubiquitous repair factors of the nonhom*ologous end-joining machinery to form coding and signal joints (Bassing et al. 2002). The simplicity of the V(D)J recombination process at the DNA template level poses logistic problems for the assembly of the different antigen receptors, because the RAG proteins are expressed in all immature Band T lymphocytes. Hence, stringent regulation must be in place to restrict the access of RAG proteins to only specific subsets
CON T R a L
a FLY MPH a P a I E SIS
•
403
of all the recombination substrates (Yancopoulos and Alt 1985; Stanhope-Baker et al. 1996). V(D)J recombination is tightly controlled in a lineage- and stage-specific manner. Within the B-Iymphoid lineage, the IgH locus is rearranged in pro-B cells prior to recombination of IgK and IgA. genes in pre-B cells, whereas the TCR~ and TCRa genes are rearranged in pro-T and pre-T cells, respectively. Moreover, V(D)J recombination of the IgH gene occurs in a defined temporal order with DH-J Hrearrangements preceding VH-DJ Hrecombination. Rearrangements of the TCR~ locus also proceed in the same order (D~-J~ before V~-Dh) during pro-T-cell development. Control mechanisms must therefore exist to shield all V genes from RAG-mediated cleavage during D-J recombination and to facilitate rearrangement of only one out of a hundred V genes during V-DJ recombination. Consequently, the process of antigen receptor generation entirely depends on accurate regulation of the accessibility of RSSs for the RAG 1/2 recombinase. Successful V-DJ recombination of the IgH or TCR~ gene leads to expression of the Igf.! or TCR~ protein as part of the pre-BCR or pre-TCR complex, which acts as an important checkpoint to inhibit V-DJ recombination of the second DJ-rearranged allele and to promote development to pre-B or pre-T cells that initiate IgL or TCRa gene rearrangements, respectively. Finally, the expression of a signaling-competent BCR or TCR completely arrests V(D)J recombination by transcriptional repression of the RAGi/2 genes in immature B or T cells (Jankovic et al. 2004). Signaling of an autoreactive BCR can, however, restart immunoglobulin light-chain gene rearrangement, which results in the generation of a BCR with a novel antigen specificity (receptor editing; Jankovic et al. 2004). Moreover, signaling of the cytokine IL-7 is essential for promoting recombination of the TCRy gene in pro-T cells (Schlissel et al. 2000). Hence, V(D)J recombination is controlled not only intrinsically by developmental and lineage-specific nuclear mechanisms, but also extrinsically by signals generated at the cell surface. The developmental and locus-specific constraints on V(D)J recombination are largely imposed at the epigenetic level (Krangel 2003). In non-lymphoid cells, the Ig and TCR genes are present in inaccessible chromatin, as exogenously expressed RAG proteins readily cleave transfected episomal recombination substrates but not endogenous antigen receptor genes in kidney cells (Romanow et al. 2000). Moreover, recombinant RAG proteins added to isolated lymphocyte nuclei can only cleave the Ig or TCR gene that is actively undergoing V(D)J recombination at the developmental stage used for
404 •
C HAP T E R
2 1
nucleus preparation (Stanhope-Baker et al. 1996). Hence, the lineage specificity and temporal ordering of gene rearrangements is caused by the sequential opening of local chromatin that renders specific RSSs accessible to the V(D)J recombinase. The ability of chromatin to both protect RSSs and to direct their cleavage suggests the existence of a "chromatin code" that marks the sites of recombination and/or facilitates RAG-mediated cleavage. Acetylation of histones on lysine residues is not only a characteristic feature of open chromatin, but also plays an important role in determining the chromatin accessibility of Ig and TCR loci, as it demarcates domains of recombination-competent gene segments (McMurry and Krangel 2000; Chowdhury and Sen 2001). Analysis of the histone acetylation state has revealed a stepwise activation of discrete chromatin domains in the IgH locus (Chowdhury and Sen 2001). A 120-kb genomic region encompassing the D H, JH, and C~ gene segments is first hyperacetylated prior to V(D)J recombination. DH-J H rearrangements subsequently induce histone acetylation and rearrangements of the DH-proximal VHgenes (Chowdhury and Sen 2001). Finally, the distal 2-Mb domain containing the majority of VHgenes appears to be activated by IL-7 signaling (Chowdhury and Sen 2001). Detailed analysis of the TCRa/8 locus in developing T cells also revealed a complete overlap between regions displaying histone H3 hyperacetylation and accessibility to the V(D)J recombinase (McMurry and Krangel2000). Hence, histone acetylation appears to be an essential part of the chromatin modification pattern that controls the initiation and/or progression of recombination (Krangel 2003). Acetylation per se is, however, insufficient to facilitate recombination, as inhibitors of histone deacetylases have little impact on V(D)J recombination in vivo (McBlane and Boyes 2000). Furthermore, a striking dichotomy between high levels of histone H3 acetylation and poor V(D)J recombination has been observed in pro-B cells (Hesslein et al. 2003; Su et al. 2003). Normal levels of histone acetylation in the DH-distal VHgene cluster fail to support distal VH-DJH recombination in pro-B cells lacking the histone lysine methyltransferase (HKMT) Ezh2 that trimethylates histone H3 at lysine 27 (H3K27me3) (Su et al. 2003). The observation that higher levels ofH3K27me3 are associated with distal compared to proximal VHgenes suggests a domain-specific role of H3K27 methylation in VHgene recombination (Su et al. 2003). The selectivity of Ezh2-mediated regulation for the IgH locus is underscored by the equal efficiency of TCR~ gene recombination in wild-type and Ezh2-deficient pro-T cells (Su et al. 2005). Hence, additional chromatin modifications are likely to be
involved in controlling V(D)J recombination of proximal VH genes in pro-B cells and TCR~ rearrangements in pro-T cells. Dimethylation of histone H3 on lysine 4 (H3K4me2) is an active histone mark, which also correlates with the accessible state of IgH and TCR~ gene segments (Morshead et al. 2003). In contrast, dimethylation ofH3 on lysine 9 (H3K9me2) is a repressive chromatin mark that inversely correlates with V(D)J recombination of IgH and TCR~ gene segments (Morshead et al. 2003). An essential role for H3K9me2 in suppressing recombination was recently demonstrated by targeting the H3K9 HKMT G9a to the PD~l germ-line promoter of a TCR~ minilocus, which prevented V(D)J rearrangements by rendering the local chromatin inaccessible (Osipovich et al. 2004). The histone modification pattern facilitating V(D)J recombinase access must be established by processes that occur within the antigen receptor loci prior to rearrangement. Before the mapping of histone modifications became experimentally feasible, it was already known that germ-line transcription of short sense RNA from unrearranged gene segment precedes V(D)J recombination (Yancopoulos and Alt 1985). A possible role of transcription in controlling locus accessibility was furthermore supported by findings demonstrating that enhancers and promoters located within the antigen receptor loci are essential for V(D)J recombination to occur. Deletion of endogenous enhancers and promoters reduces or abolishes V(D)J recombination of antigen receptor loci, whereas the insertion of additional lineage-specific enhancers leads to a novel V(D)J recombination pattern (Bassing et al. 2002; Krangel 2003). Numerous promoters, associated with V, D, and J segments, control rearrangements of promoter-proximal sequences within relatively short distances, whereas enhancers exert long-range control ofV(D)J recombination (Bassing et al. 2002; Krangel 2003). The assembly of a pre-initiation complex at a promoter may locally disrupt nucleosomes and thereby facilitate access to recombination enzymes, even in the absence of histone modification changes. More likely, however, promoters actively contribute to the establishment of a recombination-permissive chromatin structure, as the elongating RNA polymerase II complex carries its own histone acetyltransferase that may help to spread histone acetylation along transcribed regions (Orphanides and Reinberg 2000). Gene transcription also results in local exchange of the replication-dependent histone H3 by the replacement variant H3.3, which has been implicated in maintaining the accessible chromatin state of transcribed regions (Chow et al. 2005; Mito et al. 2005).
E PIG ENE TIC
As mentioned above, every antigen receptor locus contains hundreds of RSSs, although only a few of them will be cleaved in an inclividuallymphocyte at a defined developmental stage. It is thus conceivable that DNA sequence variations of individual RSS sites may also contribute to their cleavage efficiency. The analysis of artificial V(D)J recombination substrates indeed demonstrated that naturally occurring differences in RSS heptamer and nonamer elements, as well as in the less well conserved spacer and flanking coding sequences, influence the recombination frequency and thus contribute to the differential usage of particular V, D, and J gene segments in the primary antigen receptor repertoire (Lee et al. 2003). In the framework of a "histone code"-centric model, the cleavage selectivity should be determined by a process that would translate the unique features of individual RSSs or adjacent sequences into a specific histone modification pattern marking the site for RAG-mediated cleavage. This code may be established with the help of antisense transcripts. Indeed, antisense intergenic transcription throughout the entire VHgene cluster precedes VH-DJH recombination of the IgH locus in pro-B cells (Bolland et al. 2004). These long antisense transcripts may direct chromatin remodeling to open up the large VHgene domain prior to recombination. Alternatively, these antisense transcripts could form double-stranded RNA hybrids with the short sense germ-line VH transcripts and then be processed by the RNA interference machinery to generate microRNAs that recruit HKMTs to the recombination sites (Bolland et al. 2004; see Chapter 8 for detail on the RNAi machinery). As an extension of this hypothesis, we speculate that specific sense germ-line transcription of a defined RSS site may generate double-stranded RNA, which could target histone-modifying enzymes to this but not other RSS sequences. If experimentally verified, this hypothetical mechanism could account for the precision and selectivity of RAG-mediated cleavage of individual RSS sites. Interestingly, the RAG2 protein was recently shown to directly interact with histones and could thus play an important role in reading the specific histone modification pattern at individual RSS sequences (West et al. 2005). 3.2 5ubnuclear Relocation of Immunoglobulin Genes
The nuclear periphery and pericentromeric heterochromatin are two major repressive compartments in the nucleus that are important for propagating the inactive state of genes in hematopoietic cells (Brown et al. 1997; Baxter et al. 2002). Depending on their activity state, genes are repositioned between these repressive compartments
CON T R 0 L
0 FLY MPH 0 POI E 5 I 5
•
405
and central nuclear positions that facilitate gene transcription (Brown et al. 1997; Baxter et al. 2002). Interestingly, the IgH and IgK loci are located in their default state at the nuclear periphery in all non-B cells, including uncommitted lymphoid progenitors (Kosak et al. 2002). The IgH locus is thereby anchored via the distal VH genes at the nuclear periphery and is oriented with the proximal IgH domain toward the center of the nucleus, which facilitates DH-J H rearrangements in lymphoid progenitors (f*cka et al. 2004). An initial step of IgH locus activation consists of relocation of the IgH and IgK loci from the nuclear periphery to more central positions within the nucleus at the onset of B-cell development (Kosak et al. 2002). This subnuclear repositioning likely facilitates chromatin opening and germ-line transcription, leading to proximal VH-DJ H rearrangements. Circ*mstantial evidence suggests a role for EBF1 and Pax5 in the central relocation of IgH and IgK loci, respectively (f*cka et al. 2004; Sato et al. 2004). Although both alleles of the IgH and IgK loci are repositioned together to central nuclear positions in pro-B cells (Kosak et al. 2002; f*cka et al. 2004), the two alleles behave differently following successful V(D)J recombination in mature B cells (Skok et al. 2001). Following B-cell activation, the productively rearranged Ig alleles remain positioned away from centromeric clusters, thus reinforcing their expression (Skok et al. 200l). At the same time, the nonfunctional IgH and IgK alleles are relocated to, and thus silenced at, centromeric heterochromatin following B-cell activation. Interestingly, the centromeric recruitment of nonfunctional IgH and IgK loci occurs via their distal V gene region, suggesting that the same DNA sequences are involved in the recruitment of silent Ig loci to either the nuclear periphery or centromeric heterochromatin (Roldan et aI. 2005). 3.3 Locus Contraction of Immunoglobulin Genes
The approximately 200 V H genes of the IgH locus are spread over a 2.4-Mb region and can be divided into 15 distal, central, or proximal VHgene families according to their sequence similarity and position relative to the proximal D H segments. In non-B-lymphoid cells and lymphoid progenitors, the two IgH alleles are present in an extended conformation at the nuclear periphery (Kosak et al. 2002). In contrast, the IgH locus undergoes long-range contraction in committed pro-B cells, which juxtaposes distal VHgenes next to the rearranged proximal DJ H domain, thus facilitating VH-DJ H rearrangements (Fig. 7) (Kosak et al. 2002; f*cka et al. 2004). The IgK locus with its approximately 140 V genes also K
406
II
C HAP T E R
2 1
extends over a 3-Mb region and is thus as large as the IgH locus. Similar to the IgH gene in pro- B cells, the IgK locus undergoes contraction in small pre-B and immature B cells, demonstrating that both Ig loci are in a contracted state in rearranging cells (Roldan et al. 2005). Fluorescent in situ hybridization (FISH) analysis with distal, central, and proximal gene probes, furthermore, demonstrated that looping of individual Ig subdomains is responsible for long-range contraction of the IgH and IgK loci (Fig. 7) (Roldan et al. 2005). Distal VH-DJH rearrangements do not take place in paxy/- pro- B cells (Nutt et al. 1997) despite the fact that the VHgenes are accessible in a hyperacetylated chromatin state along the entire VHgene cluster including the most distal VHJ558 family (Hesslein et al. 2003). The failure of distal VH-DJ Hrearrangements correlates with the absence of IgH locus contraction (Fig. 7), which can, however, be restored by retroviral Pax5 expression in Paxy/- pro-B cells (f*cka et al. 2004). Hence, Pax5 is a key regulator of IgH locus contraction in pro- B cells. The histone methyltransferase Ezh2 has also been implicated in IgH locus contraction, as conditional Ezh2 inactivation in HSCs results in a reduction of distal VH-DJ H rearrangements despite full chromatin accessibility of distal VHgenes in Ezh2-deficient pro- B cells (Su et al. 2003). Interestingly, there is no genetic relationship between Pax5 and Ezh2 despite the similar IgH rearrangement phenotype of the respective mutant pro-B cells (f*cka et al. 2004). It is therefore possible that Pax5 functions as a sequence-specific targeting factor to recruit the Ezh2-containing Polycomb repressive complex 2 (PRC2) to selected regions in the IgH locus. The resulting methylation of local chromatin at histone H3 on lysine 27 (H3K27me3) may attract the PRCI complex to induce chromatin compaction of the targeted regions (Francis et al. 2004; discussed in Chapter 11), thus leading to looping and
contraction of the IgH locus. Alternatively, locus contraction may not depend on histone modifications in the nucleus, but rather requires lysine methylation of signaling proteins by the cytoplasmic Ezh2-containing methyltransferase complex, which is known to regulate actin polymerization by binding to the GTP/GDP exchange factor Vavl (Su et al. 2005). 3.4 Control of Allelic Exclusion at the IgH and IgK Loci
Allelic exclusion ensures the productive rearrangement of only one of the two Ig alleles, which leads to the expression of a single antibody molecule with a unique antigen specificity in B cells. The process of allelic exclusion can be divided into two distinct steps. During the initiation phase, one of the two Ig alleles is selected by differential epigenetic marking to rearrange first, which precludes simultaneous recombination of the two alleles. Expression of the productively rearranged allele subsequently prevents recombination of the second allele by feedback inhibition, thus maintaining allelic exclusion. The process of allelic exclusion is already initiated in early development at the time of implantation, when the two alleles of the antigen receptor genes start to replicate asynchronously in each cell (Mostoslavsky et al. 2001). The paternal or maternal Ig gene, which is stochastically selected for early replication by a so-far-unknown chromosomal mark, is almost invariably the first allele to undergo rearrangements in immature B lymphocytes (Mostoslavsky et al. 2001). The second VH-DJH rearrangement of the IgHlocus is thereby the regulated step, as DH-J H recombination occurs on both IgH alleles during pro-B-cell development (Bassing et al. 2002). However, nothing is yet known about how the allele-specific epigenetic mark (established in the early embryo) is translated into sequential activation of VH-DJHrecombination at the two IgH alleles. Successful
proximal domain
PaxS-I - pro-B Ezh2-1- pro-B 2.
1.
rearrangement
wild-type pro-B cell distal rearrangement
Figure 7. Contraction of the Immunoglobulin Heavy-Chain Locus in pro-B Cells The IgH locus consists of a proximal domain containing diversity (0), joining (J), and constant (C) gene segments, and a large variable (V) gene cluster with -200 V genes spread over a 204-Mb region. The IgH locus is in an extended configuration in PaxS-I - or Ezh2-1- pro-B cells, which allows V(O)j recombination to take place only in the proximal domain. In wild-type pro-B cells, all VH genes participate in VH-OjH rearrangements due to contraction of the IgH locus by looping.
E PIG ENE TIC
rearrangement of one IgH allele leads to cell-surface expression of the Ig).! protein as part of the pre-B-cell receptor (pre-BCR). This receptor functions as an important checkpoint to signal proliferative expansion of large pre-B cells, to induce subsequent differentiation to small pre-B cells, and to maintain allelic exclusion at the DJ Hrearranged IgH allele (Kitamura and Rajewsky 1992; Bassing et al. 2002). RAG protein expression is rapidly lost upon pre-BCR signaling (Fig. 8), which halts all further V(D)J recombination and prepares the ground for establishing allelic exclusion in large pre-B cells. (Grawunder et al. 1995). Pre- BCR signaling also leads to histone deacetylation and thus reduced accessibility of the VH genes in small pre-B cells, which may be a possible feedback mechanism underlying allelic exclusion (Chowdhury and Sen 2003). A more plausible mechanism is, however, provided
early pro-B
large pre-B
late pro-B
©-@~
small pre-B
~ ~
-
..................... (J) (J)
co
( :...
L...-
:
")
....
---l!
IgH rearrangement
---II
L-I
pre-BCR signaling
L-!
---I
IgL rearrangement
Figure 8. Allelic Exclusion by Decontraction of the IgH Locus in pre-B Cells In early pro-B cells, DH-J H rearrangements occur simultaneously on both IgH alleles, whereas only one allele undergoes VH-DJ H recombination at a time in late pro-B cells. The nuclei of sorted pro-B and pre-B cells were analyzed by three-dimensional DNA-FISH with fluorescent probes from the distal (red) and proximal (green) regions of the IgH locus. The two IgH alleles of the same cell are shown on two representative confocal sections. Pre-BCR signaling results not only in rapid loss of the RAG protein, but also in decontraction of the IgH locus. Although both alleles are decontracted, the IgH locus is fully extended only in the case of the incompletely DJH-rearranged allele. The two signals of the functionally rearranged allele (VDn are separated by a shorter distance due to the deletion of intervening DNA sequences. The FISH data are taken from Roldan et al. (2005).
CON T R 0 L
0 FLY MPH 0 POI E SIS
407
by the rapid reversal of IgH locus contraction in response to pre-BCR signaling, which physically separates the VH genes from the proximal IgH domain (Fig. 8), thus preventing VH-DJ H rearrangement on the second DJ Hrearranged IgH allele (Roldan et al. 2005). Pre-BCR signaling, furthermore, leads to rapid repositioning of the nonfunctional IgH allele to repressive centromeric domains (Roldan et al. 2005). Hence, locus decontraction and centromeric recruitment alter the DJH-rearranged IgH allele during the RAG-free window of pre-BCR signaling in such a way that it can no longer undergo VH-DJ H rearrangement after subsequent re-expression of RAG proteins in small pre-B cells (Fig. 8). The initiation of allelic exclusion at the IgK locus has been extensively studied by investigating the DNA methylation pattern with methyl-sensitive restriction enzymes (Mostoslavsky et al. 1998; Goldmit et al. 2002; 2005), as well as by analyzing heterozygous KO-GFP reporter mice that contain a GFP gene insertion in the J) element of the endogenous IgKlocus (Liang et al. 2004). The IgK locus is heavily methylated at CpG dinucleotides in all non-B and pro-B cells, but becomes specifically demethylated on only one allele in pre-B cells (Fig. 9) (Mostoslavsky et al. 1998; Liang et al. 2004). This monoallelic demethylation precedes rearrangement and is dependent on the activity of both the intronic and 3' Kenhancers (Mostoslavsky et al. 1998). The demethylated allele is present in accessible chromatin, as it is DNase-I-sensitive, hyperacetylated at histones H3 and H4, and positioned away from centromeric heterochromatin in pre-B cells (Fig. 9) (Goldmit et al. 2002, 2005).As a consequence, only the unmethylated IgK allele initiates germ-line transcription and VK-J Krearrangements (Goldmit et al. 2002; Liang et al. 2004), whereas both alleles undergo locus contraction in small pre-B cells (Fig. 9) (Roldan et al. 2005). Surprisingly, the second IgK allele is relocated to centromeric heterochromatin in pre-B cells (Goldmit et al. 2005) similar to the IgH locus (Roldan et al. 2005). This monoallelic centromeric recruitment (Fig. 9) may explain why the DNA-methylated allele is depleted in histone acetylation and is associated with the proteins HP1 y and Ikaros, which are enriched together with histone deacetylase complexes at centromeric heterochromatin (Goldmit et al. 2005). Interestingly, it is the late-replicating IgK allele which is repositioned to the centromeric clusters (Goldmit et al. 2005) in agreement with the finding that the asynchronous replication pattern established already in the early embryo correlates with monoallelic initiation of IgK rearrangements in pre-B cells (Mostoslavsky et al. 2001). Surprisingly, only a very small fraction (5%) of all pre-B cells undergo IgK locus activation in the KO-GFP
408
C HAP T E R 2 1
antigen
... progenitor
...
... pro-B cell
pre-B cell
B cell
Figure 9. Mechanisms Controlling Allelic Exclusion at the 19K Locus Subnuclear relocation, DNA demethylation, and histone acetylation contribute to the selection of one 19K allele for V.-]. recombination in pre-B cells. See text for detailed explanation. The distal V. region (red) and proximal I.-C. domain (green) of the 19K locus are indicated, together with their location relative to the repressive compartments at the nuclear periphery (gray) and centromeric heterochromatin (blue). The locus contraction, DNA methylation (me), and histone acetylation (ac) states of the two 19K alleles are schematically shown for different developmental stages including activated mature B cells.
reporter mice (Liang et al. 2004). On the basis of this result, it was hypothesized that certain transcription factors binding to IgK cis-regulatory elements are present in limiting amounts in pre-B cells and that the cooperative binding of such factors to IgK enhancers is a rare event, occurring stochastically at only one allele. Hence, stochastic enhancer activation by allelic competition for limiting transcription factors may contribute to allelic exclusion at the IgK locus (Liang et al. 2004). Successful rearrangement of one IgK allele leads to cell-surface expression of the BCR, which subsequently maintains allelic exclusion at the second IgK allele by repressing RAGI/2 recombinase expression (Jankovic et a1. 2004). 4 Terminal Differentiation of Mature B Cells 4.1 Plasma Cell Differentiation
Completion of V(D)J recombination and expression of the immunoglobulin (Ig) receptor on the B-cell surface mark the end of the antigen-independent phase of Blymphopoiesis. From this point on, the fate of B cells becomes dependent on antigen-induced receptor signaling (Rajewsky 1996). In the absence of antigen, peripheral B cells are maintained in a resting state where their survival is supported by tonic signals from the cell-surface BCR (Kraus et a1. 2004). A comparison of the chromatin organization in resting and activated B cells shows that Bcell quiescence is characterized by low levels of global histone methylation (Baxter et a1. 2004). The relatively high levels of histone acetylation in quiescent B cells remain stable during cell activation (Baxter et a1. 2004). The global reduction in histone lysine methylation, including the virtual absence of histone H3K9 methylation, corre-
lates with the lack of other hallmarks of constitutive heterochromatin such as Ikaros association and HP1 binding in quiescent B cells (Baxter et a1. 2004). The activation of B cells reinstates the methylation of histones, which leads to an increase of the active H3K4me3 mark on genes required for B-cell function (Pax5) and to a simultaneous increase of the repressive H3K9me2 modification on silent genes (Dntt) (Baxter et al. 2004). Activation-induced chromatin reorganization may maintain B-cell identity during the immune response. However, the B-cell genome must remain amenable to antigen-induced reprogramming, since activated B cells, following antigen encounter, are able to differentiate directly into antibody-producing plasma cells (Calame et al. 2003). Alternatively, antigen stimulation can initiate the germinal center reaction. During this reaction, mature IgM 'OW IgDhig h B cells switch their immunoglobulin isotypes and mutate their Ig genes with the help of activation-induced cytidine deaminase (AID; Honjo et al. 2004). The Ig proteins generated by these processes are perfectly suited for the differentiation and maintenance of memory B cells or the development of plasma cells, which produce antibodies with high affinity for a particular antigen (Honjo et a1. 2004). The timing of germinal center reactions and the conversion of B cells into plasma cells are regulated by two mutually exclusive transcriptional repressors, Bcl6 and Blimp1 (Fig. 10) (Turner et al. 1994; Ye et a1. 1997). Bcl6 is expressed at low levels in mature naive B cells but is rapidly up-regulated in some B cells after antigenic stimulation (f*ckuda et a1. 1997). Cells that do not up-regulate Bcl6 upon antigen encounter differentiate into plasma cells that serve as an initial source of low-affinity antibod-
E PIG ENE TIC
ies (f*ckuda et al. 1997). In contrast, B cells that up-regulate Bcl6 enter the germinal center reaction (f*ckuda et al. 1997) and are maintained as B cells by Bcl6-mediated repression of genes that control plasma cell differentiation (Shaffer et al. 2000). One key target of Bcl6 is the Prdml gene, which encodes the transcription factor Blimpl (Fig. 10). Interestingly, PaxS appears to assist Bcl6 in repressing the Blimpl (Prdml) gene (Fig. 10) (Delogu et al. 2006). However, once Blimpl is expressed, it extinguishes the B-cell transcriptional program, including Bcl6 and Pax5 expression, and simultaneously induces the transcription of plasma-cell-specific genes (Shaffer et al. 2002; Calame et al. 2003). Bcl6 and Blimp 1 use a wide arsenal of repressive mechanisms to inactivate gene transcription. One peculiar aspect of Bcl6 is the utilization of lysine acetylation beyond histone modification to control gene repression (Bereshchenko et al. 2002). Bcl6 interacts with MTA3, a subunit of the corepressor complex Mi-2/NuRD, which is highly expressed in germinal center B cells (Fujita et al. 2004). Association of Bcl6 with the MTA3-containing Mi2/NuRD complex is essential for gene repression, as RNAi-mediated depletion of MTA3leads to the reactivation of Bcl6-repressed target genes in B cells (Fujita et al. 2004). The repression function of the Bcl6/MTA3/ Mi-2/NuRD complex depends on the acetylation status of lysine residues in both Bcl6 and the histones associated with the repressed gene locus. The central domain of Bcl6 needs to be deacetylated to promote its interaction with MTA3, whereas gene repression by the MTA3/Mi2/NuRD complex depends on the class I histone deacetylases HDACI and HDAC2 (Fujita et al. 2004). Bcl6,
MTA3 Bel6 ~Blimpl Pax5
XBPl
germinal center B cell
signaling of high-affinity BCR lineage switch
•
Bcl6
G9a
t--- Blimp1
pax5/XB~P1 plasma cell
Figure 10. Transcriptional Repression Determines the Germinal Center B Cell and Plasma Cell Fates Pax5 and Bc/6 (together with MTA3) regulate the B-cell gene expression program and maintain the GC (germinal center) B-cell fate by transcriptional repression of the plasma cell regulator Blimp1. Strong BCR signaling at the end of GC B-cell development leads to degradation of the Bc/6 protein and concomitant expression of Blimpl, which subsequently represses Bc/6 and Pax5 and, together with XBP1, induces the plasma cell transcription program. Blimpl most likely activates XBP7 expression indirectly as part of the unfolded protein response, by inducing the expression of secreted immunoglobulins. For detailed description, see text.
CON T R a L
a FLY MPH a P a
f E5 I 5
409
furthermore, associates via its amino-terminal POZ domain with the three corepressors SMRT, NCoR, and BCoR in a mutually exclusive manner (Huynh and Bardwell 1998; Huynh et al. 2000). These three corepressors, which additionally interact with the class II enzyme HDAC3 (Huynh et al. 2000), may enhance MTA3-mediated repression of the same Bcl6 target genes or silence a different gene set in germinal center B cells. Antigen stimulation of the high-affinity Ig receptors on germinal center B cells is accompanied by a reduction of the Bcl6 protein level. Receptor activation thereby leads to MAP kinase-induced Bcl6 phosphorylation, which triggers rapid degradation of the Bcl6 protein by the ubiquitin/proteasome pathway (Niu et al. 1998). The drop in Bcl6 levels alleviates Prdml gene repression, resulting in increased Blimp 1 protein expression and subsequent development to plasma cells (Fig. 10) (Shaffer et al. 2000; Calame et al. 2003). Blimpl controls multiple aspects of plasma cell differentiation. First, Blimpl targets the transcriptional core program of B-cell differentiation by repressing PaxS (Fig. 10) (Shaffer et al. 2002), which is essential for the maintenance of B-cell function and identity (Horcher et al. 2001; Mikkola et al. 2002). Second, by down-regulating the expression of other transcription factors (such as Spi-B, EBF1, CIITA, Id3, Oct2, and OBF1), Blimpl indirectly terminates the transcription of genes that code for essential proteins in antigen receptor signaling and antigen presentation (Shaffer et al. 2002). Third, to ensure the resting state of plasma cells, Blimp 1 directly represses c-myc transcription (Shafferet al. 2002). Fourth, the lineage-inappropriate genes, which are repressed by PaxS in B cells, are reactivated upon Blimplmediated down-regulation of PaxS expression in plasma cells (Delogu et al. 2006). Hence, by repressing other repressors, Blimpl may indirectly activate the expression of additional genes with essential plasma cell functions. Fifth, Blimpl is essential for the expression of secreted immunoglobulins (Calame et al. 2003), which accumulate in the endoplasmic reticulum, thereby activating XBP1 expression as part of the unfolded protein response pathway. The transcription factor XBPI regulates antibody secretion and is thus indispensable for plasma cell differentiation (Reimold et al. 2001; Shaffer et al. 2004). Interestingly, Blimpl is also a key determinant of primordial germ cell specification in early embryogenesis, which is discussed in Chapter 20. The mechanism of Blimpl-mediated repression in developing primordial germ cells and plasma cells is largely unknown. It is, however, likely that Blimp 1 uses the same repression principles in plasma cells as in fibroblasts where PRDI-BF1, the
410.
CHAPTER
21
human ortholog of mouse Blimpl, is involved in postinduction repression of the interferon-~ (IFNBl) gene during viral infection (Keller and Maniatis 1991). Several mechanisms account for PRDI-BFl-mediated repression of the IFNBl gene. Binding ofPRDI-BFl is able to displace transcriptional activators from the IFNBl promoter (Keller and Maniatis 1991). In addition, PRDI-BFl interacts with corepressors of the Groucho protein family that employ histone deacetylases as a part of their repression mechanism (Ren et al. 1999). Further silencing is obtained through the association ofPRDI-BFl with the G9a protein (Gyory et al. 2004), which belongs to the subfamily of histone methyltransferases with specificity for H3 lysine 9 (Tachibana et al. 2002). In contrast to Suv39hl, which uses H3K9me3 to build a transcriptionally repressive environment at centromeric heterochromatin (Peters et al. 2001), G9a contributes to H3K9me2 and gene silencing in euchromatic regions (Tachibana et al. 2002). The catalytic activity of G9a is required for the repression function of PRDI-BFI, since a catalytically inactive G9a protein reverses the inhibitory effect of PRDI-BFl on IFNBl transcription (Gyory et al. 2004). Furth@rmore, deletion of the G9a interaction domain prevents H3K9 methylation and transcriptional silencing by PRDI-BFl (Gyory et al. 2004). In view of these data, it is likely that G9a-mediated histone methylation is an essential mechanism by which Blimp1 generates a stable gene expression pattern in plasma cells. Interestingly, the Blimpl protein also contains a SET domain of the PR (RIZ) subfamily. The SET domain of the related RIZI (Prdm2) protein has been implicated in tumor suppression and methylation of histone H3 on lysine 9 (Kim et al. 2003). Hence, it remains to be seen whether the SET domain of Blimpl also contributes to gene repression during plasma cell differentiation. 4.2 Developmental Plasticity of Mature B Cells
The generation of plasma cells is usually considered to be the terminal process of B-cell development, as the expression of immunoglobulin genes is an essential function of both B cells and plasma cells. Interestingly, the immunoglobulin genes are expressed under the combinatorial control of ubiquitous rather than B-lymphoid transcription factors in the two cell types. Apart from immunoglobulin genes, B cells and plasma cells differ radically, however, in their gene expression pattern (Shaffer et al. 2002, 2004). With regard to Pax5 function, the plasma cells even seem to go into reverse gear, as the plasma-cell-specific silencing of Pax5 expression leads to the reactivation of B-lineage-inappropriate genes that are
normally repressed by Pax5 at the onset of B-cell development (Delogu et al. 2006). These gene expression data therefore support the alternative view that the differentiation of antigen-stimulated B cells to plasma cells is a true "lineage" switch. This hypothesis predicts that the developmental potential of mature B cells should be plastic rather than being restricted to the B-lymphocyte fate, which is supported by the following evidence. Ectopic expression of the B-cell transcription factor Bcl6 and its corepressor MTA3 in established plasma cell lines leads to the repression of plasma-cell-specific genes, including the regulatory genes Blimpl and XBPl (Fujita et al. 2004). At the same time, multiple B-cell-specific genes are reactivated, including the Pax5 target genes CIITA and BLNK, and by inference, Pax5 itself (Fujita et al. 2004). Hence, Bcl6 and its partner protein MTA3 are sufficient to reprogram plasma cells to a B-cell fate, at least under the in vitro culture conditions analyzed (Fig. 11). The transcription factor C/EBPa, which is essential for granulocyte development, is exclusively expressed in myeloid progenitors and their differentiated progeny (Akashi et al. 2000). Forced expression of C/EBPa in B lymphocytes from the bone marrow or spleen leads to efficient transdifferentiation of the infected B cells into functional macrophages within 5 days (Xie et al. 2004). C/EBPa thereby activates the myeloid gene program and concomitantly represses B-cell-specific genes by interfering with the transcriptional activity of Pax5 (Xie et al. 2004). Hence, the loss of Pax5 function is likely to facilitate the myeloid lineage conversion of B cells in response to ectopic C/EBPa expression (Fig. 11). Conditional gene inactivation unequivocally identified a critical role for Pax5 in controlling the identity of B cells throughout B lymphopoiesis. Cre-mediated gene deletion in committed pro-B cells demonstrated that Pax5 is required not only to initiate its B-lymphoid transcription program, but also to maintain it in early B-cell development (Mikkola et al. 2002). As a consequence of Pax5 inactivation, previously committed pro-B cells regain the capacity to differentiate into macrophages in vitro and to reconstitute T-cell development in vivo (Fig. 11) (Mikkola et al. 2002). Conditional Pax5 deletion in mature B cells also leads to loss of the Pax5-dependent gene expression program (Horcher et al. 2001; Delogu et al. 2006). More surprisingly, however, the mature Pax5-deleted B cells retrodifferentiate all the way to Pax5 mutant progenitors in vivo following injection into RAG2-deficient mice. These Pax5 mutant progenitors home to the bone marrow, from where they seed the thymus and fully reconstitute Tcell development in the RAG2-deficient host (Fig. 11). The
EPIGENETIC
CONTROL
OF
LYMPHOPOIESIS.
411
Pax5 deletion
e-e-
~ pm-S
-
prs-S
I!\ T cells
~
C/EBPa
macrophages
rearranged Ig genes Figure 11. Developmental Plasticity of B Lymphocytes Ectopic expression of Bcl6 and MTA3 in established plasma cell lines silences the transcription of plasma-cell-specific genes and simultaneously reactivates the expression program of B cells (orange arrow) (Fujita et al. 2004). CD19+ B lymphocytes, which were not further characterized with regard to their developmental stage, undergo rapid transdifferentiation in vitro to macrophages in response to forced C/EBPa expression (red arrow) (Xie et al. 2004). Conditional Pax5 deletion allows committed pro-B cells and even mature B cells to retrodifferentiate in vivo to uncommitted progenitors, which then develop into other hematopoietic cell types in the bone marrow or T cells in the thymus (black arrows) (Mikkola et al. 2002; C. Cobaleda and M. Busslinger, unpublished data). The blue color denotes Pax5 expression during B-cell development.
fact that the corresponding CD4+CD8+ double-positive thymocytes carry IgH and IgK as well as TCRa and TCR~ rearrangements unambiguously demonstrates that mature B cells, following Pax5 loss, can be converted into T cells via retrodifferentiation to an uncommitted progenitor cell stage (c. Cobaleda and M. Busslinger, unpub!.). Hence, Pax5 expression is continuously required to maintain the identity of B lymphocytes from the pro-B-cell to the mature B-cell stage. Based on the analyses of other developmental systems in flies and vertebrates, transcription factors are thought to initiate cell-fate decision by altering gene expression patterns, while the transcriptional state of committed cells is subsequently maintained by epigenetic factors encoded by the Polycomb and Trithorax group genes (Ringrose and Paro 2004; discussed further in Chapters 11 and 12). The permanent requirement of the transcription factor Pax5 could argue against an important role of these epigenetic memory systems in B-cell development. More likely, however, Pax5 may maintain Bcell identity by acting as a crucial recruitment factor to target Polycomb or Trithorax protein complexes to gene regulatory elements.
5 Concluding Remarks
In summary, various epigenetic mechanisms are involved in regulating and guiding lymphocyte development. Of all the different epigenetic regulators, we currently know most about the role of transcription factors,
which control entire gene expression patterns by recruiting chromatin-modifying activities (such as histone acetyltransferases or deacetylases) to gene regulatory elements. Less is known about the control of gene expression by histone methyltransferases, by Trithorax and Polycomb group proteins, or by microRNA and siRNA pathways. Unraveling the role of these regulatory systems will require experimentally engineered conditional gene inactivation, because histone methyltransferases, Trithorax and Polycomb proteins, as well as components of the RNAi machinery, are of fundamental importance not only for lymphopoiesis, but also for embryonic development. Moreover, the development and availability of global ChIP-on-chip technologies will allow high-resolution mapping of epigenetic modifications along entire chromosomes and complex loci (such as the antigen receptor loci) at different stages of lymphopoiesis. These recent advances are likely to provide important novel insight into the epigenetic control mechanisms underlying lymphocyte development. References Adolfsson J., Mansson R., Buza-Vidas N., Hultquist A., Liuba K., Jensen c.T., Bryder D., Yang 1., Borge OJ, Thoren LAM., et al. 2005. Identification of Flt3+ lympho-myeloid stem cells lacking erythromegakaryocytic potential: A revised road map for adult blood lineage commitment. Cell 121: 295-306. Akashi K., Traver D., Miyamoto T., and Weissman 1.1. 2000. A elonogenic common myeloid progenitor that gives rise to all myeloid lineages. Nature 404: 193-197.
412
.. CHAPTER
21
Allman D., Sambandam A., Kim S., Miller J.P., Pagan A., Well D., Meraz A., and Bhandoola A. 2003. Thymopoiesis independent of common lymphoid progenitors. Nature Immunol. 4: 168-174. Barlev N.A., Emelyanov A.V., Castagnino P., Zegerman P., Bannister A.I., Sepulveda M.A., Robert F., Tora L., Kouzarides T., Birshtein B.K., and Berger S.L. 2003. A novel human Ada2 hom*ologue functions with Gcn5 or Brgl to coactivate transcription. Mol. Cell. BioI. 23: 6944-6957. Bassing e.H., Swat W., and Alt F.W. 2002. The mechanism and regulation of chromosomal V(D)J recombination. Cell (supp!.) 109: S45-S55. Baxter I., Merkenschlager M., and Fisher A.G. 2002. Nuclear organisation and gene expression. Curro Opin. Cell BioI. 14: 372-376. Baxter I., Sauer S., Peters A., John R., Williams R., Caparros M.L., Arney K., Otte A., Jenuwein T., Merkenschlager M., and Fisher A.G. 2004. Histone hypomethylation is an indicator of epigenetic plasticity in quiescent lymphocytes. EMBO]. 23: 4462-4472. Bereshchenko O.R., Gu W., and Dalla-Favera R. 2002. Acetylation inactivates the transcriptional repressor BCL6. Nat. Genet. 32: 606-613. Bolland D.J., Wood A.L., Johnston e.M., Bunting S.F., Morgan G., Chakalova L., Fraser P.J. and Corcoran A.E. 2004. Antisense intergenic transcription in V(D)J recombination. Nat. Immunol. 5: 630-637. Brown K.E., Guest S.S., Smale S.T., Hahm K., Merkenschlager M., and Fisher A.G. 1997. Association of transcriptionally silent genes with Ikaros complexes at centromeric heterochromatin. Cell 91: 845-854. Busslinger M. 2004. Transcriptional control of early B cell development. Annu. Rev. Immunol. 22: 55-79. Calame K.L., Lin K.I., and Tunyaplin e. 2003. Regulatory mechanisms that determine the development and function of plasma cells. Annu. Rev. Immunol. 21: 205-230. Chow e.M., Georgiou A., Szutorisz H., Maia e Silva A., Pombo A., Barahona I., Dargelos E., Canzonetta e., and Dillon N. 2005. Variant histone H3.3 marks promoters of transcriptionally active genes during mammalian cell division. EMBO Rep. 6: 354-360. Chowdhury D. and Sen R. 2001. Stepwise activation of the immunoglobulin fL heavy chain gene locus. EMBO ]. 20: 6394-6403. ---.2003. Transient IL-7/IL-7R signaling provides a mechanism for feedback inhibition of immunoglobulin heavy chain gene rearrangements. Immunity 18: 229-241. Dakic A., Metcalf D., Di Rago L., Mifsud S., Wu L., and Nutt S.L. 2005. PU.l regulates the commitment of adult hematopoietic progenitors and restricts granulopoiesis. J. Exp. Med. 201: 1487-1502. Delogu A., Schebesta A., Sun Q., Aschenbrenner K., Perlot T., and Busslinger M. 2006. Gene repression by Pax5 in B cells is essential for blood cell homeostasis and is reversed in plasma cells. Immunity 24: 269-281. Fisher A.G. 2002. Cellular identity and lineage choice. Nat. Rev. Immunol. 2: 977-982. Francis N.J., Kingston R.E., and Woodco*ck e.L. 2004. Chromatin compaction by a polycomb group protein complex. Science 306: 1574-1577. Fujita N., Jaye D.L., Geigerman e., Akyildiz A., Mooney M.R., Boss J.M., and Wade P.A. 2004. MTA3 and the Mi-2/NuRD complex regulate cell fate during B lymphocyte differentiation. Cell 119: 75-86. f*ckuda T., Yoshida T., Okada S., Hatano M., Moo T., Ishibashi K., Okabe S., Koseki H., Hirosawa S., Taniguchi M., et al. 1997. Disruption of the Bcl6 gene results in an impaired germinal center formation. ]. Exp. Med. 186: 439--448.
f*cka M., Skok J., Souabni A., Salvagiotto G., Roldan E., and Busslinger M. 2004. Pax5 induces V-to-DJ rearrangements and locus contraction of the immunoglobulin heavy-chain gene. Genes Dev. 18: 411-422. Goldmit M., Ji Y., Skok I., Roldan E., Jung S., Cedar H., and Bergman Y. 2005. Epigenetic ontogeny of the Igk locus during B cell development. Nat. Immunol. 6: 198-203. Goldmit M., Schlissel M., Cedar H., and Bergman Y. 2002. Differential accessibility at the K chain locus plays a role in allelic exclusion. EMBO J. 21: 5255-5261. Grawunder u., Leu T.M.J., Schatz D.G., Werner A., Rolink A.G., Melchers F., and Winkler T.H. 1995. Down-regulation of RAGl and RAG2 gene expression in preB cells after functional immunoglobulin heavy chain rearrangement. Immunity 3: 601-608. Gyory I., WU J., Fejer G., Seto E., and Wright K.L. 2004. PRDI-BFl recruits the histone H3 methyltransferase G9a in transcriptional silencing. Nat. Immunol. 5: 299-308. Hesslein D.G.T., Pflugh D.L., Chowdhury D., Bothwell A.L.M., Sen R., and Schatz D.G. 2003. Pax5 is required for recombination of transcribed, acetyJated, 5' IgH V gene segments. Genes Dev. 17: 37--42. Honjo T., Muramatsu M., and fa*garasan S. 2004. AID: How does it aid antibody diversity? Immunity 20: 659-668. Horcher M., Souabni A., and Busslinger M. 2001. Pax5/BSAP maintains the identity of B cells in late B lymphopoiesis. Immunity 14: 779-790. Huynh K.D. and Bardwell v.J. 1998. The BCL-6 POZ domain and other POZ domains interact with the co-repressors N-CoR and SMRT. Oncogene 17: 2473-2484. Huynh K.D., Fischle w., Verdin E., and Bardwell v.J. 2000. BCoR, a novel corepressor involved in BCL-6 repression. Genes Dev. 14: 1810-1823. Ikawa T., Kawamoto H., Wright L.Y.T., and Murre e. 2004. Long-term cultured E2A-deficient hematopoietic progenitor cells are pluripotent. Immunity 20: 349-360. Jankovic M., Casellas R., Yannoutsos N., Wardemann H., and Nussenzweig M.e. 2004. RAGs and regulation of autoantibodies. Annu. Rev. Immunol. 22: 485-501. Keller A.D. and Maniatis T. 1991. Identification and characterization of a novel repressor of ~-interferon gene expression. Genes Dev. 5: 868-879. Kim K.e., Geng L., and Huang S. 2003. Inactivation of a histone methyltransferase by mutations in human cancers. Cancer Res. 63: 7619-7623. Kitamura D. and Rajewsky K. 1992. Targeted disruption of fL chain membrane exon causes loss of heavy-chain allelic exclusion. Nature 356: 154-156. Kosak S.T., Skok J.A., Medina K.L., Riblet R., Le Beau M.M., Fisher A.G., and Singh H. 2002. Subnuclear compartmentalization of immunoglobulin loci during lymphocyte development. Science 296: 158-162. Kovanen P.E. and Leonard W.J. 2004. Cytokines and immunodeficiency diseases: Critical roles of the Yo-dependent cytokines interleukins 2, 4,7,9,15, and 21, and their signaling pathways. Immunol. Rev. 202: 67-83. Krangel M.S. 2003. Gene segment selection in V(D)J recombination: Accessibility and beyond. Nat. Immunol. 4: 624-630. Kraus M., Alimzhanov M.B., Rajewsky N., and Rajewsky K. 2004. Survival of resting mature B lymphocytes depends on BCR signaling via the Iga/~ heterodimer. Cell 117: 787-800. Lee A.I., Fugmann S.D., Cowell L.G., Ptaszek L.M., Kelsoe G., and Schatz D.G. 2003. A functional analysis of the spacer of V(D)J recombination signal sequences. PioS BioI. 1: 56-69.
EPIGENETIC
Liang H.E., Hsu L.Y., Cado D., and Schlissel M.S. 2004. Variegated transcriptional activation of the immunoglobulin K: locus in pre-B cells contributes to the aLlelic exclusion of light-chain expression. Cell 118: 19-29. Linderson Y, Eberhard D., Malin S., iohansson A., Busslinger M., and Pettersson S. 2004. Corecruitment of the Grg4 repressor by PUI is critical for Pax5-mediated repression of B-cell-specific genes. EMBO Rep. 5: 291-296. Maier H., Colbert i., Fitzsimmons D., Clark D.R., and Hagman J. 2003. Activation of the early B-cell-specific mb-I (Ig-a) gene by Pax-5 is dependent on an unmethylated Ets binding site. Mol. Cell. BioI. 23: 1946-1960. Maier H., Ostraat R., Gao H., Fields S., Shinton SA, Medina K.L., Ikawa T., Murre e., Singh H., Hardy R.R., and Hagman i. 2004. Early B cell factor cooperates with Runxl and mediates epigenetic changes associated with mb-I transcription. Nat. Immunol. 5: 1069-1077. Massari M.E., Grant P.A., Pray-Grant M.G., Berger S.L., Workman j.L., and Murre e. 1999. A conserved motif present in a class of helixloop-helix proteins activates transcription by direct recruitment of the SAGA complex. Mol. Cell. 4: 63-73. McBlane E and Boyes j. 2000. Stimulation of V(D)i recombination by histone acetylation. Curro BioI. 10: 483-486. McMurry M.T. and Krangel M.S. 2000. A role for histone acetylation in the developmental regulation of V(D)J recombination. Science 287: 495-498. Medina K.L., Pongubala j.M., Reddy K.L., Lancki D.W., DeKoter R., Kieslinger M., Grosschedl R., and Singh H. 2004. Assembling a gene regulatory network for specification of the B cell fate. Dey. Cell 7: 607-617. Mikkola 1., Heavey B., Horcher M., and Busslinger M. 2002. Reversion of B cell commitment upon loss of Pax5 expression. Science 297: 1l0-113. Mito Y, Henikoff i.G., and Henikoff S. 2005. Genome-scale profiling of histone H3.3 replacement patterns. Nat. Genet. 37: 1090-1097. Miyamoto T., Iwasaki H., Reizis B., Ye M., Graf T., Weissman 1.L., and Akashi K. 2002. Myeloid or lymphoid promiscuity as a critical step in hematopoietic lineage commitment. Dey. Cell 3: 137-147. Morshead K.B., Ciccone D. ., Taverna S.D., Allis e.D., and Oettinger M.A. 2003. Antigen receptor loci poised for V(D)i rearrangement are broadly associated with BRG 1 and flanked by peaks of histone H3 dimethylated at lysine 4. Pmc. Natl. Acad. Sci. USA 100: 11577-11582. Mostoslavsky R., Singh N., Kirillov A., Pelanda R., Cedar H., Chess A., and Bergman Y. 1998. K: chain monoallelic demethylation and the establishment of allelic exclusion. Genes Dev. 12: 1801-1811. Mostoslavsky R., Singh ., Tenzen T., Goldmit M., Gabay e., Elizur S., Qi P., Reubinoff B.E., Chess A., Cedar H., and Bergman Y 2001. Asynchronous replication and allelic exclusion in the immune system. Nature 414: 221-225. Niu H., Ye B.H., and Dalla-Favera R. 1998. Antigen receptor signaling induces MAP kinase-mediated phosphorylation and degradation of the BCL-6 transcription factor. Genes Dey. 12: 1953-1961. Nutt S.L., Heavey B., Rolink A.G., and Busslinger M. 1999. Commitment to the B-lymphoid lineage depends on the transcription factor Pax5. Nature 401: 556-562. Nutt S.L., Urbanek P., Rolink A., and Busslinger M. 1997. Essential functions of Pax5 (BSAP) in pro-B cell development: Difference between fetal and adult B lymphopoiesis and reduced V-to-Dj recombination at the IgH locus. Genes Dev. 11: 476-491. Orphanides G. and Reinberg D. 2000. RNA polymerase II elongation through chromatin. Nature 407: 471-475.
CONTROL
OF
LYMPHOPOIESIS
413
Osipovich 0., Milley R., Meade A., Tachibana M., Shinkai Y, Krangel M.S., and Oltz E.M. 2004. Targeted inhibition of V(D)J recombination by a histone methyltransferase. Nat. Immunol. 5: 309-316. Peters A.H., O'Carroll D., Scherthan H., Mechtler K., Sauer S., Schafer e., Weipoltshammer K., Pagani M., Lachner M., Kohlmaier A., et al. 2001. Loss of the Suv39h histone methyltransferases impairs mammalian heterochromatin and genome stability. Cell 107: 323-337. Rajewsky K. 1996. Clonal selection and learning in the antibody system. Nature 381: 751-758. Reimold A.M., Iwakoshi N.N., Manis i., Vallabhajosyula P., SzomolanylTsuda E., Gravallese E.M., Friend D., Grusby M.J., Alt E, and Glimcher L.H. 2001. Plasma cell differentiation requires the transcription factor XBP-1. Nature 412: 300-307. Ren B., Chee K.j., Kim T.H., and Maniatis T. 1999. PRDI-BFlIBlimp-l repression is mediated by corepressors of the Groucho family of proteins. Genes Dey. 13: 125-137. Ringrose L. and Paro R. 2004. Epigenetic regulation of cellular memory by the Polycomb and Trithorax group proteins. Annu. Rev. Genet. 38: 413-443. Roldan E., f*cka M., Chong W., Martinez D., Novatchkova M., Busslinger M., and Skok j.A. 2005. Locus 'decontraction' and centromeric recruitment contribute to allelic exclusion of the immunoglobulin heavy-chain gene. Nat. Immunol. 6: 31-41. Romanow W.i., Langerak A.W., Goebel P., Wolvers-Tettero I.L.M., van Dongen j.i.M., Feeney A.J., and Murre e. 2000. E2A and EBF act in synergy with the V(D)J recombinase to generate a diverse immunoglobulin repertoire in nonlymphoid cells. Mol. Cell 5: 343-353. Sato H., Saito-Ohara E, Inazawa i., and Kudo A. 2004. Pax-5 is essential for K: sterile transcription during IgK: chain gene rearrangement.]. Immunol. 172: 4858-4865. Schlissel M.S., Durum S.D., and Muegge K. 2000. The interleukin 7 receptor is required for T cell receptor y locus accessibility to the V(D)i recombinase.]. Exp. Med. 191: 1045-1050. Seet e.S., Brumbaugh R.L., and Kee B.L. 2004. Early B cell factor promotes B lymphopoiesis with reduced interleukin 7 responsiveness in the absence of E2A.]. Exp. Med. 199: 1689-1700. Shaffer A.L., Yu X., He Y, Boldrick j., Chan E.P., and Staudt L.M. 2000. BCL-6 represses genes that function in lymphocyte differentiation, inflammation and cell cycle control. Immunity 13: 199-212. Shaffer A.L., Lin K.1. Kuo T.e., Yu X., Hurt E.M., Rosenwald A., Giltnane J.M., Yang L., Zhao H., Calame K., and Staudt L.M. 2002. Blimp-l orchestrates plasma cell differentiation by extinguishing the mature B cell gene expression program. Immunity 17: 51--62. Shaffer A.L., Shapiro-Shelef M., Iwakoshi N.N., Lee A.-H., Qian S.B., Zhao H., Yu X., Yang L., Tan B.K., Rosenwald A., et al. 2004. XBPl, downstream of Blimp-I, expands the secretory apparatus and other organelles, and increases protein synthesis in plasma cell differentiation.Immunity 21: 81-93. Skok j.A., Brown K.E., Azuara v., Caparros M.L., Baxter j., Takacs K., Dillon N., Gray D., Perry R.P., Merkenschlager M., and Fisher A.G. 2001. Nonequivalent nuclear location of immunoglobulin alleles in B lymphocytes. Nat. Immunol. 2: 848-854. Smale S.T. 2003. The establishment and maintenance of lymphocyte identity through gene silencing. Nat. Immunol. 4: 607-615. Stanhope-Baker P., Hudson K.M., Shaffer A.L., Constantinescu A., and Schlissel M.S. 1996. Cell type-specific chromatin structure determines the targeting ofV(D)J recombinase activity in vitro. Cell 85: 887-897. Su I.H., Basavaraj A., Krutchinsky A. ., Hobert 0., Ullrich A., Chait B.T., and Tarakhovsky A. 2003. Ezh2 controls B cell development
414.
CHAPTER
27
through histone H3 methylation and Igh rearrangement. Nat. Immunol. 4: 124-13l. Su LH., Dobenecker M.W., Dickinson E., Osler M., Basavaraj A., Marqueron R., Viale A., Reinberg D., Wulfing C, and Tarakhovsky A. 2005. Polycomb group protein Ezh2 controls actin polymerization and cell signaling. Cell 121: 425-436. Tachibana M., Sugimoto K., Nozaki M., Ueda J., Ohta T., Ohki M., f*ckuda M., Takeda N., Niida H., Kato H., and Shinkai Y. 2002. G9a histone methyltransferase plays a dominant role in euchromatic histone H3 lysine 9 methylation and is essential for early embryogenesis. Genes Dev. 16: 1779-179l. Turner CA.]., Mack D.H., and Davis M.M. 1994. Blimp-I, a novel zinc finger-containing protein that can drive the maturation of B lymphocytes into immunoglobulin-secreting cells. Cell 77: 297-306. Vosshenrich CA.J., Cumano A., Muller W., Di Santo J.P., and Vieira P. 2003. Thymic stroma-derived lymphopoietin distinguishes fetal from adult B cell development. Nat. Immunol. 4: 773-779. Waskow C, Paul S., Haller C, Gassmann M., and Rodewald H. 2002.
Viable c_Kit WIW mutants reveal pivotal role for c-Kit in the maintenance of lymphopoiesis. Immunity 17: 277-288. West K.L., Singha N.C, De Ioannes P., Lacomis L., Erdjument-Bromage H., Tempst P., and Cortes P. 2005. A direct interaction between the RAG2 C terminus and the core histones is required for efficient V(D)J recombination. Immunity 23: 203-212. Xie H., Ye M., Feng R., and Graf T. 2004. Stepwise reprogramming of B cells into macrophages. CellIl7: 663-676. Yancopoulos G.D. and Alt EW. 1985. Developmentally controlled and tissue-specific expression of unrearranged VH gene segments. Cell 40: 271-28l. Ye B.H., Cattoretti G., Shen Q., Zhang J., Hawe N., de Waard R., Leung C., Nouri-Shirazi M., Orazi A., Chaganti R.S.K., et al. 1997. The BCL-6 proto-oncogene controls germinal-centre formation and Th2-type inflammation. Nature Genet. 16: 161-170. Zhang S., f*ckuda S., Lee Y., Hangoc G., Cooper S., Spolski R., Leonard W.J., and Broxmeyer H.E. 2000. Essential role of signal transducer and activator of transcription (Stat)5a but not Stat5b for Flt3dependent signaling.]. Exp. Med. 192: 719-728.
c
HAP
T
E
R
22
Nuclear Transplantation and the Reprogramming of the Genome Rudolf Jaenisch 1 and John Gurdon 2 Whitehead Institute, and Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts 02142-1479 2The Wellcome Trust/Cancer Research UK Gurdon Institute, The Henry Wellcome Building of Cancer and Developmental Biology, University of Cambridge, Cambridge CB2 1QN, United Kingdom J
CONTENTS 1. History, 41 7
4.1
Amphibians, 425
2. Nuclear Transfer Procedures, 41 7
4.2
Mammals, 427
2.1
Amphibians, 417
2.2
Mammals, 417
3. Phenotype of Cloned Animals, 419
5. Epigenetic Memory, 428 6. Medical Implications of Nuclear Transplantation, 429
3.1
Amphibians, 419
6.1
Reproductive Cloning, 430
3.2
Mammals, 420
6.2
3.3
Derivation of Cloned Mammals from Terminally Differentiated Cells, 421
Therapeutic Application of Nuclear Transplantation, 431
6.3
Reproductive Versus Therapeutic Cloning: What Is the Difference? , 432
4. Changes Associated with Nuclear Reprogramming, 425
References, 432
415
GENERAL SUMMARY The body plan of animals is constructed from hundreds of different cell types that perform the various physiological functions of the organism. A key question posed early on was the mechanism of differential gene expression which assures that the appropriate genes are active or silent, respectively, for the normal function of a given differentiated cell. In the early days, before the molecular basis of gene expression was appreciated, it had been hypothesized that the basis for tissue-specific gene expression might be the genetic elimination or permanent inactivation of silent genes from those tissues that do not express silent genes and retention of those that are expressed. Indeed, in some organisms, such as the insect Sciara,
genetic material is eliminated from somatic tissues with the full genetic complement being retained only in the germ-line cells. This raised the question of "nuclear equivalence," i.e., whether the genome of somatic cells retains the full complement of genetic material. The most unbiased approach to this question is nuclear cloning, where the potency of a somatic donor cell nucleus to direct the development of a new organism is tested by transplantation into an enucleated egg. Indeed, the generation of cloned animals from somatic cell nuclei proved beyond doubt that major genetic changes which would prevent a somatic nucleus from generating all tissue types are not part of the normal developmental process.
NUCLEAR
1 History
The first success in transplanting the nucleus of one cell into another in a multicellular organism was obtained by Briggs and King in 1952 (Briggs and King 1952), although nuclear transfer had been achieved before in single-celled organisms, including amoeba, ciliates, and Acetabularia (Gurdon 1964). Briggs and King (1952) obtained normal swimming tadpoles by transplanting the nuclei of a blastula cell into an enucleated egg of Rana pipiens. In this and in subsequent work with R. pipiens, up to 30% of blastula nuclear transfers developed to morphologically normal postneurula stages. The importance of this early success was that it opened the way to testing whether the nuclei of differentiating cells could also support normal development of recipient eggs; that is, substitute for the zygote nucleus of normally fertilized eggs. The next important paper by Briggs and King (1957) reported that very soon after the blastula stage, somatic cell nuclei (in this case of the endoderm) lose their ability to support normal development. Indeed, by the tail-bud stage, endoderm nuclei no longer gave any normal development. Soon after this, successful nuclear transfer was reported in Xenopus (Fischberg et a1. 1958). In this species, a genetic marker was used to prove that nuclear transplant embryos were derived entirely by activity of the transplanted nucleus with no contribution of the egg nucleus, which had been killed by ultraviolet irradiation. Nuclear transplant embryos developed more normally in Xenopus than in Rana, and developed quite rapidly to the adult stage. The first sexually mature adult cloned animals were obtained by the transfer of endoderm nuclei in Xenopus (Gurdon et a1. 1958), and most appeared normal in all respects. For nuclear cloning to succeed, an egg must "reprogram" the somatic donor nucleus to an "embryonic" epigenetic state so that the genetic program required for embryonic development can be activated. It is a major focus of current research to understand the molecular basis of reprogramming in nuclear transplantation experiments. Because embryonic development in amphibians and mammals is very different, each system can provide different insights into the problem of reprogramming in nuclear transplantation experiments. For example, because a great number of the large eggs can be easily obtained in amphibians, this system is particularly useful for biochemical analyses. In contrast, mammals, in particular the mouse, allow the application of tissue culture and genetic approaches as methods of choice. For these reasons, observations in the amphibian and the mammalian systems are complementary and are discussed
TRANSPLANTATION.
417
side by side to emphasize the similarities and the differences in the two systems. First, we describe the phenotype of cloned amphibians and mammals and how it is influenced by the differentiation state of the donor nucleus. This is followed by a description of molecular mechanisms that have been recognized to be important for reprogramming. Finally, the potential implications of the nuclear transplantation technology for therapy are discussed. 2 Nuclear Transfer Procedures 2.1 Amphibians
The ready availability of Xenopus eggs throughout the year, and the easy maintenance of an aquatic animal in the laboratory, have resulted in most amphibian nuclear transfer work, after the initial success with Rana, being undertaken with Xenopus. Two kinds of experiments need to be distinguished according to whether nuclei are injected into enucleated eggs or non-enucleated ooeytes (Fig. 1). Nuclear transfer requires the injection of a complete donor cell whose plasma membrane has been made permeable, either physically, by sucking the cell into a pipette too small for the size of the cell, or chemically, by a short exposure to a membrane-integrating substance. The amount of donor cell cytoplasm introduced with a nucleus is about 10-5 of the egg volume and has no effect. Eggs that have received a transplanted nucleus are cultured in a simple non-nutrient saline solution equivalent to pond water, and therefore develop independently of the culture solution. Nuclear transfer to ooeytes is entirely different. Fullgrown ooeytes taken from the ovary of a female are in the prophase of first meiosis (Fig. 1). About 50 somatic cell nuclei injected into the nucleus (germinal vesicle) of these growing oocytes do not replicate DNA or divide but become increasingly active in transcription over the course of a few days. Ooeytes that have received transplanted nuclei are cultured, like eggs, in a non-nutrient saline medium and undergo no morphological change for the whole of the culture period of 2 weeks or more. 2.2 Mammals
The earliest attempts to clone mammals were performed with rabbit embryos. In these experiments, oocytes were fused with cells from morula-stage embryos, and the resulting triploid clones were seen to undergo a few cleavage divisions (Bromhall 1975). In 1984, McGrath and Solter introduced Sendai-virus-mediated fusion to efficiently transfer donor nuclei into enucleated zygotes, a
418 •
CHAPTER
22
UV enucleation
somatic cell nucleus
~~.~~E~
Egg
Mid blastula (5h)
Unfertilized egg
Swimming tadpole (3 day) New gene expression and new cell types
somatic cell nuclei injected to oocyte germinal vesicle Culture in vitro
Oocyte
No DNA replication or cell division Intense gene transcription
Full grown oocyte
Full grown oocyte (4+ days) New gene expression
Figure 1. Diagram to Show Two Kinds of Nuclear Transplant Ellperiments in Xenopus The upper figure illustrates single nuclear transfers to eggs enucleated by ultraviolet irradiation (UV). A swimming tadpole is formed 3 days after nuclear transfer. The lower figure shows multiple nuclear transfers to growing oocytes in first meiotic prophase. (GV) Germinal vesicle or nucleus of the oocyte. Injected oocytes do not divide, and can be cultured for many days. Injected somatic cell nuclei undergo changes in gene expression during the first 4 days of culture.
method that was used for two types of experiments: (1) Uniparental embryos were generated by replacing the male with a female pronucleus (giving rise to a gynogenetic embryo) or by replacing the female with a male pronucleus (giving rise to an androgenetic embryo) (McGrath and Solter 1984a,b; Surani et a1. 1984). Uniparental embryos invariably failed to develop, providing the first evidence that normal development of mammals crucially depends on parent-specific genomic imprinting. (2) Cloned embryos were produced by transferring nuclei of cleavage-stage donor embryos into enucleated zygotes. None of the reconstructed embryos developed beyond late-cleavage stages, leading to the conclusion that totipotency of the genome is rapidly lost during early development and that cloning of mammals, in contrast to amphibians, may not be possible (McGrath and Solter 1984b). However, results obtained in other mammalian species soon challenged this conclusion. In 1986, Willadsen succeeded in cloning live lambs from 4- to 16-cell donor embryo nuclei, and this was shortly followed by the generation of cloned cattle and pigs (Robi et a1. 1987; Prather et a1. 1989). Why did cloning of farm animals succeed in contrast to the wellcontrolled mouse cloning experiments (McGrath and
Solter 1984b)? A major developmental difference between these animal species is the timing of the transition from maternal control of development (relying on maternally stored RNA) to zygotic control of development (relying on zygotically produced RNA). The major transition appears to occur at the 8- to 16-cell stage for sheep and bovine embryos (Calarco and McLaren 1976; Camous et a1. 1986) but already at the 2-cell stage for mouse embryos (Bolton et a1. 1984). Thus, the time constraints for activating the donor genome may be more relaxed in cloned sheep or bovine embryos than in cloned mouse embryos, where the donor nucleus must be activated soon after nuclear transfer in order for cleavage to proceed. The first successful derivation of cloned animals by somatic cell nuclear transfer, as opposed to nuclear transfer from donor cells of the cleavage embryo, was the generation of sheep from cultured fibroblast donor cells (Campbell et a1. 1996b). This was soon followed by the creation of "Dolly" from a mammary gland donor nucleus (Wilmut et a1. 1997), which constituted the first mammal cloned from an adult donor cell. Since then, a total of 15 mammalian species, including mice, goats, pigs, cows, rabbits, rats, cats, and dogs, have been cloned (for review, see Campbell et a1. 2005).
NUCLEAR
The procedure of mammalian cloning, like amphibian cloning, involves two steps. In most, if not all, successful somatic cell nuclear transfer experiments, the donor nuclei were transferred into enucleated oocytes (in contrast to the early mouse cloning experiments where the cleavage embryo donor nucleus was introduced into enucleated zygotes; McGrath and Solter 1984). In a first step, the metaphase spindle of the recipient egg is removed with a pipette, followed by the transfer of the donor nucleus into the enucleated egg. In most mammals, the donor nucleus is introduced into the egg by electrofusion with the enucleated egg. Mice have particularly fragile eggs, and nuclear transfer in this species is most efficient by physical transfer of the donor nucleus into the egg (Wakayama et al. 1998). As depicted in Figure 2, the donor nucleus is introduced into the enucleated egg by the use of a piezo element that facilitates the penetration of the zona pellucida and the cytoplasmic membrane (Wakayama et al. 1998). Cloned blastocysts either are implanted into the uterus of a pseudopregnant foster mother to generate cloned mice or are explanted in tissue culture to generate nuclear transfer embryonic stem (NT-ES) cells. NT-ES
TRANSPLANTATION.
419
cells are genetically identical to the donor and can, therefore, be used for "customized" cell therapy, also designated as "therapeutic cloning" (see below). 3 Phenotype of Cloned Animals
The derivation of animals by nuclear transfer from somatic donor nuclei is inefficient, and those rare animals that survive to adulthood often display multiple abnormalities. In contrast, when embryonic cells are used as nuclear donors, the survival of both amphibian and mammalian clones is much higher. The following chapter focuses on the relationship between the age of the donor nucleus and its effect on clone survival and juxtaposes the phenotype of amphibian and mammalian clones derived from adult and embryonic donor cells.
3.1 Amphibians When blastula nuclei are transplanted to good quality recipient eggs of Xenopus, over 30% of all such eggs develop into normal tadpoles, and most of these can be reared to fertile adults. As donor cells differentiate, the
a NT-mouse
isolate
~ki~
•
~ remove nucleus
.~=
enucleated oocyte
~.Nh;Q~ "customized" embryonic stem cells
b
Figure 2. Murine Nuclear Transfer by Microinjection Schematic drawing of the nuclear transfer procedure. The inner cell mass (ICM) gives rise to ES cells, when explanting the blastocyst onto irradiated feeder cells. Alternatively, when transferred to a synchronized pseudopregnant female, the blastocyst can generate a mouse. The outer cells of the blastocyst, the trophectoderm (TE), give rise to the extraembryonic tissues (placenta), and the ICM cells generate the embryo. (b) The same steps as in 0 shown by light microscopy. (Reprinted, with permission, from Meissner 2006.)
(0)
enucleation
nuclear transfer NT-mouse
NT-ESCs
420 • C HAP T E R 2 2
normality of development of nuclear transplant embryos decreases (Fig. 3), less rapidly with endoderm donor nuclei than with others. In amphibians, it has been informative to carry out serial nuclear transfers in which donor nuclei are taken from a blastula that has itself resulted from the transfer of a somatic cell nucleus (Fig. 4). This is done because the cells of a first-transfer embryo are often a mosaic of chromosomally normal and abnormal cells due to the difference in DNA replication rate between a somatic cell and an activated egg; serial nuclear transplantation shows the developmental potential of nuclei of a first-transfer embryo that are least damaged by nuclear transfer. Even with the intestinal epithelium of feeding larvae, some normal sexually mature, genetically marked, male and female frogs were obtained (Gurdon and Uehlinger 1966). This result showed that the process of differentiation does not necessarily involve the loss of ability to promote normal development, and hence, the principle of the conservation of the genome as cells differentiate. In nuclear transfer experiments with amphibians, it was not possible to obtain a normal adult animal from the nucleus of another adult. However, morphologically normal tadpoles were obtained from the nuclei of cells from many different adult tissues (Laskey and Gurdon 1970), and these tadpoles contained the normal range of functional specialized cell types. Therefore, cells that are committed to one pathway of differentiation nevertheless contain the genetic potential to promote, in combi-
nation with egg cytoplasm, most kinds of unrelated cell differentiation. The ability of nuclei of one cell type to promote other kinds of cell differentiation may be quantitated by asking to what extent functional muscle and nerve differentiation, as judged by embryos that make swimming movements after stimulation, can be generated from the nuclei of feeding larval intestinal epithelium. Such embryos can often be obtained by the serial transfer of nuclei from first-transfer embryos that are not completely normal (see Fig. 4) (Gurdon 1962). In addition, blastula cells from morphologically defective first-transfer embryos can form muscle when grafted to normal hosts grown from fertilized eggs (Fig. 4) (Byrne et al. 2003). The result shows that up to 30% of the cells of intestinal epithelium can lead, after nuclear transfer, to functional axial muscle cells (Table 1). The range of abnormalities resulting from nuclear transfer in amphibians does not show any consistent pattern that can be related at the morphological level to donor cell origin. Irrespective of the cell type and developmental stage of donor nuclei, nuclear transplant embryos die with a similar range of defects, including incomplete cleavages, failure to gastrulate, defective axis formation, and lack of head structures. The apparently haphazard nature of these defects is not surprising, since it is known that large chromosomal abnormalities are often seen in cells of first-transfer embryos (see Section 4.1, Reprogramming in clones).
3.2 Mammals COMMON ABNORMALITIES IN CLONED ANIMALS 4> Cl
'"
Ul .... .. Ul
.!! ~
4>
0 40
'" "0 a. ..
....
'"
~ ';30 4> c
"0 .-
_...~
:;,"0 C
20
'" 4> 0.1:
:: ; 10 o c '::ff!.:E o 0
e'"
B G
N
TB
HB
9 12
20
26
40
I.
ST
FT
66
144
Donor stage Hours after fertilization
Figure 3. The Survival of Xenopus Nuclear Transfer Embryos Decreases as Donor Nuclei Are Taken from More Specialized Donor Cells Donor stage abbreviations: (B) blastula; (G) gastrula; (N) neurula; (TB) tail bud; (HB) heart beat; (ST) swimming tadpole; (FT) feeding tadpole. (Reprinted, with permission, from Gurdon 1960.)
The majority of cloned mammalian embryos fail to develop soon after implantation. Those that live to birth often display common abnormalities irrespective of the donor cell type (see below; Table 2). For instance, newborn clones are frequently overgrown and show an enlarged placenta, symptoms referred to as Large Offspring Syndrome (Young et al. 1998; Hill et al. 2000; Tanaka et al. 2001). Moreover, neonate clones often suffer from respiratory distress, and kidney, liver, heart, and brain defects. Even longterm survivors can show abnormalities later in life. For example, aging cloned mice frequently become obese, develop severe immune problems, or die prematurely (Ogonuki et al. 2002; Tamashiro et al. 2002). As schematically shown in Figure 5, the two stages when the majority of clones fail are immediately after implantation and at birth. These are two critical stages of development that may be particularly vulnerable to faulty gene expression (see below). However, the generation of adult and seemingly
N U C LEA R
T RAN 5 P LAN TAT ION
E
--------~
421
•
~
Swimming tadpole
d
/
~E~
-----~
Swimming tadpole
Serial nuclear transfer to egg
~G_~E~ \.3J
Nuclear transfer (to egg)
Swimming tadpole
Tissue graft to early gastrula
x~ Uncleaved egg Figure 4. Serial Nuclear Transfers and Grafts in Xenopus For standard first transfers, a single somatic cell nucleus is transplanted to an enucleated egg, which is grown directly to a larva. For serial nuclear transfers, a first-transfer embryo at an early stage is used to provide donor nuclei for a further set of nuclear transfers to eggs. These serial nuclear transfer embryos are grown to the tadpole stage. Last, a first-transfer embryo that is partly defective is used to provide a piece of tissue for grafting to a host embryo reared from a fertilized egg. The graft contributes to part of the resulting tadpole.
healthy adult cloned animals has been taken as evidence that nuclear transfer can generate normal cloned animals, albeit with low efficiency. Importantly, serious abnormalities in cloned animals may often become manifest only when the animals age (Ogonuki et al. 2002; Tamashiro et al. 2002). The stochastic occurrence of disease and other defects at a later age in many or most adult clones implies that compensatory mechanisms which allow the survival of cloned animals do not guarantee their "normalcy." Rather, the phenotype of surviving cloned animals appears to be distributed over a wide spectrum, including abnormalities causing sudden demise at early postnatal age or more subtle abnormalities allowing survival to advanced age (Fig. 5). These considerations illustrate the complexity of defining subtle gene expression defects and emphasize the need for more sophisticated test criteria such as environmental stress or behavior tests. EPIGENETIC VERSUS GENETIC CAUSES
The abnormalities that are characteristic for cloned animals are not inherited by offspring from the clones, indicating that "epigenetic" rather than genetic aberrations are the cause. This is because epigenetic changes, in con-
trast to genetic changes, are reversible modifications of DNA or chromatin that are erased when the genome is passed through the germ line (Ogonuki et al. 2002; Tamashiro et al. 2002). Thus, the problems associated with cloning are due to faulty "epigenetic/genomic reprogramming" of the transplanted donor nucleus rather than to somatic mutations acquired in the somatic donor cells. 3.3 Derivation of Cloned Mammals from Terminally Differentiated Cells STATE OF DONOR CELL DIFFERENTIATION AND THE EFFICIENCY OF NUCLEAR REPROGRAMMING
A question already raised in the seminal cloning experiments with amphibians suggested an inverse relationship between cellular differentiation state of the donor
Table 1. Efficiency of nuclear reprogramming: Xenopus larval endoderm cells First transfers only
muscle and nerve of embryo
15%
First + serial transfers
muscle and nerve of embryo
22%
First + serial transfers with grafts to hosts
muscle and nerve of embryo
30%
422
•
C HAP T E R 2 2
Table 2. Inverse relation between differentiation state of donor cells and reprogramming efficiency Cloned mice (% of implanted blastocysts)
ES cell derivation (% of explanted blastocysts)
Reference
60-80
25-65
1
6-15
10-26
50
2
EC cells
6-15
N.D.
50
3
Sertoli
10-50
6
25
4
Cumulus
10-50
1-3
13-33
5
Fibroblast
10-50
1
13-33
5
NKT
70
4
N.D.
6
S, T cells
4
N.D.
7
7
Blastocysts (% of oocytes)
Donor cell Fertilized egg ES cells
Neurons
15
N.D.
6-28
8
Melanoma
1.5
N.D.
25
9
N.D. indicates not determined. 4 8
References: 1 Wakayama et al. 2005. 2 Wakayama et al. 1999; Rideout et al. 2000; Eggan et al. 2001; Humpherys et al. 2001; 3 Blelloch et al. 2004. Ogura et al. 2000. 5 Wakayama et al. 1998, 2005; Wakayama and Yanagimachi 1999. 6 Inoue et al. 2005. 7 Hochedlinger and )aenisch 2002. Eggan et al. 2004; Li et al. 2004. 9 Hochedlinger et al. 2004.
nucleus and its potency to direct development after transfer into the egg (see above, and Fig. 3). An important issue has been whether the state of donor cell differentiation affects the efficiency of reprogramming also in mammals. As summarized in Table 2, reprogramming can be measured functionally by evaluating clone development at several different levels, including (1) the rate
DEAD
SURVIVORS
II
Implantation
•
Birth
Age of clones
HIGHER
LOWER Degree of abnormality
Figure 5. The Survival of Mammalian Clones The phenotypes of clones are distributed over a wide range of abnormalities. Most clones fail at two defined developmental stages, implantation and birth. More subtle gene expression abnormalities result in disease and death at later ages.
of blastocyst formation following nuclear transfer into the egg, (2) the fraction of cloned embryos surviving to birth or adulthood after implantation into the uterus, and (3) the frequency with which pluripotent embryonic stem (ES) cells can be derived from cloned blastocysts explanted into culture. The efficiency of preimplantation development of reconstructed oocytes into blastocysts is particularly sensitive to experimental parameters such as the cell cycle stage and physical condition of the transferred nucleus, For example, cloning of nondividing donor cells is more efficient than cloning of actively proliferating cells (Campbell et al. 1996a; Cibelli et al. 1998). Thus, as expected from this relationship, eggs reconstructed with donor nuclei from fibroblasts, Sertoli, cumulus, or NKT cells that are in G I or Go of the cell cycle reach the blastocyst stage with relatively high efficiency, in contrast to ES or embryonal carcinoma (EC) cells that are actively dividing with a major fraction of the cells in S phase (Table 2). Due to this experimental variability during cleavage, measuring the fraction of bIastocysts derived from reconstructed oocytes is not a reliable criterion to quantify "reprogram-ability." However, once a cloned embryo has reached the blastocyst stage, the development to birth after implantation into the uterus depends on the differentiation state of the donor nucleus. Cloned embryos derived from embryonic donors such as ES or EC cells develop to term at a 10- to 20-fold higher efficiency than embryos derived from cumulus or fibroblast donor cells
N U C LEA R
(Eggan et al. 2001; Wakayama and Yanagimachi 2001), presumably because the nucleus of an undifferentiated embryonic cell is more amenable to, or requires less, reprogramming than the nucleus of a differentiated somatic cell. This indicates an inverse relationship between the stage of donor cell differentiation and the efficiency of reprogramming. Finally, once an embryo has reached the blastocyst stage, it has a rather consistent probability of giving rise to ES cells, indicating that the derivation of ES cells from explanted blastocysts is much less dependent on the state of differentiation of the donor nucleus (Table 2). CAN NUCLEI OF TERMINALLY DIFFERENTIATED CELLS BE REPROGRAMMED TO TOTIPOTENCY?
In the early amphibian and mammalian cloning experiments, the donor cell populations used for nuclear transplantation were heterogeneous, and it could not be excluded that rare adult stem cells present in the donor population instead of nuclei of the differentiated cells gave rise to the rare surviving clones. For example, the epigenetic state of somatic stem cells may resemble that of embryonic stem cells and may be easier to reprogram and thus may preferentially have generated the surviving clones. To resolve the question whether the nucleus of a terminally differentiated cell could be sufficiently reprogrammed to yield an adult animal, genetic markers were required that would retrospectively identify the donor nucleus of a surviving clone. Such markers were used to demonstrate unambiguously that nuclei from mature
lymph node cells
T RAN S P LAN TAT ION
•
423
immune cells, from terminally differentiated neurons, and from malignant cancer cells can be reprogrammed and generate adult cloned mice.
MONOCLONAL MICE FROM MATURE IMMUNE CELLS
The monoclonal mice were generated from nuclei of peripheral lymphocytes where the genetic rearrangements of the immunoglobulin (Ig) and T-cell receptor (TCR) genes could be used as stable markers revealing the identity and differentiation state of the donor nucleus of a given clone. Because previous attempts to generate monoclonal mice had been unsuccessful, two-step cloning was used to produce first ES cells from cloned blastocysts, and in a second step, monoclonal mice from the cloned ES cells (Fig. 6). Animals generated from a Bor T-cell donor nucleus were viable and carried fully rearranged immunoglobulin or TCR genes in all tissues (Hochedlinger and Jaenisch 2002). As expected, the immune cells of the monoclonal mice expressed only those alleles of the Ig and TCR locus that had been productively rearranged in the respective donor cells used for nuclear transfer, and the rearrangement of other Ig or TCR genes was inhibited. These results unequivocally demonstrated that nuclei from terminally differentiated donor cells can be reprogrammed to pluripotency by nuclear cloning. The frequency of directly deriving cloned embryos from mature Band T cells (instead of the two-step procedure used in our experiments), although difficult to estimate, is likely significantly lower than that
nuclear transfer
blastocyst Figure 6. Two-step Procedure for the Derivation of Monoclonal Mice from Mature Lymphoid Donor Cells
1 b. Derivation of ES cells
from cloned blastocyst explanted blastocyst
2. Derivation of cloned mouse
ES cell
(1) Nuclei from peripheral lymph node cells were transferred into enucleated eggs, and cloned blastocysts were derived. The blastocysts were explanted in vitro, and cloned ES cells were derived. (2) In a second step, monoclonal mice were derived by tetraploid complementation (Eggan et al. 2001; Hochedlinger and jaenisch 2002).
424
•
C HAP T E R 2 2
of deriving clones from fibroblasts or cumulus cells (possibly less than 1 in 2000 operated embryos; Table 2). More recently, terminally differentiated NKT cells were directly cloned (Inoue et al. 2005). NKT cells, like Band T cells, have genetic rearrangements that allow retrospective identification of the differentiation state of the cells. However, although T cells and NKT cells are part of the same cell lineage, their respective nuclear transfer efficiency was significantly different (Hochedlinger and Jaenisch 2002; Inoue et al. 2005). CLONED MICE FROM MATURE OLFACTORY NEURONS
In contrast to B or T cells, nuclei of postmitotic neurons have irreversibly exited the cell cycle as part of their program of differentiation. To assess whether the nucleus of a mature neuron could be reprogrammed to totipotency, fertile cloned adult mice were generated from postmitotic olfactory neurons using a similar approach as used for the generation of the monoclonal mice (d. Fig. 6) (Eggan et al. 2004; Li et al. 2004). As summarized in Table 2, the efficiency of deriving cloned ES cells from olfactory neurons was in the same range as that for nuclei from immune cells. These observations indicate that a postmitotic neuronal nucleus can reenter the cell cycle and can be reprogrammed to pluripotency. In the mouse, each of the two million cells in the olfactory epithelium expresses only one of approximately 1500 odorant receptor (OR) genes, such that the functional identity of a neuron is defined by the nature of the receptor it expresses (this is analogous to monoallelic expression of immune globulin or TCR genes in B or T cells discussed in Chapter 21). One mechanism to permit
olfactory neurons
the stochastic choice of a single olfactory receptor could involve DNA rearrangements. The generation of mice cloned from a mature olfactory neuron made it possible to investigate whether olfactory receptor choice involves irreversible DNA rearrangements. If olfactory receptor choice involved DNA rearrangements, the prediction would be, in analogy with monoclonal mice described above, that a mouse cloned from a P2-expressing neuron would express this receptor in all olfactory neurons and the repertoire of receptor expression might be altered (these would be monosmic mice that can detect only one odorant) (Fig. 7). Alternatively, if OR choice involved a reversible epigenetic mechanism, the cloned animals should have an identical P2 expression pattern to the donor mouse and a normal repertoire of receptor expression. The analysis of olfactory receptor expression showed that the mechanism of receptor choice is fully reversible and does not involve genetic alterations as seen in the maturation of Band T cells (Eggan et al. 2004). CANCER AND THE REVERSION OF THE MALIGNANT STATE BY NUCLEAR TRANSPLANTATION
The cloning of mice from terminally differentiated lymphocytes and postmitotic neurons demonstrated that nuclear transfer provides a tool to selectively reprogram the epigenetic state of a cellular genome without altering its genetic constitution. Cancer is caused by genetic as well as epigenetic alterations, but the impact of epigenetics on the malignant phenotype of a cancer cell has not been defined. Nuclear transplantation of cancer donor cells was used as an unbiased approach to assess the reversibility of the transformed state. Indeed, previous
cloned ES cell
cloned mouse
ALLaN
NT
Prospective (transient/reversible) 0.1 % ON
labeled (P2 neurons)
o
genetic
(monosmic)
0.1% ON
Figure 7. Nuclear Cloning of Mature Olfactory Neurons Mice cloned from mature olfactory neurons had the normal repertoire of olfactory receptor (OR) expression with only 0.1 % of neurons expressing the P2 receptor, as in the donor mice. P2 receptor expression was determined by using donor mice that had a GFP marker gene inserted in the P2 receptor gene. Results demonstrated that the choice of receptor expression is not determined by genetic alteration, but by a reversible epigenetic mechanism.
N U C LEA R
experiments with amphibians showed that nuclei from a kidney carcinoma cell could be reprogrammed to support early development to the tadpole stage (McKinnell 1962). A similar result was obtained in mice where nuclei from a medulloblastoma cell line were able to direct early development, albeit with low efficiency, resulting in arrested embryos (Li et al. 2003). However, these experiments did not unequivocally demonstrate that the clones were derived from cancer cells as opposed to contaminating nontransformed cells. When the nuclei of a variety of tumor cells, including leukemia, lymphoma, breast cancer, and melanoma cells, were transferred into enucleated mouse eggs, most were able to support preimplantation development into normal-appearing blastocysts (Hochedlinger et al. 2004). Therefore, the malignant phenotype of these tumor types could be suppressed by the oocyte environment and permitted apparently normal early development. However, only the genome from a RAS-induced melanoma model gave rise to a cloned ES cell line that was able to differentiate into most, if not all, somatic cell lineages in chimeric mice. However, because of genetic alterations present in the donor cells, all chimeras developed cancer. These findings demonstrated that the cancer nucleus after exposure to the egg cytoplasm directed differentiation of all lineages, indicating that the malignant phenotype of this cancer was largely determined by epigenetic alterations. A different conclusion was derived from the cloning of EC cell donor nuclei. In contrast to the somatic cancer nucleus, the malignant phenotype of the embryonal tumors was caused by genetic alterations, because it was not reversible by exposure to the egg cytoplasm (Blelloch et al. 2004). 4 Changes Associated with Nuclear Reprogramming
The strategies used for early development are very different in amphibians and mammals. For example, cleavage of the frog embryo is rapid, with about 30 minutes per cell cycle, in contrast to the mammalian embryo that has only cleaved once within 24 hours after fertilization. Additionally, the zygotic genome of the frog becomes expressed only after 12 mitotic cycles at the mid-blastula transition, in contrast to the genome of the mouse embryo that is activated at the 2-cell stage. Thus, it may not be surprising that the different developmental strategies used in amphibians and mammals affect reprogramming of the somatic donor nucleus. This chapter contrasts epigenetic reprogramming that takes place in normal development with reprogramming in
T RAN 5 P LAN TAT ION
425
frog and mammalian clones. An interesting question is whether the epigenetic state of the somatic donor nucleus influences gene expression patterns in cloned embryos. This is designated as "epigenetic memory," as discussed later in the chapter. 4.1 Amphibians REPROGRAMMING IN NORMAL DEVELOPMENT
In amphibians, the nuclei and chromosomes of oocytes and eggs are in a state entirely different from those of somatic cells. The germinal vesicle of an oocyte contains the maximally expanded lampbrush chromosomes that are intensely active in transcription (Callan and Lloyd 1960), apparently reflecting not only the high proportion of genes being transcribed, but also the dense packing of RNA polymerases on the DNA of most genes. This exceptional state of transcription is reached during early oogenesis and probably continues in the ovary throughout the life of an adult female. Mature sperm, conversely, are maximally condensed and entirely inactive in transcription. The usual chromosomal histones are replaced in sperm by protamines, which are exchanged in sperm nuclei that have entered an egg at fertilization, and sperm nuclei undergo immensely rapid decondensation within about 20 minutes. In amphibians, there are no equivalent processes to X-chromosome inactivation and imprinting that take place in mammals. A decrease in DNA methylation takes place from fertilization to the mid-blastula transition (5 hours), after which it gradually increases as development proceeds (Meehan 2003). In summary, substantial nuclear reprogramming events take place in normal development during gametogenesis and for a few hours immediately after fertilization. The most obvious change undergone by transplanted nuclei in amphibians is a volume increase and dispersion of chromatin. This takes place more rapidly in the nuclei of embryonic cells compared to those of differentiated or adult cells. In each respect, the transplanted nuclei come to adopt the condition of nuclei normally resident in eggs or oocytes. Changes in nucleic acid synthesis also follow nuclear transplantation. DNA synthesis is rapidly induced by eggs in the nuclei of nondividing cells such as those of adult brain. Nuclear transplant embryos, derived from single nuclear transfers to eggs, synthesize ribosomal RNA and tRNA to the same extent as endogenous nuclei of embryos grown from fertilized eggs. The pattern of gene transcription is changed from that characteristic of donor cells to that of early embryos; for example, all gene transcription is
426 • C HAP T E R 2 2
switched off during cleavage of nuclear-transplant embryos, and is then reactivated in surviving nucleartransplant embryos according to cell type. Muscle genes are expressed in the muscle of nuclear-transplant embryos, even when they were derived from intestine nuclei (Gurdon et al. 1984). In the case of nuclear transfers to oocytes, extensive changes in transcription take place by transplanted nuclei in the absence of any DNA replication. For example, Xenopus kidney-derived nuclei extinguish kidney-specific genes and activate oocyte-specific genes. Some of the newly activated genes are embryo-specific, as is the case for mouse thymus nuclei, which express the stem-cell marker gene Oct-4 but extinguish the thymus-specific gene Thy-l (Byrne et al. 2003). In conclusion, amphibian nuclear transfers to eggs or oocytes show an extensive reprogramming of gene transcription so that somatic cell nuclei (and in the case of eggs, their mitotic progeny) change their transcription to accord with that of the recipient cells.
from partially cleaved first-transfer embryos often yields normal tadpole development (see above). A good explanation for this is that the incubation of somatic cell nuclei in a mitosis-phase extract of eggs greatly increases the abundance of sites of the origin of DNA replication, thereby enabling such nuclei to complete chromosome replication more rapidly than can nuclei from terminally differentiated cells such as erythrocytes (Lemaitre et al. 2005). Two other explanations may help to account for nuclear transplant abnormalities that arise after zygotic transcription starts at the mid-blastula transition. One is the quantitative irregularity of early zygotic gene activation (Byrne et al. 2003), and the other is the persistence of donor-specific gene expression in the incorrect germ line of nuclear transplant embryos (see below). However, it has not been demonstrated that these differences from normal gene expression are directly responsible for the observed developmental abnormalities.
REPROGRAMMING IN CLONES
MECHANISMS OF REPROGRAMMING
It has long been thought that the most likely explanation for the increasing proportion of developmental abnormalities that are seen in amphibian nuclear transfer experiments with more differentiated donor cells relates to incomplete DNA replication. In normal Xenopus development, egg and sperm pronuclei commence chromosome duplication 20 minutes after fertilization, and it is complete 20 minutes later. In contrast, the nuclei of dividing cultured cells take about 6 hours to complete one round of DNA replication. It is not surprising, therefore, that transplanted somatic cell nuclei have often been seen to continue DNA synthesis for much longer than 40 minutes after nuclear injection, and to do so right up to the time when chromosomes condense for the first mitosis. As a result, chromosome replication can be incomplete, and incompletely replicated chromosomes torn apart, as transplanted nuclei are forced into their first mitosis. Broken chromosome fragments have been seen in nuclear transplant embryos (Di Berardino and Hoffner 1970), and this incompatibility between the rate of DNA replication and cell division in zygote as compared to somatic nuclei, resulting in aneuploidy, seems likely to account for many abnormalities of nuclear transplant embryo development, and especially for the high proportion of eggs that fail to undergo any regular cleavage at all; these can constitute up to 75% of all eggs receiving nuclei from nondividing differentiated cells. It has been noticed that the serial transfer of nuclei
The abundance and large size of amphibian eggs and oocytes encourage attempts to understand the molecular basis of reprogramming. A preferred route is to obtain cell-free extracts that can reproduce in vitro the events that follow nuclear transfer to living eggs and oocytes. Depletion of extracts could identify necessary components. This approach has been particularly successful in identifying egg components that initiate DNA synthesis. Notable is the identification of nucleoplasmin , (Laskey et al. 1978; Philpott et al. 1991), an abundant component of Xenopus eggs that can decondense sperm and promote histone protein exchange. These same processes take place when somatic nuclei are added to egg extracts (Dimitrov and Wolffe 1996; Tamada et al. 2006). Other egg extract components that may contribute to the nuclear reprogramming process include the remodeling complex ISWI (Kikyo et al. 2000) and the germ-cell proteins FRGY2 that function to reversibly disassemble nucleoli (Gonda et al. 2003). It has been suggested that by permeabilizing and resealing nuclei in extracts, the remodeling complex BRG-1 may have a role in eggs and early embryos (Hansis et al. 2004). These experiments are not easy to interpret because cell-free extracts are not yet known to be able to initiate transcription of nuclei. Therefore, the treatment of nuclei in vitro, followed by transfer to the living oocyte to test transcription (Byrne et al. 2003; Tamada et al. 2006), is the best that can be done.
N U C LEA R T RAN 5 P LAN TAT ION
At present, it seems that three steps are necessary for successful nuclear reprogramming: (1) the removal of epigenetic marks on DNA or protein that characterize the differentiated state; (2) the provision of necessary transcription factors for those genes that need to be newly expressed; and (3) the decondensation of chromatin, to give transcription factors access to the genes on which they act. 4.2 Mammals
Successive epigenetic reprogramming is an important aspect of normal development (Rideout et al. 2001). Changes of DNA methylation as well as of histones are imposed on the two parental genomes successively during gametogenesis. Following fertilization, the embryo's genome is further modified during cleavage and after implantation. Table 3 summarizes some of the epigenetic differences that distinguish cloned from normal animals as a result of faulty reprogramming. For the following discussion, we highlight the epigenetic differences between fertilized and cloned embryos at different stages of development. The stages of development that are depicted in Table 3 and that are discussed in sequence are (1) gametogenesis, (2) cleavage, (3) postimplantation, and (4) postnatal development. GAMETOGENESIS
The most important epigenetic reprogramming in normal development occurs during gametogenesis, a process that renders both sperm and oocyte genomes "epigenetically competent" for subsequent fertilization and for faithful activation of the genes that are crucial for early development (Latham 1999). In cloning, this process is
•
427
cut short, and most problems affecting the "normalcy" of cloned animals may be due to the inadequate reprogramming of the somatic nucleus following transplantation into the egg. Because the placenta is derived from the trophectoderm lineage that constitutes the first differentiated cell type of the embryo, one might speculate that reprogramming and differentiation into this early lineage are compromised in most cloned animals. Indeed, as summarized below, the fraction of abnormally expressed genes in cloned newborns is substantially higher in the placenta as compared to somatic tissues. CLEAVAGE
During cleavage, a wave of genome-wide demethylation removes the epigenetic modification present in the zygote so that the DNA of the blastocyst is largely devoid of methylation. Between implantation and gastrulation, a wave of global de novo methylation reestablishes the overall methylation pattern, which is then maintained throughout life in the somatic cells of the animal. In cloned embryos, methylation of repetitive sequences is abnormal (Bourc'his et al. 2001; Dean et al. 2001; Kang et al. 2003; Mann et al. 2003). To investigate gene expression, the activity of "pluripotency genes" such as Oct-4 that are silent in somatic cells but active in embryonic cells was examined in cloned embryos. Strikingly, the reactivation of Oct-4 and of "Oct-4-like" genes was shown to be faulty and random in a large fraction of somatic clones (Boiani et al. 2002; Bortvin et al. 2003). Because embryos lacking Oct-4 arrest early in development, incomplete reactivation of Oct-4-like genes in clones might be causal to the frequent failure of the great majority of nuclear transfer embryos to survive the postimplantation period. Moreover, a number of studies have detected abnormal DNA methylation in cloned
Table 3. Normal versus cloned embryos Stage
Normal embryos
Cloned embryos
Gametogenesis
genome "competent" for activation of "early" genes, establishment of imprints
none
Cleavage
global demethylation of DNA
abnormal methylation of DNA
activation of embryonic ("Oct4-like") genes
stochastic / faulty activation of "Oct4-like" genes
Postimplantation
Postnatal
telomere length adjustment
normal
global de novo DNA methylation, X inactivation
abnormal in some cloned animals
normal imprinting and gene expression
abnormal imprinting, global gene dysregulation
normal animal
large offspring syndrome, premature death, etc.
428 • C HAP T E R 2 2
embryos. Although it is still an unresolved question to what extent the epigenetic modification of chromatin structure and DNA methylation, which occurs in normal development, needs to be mimicked for nuclear cloning to succeed, the available evidence is entirely consistent with faulty epigenetic reprogramming causing abnormal gene expression in cloned animals. POSTIMPLANTATION DEVELOPMENT
Following implantation and prior to gastrulation, three key events shape the epigenetic state of the embryo's genome: (1) A wave of global de novo methylation reestablishes the overall methylation pattern that is characteristic of the adult and that is then maintained throughout life in the somatic cells (Dean et al. 2003); (2) dosage compensation in female embryos is accomplished by the random inactivation of one of the two X chromosomes; (3) the telomeres are adjusted to a length that is characteristic of the somatic cells. Because all these events are only initiated in the postzygotic embryo, little disturbance in the regulation of these epigenetic events might be expected in cloned animals. However, lower global methylation levels were seen in cloned bovine fetuses but not in postnatal cows (Cezar et al. 2003). X inactivation was random and undisturbed in healthy but not in abnormal cloned mouse fetuses (Eggan et al. 2000; Senda et al. 2004; Nolen et al. 2005). However, it is not clear whether these disturbances are causally involved in abnormal clone development rather than being a consequence of abnormal reprogramming during preimplantation development. In contrast, telomere length adjustment is faithfully accomplished in cloned cows and mice (Lanza et al. 2000; Tian et al. 2000; Wakayama et al. 2000; Betts et al. 2001) and thus would not be expected to impair survival of cloned animals. POSTNATAL DEVELOPMENT
The most extensive analysis of gene expression has been performed in newborn cloned mice. Expression profiling showed that 4-5% of the genome and 30-50% of imprinted genes are abnormally expressed in placentas of newborn cloned mice (Humpherys et al. 2002; Kohda et al. 2005). This argues that mammalian development is surprisingly tolerant to widespread gene dysregulation and that compensatory mechanisms assure survival of some clones to birth. However, the results suggest that even surviving clones may have subtle defects that, although not severe enough to jeopardize immediate survival, will cause an abnormal phenotype at a later age.
5 Epigenetic Memory
Two kinds of epigenetic modifications to the genome are known to take place in vertebrate development. These include a methylated cytosine in many regions of DNA where a CpG is present, and various modifications of histone tails. These changes are acquired during gametogenesis and early development and are closely associated with the activity or inactivity of genes. It would therefore be expected that these epigenetic modifications would be reversed by nuclear transfer, or if not, that they may help to account for some of the failures of nuclear transplant embryo development. In amphibians, some insight into DNA demethylation has been achieved in experiments where mammalian somatic cell nuclei were injected into Xenopus oocytes (Simonsson and Gurdon 2004). The mouse Oct-4 promoter is methylated in adult thymus cells where Oct-4 is not expressed. However, the promoter region but not the enhancer region of the regulatory part of this gene was demethylated when thymus nuclei were injected into oocytes, a result that shows the selectivity of the demethylation process. When complete nuclei were injected, the demethylation of DNA seemed to precede induced Oct-4 transcription. It is likely that a DNA demethylase activity is a special property of oocytes (see above), and that the demethylation of the promoter DNA of developmentally repressed genes may also be an important and necessary step when somatic cell nuclei are reprogrammed in egg nuclear transfer experiments. Changes in histone modifications have not yet been examined in amphibian nuclear· transfer experiments. Another design of amphibian nuclear transfer experiment has shown that the epigenetic state of somatic cells is by no means always reversed. In view of the failure of nuclear transplants, even from early tail-bud endoderm donors of R. pipiens (above), Briggs and King (1957) asked whether the morphology of abnormal embryos reflected their origin; they described a preferential survival of endoderm tissues in embryos of endodermal nuclear origin and called this an "endoderm syndrome." It was pointed out, however, that the endoderm differentiates later than other germ layers, and that this might account for its better survival (Gurdon 1963). Indeed, Simnett (1964) reported that nuclear transplant embryos of neural origin also showed the same preferential survival of their endoderm. The same question has recently been approached again, using cell-type-specific gene markers. In these experiments (Ng and Gurdon 2005), nuclei from the neurectoderm or endoderm, already expressing the cell-
N U C LEA R
donor embryo Stage 21
Endoderm
~:;~~
~'.
donor region
•.•••••
region analyzed
--.--8--.
•
~
~~ - - • • • - - ~
Stage 26
•
-- •
Neurectoderm Sox2 +
429
Cell fate
Neurectoderm
Endoderm
Endoderm Edd+
~.
Neurectoderm (
NT embryo
nuclear transfer
T RAN 5 P LAN TAT ION
Neurectoderm
Endoderm
Figure 8. Experimental Design to Test Epigenetic Memory of Cell-type-specific Gene Expression Donor nuclei are taken from stage-21 endoderm or stage-26 neurectoderm cells. The resulting nuclear transfer embryos usually form partial blastulae, the normally cleaved parts of which are divided into the future neurectodermal or endodermal regions, and analyzed for gene expression. See Table 4. (Reprinted, with permission, from Ng and Gurdon 2005.)
type-specific markers Sox2 or endodermin, respectively, were transplanted to enucleated eggs; the resulting nuclear transplant embryos were divided into neurectodermal or endodermal parts, and these parts were tested for the same Sox2 or endodermin markers (Fig. 8 and Table 4). It was found that both genes were preferentially expressed in the inappropriate cell type. For example, over half of the embryos of neurectodermal origin overexpressed the neural marker Sox2 in their endoderm cells. In some cloned embryos, there appeared to have been no reduction in the level of Sox2 gene expression compared to that of the donor cells. Transplanted nuclei, like those of normal embryos, are wholly inactive in transcription until the blastula stage. Therefore, remarkably, the active state of gene transcription established in the course of cell differentiation can be maintained in cloned embryos, in the
complete absence of the conditions that induced that gene for more than 12 mitotic cell divisions (from egg to blastula). This striking example of epigenetic memory is seen in some nuclear transplant embryos but is wholly absent in others, in which gene expression has been successfully reprogrammed. 6 Medical Implications of Nuclear Transplantation It is important to distinguish between "reproductive
cloning" and "nuclear transplantation therapy" (also referred to as seNT or therapeutic cloning). In reproductive cloning, an embryo is generated by transfer of a somatic nucleus into an enucleated egg with the goal to create a cloned individual. In contrast, the purpose of nuclear transplantation therapy is to generate an embry-
Table 4. Epigenetic memory of cell-type-specific gene expression Gene expression (%)
Endoderm nuclei (eder)
.-
~ neurectoderm
.-
Sox2
45
5
12
6
22
81
NT embryo endoderm
Neurectoderm nuclei (Sox2")
Edd
~ neurectoderm NT embryo endoderm
The percent values represent the proportion of nuclear transplant embryos (assayed individually by RT-PCR) that express the genes Edd or 50x2 at two or more times greater than the normal (or background) level. (Data from Ng and Gurdon 2005.)
430 •
C HAP T E R 2 2
individual by nuclear cloning. The available evidence suggests that it may be difficult if not impossible to produce normal clones for the following reasons: (1) As summarized above, all analyzed clones at birth showed dysregulation of hundreds of genes. Nevertheless, the development of clones to birth and beyond despite widespread epigenetic abnormalities suggests that mammalian development can tolerate dysregulation of many genes. (2) Some clones survive to adulthood by compensating for gene dysregulation. Although this "compensation" assures survival, it may not prevent maladies that become manifest at later ages. Therefore, most if not all clones are expected to have at least subtle abnormalities that may not be severe enough to result in an obvious phenotype at birth but will cause serious problems later, as seen in aged mice. Different clones may just differ in the extent of abnormal gene
onic stem cell line (referred to as ntES cells) that is "tailored" to the needs of a patient who served as the nuclear donor (Hochedlinger and Jaenisch 2003). The ntES cells could be used as a source of functional cells that would be suitable for treating an underlying disease by transplantation. Figure 9 juxtaposes normal development from a fertilized embryo, reproductive cloning, and therapeutic cloning. 6.1 Reproductive Cloning
As outlined above, all evidence obtained from the cloning of eight different mammalian species indicates that the production of normal individuals by nuclear transfer faces major hurdles. It is a key question in the public debate whether it would ever be possible to produce a normal
Reproductive Cloning
Normal DevelopmentllVF
o
oocyte (1 n)
adult cell (2n)
/
\
I I
ferti lization/IVF
I
I
Q " I
zygote (2n)
I
I I
I I I I I
~
blastocyst
I I
I
0 !
I
I I
patient's cell (2n)
0········ \ I
~
Q
..
enucleated oocyte
•
~
nuclear transfer
NT embryo (2n)
\ I Q ~
~
I I
NT blastocyst
I
I I I I I I I I
implantation in uterus
I
I I
I I
I I
I I
adult mouse
~
enucleated oocyte
Therapeutic Cloning
! !
uterine transfer into surrogate mother
~ cloned mouse
+
@@@ III @
ntES cells
/ I \
in vitro differentiation
t t •• •• ~ ~~
blood cell
~ muscle cells
neurons
Figure 9. Comparison of Normal Development with "Reproductive Cloning" and "Therapeutic Cloning" During normal development (left), a haploid (1 n) sperm cell fertilizes a haploid oocyte to form a diploid (2n) zygote that undergoes cleavage to become a blastocyst embryo. Blastocysts implant in the uterus and ultimately give rise to a newborn animal. During "reproductive cloning" (center), the diploid nucleus of an adult donor cell is introduced into an enucleated oocyte recipient which, after artificial activation, divides into a cloned blastocyst. Upon transfer into surrogate mothers, a few of the cloned blastocysts will give rise to a newborn clone. In contrast, the derivation of ntES cells by nuclear transfer (right) requires the explantation of cloned blastocysts in culture to derive an ES cell line that can be differentiated in vitro into potentially any cell type of the body for research or therapeutic purposes. (Reprinted, with permission, from Hochedlinger and jaenisch 2003 [© Massachusetts Medical Society].)
NUCLEAR
expression: If the key "Oct-4 like" genes are not activated, clones die immediately after implantation. If those genes are activated, the clone may survive to birth and beyond. These considerations argue that cloned animals, even if appearing "normal" at superficial inspection, may not be so but may harbor subtle abnormalities that become phenotypically manifest only at later ages (Jaenisch 2004). These considerations preclude the application of this approach as a potential human reproductive technology. 6.2 Therapeutic Application of Nuclear Transplantation
Immune rejection is a frequent complication of allogeneic organ transplantation due to immunological incompatibility. To treat this "host versus graft" disease, immunosuppressive drugs are routinely given to transplant recipients, a treatment that has serious side effects. Embryonic stem cells derived by nuclear transplantation are genetically identical to the patient's cells, thus eliminating the risk of immune rejection and the requirement for immunosuppression. Most importantly, protocols are being developed that allow the generation of functional cells such as neurons, muscle cells, and islet cells that can be used for therapy of patients afflicted with serious disorders such as Parkinson's, heart failure, or diabetes. Moreover, embryonic stem cells provide a renewable source of replacement tissue, allowing repeated therapy whenever needed. Indeed, the feasibility of therapeutic cloning has been demonstrated in an animal model of disease. For this, a
TRANSPLANTATION.
431
mouse strain carrying a deletion of the Rag2 gene was used as a "patient" (Rideout et al. 2002). Rag2 mutant mice suffer from severe combined immune deficiency (SCID) due to a mutation in the gene catalyzing immune receptor rearrangements in lymphocytes. These mice are devoid of mature Band T cells, a condition resembling a human disorder ("bubble babies"). Figure 10 summarizes the steps involved in this experiment. In a first step, nuclei of somatic (fibroblast) donor cells from the tails of Rag2-deficient mice were injected into enucleated eggs. The resultant embryos were cultured to the blastocyst stage, and autologous ES cells were isolated. Subsequently, one of the mutant Rag2 alleles was targeted by hom*ologous recombination in ES cells to restore normal gene structure. To obtain somatic cells for treatment, these ES cells were differentiated into embryoid bodies (embryo-like structures that contain various somatic cell types) and further into hematopoietic precursors by expressing HoxB4. Resulting hematopoietic precursors were transplanted into irradiated Rag2-deficient animals to treat the disease. The cells generated functional Band T cells which had undergone proper rearrangements of their immunoglobulin and T-cell-receptor alleles as well as serum immunoglobulins in the transplanted Rag2 mice. This experiment demonstrated that nuclear transfer, in combination with gene therapy, can be used to treat a genetic disorder. Consequently, therapeutic cloning should be applicable to other diseases where the genetic lesion is known, such as sickle cell anemia or
Rag2-1-
~ :t-~
8. Expand HSC culture and transPi;nt~
~etailtiPcelis
/
'~~~~)~!;:L!'" fD
-\ /
cif0
2. Nuclear transfer into enucleated
6. Differentiate into EBs \
oocyte
5. Repair Rag2 gene i~n,~ '" ES cells by hom*ologous ~ ..._ recombination
e>
4. Isolate isogenic Rag2-1- ES cells
culture to cloned /,." " " ,"d blastocyst
Figure 10. Scheme for Therapeutic Cloning Combined with Gene and Cell Therapy A piece of tail from a mouse hom*ozygous for the recombination activating gene 2 (Rag2) mutation was removed and cultured. After fibroblast-like cells grew out, they were used as donors for nuclear transfer by direct injection into enucleated Mil oocytes using a Piezoelectric driven micromanipulator. Embryonic stem (ES) cells isolated from the nuclear transfer-derived blastocysts were genetically repaired by hom*ologous recombination. After repair, the ntES cells were differentiated in vitro into embryoid bodies (EBs), infected with the HoxB4iGFP retrovirus, expanded, and injected into the tail vein of irradiated, Rag2-deficient mice. (Reprinted, with permission, from Hochedlinger and jaenisch 2003 [© Massachusetts Medical Society].)
432 • C HAP
T ER 2 2
beta-thalassemia. It is an unresolved issue whether nuclear transplantation in humans will be as efficient as bovine, or as inefficient as murine, cloning. The use of nuclear cloning to generate "customized" ES cells for tissue repair is controversial because the very generation of the ES cells would, so goes the argument, necessarily involve the destruction of potential human life. As a possible solution to this ethical dilemma, altered nuclear transfer (ANT) was suggested as a modification of the nuclear transfer procedure (Hurlbut 2005). ANT involves the disabling of a gene in the somatic donor cells that is essential for placental development and thus prevents the formation of a fetus because the ANT blastocyst would be unable to implant (and thus no potential human life would be destroyed) but would still allow the generation of "customized" ES cells. Using Cdx2 as a target gene, a proof-of-principle experiment in mice verified the ANT approach (Meissner and Jaenisch 2006). It remains to be seen whether the ANT modification will satisfy those who are opposed to the generation of ES cells from cloned human blastocysts.
6.3 Reproductive Versus Therapeutic Cloning: What Is the Difference?
Why is faulty reprogramming problematic for reproductive cloning but not for therapeutic applications? The most important reason for this seeming paradox is that, in contrast to reproductive cloning, the therapeutic use of nuclear transfer does not require the formation of a fetus, relying instead on the direct differentiation of functional cells in culture. Because there is no requirement for the development of a fetus, the functionality of the differentiated cells that result from this process would not be expected to be affected by the disturbed imprinting that contributes substantially to the developmental failure of clones (Jaenisch 2004). Because ES cells derived from fertilized embryos are able to participate in the generation of all normal embryonic tissues, ES cells generated through nuclear transfer should have a similar potential to generate the full range of normal tissues. Indeed, all the available evidence is consistent with the conclusion that ES cells derived from cloned embryos are biologically and molecularly indistinguishable from ES cells derived from fertilized embryos (Brambrink et al. 2006). Thus, if human ES cells derived from IVF embryos are appropriate to treat diseases, so are "customized" ES cells derived by nuclear cloning from the cells of a patient.
References Betts D., Bordignon v., Hill J., Winger Q., Westhusin M., Smith L., and King W. 2001. Reprogramming of telomerase activity and rebuilding of telomere length in cloned cattle. Proc. Natl. Acad. Sci. 98: 1077-1082. Blelloch R.H., Hochedlinger K., Yamada Y., Brennan c., Kim M., Mintz B., Chin L., and Jaenisch R. 2004. Nuclear cloning of embryonal carcinoma cells. Froc. Natl. Acad. Sci. 101: 13985-13990. Boiani M., Eckardt S., Scholer H.R., and McLaugWin K.J. 2002. Oct4 distribution and level in mouse clones: Consequences for pi uri potency. Genes Dev. 16: 1209-1219. Bolton V.N., Oades P.)., and Johnson M.H. 1984. The relationship between cleavage, DNA replication, and gene expression in the mouse 2-cell embryo. J. Embryol. Exp. Morphol. 79: 139-163. Bortvin A., Eggan K., Skaletsky H., Akutsu H., Berry D.L., Yanagimachi R., Page D.C., and )aenisch R. 2003. Incomplete reactivation of Oct4-related genes in mouse embryos cloned from somatic nuclei. Development 130: 1673-1680. Bourc'his D., Le Bourhis D., Patin D., Niveleau A., Comizzoli P., Renard J.P., and Viegas-Pequignot E. 2001. Delayed and incomplete reprogramming of chromosome methylation patterns in bovine cloned embryos. Curro BioI. 11: 1542-1546. Brambrink T., Hochedlinger K., Bell G., and Jaenisch R. 2006. ES cells derived from cloned and fertilized blastocysts are transcriptionally and functionally indistinguishable. Proc. Natl. Acad. Sci. 103: 933-938. Briggs R. and King T.J. 1952. Transplantation ofliving nuclei from blastula cells into enucleated frogs' eggs. Proc. Natl. Acad. Sci. 38: 455--463. - - - . 1957. Changes in the nuclei of differentiating endoderm cells as revealed by nuclear transplantation. f. Morphol. 100: 269-312. Bromhall ).0. 1975. Nuclear transplantation in the rabbit egg. Nature 258: 719-722. Byrne J.A., Simonsson 5., Western P.S., and Gurdon J.B. 2003. Nuclei of adult mammalian somatic cells are directly reprogrammed to oct4 stern cell gene expression by amphibian oocytes. Curro BioI. 13:. 1206-1213. Calarco P.G. and McLaren A. 1976. Ultrastructural observations of preimplantation stages of the sheep. J. Embryol. Exp. Morphol. 36: 609-622. Callan H.G. and Lloyd L. 1960. Lampbrush chromosomes of crested newts Triturus cristatus (Laurenti). Philos. Trans. R. Soc. Lond. B BioI. Sci. 243: 135-219. Camous 5., KopecnyV., and Flechon J.E. 1986. Autoradiographic detection of the earliest stage of [3H]-uridine incorporation into the cow embryo. BioI. Cel/58: 195-200. Campbell K.H., Loi P., Otaegui P.J., and Wilmut I. 1996a. Cell cycle coordination in embryo cloning by nuclear transfer. Rev. Reprod. 1: 40--46. Campbell K.H., McWhir J., Ritchie W.A., and Wilmut I. 1996b. Sheep cloned by nuclear transfer from a cultured cell line. Nature 380: 64-66. Campbell K.H., Alberio R., Choi I., Fisher P., Kelly R.D., Lee J.H., and Maalouf W. 2005. Cloning: Eight years after Dolly. Reprod. Domest. Anim. 40: 256-268. Cezar G.G., Bartolomei M.S., Forsberg E.J., First N.L., Bishop M.D., and Eilertsen K.J. 2003. Genome-wide epigenetic alterations in cloned bovine fetuses. Bioi. Reprod. 68: 1009-1014. Cibelli J.B., Stice S.L., Golueke P.J., Kane J.J., Jerry J., Blackwell c., Ponce de Leon F. A., and Robl J.M. 1998. Cloned transgenic calves produced from nonquiescent fetal fibroblasts. Science 280: 1256-1258.
N U C LEA R
Dean W., Santos E, and Reik W. 2003. Epigenetic reprogramming in early mammalian development and following somatic nuclear transfer. Semin. Cell Dev. BioI. 14: 93-100. Dean W., Santos E, Stojkovic M., Zakhartchenko v., Walter J., Wolf E., and Reik W. 2001. Conservation of methylation reprogramming in mammalian development: Aberrant reprogramming in cloned embryos. Proc. Nati. Acad. Sci. 98: 13734-13738. Di Berardino M.A. and Hoffner N. 1970. Origin of chromosomal abnormalities in nuclear transplants-A reevaluation of nuclear differentiation and nuclear equivalence in amphibians. Dev. BioI. 23: 185-209. Dimitrov S. and Wolffe A.P. 1996. Remodeling somatic nuclei in Xenopus laevis egg extracts: Molecular mechanisms for the selective release of histones HI and H1° from chromatin and the acquisition of transcriptional competence. EMBO f. 15: 5897-5906. Eggan K., Akutsu H., Hochedlinger K., Rideout W., Yanagimachi R., and Jaenisch R. 2000. X-Chromosome inactivation in cloned mouse embryos. Science 290: 1578-1581. Eggan K., Akutsu H., Loring J., Jackson-Grusby 1., Klemm M., Rideout W.M., III, Yanagimachi R., and Jaenisch R. 2001. Hybrid vigor, fetal overgrowth, and viability of mice derived by nuclear cloning and tetraploid embryo complementation. Froc. Nati. Acad. Sci. 98: 6209-6214. Eggan K., Baldwin K., Tackett M., Osborne J., Gogos J., Chess A., Axel R., and Jaenisch R. 2004. Mice cloned from olfactory sensory neurons. Nature 428: 44-49. Fischberg M., Gurdon J.B., and Elsdale T.R. 1958. Nuclear transplantation in Xenopus laevis. Nature 181: 424. Gonda K., Fowler J., Katoku-Kikyo N., Haroldson J., Wudel J., and Kikyo N. 2003. Reversible disassembly of somatic nucleoli by the germ cell proteins FRGY2a and FRGY2b. Nat. Cell BioI. 5: 205-210. Gurdon J.B. 1960. The developmental capacity of nuclei taken from differentiating endoderm cells of Xenopus laevis. f. Embryoi. Exp. Morphoi. 8: 505-526. - - - . 1962. The developmental capacity of nuclei taken from intestinal epithelium cells of feeding tadpoles. J. Embryol Exp. Morphoi. 10: 622-640. - - - . 1963. Nuclear transplantation in Amphibia and the importance of stable nuclear changes in cellular differentiation. Q. Rev. BioI. 38: 54-78. - - - . 1964. The transplantation of living cell nuclei. Adv. Morphog. 4: 1-43. Gurdon J.B. and Uehlinger V. 1966. "Fertile" intestine nuclei. Nature 210: 1240-1241. Gurdon J.B., Elsdale T.R., and Fischberg M. 1958. Sexually mature individuals of Xenopus laevis from the transplantation of single somatic nuclei. Nature 182: 64-65. Gurdon J.B., Brennan S., Fairman S., and Mohun T.J. 1984. Transcription of muscle-specific actin genes in early Xenopus development: Nuclear transplantation and cell dissociation. Cell 38: 691-700. Hansis C, Barreto G., Maltry N., and Niehrs C 2004. Nuclear reprogramming of human somatic cells by Xenopus egg extract requires BRG1. Curl'. BioI. 14: 1475-1480. Hill J.R., Burghardt R.C., Jones K., Long CR., Looney CR., Shin T., Spencer T.E., Thompson J.A., Winger Q.A., and Westhusin M.E. 2000. Evidence for placental abnormality as the major cause of mortality in first-trimester somatic cell cloned bovine fetuses. BioI. Reprod. 63: 1787-1794. Hochedlinger K. and Jaenisch R. 2002. Monoclonal mice generated by nuclear transfer from mature Band T donor cells. Nature 415: 1035-1038.
T RAN S P LAN TAT ION
•
433
- - - . 2003. Nuclear transplantation, embryonic stem cells, and the potential for cell therapy. N. Engl. J. Med. 349: 275-286. Hochedlinger K., Blelloch R., Brennan C, Yamada Y., Kim M., Chin 1., and Jaenisch R. 2004. Reprogramming of a melanoma genome by nuclear transplantation. Genes Dev. 18: 1875-1885. Humpherys D., Eggan K., Akutsu H., Friedman A., Hochedlinger K., Yanagimachi R., Lander E., Golub T.R., and Jaenisch R. 2002. Abnormal gene expression in cloned mice derived from embryonic stem cell and cumulus cell nuclei. Proc. Nati. Acad. Sci. 99: 12889-12894. Humpherys D., Eggan K., Akutsu H., Hochedlinger K., Rideout W., Biniszkiewicz D., Yanagimachi R., and Jaenisch R. 2001. Epigenetic instability in ES cells and cloned mice. Science 293: 95-97. Hurlbut W.B. 2005. Altered nuclear transfer as a morally acceptable means for the procurement of human embryonic stem cells. Perspect. BioI. Med. 48: 211-228. Inoue K., Wakao H., Ogonuki N., Miki H., Seino K., Nambu-Wakao R., Noda S., Miyoshi H., Koseki H., Taniguchi M., and Ogura A. 2005. Generation of cloned mice by direct nuclear transfer from natural killer T cells. Curl'. BioI. 15: 1114-1118. Jaenisch R. 2004. Human cloning-The science and ethics of nuclear transplantation. N. Engi. f. Med. 351: 2787-2791. Kang Y.K., Lee K.K., and Han Y.M. 2003. Reprogramming DNA methylation in the preimplantation stage: Peeping with Dolly's eyes. Curl'. Opin. Cell BioI. 15: 290-295. Kikyo N., Wade P.A., Guschin D., Ge H., and Wolffe A.P. 2000. Active remodeling of somatic nuclei in egg cytoplasm by the nucleosomal ATPase ISWI. Science 289: 2360-2362. Kohda T., Inoue K., Ogonuki N., Miki H., Naruse M., Kaneko-Ishino T., Ogura A., and Ishino E 2005. Variation in gene expression and aberrantly regulated chromosome regions in cloned mice. BioI. Reprod. 73: 1302-1311. Lanza R., Cibelli J., Blackwell C, Cristofalo v., Francis M., Baerlocher G., Mak J., Schertzer M., Chavez E., Sawyer N., et al. 2000. Extension of cell life-span and telomere length in animals cloned from senescent somatic cells. Science 288: 665-669. Laskey R.A. and Gurdon J.B. 1970. Genetic content of adult somatic cells tested by nuclear transplantation from cultured cells. Nature 228: 1332-1334. Laskey R.A., Honda B.M., Mills A.D., and Finch J.T. 1978. Nucleosomes are assembled by an acidic protein which binds histones and transfers them to DNA. Nature 275: 416-420. Latham K.E. 1999. Mechanisms and control of embryonic genome activation in mammalian embryos. Int. Rev. Cytol. 193: 71-124. Lemaitre J.M., Danis E., Pasero P., Vassetzky Y., and Mechali M. 2005. Mitotic remodeling of the replicon and chromosome structure. Cell 123: 787-801. Li J., Ishii T., Feinstein P., and Mombaerts P. 2004. Odorant receptor gene choice is reset by nuclear transfer from mouse olfactory sensory neurons. Nature 428: 393-399. Li 1., Connelly M.C, Wetmore C, Curran T., and Morgan J.I. 2003. Mouse embryos cloned from brain tumors. Cancer Res. 63: 2733-2736. Mann M.R., Chung Y.G., Nolen L.D., Verona R.I., Latham K.E., and Bartolomei M.S. 2003. Disruption of imprinted gene methylation and expression in cloned preimplantation stage mouse embryos. BioI. Reprod. 69: 902-914. McGrath J. and Solter 0. 1984a. Completion of mouse embryogenesis requires both the maternal and paternal genomes. Cell 37: 179-187. McGrath J. and Solter D. 1984b. Inability of mouse blastomere nuclei transferred to enucleated zygotes to support development in vitro. Science 226: 1317-1319.
434 •
C HAP T E R 2 2
MeKinnell R.G. 1962. Development of Rana pipiens eggs transplanted with Lucke tumor cells. Am. Zool. 2: 430--431. Meehan R.R. 2003. DNA methylation in animal development. Semin. Cell Dey. BioI. 14: 53-65. Meissner A. 2006. "Conditional RNA interference, altered nuclear transfer and genome-wide DNA methylation analysis." Ph.D. thesis. University Saarland, Saarbriicken, Germany. Meissner A. and Jaenisch R. 2006. Generation of nuclear transferderived pluripotent ES cells from cloned Cdx2-deficient blastocysts. Nature 439: 212-215. Ng R.K. and Gurdon J.B. 2005. Epigenetic memory of active gene transcription is inherited through somatic cell nuclear transfer. Proc. Natl. Acad. Sci. 102: 1957-1962. Nolen L.D., Gao S., Han Z., Mann M.R., Gie Chung Y., Otte A.P., Bartolomei M.S., and Latham K.E. 2005. X chromosome reactivation and regulation in cloned embryos. Dey. BioI. 279: 525-540. Ogonuki N., Inoue K., Yamamoto Y., Noguchi Y., Tanemura K., Suzuki 0., Nakayama H., Doi K., Ohtomo Y, Satoh M., et al. 2002. Early death of mice cloned from somatic cells. Nat. Genet. 30: 253-254. Ogura A., Inoue K., Ogonuki N., Noguchi A., Takano K., Nagano R., Suzuki 0., Lee J., Ishino E, and Matsuda J. 2000. Production of male cloned mice from fresh, cultured, and cryopreserved immature Sertoli cells. BioI. Reprod. 62: 1579-1584. Philpott A., Leno G.H., and Laskey R.A. 1991. Sperm decondensation in Xenopus egg cytoplasm is mediated by nucleoplasmin. Cell 65: 569-578. Prather R.S., Sims M.M., and First N.L. 1989. Nuclear transplantation in early pig embryos. BioI. Reprod. 41: 414--418. Rideout W.M., III, Eggan K., and Jaenisch R. 2001. Nuclear cloning and epigenetic reprogramming of the genome. Science 293: 1093-1098. Rideout W.M., III, Hochedlinger K., Kyba M., Daley G.Q., and Jaenisch R. 2002. Correction of a genetic defect by nuclear transplantation and combined cell and gene therapy. Cell 109: 17-27. Rideout W.M., III, Wakayama T., Wutz A., Eggan K., Jackson-Grusby L., Dausman J., Yanagimachi R., and Jaenisch R. 2000. Generation of mice from wild-type and targeted ES cells by nuclear cloning. Nat. Genet. 24: 109-110. Robi J.M., Prather R., Barnes E, Eyestone W., Northey D., Gilligan B., and First N.L. 1987. Nuclear transplantation in bovine embryos. f. Anim. Sci. 64: 642-647. Senda S., Wakayama T., Yamazaki Y, Ohgane J., Hattori N., Tanaka S., Yanagimachi R., and Shiota K. 2004. Skewed X-inactivation in cloned mice. Biochem. Biophys. Res. Commun. 321: 38--44. Simnett J.D. 1964. The development of embryos derived from the
transplantation of neural ectoderm cell nuclei in Xenopus laevis. Dev. BioI. 10: 467--486.
Simonsson S. and Gurdon J. 2004. DNA demethylation is necessary for the epigenetic reprogramming of somatic cell nuclei. Nat. Cell BioI. 6: 984-990. Surani M.A., Barton S.C., and Norris M.L. 1984. Development of reconstituted mouse eggs suggests imprinting of the genome during gametogenesis. Nature 308: 548-550. Tamada H., Van Thuan ., Reed P., Nelson D., Katoku-Kikyo N., Wudel J., Wakayama T, and Kikyo N. 2006. Chromatin decondensation and nuclear reprogramming by nucleoplasmin. Mol. Cell. BioI. 26: 1259-1271. Tamashiro K.L., Wakayama T., Akutsu H., Yamazaki Y., Lachey J.L., Wortman M.D., Seeley R.J., D'Alessio D.A., Woods S.c., Yanagimachi R., and Sakai R.R. 2002. Cloned mice have an obese phenotype not transmitted to their offspring. Nat. Med. 8: 262-267. Tanaka 5., Oda M., Toyoshima Y, Wakayama T, Tanaka M., Yoshida N., Hattori N., Ohgane J., Yanagimachi R., and Shiota K. 2001. Placentomegaly in cloned mouse concepti caused by expansion of the spongiotrophoblast layer. BioI. Reprod. 65: 1813-1821. Tian X.c., Xu J., and Yang X. 2000. Normal telomere lengths found in cloned cattle. Nat. Genet. 26: 272-273. Wakayama 5., Ohta H., Kishigami S., Van Thuan N., Hikichi T., Mizutani E., Miyake M., and Wakayama T 2005. Establishment of male and female nuclear transfer embryonic stem cell lines from different mouse strains and tissues. BioI. Reprod. 72: 932-936. Wakayama T and Yanagimachi R. 1999. Cloning of male mice from adult tail-tip cells. Nat. Genet. 22: 127-128. ---.2001. Mouse cloning with nucleus donor cells of different age and type. Mol. Reprod. Dey. 58: 376---383. Wakayama T, Perry A.C., Zuccotti M., Johnson K.R., and Yanagimachi R. 1998. Full-term development of mice from enucleated oocytes injected with cumulus cell nuclei. Nature 394: 369-374. Wakayama T, Rodriguez 1., Perry A.C., Yanagimachi R., and Mombaerts P. 1999. Mice cloned from embryonic stem cells. Proc. Natl. Acad. Sci. 96: 14984-14989. Wakayama T., Shinkai Y., Tamashiro K.L., Niida H., Blanchard D.C.,. Blanchard R.J., Ogura A., Tanemura K., Tachibana M., Perry A.C., et al. 2000. Cloning of mice to six generations. Nature 407: 318-319. Wilmut 1., Schnieke A.E., McWhir J., Kind A.J., and Campbell K.H. 1997. Viable offspring derived from fetal and adult mammalian cells. Nature 385: 810-813. Young L.E., Sinclair K.D., and Wilmut 1. 1998. Large offspring syndrome in cattle and sheep. Rev. Reprod. 3: 155-163.
c
H
A
p
T
E
23
R
Epigenetics and Human Disease Huda Y. Zoghbi' and Arthur L. Beaudee 'Howard Hughes Medical Institute, and 2Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030
CONTENTS 1. Introduction, 437
3.3
2. Studies of Human Cases Uncover the Role of Epigenetics in Biology, 438
Disorders Affecting Chromatin Structure in cis, 447
3.4
Epigenetics-Environment Interactions, 449
3. Human Diseases, 439 3.1
Disorders of Genomic Imprinting, 439
3.2
Disorders Affecting Chromatin Structure in trans, 443
4. Looking into the Future, 451 Acknowledgments, 451 References, 451
435
GENERAL SUMMARY The last two decades have witnessed unparalleled success in identifying the genetic bases for hundreds of human disorders. Studies of genotype-phenotype relationships challenged clinicians and researchers, because some observations could not be easily explained. For example, monozygotic twins carrying the same disease mutation can be quite different clinically. A mutation passed on in a multigeneration family can cause vastly different diseases depending on the sex of the transmitting parent. The study of such unusual cases uncovered the role of the epigenome (altered genetic information without change in DNA sequence) in health and disease. These studies showed that some regions of the mammalian genome are not functionally equivalent on the maternal and paternal alleles. Patients who inherit both hom*ologous chromosomes (or segments thereof) from the same parent-uniparental disomy (UPD)-have loss of expression of some genes that are only expressed on maternal alleles (in case of paternal UPD) and increased levels for paternally expressed genes. UPD as well as altered DNA modifications (epigenetic mutations that might alter DNA methylation) quickly became recognized as the molecular bases for a variety of developmental and neurological disorders. It is interesting that for
many of these disorders, either epigenetic or genetic mutations can lead to the same phenotype. This is often because the genetic mutations disrupt the function of a gene that is typically misregulated when epigenetic defects affect the locus. In another class of diseases where genetic mutations cause loss of function of proteins involved in DNA methylation or chromatin remodeling, the phenotypes result from altered epigenetic states at one or more loci. The relationships between the genome and epigenome have broadened the types of molecular events that cause human diseases. These could be de novo or inherited, genetic or epigenetic, and most interestingly, some might be influenced by environmental factors. The finding that environmental factors such as diet and experience alter the epigenome (specifically DNA methylation) is likely to provide mechanistic insight about disorders with genetic predisposition and which are highly influenced by the environment. Such disorders include neural tube defects and psychiatric illnesses. Identifying environmental factors that can affect the epigenome provides hope for developing interventions that might decrease the risk or the burden of developmental abnormalities, cancer, and neuropsychiatric disorders.
E P f G ENE TIC SAN 0
1 Introduction
Two genetically identical male monozygotic twins, raised in the same environment, manifested very different neurological functions. Both twins carried the same mutation in the X-linked adrenoleukodystrophy (AiD) gene, yet one developed blindness, balance problems, and loss of myelin in the brain-features typical of the progressive and lethal neurological disease-while the other remained healthy. The conclusion of the investigators reporting the unusual occurrence was "some nongenetic factors may be important for different adrenoleukodystrophy phenotypes" (Korenke et al. 1996). That indeed was a valid conclusion in 1996, given the focus of medical genetics on DNA sequence. If the DNA sequence could not explain a phenotypic variation, then environmental factors did. Similar to the case of the ALD-discordant monozygotic twins, many monozygotic twins have been found to be discordant for schizophrenia despite similar environmental rearing conditions (Petronis 2004). Thankfully, research during the past decade has finally focused attention on epigenetic changes, which are modifications of the genetic information that do not alter DNA sequence, as a potential explanation for discordant phenotypes in monozygotic twins and in individuals who otherwise share similar DNA sequence alterations (Dennis 2003; Fraga et al. 2005). Epigenetic modifications control gene expression patterns in a cell. These modifications are stable and heritable such that a mother liver cell will indeed give rise to more liver cells after it divides. In the case of nondividing cells such as neurons, adaptation of chromosomal regions through chromatin modifications offers a mechanism for maintaining epigenetic information and possibly mediating the reproducible response of neurons to specific stimuli. An epigenotype (the epigenetic state of a genomic locus) is established based on the methylation state of the DNA, chromatin modifications, and the yetto-be elucidated various activities of noncoding RNAs. In mammals, DNA methylation, which is the beststudied epigenetic signal, occurs predominantly at the carbon-s position of symmetrical CpG dinucleotides. The state of DNA methylation is maintained after cell division through the activity of DNA-methyltransferase I, which methylates hemimethylated CpG dinucleotides in daughter cells. Chromatin modifications involve covalent posttranslational modifications of the protruding amino-terminal histone tails by the addition of acetyl,
HUM AND f SEA S E
•
437
methyl, phosphate, ubiquitin, or other groups. Methyl modifications can be mono-, di-, or tri-methylation. These modifications constitute the potential "histone code" that underlies a specific chromatin structure, which in turn affects the expression of adjacent genes. Because chromatin consists of densely packed DNA strands wrapped around histones, the folding pattern of DNA into chromatin is clearly at the root of gene activity changes. Although histone codes and chromatin structures can be stably transmitted from a parent cell to daughter cells, the mechanisms underlying the replication of such structures are not fully understood. The epigenotype shows plasticity during development and postnatally, depending on environmental factors and experiences (see Section 3.4); thus, it is not surprising that epigenotypes could contribute not only to developmental human disorders, but also to postnatal and even adult diseases. The most recent class of molecules contributing to the epigenetic signal is that of noncoding RNAs. For years the class of non-protein-coding RNA (ncRNA) included tRNA, rRNA, and spliceosomal RNA. More recently, because of the availability of genome sequence from multiple organisms, together with cross-species molecular genetic studies (from Escherichia coli to humans), the list of ncRNAs has expanded and resulted in the identification of hundreds of small ncRNAs, including small nucleolar RNA (snoRNA), microRNA (miRNA), short-interfering RNA (siRNA), and small double-stranded RNA. Some of these small RNA molecules regulate chromatin modifications, imprinting, DNA methylation, and transcriptional silenc~ ing, as discussed in detail in Chapter 8. The first definitive evidence of a role for epigenetics in human disease came about after the understanding of genomic imprinting and the finding that several genes are subject to regulation by this mechanism (Reik 1989). Genomic imprinting is a form of epigenetic regulation in which the expression of a gene depends on whether it is inherited from the mother or the father. Thus, at an imprinted diploid locus, there is unequal expression of the maternal and paternal alleles. In each generation, the parent-specific imprinting marks have to be erased, reset, and maintained, thus rendering imprinted loci vulnerable to any errors that may occur during this process. Such errors, as well as mutations in genes encoding proteins involved in DNA methylation, binding to methylated DNA, and histone modifications, all contribute to the fast-growing class of human disorders affecting the epigenome (Fig. 1).
438 •
C HAP T E R 2 3
CHROMATIN RELATED DISEASES
epigenetic
1
genomic imprinting defects
altered DNA methylation
Figure 1. Genetic and Epigenetic Mechanisms Underlying Chromatinrelated Disorders
~.-------------.. genetic
trans
c~
effects
effects
~
~
chromatin effector mutation
regulatory sequence (e.g. promoter) mutation
•
2 Studies of Human Cases Uncover the Role of Epigenetics in Biology
There is no doubt that the study of model organisms has been crucial for understanding many biological principles, especially in the fields of genetics, development, and neuroscience. It is often forgotten, however, that humans represent one of the most important model organisms when it comes to all aspects of biology. The characterization of thousands of human diseases represents the largest mutant screen for any species, and if carefully and systematically studied, these phenotypes are likely to reveal biological insights in addition to the medical benefits. It is therefore not surprising that the genotype-phenotype relationships that challenged Mendelian inheritance in the case of "dynamic mutations" were revealed through the study of patients with fragile X syndrome (Pieretti et al. 1991). Patients with unique features and the observant physicians who study them often break open a new field in biology, revealing novel genetic and molecular mechanisms. This indeed proved to be the case in revealing the role of epigenetics in human development and disease. A female patient made medical history for being reported twice by the physicians who saw her over the span of ten years. At the age of 7 years, she was reported in the medical literature because she suffered from cystic fibrosis (CF) and growth hormone deficiency, and was very short (Hubbard et al. 1980). During the race to find
Epigenetic mechanisms typically involve the alteration of DNA methylation or chromatin at imprinted loci, so disrupting monoallelic expression. Genetic mechanisms can be categorized into two classes. trans effects include the loss or dysfunction of chromatin-associated factors which can in turn alter chromatin structure and gene expression at certain genomic regions. cis effects represent mutations in noncoding regions that may be necessary for regulation. These mutations, which may include the expansion of DNA repeats, can lead to chromatin alterations which affect genome stability and gene expression.
the CF gene, Beaudet and colleagues sought unusual patients who had CF plus additional features in hope of identifying small deletions or chromosomal rearrangements that might facilitate the mapping and identification of the CF gene. Hence, this patient was brought to their attention. She was 16 years of age, measured 130 cm, had normal intelligence, but clearly had some body asymmetry (see right panel of title page figure). Analysis of her DNA revealed that she is hom*ozygous for multiple polyc morphic DNA markers on chromosome 7, including the centromeric alphoid repeats (Spence et al. 1988). After excluding non-paternity and hemizygosity, and after analyzing grand-maternal DNA (mother was deceased), Spence and colleagues concluded that this patient inherited two identical copies of the centromeric region of chromosome 7 from her maternal grandmother (Spence et al. 1988). Given Engel's theoretical proposal that uniparental disomy (UPD) is a possibility in humans (Engel 1980), Beaudet and colleagues immediately recognized that maternal UPD for chromosome 7 uncovered a recessive mutation in the CF gene and accounted for the additional somatic features. The constellation of clinical features in the patient, together with the laboratory evaluations, not only resulted in the identification of the first human case of UPD, but also illustrated that the maternal and paternal genomes are not equivalent for at least some portion of chromosome 7. This provided a novel mechanism of non-Mendelian inheritance to explain disease and developmental abnormalities (Fig. 2).
E PIG ENE TIC SAN 0
Although in 1988 it was thought by some that UPD of a chromosome was a rare event, today we know that UPD has been reported thus far for all human chromosomes except chromosomes 3 and 19. The study of unusual patients not only identified cases of UPD for additional chromosomes, but in 1989 also led to the proposal that UPD causes disease due to changes in epigenotype and disruption of genomic imprinting (Nicholls et al. 1989). Nicholls et al. studied a patient with Prader-Willi syndrome (PWS) who had a balanced Robertsonian translocation t(l3;15) that was also present in his asymptomatic mother and maternal relatives. The fact that the proband inherited his second free chromosome 15 from his mother (while all asymptomatic individuals inherited it from their fathers) led the authors to conclude that maternal UPD had led to the PWS phenotype. After confirming maternal UPD15 in a second PWS patient with an apparently normal karyotype, the authors proposed a role for genomic imprinting in the etiology of
gene A
Maternal
!J\'\O\'\() \a.I.ol~tW
!J\'\O\'\()
\-.I......iilioloIW
UPD
Paternal
•
439
PWS. Furthermore, they concluded that either paternal deletions or maternal UPD from 15q11-13 will lead to PWS, and they predicted that paternal UPD15 would lead to Angelman syndrome, just as maternal deletions of this region do. All of these predictions proved true (Fig. 3).
3 Human Diseases 3.1 Disorders of Genomic Imprinting
The discovery of UPD was the clinical entry point into disorders of genomic imprinting in humans. Whereas PWS and Angelman syndrome were the first genomic imprinting disorders to be studied, Beckwith-Wiedemann syndrome, pseudohypoparathyroidism, and SilverRussell syndrome expanded the list and introduced many intriguing questions about how epigenetic defects lead to the disease phenotype. In the following section, we give a brief review of the clinical features of each disorder, the various mechanisms leading to epigenotypic defects, and the phenotypes and biological insight gained from the study of this class of disorders (see Table 1).
geneS
Maternal!J\'\O\'\() .,............w Paternal
HUM AND I SEA S E
!J\'\O\'\() \J.j~""'W
UPD Figure 2. Consequences of Uniparental Disomy (UPD) The DNA methylation states of upstream CpG islands are indicated by pink circles when methylated and open circles when unmethylated. The DNA methylation state affects the expression of its downstream gene. Maternally inherited alleles are doubled (gene B); whereas those that are on the paternal alleles are lost (gene A).
SISTER SYNDROMES: PRADER-WllLl AND ANGElMAN
Prader-Willi syndrome (PWS; OMIM 176270) and Angelman syndrome (AS; OMIM 105830) are caused in the majority of cases by the same 5- to 6-Mb deletion in 15qll-q13, but their phenotypes are vastly different. Genomic imprinting in the region of 15qll-q13 accounts for the phenotypic differences, given that PWS is caused by paternally inherited deletions whereas in AS, the deletion is of maternal origin (Ledbetter et al. 1981; Magenis et al. 1987; Nicholls et al. 1989). PWS, which occurs in approximately 1110,000 births, was described almost 50 years ago and is characterized by infantile hypotonia, developmental delay, failure to thrive due to poor feeding, and lethargy, followed by hyperphagia, severe obesity, short stature, secondary hypogonadism with genital hypoplasia, and mild cognitive impairment. PWS patients also have distinct physical characteristics such as small hands and feet, almond-shaped eyes, and thin upper lip. Most PWS patients have mild to moderate mental retardation, and the vast majority display a variety of obsessive-compulsive behaviors, anxiety, and sometimes a withdrawn, unhappy disposition (Fig. 4a). In contrast, patients with AS have a "happy disposition," smile frequently, and have unexplained bouts of laughter. AS patients suffer from severe developmental delay, very minimal (if any) verbal skills, balance problems (ataxia), abnormal hand-
440 •
C HAP T E R 2 3
maternal deletion
ANGELMAN SYNDROME
cause:
paternal UPD
ul I
genetic
epigenetic
imprint defect
mutations in UBE3A
I 11
mixed
genetic
PRADER-WILLI SYNDROME Figure 3. Prader-Willi Syndrome and Angelman Syndrome paternal deletion
Both syndromes can be caused by genetic, epigenetic, or mixed defects.
maternal UPD
flapping movements, microcephaly, seizures, and some dysmorphic features such as prominent mandible and wide mouth (Fig. 4b). Hypotonia, hypopigmentation of the skin and irides, and strabismus can be seen in both disorders. The majority of PWS and AS (~70%) are caused by paternal and maternal deletions of I5q II-q 13, respectively. About 25% of PWS cases are caused by maternal UPD of I5q II-q 13, whereas paternal UPD of this region accounts for 2-5% of AS patients (Fig. 3). The difference in frequency of UPD between PWS and AS is usually initiated by maternal nondisjunction, as influenced by maternal age leading to a conception with trisomy or monosomy 15. These are then "rescued;' leading to maternal UPD and PWS or paternal UPD and AS, respectively. The difference in frequency of the two UPDs is presumably related to the frequency of the two abnormal eggs and the probability of rescue for the two circ*mstances. Translocations within the PWS/AS critical
region account for less than 10% of the cases, but it is of note that such translocations are associated with a high recurrence risk (up to 50%) depending on the sex of the transmitting parent. In fact, PWS and AS co-occurred in some families due to translocations or other structural abnormalities of I5qll-q13, and the phenotype was determined by the sex of the transmitting parent (Hasegawa et al. 1984; Smeets et al. 1992). Imprinting defects represent another class of mutations leading to PWS or AS phenotypes. These defects, which involve a bipartite imprinting center (IC) within I5qll-q13 (Ohta et a1.I999), cause a chromosome of one parental origin to have an altered epigenotype, typically that of the chromosome of an opposite parental origin: Imprinting defects often involve deletion of the IC, but there are instances when such defects appear to be due to an epigenetic mutation that does not involve the DNA sequence. The outcome of such diverse imprinting defects is the same and includes alterations in DNA methylation,
Table 1. Selected disorders of genomic imprinting Disorder
Gene
Comments
Gene(s) involved
Prader-Willi syndrome
deletion, UPD, imprint defect
15q11-q13
snoRNAs and other (7)
Angelman syndrome
deletion, UPD, imprint defect, point mutation, duplication'
15q11-q13
UBE3A
Beckwith-Wiedemann syndrome
imprint defect, UPD, 11 p15.5 duplication, translocation point mutation
11 p15.5
IGF2, co*kN1C
Silver-Russell syndrome
UPD, duplication translocation, inversion
7p11.2
several candidates in the region
epimutation
11p15.5
biallelic expression of H19 and decrease of IG F2
point mutation, imprint defect, UPD
20q13.2
GNASI
Pseudohypoparathyroidism
'Maternal duplications, trisomy, and tetrasomy for this region cause autism and other developmental abnormalities.
E PIG ENE TIC SAN 0
Figure 4. Images of a Prader-Willi Syndrome Patient (0) and Angelman Syndrome Patient (b) These pictures illustrate the dramatic differences in the clinical features of the disorders resulting from defects in an imprinted region. Images kindly provided by Drs. Daniel J. Driscoll and Carlos A. Bacino, respectively.
chromatin structure, and, ultimately, gene expression patterns. Imprinting defects account for 2-5% of PWS and AS cases, and the IC deletions are typically associated with 50% recurrence risk, depending on the sex of the transmitting parent, whereas the recurrence risk is low for families without IC deletions. The identification of imprinting defects in a handful of AS patients who were conceived after intracytoplasmic sperm injection (rCSI) raised the possibility that this approach of in vitro fertilization might cause imprinting defects (Cox et a1. 2002; Orstavik et a1. 2003). The finding of imprinting defects among AS cases born to sub-fertile couples who did not receive ICSI (but did receive hormonal stimulation) raises further questions about whether there are common mechanisms for infertility and imprinting defects or whether indeed assisted reproductive technology (hormones and/or ICSI) has epigenetic consequences (Ludwig et a1. 2005). Exactly which gene(s) is affected by genomic imprinting in 15q11-q13 is known for AS but not for PWS. About 10-15% of AS cases are caused by loss-of-function mutations in the ubiquitin E3 ligase gene (UBE3A) encoding the E6-associated protein (E6-AP) (Kishino et a1. 1997; Matsuura et a1. 1997). Expression studies demonstrated that Ube3a is expressed exclusively from the maternal allele in cerebellar Purkinje cells and hippocampal neurons. Furthermore, Ube3a+ l - mice lacking the maternal allele reproduce features of AS (Jiang et a1. 1998). These
HUM AND I SEA S E
441
results, together with human data, pinpoint the UBE3A gene as the causative gene in AS. Paternal UPD or maternal deletions of 15qll-q13 lead to loss of expression of UBE3A in Purkinje cells. In the case of IC imprinting defects, it appears that loss of silencing of an antisense transcript leads to suppression of UBE3A expression (Rougeulle et a1. 1998). It is intriguing that about 10% of AS cases remain without a molecular diagnosis. A subset of these patients appear to have mutations in a chromatin-remodeling protein, methyl-CpG-binding protein 2, as discussed below. In the case of PWS, there are several candidate imprinting genes that are only expressed from the paternal allele; however, it is not clear which of these genes is contributing to the PWS phenotype. The best candidate genes thus far are in a cluster of noncoding snoRNAs. The best protein-coding candidate genes are SNURF-SNRPN and Necdin (NDN). SNURF-SNRPN has its major transcriptional start site at the IC, and it encodes a small nuclear ribonucleoprotein (SNRPN) that functions in the regulation of splicing. Another gene, a "SNRPN upstream reading frame" or SNURF, along with upstream noncoding exons, is thought to be the major site of imprinting defects, because disruption of this gene leads to altered imprinting of SNRPN and other 15q11-q13 imprinted genes. Mice lacking Snrpn appear normal, but mice with deletions spanning Snrpn and other genes hom*ologous to those in 15qll-q13 are hypotonic, develop growth retardation, and die before weaning (Tsai et a1. 1999). Several small nucleolar RNA (snoRNA) genes are expressed from the paternal allele and are suspected to contribute to the PWS phenotype (Meguro et a1. 2001). A recent study showed that loss of the paternal allele from one cluster of these genes (HBIl-52) does not cause PWS (Runte et a1. 2005). However, a study in mice suggests loss of Pwcrl/MBIl-85 snoRNA is likely responsible for the neonatal lethality in PWS mouse models (Ding et a1. 2005). Therefore, PWS may be caused by loss of one or more snoRNA genes, possibly in combination with loss of other paternally expressed genes in 15qll-q13. Careful studies of rare translocation and deletion families support the interpretation that deficiency of PWCR1/HBII85 snoRNAs causes PWS (Schule et a1. 2005).
BECKWITH-WIEDEMANN SYNDROME
The story of Beckwith-Wiedemann syndrome (BWS; OMIM 130650) represents an excellent example of how a human disorder uncovered the importance of epigenetics not only in normal development, but in the regulation of
442
II
C HAP T E R 2 3
cell growth and tumorigenesis. BWS is characterized by somatic overgrowth, congenital abnormalities, and a predisposition to childhood embryonal malignancies (Weksberg et al. 2003). BWS patients typically manifest gigantism, macroglossia (large tongue), hemihypertrophy, variable degrees of ear and other organ anomalies, and omphalocele (protrusion of abdominal organs through the navel). In addition, many patients suffer from increased size of internal organs; embryonic tumors such as Wilms' tumor, hepatoblastoma, or rhabdomyosarcoma; and hyperplasia and hypertrophy of pancreatic islets, often leading to neonatal hypoglycemia. The majority of BWS cases are sporadic, but a small number of families with an autosomal dominant inheritance pattern (in retrospect, modified by genomic imprinting) suggested genetic etiology and linked the syndrome to 11 p15 (Ping et al. 1989). Preferential loss of maternal alleles in BWS-related tumors, an excess of transmitting females in the dominant form of the disease, and paternal UPD of 11 p 15.5 in some cases of BWS provided evidence that epigenetics and imprinting must play an important role in the etiology of BWS, and that the disease might result from a mixture of genetic and epigenetic abnormalities either de novo or inherited. The cluster of imprinted genes implicated in BWS maps to an approximately 1-Mb region in 11p 15.5 and includes at least 12 imprinted genes. These genes are thought to be regulated by two imprinting centers separated by a nonimprinted region (Weksberg et al. 2003). The reciprocally imprinted H19 and insulin-like growth factor (IGF2) and a differentially methylated region are thought to represent one imprinting control region (ICRl) (Joyce et al. 1997; Weksberg et al. 2003). H19 encodes a maternally expressed noncoding pol II RNA, and lGF2 encodes a paternally expressed growth factor. These two genes share a common set of enhancers, access to which is affected by the methylation state of ICRI and binding of CTCF, a zinc finger protein (Hark et al. 2000). The second imprinting control region (ICR2) contains several maternally expressed genes, including the cyclin-dependent kinase inhibitor (CDKNIC encoding p5i;P2), a component of the potassium channel (KCNQl), and a putative cation transporter (SLC22AlL). The differentially methylated region in ICR2 maps to an intron of KCNQl and is unmethylated on paternal alleles, leading to expression of KCNQIOTI in an antisense direction of KCNQl. Methylation of ICR2 on the maternal allele is believed to silence maternal expression of KCNQIOTl, allowing expression of the maternally expressed KCNQl and CDKNIC (Lee et al. 1999; Smilinich et al. 1999).
Various epigenetic as well as genetic molecular defects provided some insight about which genes contribute to the BWS phenotype. On unmethylated maternal alleles, CTCF binds ICRI and establishes a chromatin boundary whereby the lGF2 promoter is insulated from enhancers. These enhancers can then access the H19 promoter (proximal to the boundary), permitting transcription of H19. Methylation of ICRI on paternal alleles abrogates the binding of CTCF, permitting expression of lGF2 and silencing of H19. The findings that either duplications in 11pI5.5 that span the lGF2 locus or paternal UPD of this region (expected to lead to overexpression of lGF2) , coupled with data showing that transgenic mice overexpressing lGF2 develop overgrowth and large tongues, implicated lGF2 overexpression as one potential cause of the overgrowth phenotype in BWS (Henry et al. 1991; Weksberg et al. 1993; Sun et al. 1997). It is quite intriguing that loss-of-function mutations in CDKNI C give rise to BWS, similar to those caused by overexpression of lGF2. Mice lacking Cdknlc develop omphaloceles but not overgrowth. However, when loss of Cdknlc is coupled with increased expression of 19f2, the animals reproduce many features of BWS (Caspary et al. 1999). To date, the molecular lesions that cause BWS include (1) paternal duplications encompassing lGF2, (2) paternal UPD for 11pI5.5, (3) loss-of-function mutations in the maternal allele of CDKNI C, (4) translocations on the maternal chromosome disrupting KCNQl which affect imprinting of lGF2 but curiously not ICR2, and (5) most commonly, loss of imprinting for ICR21KCNQIOTl which again alters imprinting of lGF2 and suggests some regulatory interactions between ICR1 and ICR2 (Cooper et al. 2005). Some of the epigenetic changes identified in BWS, such as methylation defects at the H19 ICRl, have also been confirmed in individuals who develop Wilms' tumor but not BWS, suggesting that the timing of the epigenetic defect might dictate whether abnormal growth regulation will affect the whole organism or a specific organ. The fact that aberrant methylation at IRCI often leads to Wilms' tumor, and at ICR2 often leads to rhabdomyosarcoma and hepatoblastoma in BWS, suggests that there is more than one locus in 11p15.5 predisposing to tumorigenesis (Weksberg et al. 2001; DeBaun et al. 2003; Prawitt et al. 2005).
SILVER-RUSSELL SYNDROME
Silver-Russell syndrome (SRS; OMIM 180860) is a developmental disorder characterized by growth retardation, short stature often with asymmetry, and some
E PIG ENE T f C 5
dysmorphic facial and cranial features as well as digit abnormalities. The most prominent feature is the somatic growth abnormality, with other features being highly variable. SRS is genetically heterogeneous, but it is estimated that about 10% of the cases result from maternal UPD for chromosome 7 (Eggermann et al. 1997). It is proposed that loss of function of a paternally expressed gene, possibly one that promotes growth, causes SRS, but an alternate model of overexpression of a maternally expressed growth-suppressing gene cannot be excluded. It is interesting that an epigenetic mutation causing demethylation of the ICR1 on chromosome llp15 has been identified in several individuals with SRS. This epigenetic defect causes biallelic expression of H19 and decreased expression of IGF2 (Gicquel et al. 2005).
AND
HUM AND I 5 E A 5 E
443
The genotype-phenotype studies of these clinical disorders demonstrate that with the exception of SRS, all the other genomic imprinting disorders (PWS, AS, BWS, and PHP) can be caused by a mixture of genetic or epigenetic abnormalities, either de novo or inherited. It is hard to believe that such a mixed genetic model for disease would remain unique for this small subset of disorders. A little over a decade ago, UPD was only a theoretical possibility, but now it is established to occur in many chromosomal regions and to result in diverse diseases and developmental phenotypes. One challenge in human genetics research is to uncover which genes are responsible for which UPD-associated phenotypes in order to establish a list of diseases that are likely to result from mixed genetic/epigenetic mechanisms. 3.2 Disorders Affecting Chromatin Structure in trans
PSEUDOHYPOPARATHYROIDISM
Pseudohypoparathroidism (PHP) represents a group of phenotypes that result from functional hypoparathyroidism despite normal parathyroid hormone (PTH) levels. These patients are resistant to PTH. There are several clinical subtypes-la, Ib, Ie, II, and Albright hereditary osteodystrophy (OMIM 103580). In addition to the functional hypoparathyroidism and osteodystrophy, these clinical variants may exhibit a variety of developmental and somatic defects. The clinically heterogeneous phenotypes result from mutations in the GNASl gene encoding the a-stimulating activity polypeptide 1 (Gsa), a guanine nucleotide-binding protein. GNASl maps to chromosome 20q13.2. The GNASl locus has three upstream alternative first exons (exons lA, XL, and NESP55) that are spliced to exons 2-13 to produce different transcripts and, in the case of NESP55 and XL, this alternative splicing produces unique proteins. There are differentially methylated regions near these exons, causing NESP55 to be expressed exclusively from maternal alleles, whereas XL, exon lA, and an antisense transcript for NESP55 are paternally expressed. Although the transcript encoding the Gsa protein is biallelically expressed, the maternal allele is preferentially expressed in some tissues such as the proximal renal tubule. The combination of genomic and tissue-specific imprinting accounts for the variable phenotypes and parent-of-origin effect even for mutations that have a clear autosomal dominant inheritance pattern (Hayward et al. 1998). Of note is the finding that one patient with paternal uniparental disomy of the GNASl region developed PHP type Ib disease (Bastepe et al. 2003).
The importance of finely tuned chromatin structure for human health has been highlighted through the rapidly growing list of human diseases caused by mutations in genes encoding proteins essential for chromatin structure and remodeling. These disorders themselves do not have epigenetic mutations but alter chromatin states that are critical components of the epigenotype. The vast differences in phenotypes, as well as the fact that subtle changes in protein levels or even conserved amino acid substitution can lead to human disease, are beginning to provide clues about the tightly controlled regulation and interactions of chromatin-remodeling proteins. Disorders that affect chromatin in trans result either from disruption of function of proteins directly involved in chromatin remodeling, such as CREB-binding protein (CBP), EP300, or methyl-CpG-binding protein (MeCP2), or from loss of function of proteins involved in DNA methylation such as de novo DNA methyltransferase 3B (DNMT3B) or methylene tetrahydrofolate reductase (MTHFR) (see Table 2). Disruption of the function of any of these genes causes complex multisystem phenotypes or neoplasia owing to the downstream effects of misregulation of expression of a large number of target genes. Although yet to be discovered, there is an ample opportunity for diseases caused by mutation in noncoding RNAs acting in trans. RUBINSTEIN-TAYBI SYNDROME
Rubinstein-Taybi syndrome (RSTS; OMIM 180849) is characterized by mental retardation, broad thumbs and toes, facial abnormalities, congenital heart defects, and increased risk of tumor formation. The high concordance rate in monozygotic twins, together with a few cases of
444
C HAP T E R 2 3
Table 2. Selected genetic disorders affecting chromatin structure in trans Disorder
Gene
Rubinstein-Taybi syndrome
CREBBp, EP300
Rett syndrome
MECP2
loss of function as well as duplication causes a broad spectrum of phenotypes
a-Thalassemia and X-linked mental retardation
ATRX
somatic mutations cause a-thalassemia and myelodysplastic syndrome
ICF Syndrome
DNMT3B
Schimke immuno-osseous dysplasia
SMARCAL7
Mental retardation
MTHFR
mother-to-child transmission, suggested that this disease has a genetic basis and that an autosomal dominant inheritance was most likely. Cytogenetic abnormalities involving 16p13.3 were identified in several RSTS patients (Tommerup et al. 1992) and found to map to the region that contains the CREB-binding protein gene (CREBBP or CBP). Heterozygous mutations in CREBBP demonstrated that haploinsufficiency of CBP causes RSTS (Petrij et al. 1995). CBP was first described as a coactivator of the cAMP-responsive binding protein CREE. When cellular levels of cAMP increase, protein kinase A (PKA) translocates to the nucleus and phosphorylates CREB, which leads to its activation and binding to cAMPresponse elements (CREs) (Mayr and Montminy 2001). CBP is a large protein (~250 kD) with a bromodomain that has been shown to bind PKA-phosphorylated CREB (Chrivia et al. 1993). CBP in turn activates transcription from a CRE-containing promoter through the acetylation of all four core histones in the adjacent nucleosomes (Ogryzko et al. 1996). CBP also interacts through a region in its carboxyl terminus directly with the basal transcription factor TFIIIB (Arias et al. 1994; Kwok et al. 1994). in vitro functional analysis of one of the CBP missense mutations (Arg-1378 to proline) that cause RSTS revealed that this mutation abolishes the histone acetyltransferase (HAT) activity of CBP (Murata et al. 2001). These data, together with the finding that mice haploinsufficient for CBP have impaired learning and memory, altered synaptic plasticity, and abnormal chromatin acetylation, support the conclusion that decreased HAT activity of CBP is a key contributor to the RSTS phenotype (Alarcon et al. 2004). Consistent with the role of decreased HAT activity in disease is the recent discovery that mutations in a second gene, p300, encoding a potent HAT and transcriptional coactivator cause some cases of RSTS (Roelfsema et al. 2005). The finding that some of the synaptic plasticity defects, as well as learning and memory deficits of the CBP+'- mice, can be reversed by using histone deacetylase
Comments
(HDAC) inhibitors (Alarcon et al. 2004) raises the question whether pharmacologic therapy using such reagents can ameliorate some of the mental deficits in RSTS.
RETI SYNDROME
Rett syndrome (RTT, OMIM 312750) is a dominant Xlinked postnatal neurodevelopmental disorder characterized by motor abnormalities, ataxia, seizures, replacement of hand use by purposeless hand-wringing, and language regression (Hagberg et al. 1983). RTT is classified as one of the autistic spectrum disorders (ASD) in DSMIV and shares three main features with ASD: Both manifest postnatally, often after a period of apparent normal development; both disrupt social and language development, and both are accompanied by unusual stereotyped hand or arm movements (Fig. Sa). Although RTT is a sporadic disorder in the vast majority of cases (>99%), the discoyery of a handful of families in whom the gene was transmitted through maternal lines suggested a genetic basis for this disorder. Such families, together with findings that RTT was typically observed in females and that obligate carrier females can be asymptomatic, led to the hypothesis that RTT is an X-linked dominant disorder. An exclusion mapping strategy localized the RTT gene to Xq27-qter, and candidate gene analysis pinpointed the gene encoding methyl-CpG-binding protein 2 (MECP2) as the causative gene (Amir et al. 1999). The discovery of mutations in MECP2 as the major cause of RTT provided molecular evidence for a relationship between RTT and autism. Mutations in MECP2 are now known to cause a broad spectrum of phenotypes in females, including learning disabilities, isolated mental retardation, Angelman-like syndrome, and ASD. X-chromosome inactivation (XCI) patterns are the major molecular determinants for this clinical variability. Females with MECP2 mutations and balanced XCI patterns typically have classic RTT with the exception of a few hypomorphic
E PIG ENE TIC SAN 0
b Figure 5. Genetic Disorders Affecting Chromatin in cis (0) This photo of a Rett syndrome patient illustrates the unusual stereotyped hand movements, teeth grinding, and abnormal posture. Photo kindly provided by Dr. Daniel G. Glaze. (b) Micrograph of chromosomes from an ICF patient, courtesy of Drs. Timothy H. Bestor, Robert A. Rollins, and Deborah Bourc'his.
alleles. Females with unbalanced XCI patterns favoring the wild-type allele typically have the milder phenotypes (Wan et a1. 1999; Carney et a1. 2003). Males with MECP2 mutations display a broader phenotype than females, due to their hemizygosity for the locus. RTT-causing mutations typically cause neonatal lethality unless the male is mosaic for the mutations or has XXY karyotype, in which case, all the phenotypes seen in females are also seen in these males (Zeev et a1. 2002; Neul and Zoghbi 2004). On the other hand, males that have hypomorphic alleles which barely cause a phenotype in females develop any combination of features including mental retardation, seizures, tremors, enlarged testes, bipolar disease, or schizophrenia (Meloni et al. 2000; Couvert et a1. 2001). MeCP2 was identified on the basis of its ability to bind symmetrically methylated CpG dinucleotides (Lewis et a1. 1992). It localizes to heterochromatin and acts as a transcriptional repressor in a methylation-dependent manner (Nan et a1. 1997). MeCP2 binds methylated DNA
HUM AND I SEA S E
445
through its methyl-CpG-binding domain and interacts with corepressors Sin3A and HDACs through its transcription repression domain. MeCP2 also associates with Brahma, a component of the SWI-SNF chromatinremodeling complex (Harikrishnan et a1. 2005). An intriguing feature of RTT is the delayed postnatal onset of phenotypes in the absence of neurodegeneration. Studies on the distribution and abundance of MeCP2 revealed that it is detected in mature neurons, probably after synapse formation (Shahbazian et a1. 2002a; Kishi and Macklis 2004; Mullaney et a1. 2004). Such a distribution suggests that MeCP2's neuronal function is essential after neuronal maturation and activity have been established and that it plays a role in regulating neuronal activity. Some targets of MeCP2 are beginning to be identified, but exactly which of these targets mediate the diverse RTT phenotypes remains to be determined (Chen et a1. 2003; Martinowich et a1. 2003; Horike et a1. 2005; Nuber et a1. 2005). Studies using cell extracts from RTT patients or brain extracts from mouse models that lack functional MeCP2 have revealed altered histone acetylation (Wan et al. 2001; Shahbazian et a1. 2002b; Kaufmann et a1. 2005), consistent with a proposed role for this protein in deacetylation of histones based on its interactions with HDACs. It is interesting that doubling the dose of MeCP2 in mice and humans leads to progressive postnatal phenotypes that are in fact more severe than some of the loss-of-function phenotypes (Collins et a1. 2004; Meins et a1. 2005; Van Esch et a1. 2005). Whether increasing MeCP2 levels results in titration of key interactors and/or aberrant expression of its targets remains to be seen. In pursuit of revealing potentially novel functions for MeCP2, Young and colleagues discovered that MeCP2 interacts with Y box-binding protein 1 (YB-l), an RNAbinding protein that affects splicing (Young et a1. 2005). MeCP2 regulates RNA splicing of reporter minigenes, but most importantly, seems to affect RNA splicing in vivo based on altered RNA splicing patterns in brain tissue from a mouse model for RTT (Young et a1. 2005). The importance of MeCP2 in epigenetic regulation of neuronal gene expression and its effects on RNA splicing are likely to be at the root of loss of developmental milestones and abnormal neurological function in RTT and related disorders.
a-THALASSEMIA X-LINKED MENTAL RETARDATION
Males with a-thalassemia X-linked mental retardation syndrome (ATRX; OMIM 301040) display a-thalassemia, moderate to severe mental retardation, dysmorphic facial
446
C HAP T E R 2 3
features, microcephaly, skeletal and genital abnormalities, and, usually, inability to walk. Heterozygous females are typically asymptomatic. Mutations in the ATRX gene, which maps to Xq 13, cause this syndrome, as well as a host of additional phenotypes, including variable degrees of X-linked mental retardation (XLMR), severe MR with spastic paraplegia, and acquired a-thalassemia in myelodysplastic syndrome (ATMDS) owing to somatic mutations (Gibbons et al. 1995, 2003; Villard et al. 1996; Yntema et al. 2002). The ATRX protein contains a plant homeodomain (PHD-like) zinc finger motif as well as a DNA-dependent ATPase of the SNF2 family. This, together with its localization to pericentromeric heterochromatic domains and association with heterochromatinla (HPla) (McDowell et al. 1999), suggests a role as a chromatin-remodeling protein. Mutations in ATRX cause down-regulation of the a-globin locus and abnormal methylation of several highly repeated sequences, including subtelomeric repeats, Y-specific satellite, and ribosomal DNA arrays. A recent study demonstrated that ATRX is essential for the survival of cortical neurons, hinting that increased neuronal loss might contribute to the severe mental retardation and spasticity seen in patients with ATRX mutations (Berube et al. 2005). It is interesting that levels of ATRX are tightly regulated and that either decreases or increases cause major neurodevelopmental problems. For example, human patients with mutations that result in 10-30% of normal ATRX levels display the full ATRX phenotype despite having significant amounts of the normal ATRX protein (Picketts et al. 1996). Too much of ATRX seems to be equally devastating. Transgenic mice that overexpress ATRX develop neural tube defects, have growth retardation, and die during embryogenesis. Those that survive develop craniofacial abnormalities, compulsive facial scratching, and seizures. The features are reminiscent of clinical features of patients with loss-of-function mutations of ATRX, raising the possibility that levels of ATRX are tightly regulated for the functional integrity of the protein complex within which it resides.
IMMUNODEFICIENCY, CENTROMERIC REGION INSTABILITY, AND FACIAL ANOMALIES SYNDROME
The immunodeficiency, centromeric region instability, and facial anomalies syndrome (ICF, OMIM 242860) is a rare autosomal recessive chromosome breakage disorder. ICF patients display two invariant phenotypes, immunodeficiency and cytogenetic abnormalities. Highly variable and less penetrant phenotypes include craniofacial defects
such as a broad and flat nasal bridge, epicanthal folds, high forehead and low-set ears, psychom*otor retardation, and intestinal dysfunction (Smeets et al. 1994). The immunodeficiency is typically severe and is often the cause of premature death during childhood due to respiratory or gastrointestinal infections. A decrease in serum IgG levels is the most common immunological defect, but decreased numbers of B or T cells are also observed (Ehrlich 2003). Cytogenetic abnormalities primarily affecting chromosomes 1 and 16, and to a lesser degree 9, are seen on routine karyotype analysis of blood and in cultured cells of ICF patients (Fig. 5b) (Tuck-Muller et al. 2000). Hypomethylation of juxtacentromeric repeat sequences on chromosomes 1,9, and 16 had been discovered well before the identification of the ICF gene (Jeanpierre et al. 1993). These chromosomes contain the largest blocks of classic satellite (satellites 2 and 3) tandem repeats near their centromeres. The finding that ICF is caused by loss-of-function mutations in the de novo DNA methyltransferase gene (DNMT3B) provided insight into the decrease in methylation at centromeric satellites 2 and 3 (Hansen et al. 1999; Okano et al. 1999; Xu et al. 1999). However, it remains unclear why loss of function of a widely expressed de novo methyltransferase selectively affects specific repetitive sequences. One possible explanation entails the subcellular distribution and/or context-specific protein interaction of DNMT3B (Bachman et al. 2001). Another possibility is that the catalytic activity of DNMT3B is more essential for methylating sequences that have a high density of CpGs over large genomic regions, as in the case of satellite 2 (Gowher and Jeltsch 2002) or the D4Z4 repetitive sequence, implicated in facioscapulohumeral muscular dystrophy (Kondo et al. 2000). Whether additional specific sequences are hypomethylated remains to be determined, but it is predicted that DNA hypomethylation leads to altered expression of genes that play an important role in craniofacial, nervous system, and immunological development. Gene expression studies using RNA from lymphoblastoid cell lines of ICF patients and healthy controls revealed several alterations in genes involved in maturation, migration, activation, and homing of lymphocytes (Ehrlich et al. 2001). It is not clear, however, whether loss of DNMT3B causes dysregulation of such genes, because the methylation patterns at their promoter did not seem to be altered..Given that the only hypomethylation detected so far in ICF is at satellite DNA, it is hypothesized that some of the genes altered in ICF might associate with satellite DNA. Such sequences typically behave as heterochromatin when methylated; thus, in ICF there is
E PIG ENE TIC 5
dysregulated gene expression due to trans-effects of heterochromatic regions rich in satellite 2 and 3 domains (Bickmore and van der MaareI2003). SCHIMKE IMMUNO-OSSEUS DYSPLASIA
Schimke immuno-osseous dysplasia (SIOD, OMIM 242900) is an autosomal recessive multisystem disorder characterized by dysplasia of the spine and ends of long bones, growth deficiency, renal function abnormalities due to focal and segmental glomerulosclerosis, hypothyroidism, and defective T-cell-mediated immunity (Schimke et al. 1971; Spranger et al. 1991). SIOD is caused by mutations in SMARCALl (SW1/SNF2-related, matrix associated; actin-dependent regulator of chromatin, subfamily a-like 1), which encodes a protein proposed to regulate transcriptional activity through chromatin remodeling (Boerkoel et al. 2002). Nonsense and frameshift mutations cause severe phenotypes, whereas some of the missense mutations cause milder or partial phenotypes (Boerkoel et al. 2002). Recently, a patient with B-cell lymphoma and SIOD was found to have mutations in SMARCALl, suggesting that loss of function of this protein can cause a fatal lymphoproliferative disorder (Taha et al. 2004). The exact mechanism by which loss of SMARCAL1 causes the phenotypes of SIOD remains to be elucidated. METHYLENE TETRAHYDROFOLATE REDUCTASE DEFICIENCY
Methylene tetrahydrofolate reductase (MTHFR) is involved in the conversion of S,10-methylene tetrahydrofolate (S,10-MTHF) to S-methyl tetrahydrofolate (SMTHF). A methyl group is then acquired from SMTHF during the conversion of hom*ocysteine to methionine by methionine synthase. Methionine is further converted to S-adenosyl methionine (SAM), the major methyl donor for all methyl transferases. Deficiency of MTHFR causes a rare autosomal recessive disorder characterized by mental retardation (Rozen 1996). A common thermolabile polymorphism (677C>T, which changes alanine to valine) causes reduced activity of MTHFR and has been associated, especially in hom*ozy-
AND
HUM AND I 5 E A 5 E
•
447
gotes whose diets are low in folate, with hyperhom*ocysteinemia (Goyette et a1. 1994). This polymorphism has been investigated as a risk factor of atherosclerosis, neural tube defects, and cancer (Ma et a1. 1997; Brattstrom et a1. 1998; Chen et a1. 1999; Botto and Yang 2000; Schwahn and Rozen 2001). Mice heterozygous or hom*ozygous for a null allele of MTHFR have decreased levels of SAM and decreased global DNA methylation. Furthermore, the null mutants have aortic lipid deposition and neuronal degeneration (Chen et a1. 2001). The global alteration in DNA methylation associated with partial or complete loss of MTHFR suggests that some of the phenotypes associated with its dysfunction might result from disturbances of chromatin due to decreased DNA (and possibly histone) methylation. There is one report of MTHFR deficiency causing an Angelman syndrome phenotype (Am et a1. 1998), and there is considerable phenotypic overlap of severe MTHFR deficiency with AS and RTT (Fattal-Valevski et a1. 2000). 3.3 Disorders Affecting Chromatin Structure in cis
The genes for most Mendelian disorders are usually identified by fmding mutations in either exons or splice sites, whereby the gene products, RNA or protein, are altered or not produced. For many of these disorders, however, there is frequently a small group of patients in whom mutations cannot be identified after sequencing of coding and noncoding regions of the gene despite linkage to the specific locus. It is becoming increasingly clear that epigenetic or genetic abnormalities which affect gene expression in cis underlie some Mendelian disorders and cases lacking exonic mutations. The following three examples demonstrate how cis-linked alterations in chromatin structure can result in human disease (see Table 3). ao~- AND O~-THALASSEMIA
The thalassemias are the most common single-gene disorders in the world. They are a heterogeneous group of hemoglobin synthesis disorders caused by reduced levels of one or more of the globin chains of hemoglobin. The
Table 3. Selected genetic disorders affecting chromatin structure in cis Disorder
Gene
aop- and op-thalassemia
deletion of LCR causes decreased globin expression
Fragile X syndrome
expansion of CCG repeat leads to abnormal methylation and silencing of FMRI
FSH dystrophy
contraction of D4Z4 repeats causes less repressive chromatin
Multiple cancers
germ-line epimutation of MLHI
Comments
premutation alleles (60-200) cause a neurodegenerative disorder
448
C HAP T E R 2 3
imbalance in synthesis of various globin chains leads to abnormal erythropoiesis and profound anemIa (Weatherall et al. 2001). Hundreds of coding and splicing mutations have been identified, but it was the deletions of the regulatory sequences that pinpointed how changes in chromatin structure can explain some subtypes of thalassemia. The discovery that deletions of approximately 100 kb which removed the upstream part of the p-globin gene (while leaving the gene intact) caused aOp-thalassemia helped identify the locus control region (LCR) that regulates p-globin expression (Kioussis et al. 1983; Forrester et al. 1990). Smaller deletions involving part of the LCR caused Op-thalassemia (Curtin et al. 1985; Driscoll et al. 1989). These deletions resulted in an altered chromatin state at the p-globin locus despite being tens of kilobases upstream of the coding region (Grosveld 1999). FRAGILE
X
SYNDROME
Fragile X mental retardation (OMIM 309550) is one of the most common causes of inherited mental retardation. Over 60 years ago, Martin and Bell described a family which showed that mental retardation segregated as an Xlinked disorder (Martin and Bell 1943). In 1969, Lubs reported on the constriction on the long arm of the X chromosome in some mentally retarded males and one asymptomatic female (Lubs 1969). This chromosomal variant was mapped to Xq27.3 and dubbed the fragile X chromosome (Harrison et al. 1983). Cytogenetic studies, especially those using culture media deficient in folic acid and thymidine, revealed the fragile site in families with Xlinked mental retardation, and they were then diagnosed as having fragile X syndrome (Sutherland 1977; Richards et al. 1981). Affected males have moderate to severe mental retardation, macroorchidism, connective tissue abnormalities such as hyperextensibility of joints, and large ears (Fig. 6) (Hagerman et al. 1984). The gene responsible for fragile X syndrome is FMR1, which encodes FMRP protein. The most common mutational mechanism is an expansion of an unstable noncoding CGG repeat (Warren and Sherman 2001). Normal alleles contain 6-60 repeats, premutation alleles have 60-200, and the full mutation contains>200 repeats. The repeat expansion at the 5'UTR of the FMRl gene provides an excellent example of a genetic disorder that is mediated through altered chromatin structure in cis. A CpG island in the 5' regulatory region of FMRl becomes aberrantly methylated upon repeat expansion in the case of the full mutation (Verkerk et al. 1991). Decreased histone acetylation at the 5' end is documented in cells from fragile X patients com-
Figure 6. Example of a Genetic Disorder Affecting Chromatin in trans The photograph is of a patient with fragile X syndrome who, in addition to mental retardation, has the typical features of prominent forehead and large ears. Photograph kindly provided by Dr. Stephen 1. Warren.
pared to healthy controls (Coffee et al. 1999). In turn, the altered DNA methylation and histone acetylation patterns lead to loss of expression of FMRl and, therefore, loss of FMRP function in patients with fragile X syndrome. Thus, these patients have a primary genetic mutation and a secondary epigenetic mutation. An interesting epigenetic mechanism has been proposed to explain how the CGG FMR1 repeat gets methylated and subsequently silenced. The finding that a premutation CGG repeat forms a single and stable hairpin structure (Handa et al. 2003), together with findings that rCGG repeats can be cleaved by Dicer, raised the possibility that expanded CGG repeats (which are unmethylated during early development) can be transcribed and that the resulting RNA forms a hairpin structure that can be cleaved by Dicer to produce small noncoding RNAs. These small RNA molecules associate with RNA-induced initiator of transcriptional gene silencing (RITS) and recruit DNA de novo methyltransferases and/or histone methyltransferases to the 5'UTR of FMR1, leading to full methylation of the CGG repeat and transcriptional repression of FMRl as development progresses (Jin et al. 2004a). FMRP is a selective RNA-binding protein that contains 2 KH domains and an RGG box. It associates with polysomes in an RNA-dependent manner through messenger ribonucleoprotein particles and has been impli-
E PIG ENE TIC 5
cated in suppressing translation both in vitro and in vivo (Laggerbauer et al. 2001; Li et al. 2001). The localization of FMRP with mRNA and polyribosomes in dendritic spines provided evidence for its role in regulating local protein synthesis in response to synaptic stimulation (Feng et al. 1997; Weiler and Greenough 1999; Brown et aI. 2001; Darnell et al. 2001, 2005). Putative targets of FMRP have been identified that play a role in synaptic development and that could explain partially the neurodevelopmental phenotypes (Brown et al. 2001; Darnell et al. 2001). Several studies suggest that the RNA interference (RNAi) pathway is a major mechanism by which FMRP regulates translation. Drosophila fragile X hom*olog (Dfmr1) associates with Argonaute (ARG02) and the RNA-induced silencing complex (RISC), and mammalian FMRP interacts with EIF 2C2 and associates with Dicer activity (Caudy et al. 2002; Ishizuka et al. 2002; Jin et al. 2004b). The favored proposed mechanism for FMRP's role as a translational suppressor is that FMRP binds to specific mRNA ligands, recruits RISC along with miRNAs, and facilitates the recognition between the miRNAs and the mRNA ligands (Jin et al. 2004a). Carriers of the fragile X premutation (60-200 repeats) develop a distinct neurodegenerative syndrome characterized by tremor and ataxia (Hagerman and Hagerman 2004). Interestingly, these premutations may induce pathogenesis at the RNA level because the FMRl RNA and protein are present. Studies in animal models suggest that the RNA encoded by CGG repeats binds to and alters the function of some cellular proteins, causing them to accumulate (Jin et al. 2003; Willemsen et al. 2003). FACIOSCAPULOHUMERAL DYSTROPHY
Facioscapulohumeral dystrophy (FSHD; OMIM 158900) is an autosomal dominant muscular dystrophy characterized by progressive wasting of the muscles of the face, upper arm, and shoulder. The more severe cases have hearing loss, and a very small subset of severely affected children are mentally retarded and have seizures (Mathews 2003). The major locus for FSHD (FSHD1) maps to the subtelomeric region of chromosome 4q35 near D4Z4, a low-copy repeat that contains an array of 3.3-kb GC-rich units. This repeated array is polymorphic and contains 11-150 units on normal chromosomes whereas it is in the1-1O-unit range on FSHD chromosomes (Wijmenga et al. 1992; van Deutekom et al. 1993). A second variable satellite repeat sequence (~-68bp Sau3A) distal to D4Z4 appears to playa role in developing FSHD. The 4qA variant at the ~-satellite repeat, along with the contraction of
AND
HUM AND I 5 E A 5 E
•
449
the D4Z4 repeat, is necessary for the manifestation of FSHD (Lemmers et al. 2002). Exactly how contractions of D4Z4 together with 4qA ~-satellite variant cause disease is not quite understood. The 4q35 region containing D4Z4 shares similarities with other subtelomeres and displays features typical of heterochromatic regions (Flint et al. 1997; Tupler and Gabellini 2004). In vitro and in vivo studies identified a 27-bp sequence in D4Z4 that binds to a complex termed the D4Z4-repressing complex (DRC) which comprises the transcriptional repressor Ying Yang1 (YY1), high mobility group box 2 (HMGB2), and nucleolin (Gabellini et al. 2002). Bickmore and van der Maarel, and Gabellini and colleagues, proposed that contraction of the repeats causes a less repressive chromatin state leading to increased transcription of 4q35-qter genes (Bickmore and van der Maarel2003; Tupler and Gabellini 2004). The finding of increased expression of three genes-FRG1 and 2 (FSHD region genes 1 and 2), and adenine nucleotide transporter 1 (ANT1)-in FSHD muscle compared to normal muscle is consistent with this hypothesis (Gabellini et al. 2002).Whether these gene expression changes are a direct or indirect consequence of D4Z4 contractions and whether misregulation of additional genes contributes to the disease phenotype remains to be seen. EPIMUTATIONS AND HUMAN DISEASE
Epimutations in the DNA mismatch repair gene MLHl have been identified in two individuals who have had multiple cancers (Suter et al. 2004). Abnormal methylation of the promoter region of the MLHl gene was detected in all available normal tissues from these two individuals. Deletion or loss of heterozygosity of MLHl in tumor tissue led to the complete loss of MLH 1 protein. These patients both suffered from colorectal cancer; one of them had duodenal cancer and the other developed endometrial and breast cancer as well as melanoma. The extent of the role of epimutations in human disease will only become apparent when investigators begin to search for such mutations systemically. 3.4 Epigenetics-Environment Interactions
Data from human studies as well as animal models are providing evidence that the environment can affect epigenetic marks and, as a result, gene function. The finding that monozygotic twins have similar epigenotypes during early years of life, but exhibit remarkable differences in the content and distribution of 5-methylcytosines and acetylated histones, provides strong evidence that the
450 •
C HAP T E R 2 3
epigenotype is metastable and displays temporal variability (Fraga et al. 2005). It is likely that many environmental factors and stochastic events contribute to the variations in the epigenome (Fig. 7) (Anway et al. 2005), but diet and early experiences are emerging as potential key players. DIET AND EPIGENOTYPES IN AGING AND DISEASE
Several reports indicate that there is an age-dependent decrease of global DNA methylation while concurrently there might be site-specific hypermethylation (Hoal-van Helden and van Helden 1989; Cooney 1993; Rampersaud et al. 2000). Given the large body of data linking altered DNA methylation to cancer risk or progression (MaysHoopes 1989; Issa et al. 1994), such epigenetic changes might contribute to the age-related increase in cancer risk. The role of diet as a contributing factor in controlling global methylation status has been best illustrated in adult males suffering from uremia and undergoing hemodialysis. The presence of hyperhom*ocystinemia in these patients suggests low methionine content, presumably due to folate depletion. These males had reduced global and locus-specific DNA methylation that was reversed after the administration of high doses of folic acid (Ingrosso et al. 2003). Because several of the neuropsychiatric features resulting from folate and B12 deficiencies overlap with those seen with sporadic neuropsychiatric disorders, it was proposed that the latter might be caused by alterations in methylation patterns in the central nervous sys-
tem (Reynolds et al. 1984). Low levels of SAM were found in folate-responsive depression; furthermore, SAM supplementation is helpful as an adjunct therapeutic in some forms of depression (Bottiglieri et al. 1994). Last, although it is unclear how increased folic acid intake by childbearing women reduces the risk of neural tube defect, it is tempting to propose some epigeneticmediated effects on DNA or histone methylation. The finding that supplementing maternal diets with extra folic acid, B12, and betaine alters the epigenotype and phenotype of the offspring of agouti viable yellow mice is likely to be the first of many examples yet to be discovered in humans and other mammals (Wolff et al. 1998; Waterland and Jirtle 2003). EARLY EXPERIENCES AND EPIGENOTYPES
The best example of how early experiences and maternal behavior might alter the mammalian epigenotype has so far been described only in rats. Frequent licking and grooming by rat mothers altered the DNA methylation status in the promoter region of the glucocorticoid receptor (GR) gene in the hippocampus of their pups. The highly licked and groomed pups have decreased DNA methylation and increased histone acetylation at the GR promoter compared to pups that were raised by low-licking and grooming mothers (Weaver et al. 2004). The increased levels of GR, secondary to the epigenotype change, affect the regulation of stress hormone levels and the lifelong response to stress in the rat pups (Liu et al. 1997; Weaver et al. 2004). Although such data are not
EPIGENOME ENVIRONMENT
Figure 7. The Epigenotype Plays a Critical Role along with the Genotype and Environmental Factors in Determining Phenotypes Known epigenetic factors affecting gene expression and genome stability include DNA methylation, chromatin-remodeling complexes, covalent histone modifications, the presence of histone variants, or noncoding regulatory RNAs (ncRNA).
EPIGENETICS
available for humans yet, they certainly raise questions about the role of early experiences in modulating epigenotypes and risk for psychiatric disorders in humans. 4 Looking into the Future
During the next decade, we antIClpate that mutations which alter the epigenotype will become increasingly recognized as mutational mechanisms that cause a variety of human disorders. Traditionally, the identification of disease-causing genes has focused on disorders where familial cases or patients with chromosomal abnormalities facilitated the positional doning of the responsible gene. At this time, we are challenged as we attempt to discover the mutational bases for some of the most common and devastating disorders such as schizophrenia, autism, and mood disorders. Familial cases are not very common; genetic heterogeneity is very likely; and last but not least, genetic data-especially the rate of discordances in monozygotic twins-do not always support a straightforward Mendelian inheritance model. These findings, coupled with the strong environmental effects on the penetrance of some of these disorders, underscore the importance of investigating the epigenomes in such diseases. Even single-gene disorders such as Angelman syndrome, Beckwith-Wiedemann syndrome, and SilverRussell syndrome can be caused either by genomic mutations or by mutations that affect the epigenotype, and can be either inherited or de novo. Such molecular variations will undoubtedly be unearthed for other human disorders. Furthermore, data demonstrating that the levels of several proteins involved in epigenetic regulation are tightly regulated and that perturbations of such levels either through loss-of-function mutations or duplications cause human disorders, suggest that epigenetic mutations that will affect transcription, RNA splicing, or protein modifications are also likely to cause disease. Acknowledgments
We thank Drs. Timothy H. Bestor, Robert A. Rollins, and Deborah Bourc'his for the image of chromosomes from an ICF syndrome patient; Dr. Daniel J. Driscoll for the image of a Prader-Willi syndrome patient, Dr. Carlos A. Bacino for the image of an Angelman syndrome patient, Dr. Daniel G. Glaze for the image of a Rett syndrome patient, and Dr. Stephen T. Warren for the image of a fragile X syndrome patient. We gratefully acknowledge colleagues in the Zoghbi laboratory for reading the chapter and providing excellent suggestions. We also thank past
AND
HUMAN
DISEASE.
451
and current laboratory members who have contributed to our work on Rett syndrome, Prader-Willi syndrome, and Angelman syndrome. Last, but not least, we are grateful to our patients with Rett syndrome, UPD of chromosome 7, Prader-Willi syndrome, Angelman syndrome, and autism, and to their families for enlightening us about the role of epigenetics in human diseases. Our work has been supported by grants from the National Institutes of Health (5 POI HD040301-05; 5 P30 HD024064-17; 5 POI HD37283); International Rett Syndrome Association; Rett Syndrome Research Foundation; Cure Autism Now; The Simons Foundation; March of Dimes (l2-FY03-43); and the Blue Bird Clinic Rett Center. H.Y.Z. is an investigator with the Howard Hughes Medical Institute. We regret that due to space constraints, we had to eliminate many important and relevant citations. References Alarcon J.M., Malleret G., Touzani K., Vronskaya 5., Ishii 5., Kandel E.R., and Barco A. 2004. Chromatin acetylation, memory, and LTP are impaired in CBP+'- mice: A model for the cognitive deficit in Rubinstein-Taybi syndrome and its amelioration. Neuron 42: 947-959. Amir R.E., Van den Veyver LB., Wan M., Tran C.Q., Francke D., and Zoghbi H.Y. 1999. Rett syndrome is caused by mutations in Xlinked MECP2, encoding methyl-CpG-binding protein 2. Nat. Genet. 23: 185-188. Anway M.D., Cupp A.S., Dzumcu M., and Skinner M.K. 2005. Epigenetic transgenerational actions of endocrine disruptors and male fertility. Science 308: 1466-1469. Arias J., Alberts A.S., Brindle P., Claret EX., Smeal T., Karin M., Feramisco J., and Montminy M. 1994. Activation of cAMP and mitogen responsive genes relies on a common nuclear factor. Nature 370: 226-229. Am P.H., Williams C.A., Zori R.T., Driscoll D.J., and Rosenblatt D.S. 1998. Methylenetetrahydrofolate reductase deficiency in a patient with phenotypic findings of Angelman syndrome. Am. ]. Med. Genet. 77: 198-200. Bachman K.E., Rountree M.R., and Baylin S.B. 2001. Dnmt3a and Dnmt3b are transcriptional repressors that exhibit unique localization properties to heterochromatin. ]. BioI. Chern. 276: 32282-32287. Bastepe M., Frohlich L.E, Hendy G.N., Indridason 0.5., Josse R.G., Koshiyama H., Korkko J., Nakamoto J.M., Rosenbloom A.L., Slyper A.H., et al. 2003. Autosomal dominant pseudohypoparathyroidism type Ib is associated with a heterozygous microdeletion that likely disrupts a putative imprinting control element of GNAS. J. Clin. Invest. 112: 1255-1263. Berube N.G., Mangelsdorf M., Jagla M., Vanderluit J., Garrick D., Gibbons R.J., Higgs D.R., Slack R.S., and Picketts D.J. 2005. The chromatin-remodeling protein ATRX is critical for neuronal survival during corticogenesis.]. Clin. Invest. 115: 258-267. Bickmore W.A. and van der Maarel S.M. 2003. Perturbations of chromatin structure in human genetic disease: Recent advances. Hum. Mol. Genet. 12: R207-R213. Boerkoel C.E, Takashima H., John J., Yan J., Stankiewicz P., Rosenbarker 1., Andre J.L., Bogdanovic R., Burguet A., co*ckfield 5., et al. 2002.
452 •
C HAP T E R 2 3
Mutant chromatin remodeling protein SMARCALl causes Schimke immuno-osseous dysplasia. Nat. Genet. 30: 215-220. Bottiglieri T., Hyland K., and Reynolds E.H. 1994. The clinical potential of ademethionine (S-adenosylmethionine) in neurological disorders. Drugs 48: 137-152. Botto L.D. and Yang Q. 2000. 5,IO-Methylenetetrahydrofolate reductase gene variants and congenital anomalies: A HuGE review. Am. ]. Epidemiol. 151: 862-877. Brattstrom L., Wilcken D.E., Ohrvik J., and Brudin L. 1998. Common methylenetetrahydrofolate reductase gene mutation leads to hyperhom*ocysteinemia but not to vascular disease: The result of a meta-analysis. Circulation 98: 2520-2526. Brown v., Jin P., Ceman S., Darnell J.C, O'Donnell W.T., Tenenbaum SA, Jin X., Feng Y., Wilkinson K.D., Keene J.D., et al. 2001. Microarray identification of FMRP-associated brain mRNAs and altered mRNA translational profiles in fragile X syndrome. Cell 107: 477-487. Carney R.M., Wolpert CM., Ravan S.A., Shahbazian M., Ashley-Koch A., Cuccaro M.L., Vance J.M., and Pericak-Vance M.A. 2003. Identification of MeCP2 mutations in a series of females with autistic disorder. Pediatr. Neural. 28: 205-211. Caspary T., Cleary M.A., Perlman E.J., Zhang P., Elledge S.J., and Tilghman S.M. 1999. Oppositely imprinted genes p57K;P2 and Igf2 interact in a mouse model for Beckwith-Wiedemann syndrome. Genes Dey. 13: 3115-3124. Caudy A.A., Myers M., Hannon G.J., and Hammond S.M. 2002. Fragile X-related protein and VIG associate with the RNA interference machinery. Genes Dey. 16: 2491-2496. Chen J., Giovannucci E.L., and Hunter D.J. 1999. MTHFR polymorphism, methyl-replete diets and the risk of colorectal carcinoma and adenoma among U.S. men and women: An example of geneenvironment interactions in coIorectal tumorigenesis. f. Nutr. 129: S560-S564. Chen w.G., Chang Q., Lin Y., Meissner A., West A.E., Griffith E.C, Jaenisch R., and Greenberg M.E. 2003. Derepression of BDNF transcription involves calcium-dependent phosphorylation of MeCP2. Science 302: 885-889. Chen Z., Karaplis A.C, Ackerman S.L., Pogribny I.P., Melnyk S., Lussier-Cacan S., Chen M.E, Pai A., John S.W., Smith R.S., et al. 2001. Mice deficient in methylenetetrahydrofolate reductase exhibit hyperhom*ocysteinemia and decreased methylation capacity, with neuropathology and aortic lipid deposition. Hum. Mol. Genet. 10: 433-443. Chrivia J.C., Kwok R.P., Lamb N., Hagiwara M., Montminy M.R., and Goodman R.H. 1993. Phosphorylated CREB binds specifically to the nuclear protein CBP. Nature 365: 855-859. Coffee B., Zhang E, Warren S.T., and Reines D. 1999. Acetylated histones are associated with FMRI in normal but not fragile X-syndrome cells (erratum Nat. Genet. 22: 209 [1999]). Nat. Genet. 22: 98-101. Collins A.L., Levenson J.M., Vilaythong A.P., Richman R.D., Armstrong L., Noebels J.L., Sweatt J.D., and Zoghbi H.Y. 2004. Mild overexpression of MeCP2 causes a progressive neurological disorder in mice. Hum. Mol. Genet. 13: 2679-2689. Cooney CA. 1993. Are somatic cells inherently deficient in methylation metabolism? A proposed mechanism for DNA methylation loss, senescence and aging. Growth Dev. Aging 57: 261-273. Cooper W.N., Luharia A., Evans G.A., Raza H., Haire A.C, Grundy R., Bowdin S.C, Riccio A., Sebastio G., Bliek J., et al. 2005. Molecular subtypes and phenotypic expression of Beckwith-Wiedemann syndrome. Eur.]. Hum. Genet. 13: 1025-1032. Couvert P., Bienvenu T., Aquaviva C, Poirier K., Moraine C, Gendrot
C, Verloes A., Andres C, Le Fevre A.C, Souville I., et al. 2001. MECP2 is highly mutated in X-linked mental retardation. Hum. Mol. Genet. 10: 941-946. Cox G.E, Burger J., Lip v., Mau U.A., Sperling K., Wu B.L., and Horsthemke B. 2002. Intracytoplasmic sperm injection may increase the risk of imprinting defects. Am. f. Hum. Genet. 71: 162-164. Curtin P., Pirastu M., Kan Y.W., Gobert-Jones J.A., Stephens A.D., and Lehmann H. 1985. A distant gene deletion affects ~-globin gene function in an atypical yS~-thalassemia. f. Clin. Invest. 76: 1554-1558. Darnell J.C, Jensen K.B., Jin P., Brown v., Warren S.T., and Darnell R.B. 2001. Fragile X mental retardation protein targets G quartet mRNAs important for neuronal function. Cell 107: 489-499. Darnell J.C, Fraser CE., Mostovetsky 0., Stefani G., Jones T.A., Eddy S.R., and Darnell R.B. 2005. Kissing complex RNAs mediate interaction between the Fragile-X mental retardation protein KH2 domain and brain polyribosomes. Genes Dey. 19: 903-918. DeBaun M.R., Niemitz E.L., and Feinberg A.P. 2003. Association of in vitro fertilization with Beckwith-Wiedemann syndrome and epigenetic alterations of LITI and H19. Am. f. Hum. Genet. 72: 156-160. Dennis C 2003. Epigenetics and disease: Altered states. Nature 421: 686-688. Ding E, Prints Y., Dhar M.S., Johnson D.K., Garnacho-Montero C, Nicholls R.D., and Francke U. 2005. Lack of Pwcrl/MBII-85 snoRNA is critical for neonatal lethality in Prader-Willi syndrome mouse models. Mamm. Genome 16: 424-431. Driscoll M.C, Dobkin CS., and Alter B.P. 1989. YS~-thalassemia due to a de novo mutation deleting the 5' ~-globin gene activation-region hypersensitive sites. Prac. Natl. Acad. Sci. 86: 7470-7474. Eggermann T., Wollmann H.A., Kuner R., Eggermann K., Enders H., Kaiser P., and Ranke M.B. 1997. Molecular studies in 37 Silver-Russell syndrome patients: Frequency and etiology of uniparental disomy. Hum. Genet. 100: 415-419. Ehrlich M. 2003. The ICF syndrome, a DNA methyltransferase 3B deficiency and immunodeficiency disease. Clin. Immunol. 109: 17-28. Ehrlich M., Buchanan K.L., Tsien E, Jiang G., Sun B., Uicker w., Weemaes CM., Smeets D., Sperling K., Belohradsky B.H., et ai. 2001. DNA methyltransferase 3B mutations linked to the ICF syndrome cause dysregulation of lymphogenesis genes. Hum. Mol. Genet. 10: 2917-2931. Engel E. 1980. A new genetic concept: Uniparental disomy and its poten tial effect, isodisomy. Am. f. Med. Genet. 6: 137-143. Fattal-Valevski A., Bassan H., Korman S.H., Lerman-Sagie T., Gutman A., and Harel S. 2000. Methylenetetrahydrofolate reductase deficiency: Importance of early diagnosis.]. Child Neural. 15: 539-543. Feng Y., Absher D., Eberhart D.E., Brown v., Malter H.E., and Warren S.T. 1997. FMRP associates with polyribosomes as an mRNP, and the I304N mutation of severe fragile X syndrome abolishes this association. Mol CellI: 109-118. Flint J., Thomas K., Micklem G., Raynham H., Clark K., Doggett N.A., King A., and Higgs D.R. 1997. The relationship between chromosome structure and function at a human telomeric region. Nat. Genet. 15: 252-257. Forrester W.C, Epner E., Driscoll M.C, Enver T., Brice M., Papayannopoulou T., and Groudine M. 1990. A deletion of the human beta-globin locus activation region causes a major alteration in chromatin structure and replication across the entire betaglobin locus. Genes Dev. 4: 1637-1649. Fraga M.E, Ballestar E., Paz M.E, Ropero S., Setien E, Ballestar M.L., Heine-Suner D., Cigudosa J.C, Urioste M., Benitez J., et al. 2005. Epigenetic differences arise during the lifetime of monozygotic
E PIG ENE T f C 5
twins. Proc. Natl. Acad. Sci. 102: 10604-10609. Gabellini D., Green M.R., and Tupler R. 2002. Inappropriate gene activation in FSHD: A repressor complex binds a chromosomal repeat deleted in dystrophic muscle. Cell 110: 339-348. Gibbons R.l., Picketts D.l., Villard L., and Higgs D.R. 1995. Mutations in a putative global transcriptional regulator cause X-linked mental retardation with alpha-thalassemia (ATR-X syndrome). Cell 80: 837-845. Gibbons R.J., Pellagatti A., Garrick D., Wood WG., Malik N., Ayyub H., Langford c., Boultwood J., Wainscoat J.S., and Higgs D.R. 2003. Identification of acquired somatic mutations in the gene encoding chromatin-remodeling factor ATRX in the a-thalassemia myelodysplasia syndrome (ATMDS). Nat. Genet. 34: 446-449. Gicquel c., Rossignol S., Cabrol S., Houang M., Steunou v., Barbu v., Danton E, Thibaud N., Le Merrer M., Burglen L., et al. 2005. Epimutation of the telomeric imprinting center region on chromosome 11p15 in Silver-Russell syndrome. Nat. Genet. 37: 1003-1007. Gowher H. and Jeltsch A. 2002. Molecular enzymology of the catalytic domains of the Dnmt3a and Dnmt3b DNA methyltransferases. ]. BioI. Chem. 277: 20409-20414. Goyette P., Sumner J.S., Milos R., Duncan A.M., Rosenblatt D.S., Matthews R.G., and Rozen R. 1994. Human methylenetetrahydrofolate reductase: Isolation of cDNA, mapping and mutation identification. Nat. Genet. 7: 195-200. Grosveld E 1999. Activation by locus control regions? Curro Opin. Genet. Dev. 9: 152-157. Hagberg B., Aicardi l., Dias K, and Ramos O. 1983. A progressive syndrome of autism, dementia, ataxia, and loss of purposeful hand use in girls: Rett's syndrome: Report of 35 cases. Ann. Neurol. 14: 471-479. Hagerman EJ. and Hagerman R.T. 2004. The fragile-X premutation: A maturing perspective. Am.]. Hum. Genet. 74: 805-816. Hagerman R.J., Van Housen K, Smith A.C., and McGavran L. 1984. Consideration of connective tissue dysfunction in the fragile X syndrome. Am. J. Med. Genet. 17: 111-121. Handa v., Saha T, and Usdin K 2003. The fragile X syndrome repeats form RN;\ hairpins that do not activate the interferon-inducible protein kinase, PKR, but are cut by Dicer. Nucleic Acids Res. 31: 6243-6248. Hansen R.S., Wijmenga c., Luo P., Stanek A.M., Canfield TK., Weemaes C.M., and Gartler S.M. 1999. The DNMT3B DNA methyltransferase gene is mutated in the ICF immunodeficiency syndrome.Proc. Natl.Acad. Sci. 96: 14412-14417. Harikrishnan K.N., Chow M.Z., Baker E.K., Pal S., Bassal S., Brasacchio D., Wang L., Craig J.M., Jones P.L., Sif S., and EI-Osta A. 2005. Brahma links the SWI/SNF chromatin-remodeling complex with MeCP2-dependent transcriptional silencing. Nat. Genet. 37: 254-264. Hark A.T, Schoenherr c.J., Katz D.J., Ingram R.S., Levorse J.M., and Tilghman S.M. 2000. CTCF mediates methylation-sensitive enhancer-blocking activity at the H19/Igf2 locus. Nature 405: 486-489. Harrison c.J., Jack E.M., Allen TD., and Harris R. 1983. The fragile X: A scanning electron microscope study. ]. Med. Genet. 20: 280-285. Hasegawa T, Hara M., Ando M., Osawa M., f*ckuyama Y., Takahashi M., and Yamada K 1984. Cytogenetic studies of familial PraderWilli syndrome. Hum. Genet. 65: 325-330. Hayward B.E., Kamiya M., Strain L., Moran v., Campbell R., Hayashizaki Y., and Bonthron D.T 1998. The human GNASI gene is imprinted and encodes distinct paternally and biallelically expressed G proteins. Proc. Natl. Acad. Sci. 95: 10038-10043.
AND
HUM AND f 5 E AS E
•
453
Henry I., Bonaiti-Pellie c., Chehensse v., Beldjord c., Schwartz c., Utermann G., and Junien c. 1991. Uniparental paternal disomy in a genetic cancer-predisposing syndrome. Nature 351: 665-667. Hoal-van HeIden E.G. and van Heiden ED. 1989. Age-related methylation changes in DNA may reflect the proliferative potential of organs. Mutat. Res. 219: 263-266. Horike S., Cai S., Miyano M., Cheng J.E, and Kohwi-Shigematsu T 2005. Loss of silent-chromatin looping and impaired imprinting of DLX5 in Rett syndrome. Nat. Genet. 37: 31-40. Hubbard V.S., Davis P.B., di Sant'Agnese EA., Gorden E, and Schwartz R.H. 1980. Isolated growth hormone deficiency and cystic fibrosis: A report of two cases. Am. J. Dis. Child 134: 317-319. Ingrosso D., Cimmino A., Perna A.E, Masella L., De Santo N.G., De Bonis M.L., Vacca M., D'Esposito M., D'Urso M., Galletti E, and Zappia V. 2003. Folate treatment and unbalanced methylation and changes of allelic expression induced by hyperhom*ocysteinaemia in patients with uraemia. Lancet 361: 1693-1699. Ishizuka A., Siomi M.C., and Siomi H. 2002. A Drosophila fragile X protein interacts with components of RNAi and ribosomal proteins. Genes Dev. 16: 2497-2508. Issa J.E, Ottaviano Y.L., Celano E, Hamilton S.R., Davidson N.E., and Baylin S.B. 1994. Methylation of the oestrogen receptor CpG island links ageing and neoplasia in human colon. Nat. Genet. 7: 536-540. Jeanpierre M., Turleau c., Aurias A., Prieur M., Ledeist E, Fischer A., and Viegas-Pequignot E. 1993. An embryonic-like methylation pattern of classical satellite DNA is observed in ICF syndrome. Hum. Mol. Genet. 2: 731-735. Jiang Y.H., Armstrong D., Albrecht u., Atkins C.M., Noebels J.L., Eichele G., Sweatt J.D., and Beaudet A.L. 1998. Mutation of the Angelman ubiquitin ligase in mice causes increased cytoplasmic p53 and deficits of contextual learning and long-term potentiation (see comments). Neuron 21: 799-811. Jin E, Alisch R.S., and Warren S.T 2004a. RNA and microRNAs in fragile X mental retardation. Nat. Cell BioI. 6: 1048-1053. Jin P., Zarnescu D.C., Zhang E, Pearson C.E., Lucchesi J.c., Moses K., and Warren S.T. 2003. RNA-mediated neurodegeneration caused by the fragile X premutation rCGG repeats in Drosophila. Neuron 39: 739-747. Jin E, Zarnescu D.C., Ceman S., Nakamoto M., Mowrey J., Jongens TA., Nelson D.L., Moses K., and Warren S.T. 2004b. Biochemical and genetic interaction between the fragile X mental retardation protein and the microRNA pathway. Nat. Neurosci. 7: 113-117. Joyce J.A., Lam WK., Catchpoole D.l., Jenks P., Reik W, Maher E.R., and Schofield P.N. 1997. Imprinting of IGF2 and H19: Lack of reciprocity in sporadic Beckwith-Wiedemann syndrome. Hum. Mol. Genet. 6: 1543-1548. Kaufmann W.E., Jarrar M.H., Wang J.S., Lee Y.J., Reddy S., Bibat G., and Naidu S. 2005. Histone modifications in Rett syndrome lymphocytes: A preliminary evaluation. Brain Dev. 27: 331-339. Kioussis D., Vanin E., deLange T, Flavell R.A., and Grosveld EG. 1983. Beta-globin gene inactivation by DNA translocation in gamma beta-thalassaemia. Nature 306: 662-666. Kishi N. and MackJis J.D. 2004. MECP2 is progressively expressed in post-migratory neurons and is involved in neuronal maturation rather than cell fate decisions. Mol. Cell. Neurosci. 27: 306-321. Kishino T, Lalande M., and WagstaffJ. 1997. UBE3A/E6-AP mutations cause Angelman syndrome. Nat. Genet. 15: 70-73. Kondo T., Bobek M.P., Kuick R., Lamb B., Zhu X., Narayan A., Bourc'his D., Viegas-Pequignot E., Ehrlich M., and Hanash S.M. 2000. Whole-genome methylation scan in ICF syndrome: Hypomethylation of non-satellite DNA repeats D4Z4 and NBL2. Hum. Mol. Genet. 9: 597-604.
454
C HAP T E R 2 3
Korenke G.e., Fuchs S., Krasemann E., Doerr H.G., Wilichowski E., Hunneman D.H., and Hanefeld E 1996. Cerebral adrenoleukodystrophy (ALD) in only one of monozygotic twins with an identical ALD genotype. Ann. Neurol. 40: 254-257. Kwok R.P., Lundblad J.R., Chrivia J.e., Richards J.P., Bachinger H.P., Brennan R.G., Roberts S.G., Green M.R., and Goodman R.H. 1994. Nuclear protein CBP is a coactivator for the transcription factor CREB. Nature 370: 223-226. Laggerbauer B., Ostareck D., Keidel E.M., Ostareck-Lederer A., and Fischer U. 2001. Evidence that fragile X mental retardation protein is a negative regulator of translation. Hum. Mol. Genet. 10: 329-338. Ledbetter D.H., Riccardi VM., Airhart S.D., Strobel R.J., Keenan B.S., and Crawford J.D. 1981. Deletions of chromosome 15 as a cause of the Prader-Willi syndrome. N. Engl. f. Med. 304: 325-329. Lee M.P., DeBaun M.R., Mitsuya K., Galonek H.L., Brandenburg S., Oshimura M., and Feinberg A.P. 1999. Loss of imprinting of a paternally expressed transcript, with antisense orientation to Kv LQTl, occurs frequently in Beckwith-Wiedemann syndrome and is independent of insulin-like growth factor II imprinting. Proc. Natl. Acad. Sci. 96: 5203-5208. Lemmers R.J., de Kievit P., Sandkuijl 1., Padberg G.W., van Ommen G.J., Frants R.R., and van der Maarel S.M. 2002. Facioscapulohumeral muscular dystrophy is uniquely associated with one of the two variants of the 4q subtelomere. Nat. Genet. 32: 235-236. Lewis J.D., Meehan R.R., Henzel W.J., Maurer-Fogy I., Jeppesen P., Klein E, and Bird A. 1992. Purification, sequence, and cellular localization of a novel chromosomal protein that binds to methylated DNA. Cell 69: 905-914. Li Z., Zhang Y., Ku 1., Wilkinson K.D., Warren S.T., and Feng Y. 2001. The fragile X mental retardation protein inhibits translation via interacting with mRNA. Nucleic Acids Res. 29: 2276-2283. Liu D., Diorio J., Tannenbaum B., Caldji e., Francis D., Freedman A., Sharma S., Pearson D., Plotsky P.M., and Meaney M.J. 1997. Maternal care, hippocampal glucocorticoid receptors, and hypothalamic-pituitary-adrenal responses to stress. Science 277: 1659-1662. Lubs H.A. 1969. A marker X chromosome. Am. f. Hum. Genet. 21: 231-244. Ludwig M., Katalinic A., Gross S., Sutcliffe A., Varon R., and Horsthemke B. 2005. Increased prevalence of imprinting defects in patients with Angelman syndrome born to subfertile couples. f. Med. Genet. 42: 289-291. Ma J., Stampfer M.J., Giovannucci E., Artigas e., Hunter D.J., Fuchs e., Willett WC., Selhub J., Hennekens e.H., and Rozen R. 1997. Methylenetetrahydrofolate reductase polymorphism, dietary interactions, and risk of colorectal cancer. Cancer Res. 57: 1098-1102. Magenis R.E., Brown M.G., Lacy D.A., Budden S., and LaFranchi S. 1987. Is Angelman syndrome an alternate result of del(l5)(qllqI3)? Am. J. Med. Genet. 28: 829-838. Martin J. and Bell J. 1943. A pedigree of mental defect showing sexlinkage. Arch. Neurol. Psychiat. 6: 154-157. Martinowich K., Hattori D., Wu H., Fouse S., He E, Hu Y., Fan G., and Sun Y.E. 2003. DNA methylation-related chromatin remodeling in activity-dependent Bdnf gene regulation. Science 302: 890-893. Mathews K.D. 2003. Muscular dystrophy overview: Genetics and diagnosis. Neurol. Clin. 21: 795-816. Matsuura T., Sutcliffe J.S., Fang E, Galjaard R.J., Jiang Y.H., Benton e.S., Rommens J.M., and Beaudet A.L. 1997. De novo truncating mutations in E6-AP ubiquitin-protein ligase gene (UBE3A) in Angelman syndrome. Nat. Genet. 15: 74-77. Mayr B. and Montminy M. 2001. Transcriptional regulation by the phosphorylation-dependent factor CREB. Nat. Rev. Mol. Cell. BioI. 2: 599-609.
Mays-Hoopes 1.1. 1989. Age-related changes in DNA methylation: Do they represent continued developmental changes? Int. Rev. Cytol. 114: 181-220. McDowell T.L., Gibbons R.J., Sutherland H., O'Rourke D.M., Bickmore WA., Pombo A., Turley H., Gatter K., Picketts D.]., Buckle VJ., et al. 1999. Localization of a putative transcriptional regulator (ATRX) at pericentromeric heterochromatin and the short arms of acrocentric chromosomes. Proc. Natl. Acad. Sci. 96: 13983-13988. Meguro M., Mitsuya K., Nomura N., Kohda M., Kashiwagi A., Nishigaki R., Yoshioka H., Nakao M., Oishi M., and Oshimura M. 2001. Largescale evaluation of imprinting status in the Prader-Willi syndrome region: An imprinted direct repeat cluster resembling small nucleolar RNA genes. Hum. Mol. Genet. 10: 383-394. Meins M., Lehmann J., Gerresheim E, Herchenbach J., Hagedorn M., Hameister K., and Epplen J.T. 2005. Submicroscopic duplication in Xq28 causes increased expression of the MECP2 gene in a boy with severe mental retardation and features of Rett syndrome. J. Med. Genet. 42: e12. Meloni I., Bruttini M., Longo I., Mari E, Rizzolio E, D'Adamo E, Denvriendt K., Fryns J.P., Toniolo D., and Renieri A. 2000. A mutation in the Rett syndrome gene,MECP2, causes X-linked mental retardation and progressive spasticity in males. Am. f. Hum. Genet. 67: 982-985. Mullaney B.e., Johnston M.Y., and Blue M.E. 2004. Developmental expression of methyl-CpG binding protein 2 is dynamically regulated in the rodent brain. Neuroscience 123: 939-949. Murata T., Kurokawa R., Krones A., Tatsumi K., Ishii M., Taki T., Masuno M., Ohashi H., Yanagisawa M., Rosenfeld M.G., et al. 2001. Defect of histone acetyltransferase activity of the nuclear transcriptional coactivator CBP in Rubinstein-Taybi syndrome. Hum. Mol. Genet. 10: 1071-1076. Nan X., Campoy EJ., and Bird A. 1997. MeCP2 is a transcriptional repressor with abundant binding sites in genomic chromatin. Cell 88: 471--481. Neul J.L. and Zoghbi H.Y. 2004. Rett syndrome: A prototypical neurodevelopmental disorder. Neuroscientist 10: 118-128. Nicholls R.D., Knoll J.H.M., Butler M.G., Karam S., and Lalande M.1989. Genetic imprinting suggested by maternal heterodisomy in nondeletion Prader-Willi syndrome. Nature 342: 281-285. Nuber U.A., Kriaucionis S., Roloff T.e., Guy J., Selfridge J., Steinhoff e., Schulz R., Lipkowitz B., Ropers H.H., Holmes M.e., and Bird A. 2005. Up-regulation of glucocorticoid-regulated genes in a mouse model of Rett syndrome. Hum. Mol. Genet. 14: 2247-2256. Ogryzko VV, Schiltz R.L., Russanova V., Howard B.H., and Nakatani Y. 1996. The transcriptional coactivators p300 and CBP are histone acetyltransferases. Cell 87: 953-959. Ohta T., Gray T.A., Rogan P.K., Buiting K., Gabriel J.M., Saitoh S., Muralidhar B., Bilienska B., Krajewska-Walasek M., Driscoll D.J., et al. 1999. Imprinting-mutation mechanisms in Prader-Willi syndrome. Am. f. Hum. Genet. 64: 397--413. Okano M., Bell D.W, Haber D.A., and Li E. 1999. DNA methyltransferases Dnmt3a and Dnmt3b are essential for de novo methylation and mammalian development. Cell 99: 247-257. Orstavik K.H., EikIid K., van der Hagen e.B., Spetalen S., Kierulf K., Skjeldal 0., and Buiting K. 2003. Another case of imprinting defect in a girl with Angelman syndrome who was conceived by intracytoplasmic sem*n injection. Am. J. Hum. Genet. 72: 218-219. Petrij E, Giles R.H., Dauwerse H.G., Saris J.J., Hennekam R.e., Masuno M., Tommerup N., van Ommen G.J., Goodman R.H., Peters D.J., et al. 1995. Rubinstein-Taybi syndrome caused by mutations in the transcriptional co-activator CBP. Nature 376: 348-351. Petronis A. 2004. The origin of schizophrenia: Genetic thesis, epigenetic antithesis, and resolving synthesis. BioI. Psychiatry 55: 965-970.
E PIG ENE TIC SAN 0
Picketts D.t, Higgs D.R., Bachoo S., Blake D.J., Quarrell O.W, and Gibbons R.J. 1996. ATRX encodes a novel member of the SNF2 family of proteins: Mutations point to a common mechanism underlying the ATR-X syndrome. Hum. Mol. Genet. 5: 1899-1907. Pieretti M., Zhang E, Fu Y-H., Warren ST, Oostra B.A., Caskey C.T, and Nelson D.L. 1991. Absence of expression of the FMR-I gene in fragile X syndrome. Cell 66: 817-822. Ping A.J., Reeve A.E., Law D.J., Young M.R., Boehnke M., and Feinberg A.P. 1989. Genetic linkage of Beckwith-Wiedemann syndrome to 11p15. Am. J. Hum. Genet. 44: 720-773. Prawitt D., Enklaar T, Gartner-Rupprecht B., Spangenberg c., Oswald M., Lausch E., Schmidtke P., Reutzel D., Fees S., Lucito R., et al. 2005. Microdeletion of target sites for insulator protein CTCF in a chromosome 11p15 imprinting center in Beckwith-Wiedemann syndrome and Wilms' tumor. Proc. Natl. Acad. Sci. 102: 4085--4090. Rampersaud G.c., Kauwell G.P., Hutson A.D., Cerda J.J., and Bailey L.B. 2000. Genomic DNA methylation decreases in response to moderate folate depletion in elderly women. Am. f. Clin. Nutr. 72: 998-1003. Reik W. 1989. Genomic imprinting and genetic disorders in man. Trends Genet. 5: 331-336. Reynolds E.H., Carney M.W, and Toone B.K. 1984. Methylation and mood. Lancet 2: 196-198. Richards B.W., Sylvester EE., and Brooker C. 1981. Fragile X-linked mental retardation: The Martin-Bell syndrome. f. Ment. Defic. Res. 25: 253-256. Roelfsema J.H., White S.J., Ariyurek Y, Bartholdi D., Niedrist D., Papadia E, Bacino C.A., den Dunnen J.T, van Ommen G.J., Breuning M.H., et al. 2005. Genetic heterogeneity in Rubinstein-Taybi syndrome: Mutations in both the CBP and EP300 genes cause disease. Am. f. Hum. Genet. 76: 572-580. Rougeulle c., Cardoso C., Fontes M., Colleaux L., and Lalande M. 1998. An imprinted antisense RNA overlaps UBE3A and a second maternally expressed transcript. Nat. Gen~t. 19: 15-16. Rozen R. 1996. Molecular genetics of methylenetetrahydrofolate reductase deficiency. J. Inherit. Metab. Dis. 19: 589-594. Runte M., Varon R., Horn D., Horsthemke B., and Buiting K. 2005. Exclusion of the C/D box snoRNA gene cluster HBII-52 from a major role in Prader-Willi syndrome. Hum. Genet. 116: 228-230. Schimke R.N., Horton WA., and King C.R. 1971. Chondroitin-6-sulphaturia, defective cellular immunity, and nephrotic syndrome. Lancet 2: 1088-1089. Schule B., Albalwi M., Northrop E., Francis D.!., Rowell M., Slater H.R, Gardner R,J., and Francke U. 2005. Molecular breakpoint cloning and gene expression studies of a novel translocation t(4;15)(q27;q11.2) associated with Prader-Willi syndrome. BMC Med. Genet. 6: 18. Schwahn B. and Rozen R. 2001. Polymorphisms in the methylenetetrahydrofolate reductase gene: Clinical consequences. Am. f. Pharmacogenomics 1: 189-201. Shahbazian M.D., Antalffy B., Armstrong D.L., and Zoghbi H.Y 2002a. Insight into Rett syndrome: MeCP2 levels display tissue- and cellspecific differences and correlate with neuronal maturation. Hum. Mol. Genet. 11: 115-124. Shahbazian M., Young J., Yuva-Paylor L., Spencer c., Antalffy B., Noebels J., Armstrong D., Paylor R., and Zoghbi H. 2002b. Mice with truncated MeCP2 recapitulate many Rett syndrome features and display hyperacetylation of histone H3. Neuron 35: 243-254. Smeets D.E, Moog u., Weemaes C.M., Vaes-Peeters G., Merkx G.P., Niehof J.E, and Hamers G. 1994. ICF syndrome: A new case and review of the literature. Hum. Genet. 94: 240-246. Smeets D.P., Hamel B.C., Nelen M.R., Smeets H.J., Bollen J,H., Smits
HUM AND I SEA S E
455
A.P., Ropers H.H., and van Oost B.A. 1992. Prader-Willi syndrome and Angelman syndrome in cousins from a family with a translocation between chromosomes 6 and 15. N. Engl. f. Med. 326: 807-811. Smilinich N.J., Day C.D., Fitzpatrick G.v., Caldwell G.M., Lossie A.C., Cooper P.R, Smallwood A.C., Joyce J.A., Schofield EN., Reik W, et al. 1999. A maternally methylated CpG island in KvLQTl is associated with an antisense paternal transcript and loss of imprinting in Beckwith-Wiedemann syndrome. Proc. Natl. Acad. Sci. 96: 8064-8069. Spence J.E., Perciaccante RG., Greig G.M., Willard H.E, Ledbetter D.H., Hejtmancik J.E, Pollack M.S., O'Brien WE., and Beaudet A.L. 1988. Uniparental disomy as a mechanism for human genetic disease. Am. f. Hum. Genet. 42: 217-226. Spranger J., Hinkel G.K., Stoss H., Thoenes W., Wargowski D., and Zepp E 1991. Schimke immuno-osseous dysplasia: A newly recognized multisystem disease. f. Pediatr. 119: 64-72. Sun EL., Dean WL., Kelsey G., Allen N.D., and Reik W. 1997. Transactivation of Igf2 in a mouse model of Beckwith-Wiedemann syndrome. Nature 389: 809-815. Suter C.M., Martin D.!., and Ward R.L. 2004. Germline epimutation of MLH1 in individuals with multiple cancers. Nat. Genet. 36: 497-501. Sutherland G.R. 1977. Fragile sites on hwnan chromosomes: Demonstration of their dependence on the type of tissue culture medium. Science 197: 265-266. Taha D., Boerkoel C.E, Balfe J.W, Khalifah M., Sloan E.A., Barbar M., Haider A., and Kanaan H. 2004. Fatallymphoproliferative disorder in a child with Schimke immuno-osseous dysplasia. Am. f. Med. Genet. A 131: 194-199. Tommerup N., van der Hagen C.B., and Heiberg A. 1992. Tentative assignment of a locus for Rubinstein-Taybi syndrome to 16p 13.3 by a de novo reciprocal translocation, t(7;16)(q34;p13.3). Am. f. Med. Genet. 44: 237-241. Tsai TE, Jiang YH., Bressler J., Armstrong D., and Beaudet A.L. 1999. Paternal deletion from Snrpn to Ube3a in the mouse causes hypotonia, growth retardation and partial lethality and provides evidence for a gene contributing to Prader-Willi syndrome. Hum. Mol. . Genet. 8: 1357-1364. Tuck-Muller C.M., Narayan A., Tsien E, Smeets D.E, Sawyer J., Fiala E.S., Sohn O.S., and Ehrlich M. 2000. DNA hypomethylation and unusual chromosome instability in cell lines from ICF syndrome patients. Cytogenet. Cell Genet. 89: 121-128. Tupler R. and Gabellini D. 2004. Molecular basis of facioscapulohumeral muscular dystrophy. Cell. Mol. Life Sci. 61: 557-566. van Deutekom J.c., Wijmenga c., van Tienhoven E.A., Gruter A.M., Hewitt J.E., Padberg G.W., van Ommen G.J., Hofker M.H., and Frants R.R 1993. FSHD associated DNA rearrangements are due to deletions of integral copies of a 3.2 kb tandemly repeated unit. Hum. Mol. Genet. 2: 2037-2042. Van Esch H., Bauters M., Ignatius J., Jansen M., Raynaud M., Hollanders K., Lugtenberg D., Bienvenu T, Jensen L.R., Gecz J., et al. 2005. Duplication of the MECP2 region is a frequent cause of severe mental retardation and progressive neurological symptoms in males. Am. J. Hum. Genet. 77: 442--453. Verkerk A.J.M.H., Pieretti M., Sutcliffe J.S., Fu Y.-H., Kuhl D.P.A., Pizutti A., Reiner 0., Richards S., Victoria M.E, Zhang R., et al. 1991. Identification of a gene (FMR-l) containing a CGG repeat coincident with a breakpoint cluster region exhibiting length variation in fragile X syndrome. Cell 65: 905-914. Villard L., Gecz J., Mattei J.E, Fontes M., Saugier-Veber E, Munnich A., and Lyonnet S. 1996. XNP mutation in a large family with JubergMarsidi syndrome. Nat. Genet. 12: 359-360.
456 • C HAP T E R 2 3
Wan M., Zhao K., Lee S.S., and Francke U. 2001. MECP2 truncating mutations cause histone H4 hyperacetylation in Rett syndrome. Hum. Mol. Genet. 10: 1085-1092. Wan M., Lee S.S., Zhang X., Houwink-Manville I., Song H.R., Arnir R.E., Budden S., Naidu S., Pereira J.L., Lo I.E, et al. 1999. Rett syndrome and beyond: Recurrent spontaneous and familial MECP2 mutations at CpG hotspots. Am. J. Hum. Genet. 65: 1520-1529. Warren S.T. and Sherman S.L. 2001. The fragile X syndrome. In The metabolic and molecular bases of inherited disease, 8th edition (ed. CR. Scriver et al.), vol. 1, pp. 1257-1289. McGraw-Hill, New York. Waterland R.A. and Jirtle R.L. 2003. Transposable elements: Targets for early nutritional effects on epigenetic gene regulation. Mol. Cel/. BioI. 23: 5293-5300. Weatherall D.l., Clegg M.B., Higgs D.R., and Wood w.G. 2001. The hemoglobinopathies. In The metabolic & molecular bases of inherited disease, 8th edition (ed. CR. Scriver et aI.), pp. 4571--4636. McGraw-Hill, New York. Weaver I.C, Cervoni N., Champagne EA., D'Alessio A.C, Sharma S., Seck! J.R., Dymov S., Szyf M., and Meaney M.J. 2004. Epigenetic programming by maternal behavior. Nat. Neurasci. 7: 847-854. Weiler I.J. and Greenough W.T. 1999. Synaptic synthesis of the Fragile X protein: Possible involvement in synapse maturation and elimination. Am. f. Med. Genet. 83: 248-252. Weksberg R., Smith A.C, Squire J., and Sadowski P. 2003. BeckwithWiedemann syndrome demonstrates a role for epigenetic control of normal development. Hum. Mol. Genet. 12: R61-R68. Weksberg R., Nishikawa J., Caluseriu 0., Fei YL., Shuman C, Wei C, Steele L., Cameron J., Smith A., Arnbus I., et al. 2001. Tumor development in the Beckwith-Wiedemann syndrome is associated with a variety of constitutional molecular llp15 alterations including imprinting defects of KCNQIOTl. Hum. Mol. Genet. 10: 2989-3000. Weksberg R., Teshima I., Williams B.R., Greenberg CR., Pueschel S.M., Chemos J.E., Fowlow S.B., Hoyme E., Anderson I.J., Whiteman D.A., et aI. 1993. Molecular characte;ization of cytogenetic alter-
ations associated with the Beckwith-Wiedemann syndrome (BWS) phenotype refines the localization and suggests the gene for BWS is imprinted. Hum. Mol. Genet. 2: 549-556. Wijmenga C, Hewitt J.E., Sandkuijl L.A., Clark L.N., Wright T.J., Dauwerse H.G., Gruter A.M., Hofker M.H., Moerer P., Williamson R., et al. 1992. Chromosome 4q DNA rearrangements associated with facioscapulohumeral muscular dystrophy. Nat. Genet. 2: 26-30. Willemsen R., Hoogeveen-Westerveld M., Reis S., Holstege J., Severijnen L.A., Nieuwenhuizen I.M., Schrier M., Van Unen L., Tassone E, Hoogeveen A.T., et aI. 2003. The FMRI CGG repeat mouse displays ubiquitin-positive intranuclear neuronal inclusions; implications for the cerebellar tremor/ataxia syndrome. Hum. Mol. Genet. 12: 949-959. Wolff G.L., Kodell R.L., Moore S.R., and Cooney CA. 1998. Maternal epigenetics and methyl supplements affect agouti gene expression in AVY/a mice. Faseb f. 12: 949-957. Xu G.L., Bestor T.H., Bourc'his D., Hsieh CL., Tommerup N., Bugge M., Hulten M., Qu X., Russo J.J., and Viegas-Pequignot E. 1999. Chromosome instability and immunodeficiency syndrome caused by mutations in a DNA methyltransferase gene. Nature 402: 187-191. Yntema H.G., Poppelaars EA., Derksen E., Oudakker A.R., van Roosmalen T., Jacobs A., Obbema H., Brunner H.G., Hamel B.C, and van Bokhoven H. 2002. Expanding phenotype of XNP mutations: Mild to moderate mental retardation. Am. f. Med. Genet. 110: 243-247. Young J.I., Hong E.P., Castle J., Crespo-Barreto J., Bowman A.B., Rose M.E, Kang D., Richman R., Johnson J.M., Berget S., and Zoghbi H.Y. 2005. Inaugural article: Regulation of RNA splicing by the methylation-dependent transcriptional repressor methyl-CpG binding protein 2. Proc. Natl. Acad. Sci. 102: 17551-17558. Zeev B.B., Yaron Y, Schanen N.C, Wolf H., Brandt ., Ginot N., Shomrat R., and Orr-Urtreger A. 2002. Rett syndrome: Clinical manifestations in males with MECP2 mutations. J. Child Neural. 17: 20-24.
c
H
A
p
T
E
R
24
Epigenetic Determinants of Cancer Stephen B. Baylin 1 and Peter A. Jones 2 'Cancer Biology Program, The Sidney Kimmel Cancer Center, Johns Hopkins Medical Institutions, Baltimore, Maryland 21231 2Department of Urology, Biochemistry and Molecular Biology, USC/Norris Comprehensive Cancer Center, Keck School of Medicine, University of Southern California, Los Angeles, California 90089-9181
CONTENTS 1. The Biological Basis of Cancer, 459
2. The Importance of Chromatin to Cancer, 459 3. The Role of DNA Methylation in Cancer, 461 4. Hypermethylated Gene Promoters in Cancer, 462
6. The Molecular Anatomy of Epigenetically Silenced Cancer Genes, 467
7. Summary of Major Research Issues for Understanding Epigenetic Gene Silencing in Cancer, 470
8. Detection of Cancer by DNA Methylation, 471
4.1 The Genes Involved, 462 4.2 Searching for New Genes Epigenetically Silenced in Cancer, 463
9. Epigenetic Therapy, 471 References, 473
4.3 Determining the Functional Importance of Genes Hypermethylated in Cancer, 464
5. Epigenetic Gene Silencing and Its Role in the Evolution of Cancer-Importance for Early Tumor Progression Stages, 465
457
GENERAL SUMMARY Cancer is caused by the heritable deregulation of genes, which control when cells divide, die, and move from one part of the body to another. During the process of carcinogenesis, genes can become activated in ways that enhance division or prevent cell death, or alternatively, they can become inactivated so that they no longer are available to apply the brakes to these processes. The first class of genes is called "oncogenes" and the second "tumor suppressor genes." It is the interplay between these two gene classes that results in the formation of cancer. Genes can become inactivated by at least three pathways, including (1) a gene can be mutated so that its function becomes disabled; (2) a gene can be completely lost and thus not be available to work appropriately; and (3) a gene, which has not been mutated or lost, can be switched off in a heritable fashion by epigenetic changes. This epigenetic silencing can involve his-
tone modifications, the binding of repressive proteins, and inappropriate methylation of cytosine (C) residues in CpG sequence motifs that reside within control regions which govern gene expression. This chapter focuses on this third pathway. The basic molecular mechanisms responsible for maintaining the silenced state are quite well understood, as outlined in this book. Consequently, we also know that epigenetic silencing has profound implications for cancer prevention, detection, and therapies. We now have drugs available approved by the American FDA which can reverse epigenetic changes and restore gene activity to cancer cells. Additionally, because the changes in DNA methylation can be analyzed with a high degree of sensitivity, many strategies to detect cancer early rely on finding DNA methylation changes. The translational opportunities for epigenetics in human cancer research, detection, prevention, and treatment are therefore quite extraordinary.
E PIG ENE TIC
1 The Biological Basis of Cancer
Cancer is ultimately a disease of gene expression in which the complex networks governing homeostasis in multicellular organisms become deranged, allowing cells to grow without reference to the needs of the organism as a whole. Great advances have been made in the delineation of the subset of cellular control pathways subject to derangement in human cancer (Table 1). That this limited number of cellular control pathways are affected and heritably disabled in almost all cancers is a key concept that has advanced the field (Hanahan and Weinberg 2000). The focus, until the last several years, has largely been on the genetic basis of cancer, particularly on the mutational activation of oncogenes or inactivation of tumor suppressor genes. However, a growing body of data has appeared since the mid-1990s to indicate that heritable changes, regulated by epigenetic alterations, may also be critical for the evolution of all human cancer types (Fig. 1) (Jones and Laird 1999; Jones and Baylin 2002; Herman and Baylin 2003). These data, particularly DNA and chromatin methylation patterns that are fundamentally altered in cancers, have led to new opportunities for the understanding, detection, treatment, and prevention of cancer. Genetic and epigenetic abnormalities can cause heritable disruptions to homeostatic pathways by two different mechanisms. Either the activation of an oncogene can occur, generally through activating point mutations, or tumor suppressor genes can be inactivated (Jones and Laird 1999; Hanahan and Weinberg 2000; Jones and Baylin 2002; Herman and Baylin 2003). For example, mutations in a signaling gene (oncogene) such as RAS, which enhance the activity of the gene product to stimulate growth, are often found in human cancers. These mutations are often dominant and drive the formation of cancers. Genetic mutations and epigenetic silencing of
0 E T E R MIN ANT S 0 F CAN C E R
459
tumor suppressor genes, on the other hand, are often recessive, requiring disruptive events in both allelic copies of a gene for the full expression of the transformed phenotype. The idea that both copies of a tumor suppressor gene had to be incapacitated in a malignant cell line was proposed by Knudson (2001) and has found wide acceptance. It is now realized that three classes of "hits" can participate in different combinations to cause a complete loss of activity of tumor suppressor genes. Direct mutations in the coding sequence, loss of parts or entire copies of genes, or epigenetic silencing can cooperate with each other to result in the disablement of key control genes (Fig. 2). 2 The Importance of Chromatin to Cancer
Despite the major advances in understanding the key molecular lesions in cellular control pathways that contribute to cancer, it remains true that microscopic examination of nuclear structure by a pathologist is a gold standard in cancer diagnosis. The human eye can accurately discern changes in nuclear architecture, which largely involve the state of chromatin configuration, and definitively diagnose the cancer phenotype in a single cell. Foremost in the cues used by pathologists are the size of the nucleus, nuclear outline, a condensed nuclear membrane, prominent nucleoli, dense "hyperchromatic" chromatin, and a high nuclear/cytoplasmic ratio. These structural features, visible under a microscope (Fig. 3), likely correlate with profound alterations in chromatin function and resultant changes in gene expression states and/or chromosome stability. Linking changes observable at a microscopic level with the molecular marks discussed throughout this book remains one of the great challenges in cancer research. In this chapter, we review epigenetic marks, typified by changes in DNA cytosine methylation
Table 1. Key cellular pathways disrupted in human cancers by genetic and'epigenetic mechanisms Pathway
Example of genetic alteration
Example of epigenetic alteration
Self-sufficiency in growth signals
mutations in RAS gene
methylation of RASSFIA gene
Insensitivity to antigrowth signals
mutation in TCf13 receptor genes
down-regulation of TGF~ receptors
Tissue invasion and metastasis
mutation in E-Cadherin gene
methylation of E-Cadherin promoter
Limitless replicative potential
mutations in p 76 and Rb genes
silencing of p 76 or Rb genes by promoter methylation
Sustained angiogenesis
silencing of thrombospondin-l
Evading apoptosis
mutation in p53
methylation of DAP-kinase, ASC/TMS7, and HIC7
DNA repair capacity
mutations in MLH7, MSH2
methylation of CST Pi, 06-MCMT, MLH7
Monitoring genomic stability
mutations in Chfr
methylation of Chfr
Protein ubiquination functions
mutations in Chfr
methylation of Chfr
460 • C HAP T E R 2 4
DNA methylation
Figure 1. Epigenetic Alterations Involving DNA Methylation Can Lead to Cancer by Various Mechanisms
T
hypo
hyper
deamination
T UV
carcinogen
meCpG
9999
~
ene
TpG genome instability
promoter silencing
carcinogen-induced increased UV-induced mutations mutations
mutation
at CpG dinucleotides and histone modifications, which are abnormally distributed in cancer cells. They are increasingly being linked to heritable events that affect the stability and function of the genome and, thus, contribute very significantly to the cancer phenotype. Several examples of the roles of chromatin-modifying activities in human cancer are known (Wolffe 2001). For example, acute myeloid leukemia (AML) and acute promyelocytic leukemia (PML) are both caused by chromosomal translocations that alter the use of histone deacetylases (HDACs). In PML, the PML gene is fused to the retinoic acid receptor (RAR). This receptor recruits HDAC activity and DNA methylation, and causes a state
Loss of DNA cytosine methylation '(hypo) results in genome instability. Focal hypermethylation in gene promoters (hyper) causes heritable silencing and therefore inactivation of tumor suppressor genes. Additionally, methylated CpG sites are hotspots for C---7T transition mutations caused by spontaneous hydrolytic deamination. Methylation of CpG sites also increases the binding of some chemical carcinogens to DNA and increases the rate of UVinduced mutations.
of transcriptional silencing, as shown with experimental promoter constructs. The data suggest that this targeting of chromatin change can potentially lead to tumor suppressor gene silencing, which participates in a cellular differentiation block (Di Croce et al. 2002). In AML, the DNA-binding domain of the transcription factor AML-1 is fused to a protein called ETO, which interacts with a HDAC. Repression of cellular differentiation by the mistargeted HDAC contributes to aberrant gene repression and, ultimately, leukemia (Amann et al. 2001). These are just two examples of the direct involvement of chromatin modifications in the oncogenic phenotype. It has, however, become clear that chromatin modifications can
e
e mut;/tion
e
mut
e
~ethYlation
FIRST HIT
...l9:.Je ...
I.
e
LO~ ~ethYlation SECOND
L~H ~ethYlation SECOND HIT
HIT
e
mut
I.
U 'it;
mut
I.
mutation
mutation
&
&
LOH
methylation
_
ge methylation & LOH
'ie ge biallelic methylation
Figure 2. How DNA Methylation Can Contribute to the Inactivation of Tumor Suppressor Genes Two active alleles of a tumor suppressor gene are shown as the two blue boxes at the top. The first step of gene inactivation is shown as a localized mutation (left) or gene silencing by DNA methylation (right). The second hit is shown as either a loss of heterozygosity (LOH) or transcriptional silencing by additional epigenetic events. In this way, DNA methylation can contribute as one of the pathways to satisfy Knudson's hypothesis.
E P f G ENE T feD E T E R MIN ANT 5
Normal Skin
Squamous Cell Carcinoma
Figure 3. Chromatin Structural Changes in Cancer Cells These two photomicrographs were taken from a patient with a squamous cell carcinoma of the skin. The left panel shows normal epidermal cells within one millimeter of the contiguous tumor, shown at the same magnification on the right. The chromatin, which stains purple due to its affinity to hematoxylin, appears much more coarse and granular in the cancer cells than in normal epidermis. Such changes in the staining characteristics of chromatin are used by pathologists as diagnostic criteria for cancer.
directly and indirectly alter the patterns of cytosine methylation, an epigenetic change of the DNA which can either initiate or "lock in" silencing of key genes leading to heritable perturbations in key cellular pathways. 3 The Role of DNA Methylation in Cancer
The initial discovery that DNA contained 5-methylcytosine, in addition to the four bases directly incorporated into DNA, soon led to the proposal that alterations in DNA methylation may contribute to oncogenesis (Table 2). Over the last 40 years, there have been many studies
0 F CAN C E R
461
that have shown alterations in the patterns of distribution of 5-methylcytosine between cancer and normal cells in human DNA. Among these, there are at least three major routes by which CpG methylation can contribute to the oncogenic phenotype. These include hypomethylation of the cancer genome, focal hypermethylation of the promoters of tumor suppressor genes, and direct mutagenesis (Fig. 1) (Jones and Laird 1999; Jones and Baylin 2002; Herman and Baylin 2003). Although each of these alterations individually could contribute to cancer causation in humans, it is, perhaps, most significant that all three occur simultaneously, thus indicating that alterations in the homeostasis of epigenetic mechanisms are central contributors to human cancer. The most prominent, and the earliest recognized, change in DNA methylation patterns in cancer cells is an overall decrease in this modification, which could contribute to genomic instability (for further discussion, see Chapter 18). This is well known to be a hallmark of human cancer (Feinberg and Vogelstein 1983; Feinberg et al. 1988; Jones and Laird 1999; Jones and Baylin 2002; Herman and Baylin 2003). More recently, a mounting body of data has illustrated that the abnormal methylation of CpG islands in the 5' regions of cancer-related genes is integral to their transcriptional. silencing, providing an alternative mechanism to mutation for the inactivation of genes with tumor suppressor function (Jones and Laird 1999; Jones and Baylin 2002; Herman and Baylin 2003). Finally, in addition to the above roles of cytosine methyla-
Table 2. Time line for elucidating the role of DNA methylation in cancer Observation
Reference
Hypothesis of "methylases as oncogenic agents"
Srinivasan and Borek (1964)
Decreased levels of 5-methylcytosine in animal tumors
Lapeyre and Becker (1 979)
5-Azacytidine and 5-aza-2'-deoxycytidine inhibit methylation and activate genes
Jones and Taylor (1980)
Decreased genomic and gene-specific methylation in human tumors
Ehrlich et al. (1982); Feinberg and Vogelstein (1983); Flatau et al. (1983)
Inhibitors of DNA methylation alter tumorigenic phenotype
Frost et al. (1984)
Methylation of a CpC island in cancer
Baylin et al. (1987)
Hot spots for p53 mutations are methylated CpC sites
Rideout et al. (1 990)
Allele-specific methylation of the retinoblastoma tumor suppressor gene
Sakai et al. (1991)
Loss of imprinting in cancer
Rainier et al. (1993)
Hypermethylation of CpC islands is associated with aging
Issa et al. (1994)
Mice with decreased methylation develop fewer tumors
Laird et al. (1995)
Coupling DNA methylation and HDAC inhibitors leads to rapid isolation of tumor suppressor genes
Suzuki et al. (2002); Yamash*ta et al. (2002)
DNA repair gene (MLH7) is methylated in somatic cells
Cazzoli et al. (2002)
5-Azacytidine is FDA-approved for treatment of myelodysplastic syndrome
Kaminskas et al. (2005)
462
C HAP T E R 2 4
tion in genomic instability and gene silencing, 5-methylcytosine is itself a highly unstable base, and hence mutagenic. This can contribute directly to cancer by causing transition mutations in which meCpG is converted to TpG (Rideout et al. 1990). The fact that these modifications are so prevalent in cancers and are now known to contribute directly to carcinogenesis has also led to new possibilities in which epigenetic changes are targeted for therapeutic reversal (Egger et al. 2004). DNA cytosine methylation is therefore now acknowledged to play a critical role in human carcinogenesis. Almost all human genes contain methylated cytosine residues in their coding regions, known for some time to contribute disproportionately to the formation of disease-causing mutations. The methylation of the carbon 5 of the cytosine ring increases the rate of hydrolytic deamination of the base in double-stranded DNA. However, the deamination product of 5-methyleytosine is thymine rather than uracil (see Fig. 12 in Chapter 3). DNA repair mechanisms are subsequently less efficient at repairing deamination-induced mismatches in DNA. Methylated CpG sites are known to contribute to more than 1/3 of all transition mutations in the human germ line (Rideout et al. 1990). This is also true for cancer-causing genes such as p53 (Rideout et al. 1990). More surprising is the observation that this mechanism also contributes significantly to the formation of inactivating mutations in tumor suppressor genes in somatic tissues. For example, more than 50% of all of the p53 mutations which are acquired in sporadic colorectal cancers occur at sites of cytosine methylation (Greenblatt et al. 1994). Thus, the modification of DNA by the DNA methyltransferases (DNMTs) substantially increases the risk of getting cancer by this endogenous mechanism. Methylation of cytosine residues has also been shown to favor the formation of carcinogenic adducts between DNA and carcinogens such as benzo(a)pyrene in cigarette smoke. In this case, methylation of the cytosine residue increases the formation of carcinogenic adducts between an adjacent guanine residue and benzo(a)pyrene diol epoxide, resulting in increased mutations at CpG sites in the lungs of cigarette smokers (Greenblatt et al. 1994; Pfeifer et al. 2000). Interestingly, methylation can also alter the rate of mutations in the p53 gene in sunlight-exposed skin (Greenblatt et al. 1994; Pfeifer et al. 2000). This is because the methyl group changes the absorption spectrum for cytosine into the range of incident sunlight, thereby increasing the formation of pyrimidine dimers in the DNA of skin cells. Thus, the epigenetic modification of DNA not only
increases spontaneous mutagenesis, but also can influence the way DNA interacts with carcinogens and ultraviolet light (Pfeifer et al. 2000). Hypomethylation of DNA, which has long been known to occur in animal and human tumors (Table 2), affects chromosomal stability and increases aneuploidy. Genomic instability is a hallmark of cancer, and the increased chromosomal fragility caused by hypomethylation of satellite and other sequences could conceivably contribute to cancer formation by decreasing the stability of the genome (Narayan et al. 1998; Gaudet et al. 2003). The exact mechanisms by which this instability is mediated are not yet fully understood but could easily be the result of altered DN4-protein interactions caused by hypomethylation. 4 Hypermethylated Gene Promoters in Cancer 4.1 The Genes Involved
The best-understood mechanism by which DNA methylation contributes to cancer is through the focal hypermethylation of promoters of tumor suppressOl' genes. Exact mechanisms by which this hypermethylation occurs are discussed in detail below. However, this dearly is a significant pathway resulting in the heritable silencing of genes that suppress cancer development (Jones and Laird 1999; Jones and Baylin 2002; Herman and Baylin 2003). Usually, DNA hypermethylation occurs at CpG-rich regions, or CpG islands, which are located in and around the transcriptional start site of abnormally silenced genes in can-, cer. It is important to recognize that cytosine methylation in CpG islands in the vicinity of the gene start-site position is most critical since this same DNA modification occurring within bodies of genes generally bears no correlation to transcription status (Jones 1999). The list of cancer-related genes affected by the above transcription disruption is growing steadily. As previously reviewed, this involves genes in all chromosome locations (Jones and Baylin 2002). Indeed, this epigenetic change may now outnumber those genes which are frequently mutated in human tumors. As mentioned earlier (Table 1), loss of tumor suppressor gene function through CpG methylation, which causes gene silencing, affects virtually every pathway known. To understand the significance of the genes for the process of tumorigenesis, and the challenges for the future in this field, the genes may, perhaps, be divided into three groups. The first group of genes comprises those which were instrumental in defining promoter hypermethylation and gene silencing as an important mechanism for loss of
E PIG ENE TIC
tumor suppressor gene function in cancer (Table 3). These were already recognized as classic tumor suppressor genes which, when mutated in the germ line of families, cause inherited forms of cancer (Jones and Laird 1999; Jones and Baylin 2002; Herman and Baylin 2003). They are also often mutated in sporadic forms of cancers but can frequently be hypermethylated on one or both alleles in such tumors (Jones and Laird 1999; Jones and Baylin 2002; Herman and Baylin 2003). In addition, for these genes, promoter hypermethylation can sometimes constitute the "second hit" in Knudson's hypothesis by being associated with loss of function of the second copy of the gene in familial tumors where the first hit is a germ-line mutation (Grady et al. 2000; Esteller et al. 2001b). In some instances, 5-azacytidine-induced reactivation of these genes in cultured tumor cells has been shown to restore the key tumor suppressor gene function lost during tumor progression. An example of such is mismatch repair function in colon cancer cells where the MLHl gene is silenced (Herman et al. 1998). The second group of epigenetically silenced genes are those previously identified as candidate tumor suppressor genes by virtue of their function, but they have not been found to have an appreciable frequency of mutational inactivation. These genes may be those emerging as candidate suppressors because they reside in chromosome positions that frequently suffer deletions in cancers (Table 3). Examples include RASFFIA and FHIT on chromosome 3p in lung and other types of tumors (Dammann et al. 2000; Burbee et al. 2001). Others are those known to encode proteins which subserve functions critical to prevention of tumor progression, such as the pro-apoptotic gene, DAP-kinase (Katzenellenbogen et al. 1999). These genes present an important challenge for the field of cancer epigenetics in that despite their having been identified as having frequent promoter hypermethylation in tumors, it must be proven, since many of these genes are not frequently, or at all, mutated, how the genes actually
Table 3. Discovery classes of hypermethylated genes 1. Classic tumor suppressor genes known to be mutated in the germ line of families with hereditary cancer syndromes: Some examples = VHL, E-cadherin, p76lnk4a, MLH7, APC, Stk4, Rb 2. Candidate tumor suppressor genes: Some examples = FHIT, RASSF7A, 06-MGMT, Gst-Pi, GATAs 4 and 5, DAP-kinase
3. Genes discovered through random screens for hypermethylated genes: Some examples = HIC-7, SFRPs 7,2,4,5, BMP-3, SLC5A8, 5517
0 E T E R MIN ANT S
a
F CAN C E R
•
463
contribute to tumorigenesis. We return to this issue, and the steps being taken to address it, in a later section. The third group of genes is being identified through strategies employed to randomly identify aberrantly silenced genes associated with promoter hypermethylation (Suzuki et al. 2002; Yamash*ta et al. 2002; Ushijima 2005). As compared to those genes in the second group, it is a challenge to place these genes into a functional context for cancer progression because their functions may be totally unknown.
4.2 Searching for New Genes Epigenetically Silenced in Cancer
The most commonly used approach for identifying new genes that are epigenetically silenced in cancer is to consider any potential tumor suppressor gene a candidate if mutations are not found, or if gene expression is low or absent in tumors of interest. Another approach being utilized is to employ techniques that randomly screen cancer genomes for hypermethylated genes (Toyota et al. 1999; Suzuki et al. 2002; Yamash*ta et al. 2002; Ushijima 2005). This avenue presents great opportunities for enriching our knowledge of cancer biology but also generates many challenges. As recently reviewed (Ushijima 2005), each approach has strengths and weaknesses. Several approaches rely on an initial step in which DNA is digested with one or more restriction enzymes that differentially cleave CpG sites according to whether they are methylated. To identify potentially hypermethylated genes, products are then analyzed on two-dimensional gels (Restriction Landmark Genomic Sequencing, RLGS), randomly amplified using arbitrary primers or subtraction techniques to differentiate the methylated sequences between normal and tumor DNA. These analyses have the power to identify large numbers of hypermethylated genes, and each of them has contributed to new knowledge about important candidate tumor suppressor genes. However, the sequences identified mayor may not be associated with CpG islands, which are strategically located to participate in gene silencing. Repetitive sequences, which are often highly methylated, are sometimes included in the final products of the procedures. Consequently, the efficiency of identifying hypermethylated tumor suppressor genes is often not high, and genome-wide coverage may thus be difficult. Other approaches are relying on spotting CpG island sequences contained in the genome onto a microarray (C.M. Chen et al. 2003), often taking into account their relationships to gene start sites, and probing these arrays
464 • C HAP T E R 2 4
with either genomic DNA which has been digested with methylation-sensitive enzymes, or cDNAs to take into account gene expression status. This approach has a powerful potential for identifying hypermethylated tumor suppressors but is limited by the number of candidate CpG islands that can be arrayed. Another approach is to manipulate cultured tumor cells with agents that cause DNA demethylation, such as 5azacytidine or 5-aza-2'-deoxycytidine, and hybridizing RNA from before and after drug treatment to gene microarrays to detect up-regulated genes (Suzuki et al. 2002; Yamash*ta et al. 2002). This approach has the potential to identify all genes hypermethylated in cell cultures of all human cancer types. However, gene changes caused by effects other than demethylation activities of the treatment agents can decrease the efficiency for hypermethylated gene identification. What is less recognized is that the very low expression levels of the genes which are being sought, both before and after drug treatment, severely challenge the sensitivity of most gene microarray platforms and markedly reduce the efficiency of this approach (Suzuki et al. 2002). Use of subtraction techniques after drug treatment to enrich for gene transcripts which are increased can improve the sensitivity of the gene microarray approach (Suzuki et al. 2002) but must be adapted to fit fluorescent probe-labeling procedures for microarrays that readily provide full-genome coverage. Simultaneously employing drugs that alter chromatin changes which collaborate with promoter DNA methylation, such as HDAC inhibitors, can help with gene microarray approaches. This maneuver helps to more specifically identify the genes being sought by taking into account the roles chromatin changes have in silencing hypermethylated genes, as discussed in detail in a later section. This latter approach has recently identified important genes silenced in colon cancer (Suzuki et al. 2002). 4.3 Determining the Functional Importance of Genes Hypermethylated in Cancer
The rapidity with which hypermethylated genes are being discovered in cancer has presented a formidable research challenge. Frequent promoter hypermethylation in a given gene does not in and of itself guarantee functional significance for the attendant gene silencing as would loss of function due to a genetic mutation. This is especially the case when the hypermethylated gene is not a known classic tumor suppressor and when there is no evidence that the gene may also be frequently mutated in cancers. Thus, it is obligatory that the gene in question be
studied in such a way that the significance of loss of function is determined in terms of both the processes controlled by the encoded protein and the implications for tumor progression. There are several stages for such investigations, each of increasing importance for firmly documenting the role in cancer formation, which are outlined in Table 4. First, of course, is the documentation of the hypermethylation and its consequences for the expression state of the gene, including the ability of the gene to undergo reexpression with promoter demethylation. Second, the incidence for hypermethylation and silencing of the gene must be well established in primary as well as cultured tumor samples. Third, as further explained below, it is often essential to know at what point the silencing of the gene occurs in tumor progression (Fig. 4). Fourth, the contribution of loss of function of the gene to tumorigenicity must be· directly assessed. This can begin with routine studies of cultured cells through assessment of gene reinsertion effects on cellular properties such as induction of apoptosis, effects on soft agar cloning, and effects on tumorigenicity of the cells when grown as heterotransplants in athymic mice. Fifth, the function of the encoded protein must be established either through having previous knowledge of the type of protein involved, through recognition of suggested functions by nature of the protein structure, or through studies of the biology of the protein in cell culture models. Ultimately, however, step six must be taken, which may often involve trans-
Table 4. Steps in documenting the importance of a hypermethylated gene for tumorigenesis 1. Document epG island promoter methylation and correlate with transcriptional silencing of the gene and ability to reverse the silencing with demethylating drugs in cell culture. 2. Document correlation of promoter hypermethylation with specificity for this change in tumor cells (cell culture and primary tumors) versus normal cell counterparts and incidence for the hypermethylation change in primary tumors. 3. Document the position of the hypermethylation change for tumor progression of given cancer types. 4. Document the potential significance for the gene silencing in tumorigenesis through gene reinsertion studies in cell culture and effects on soft agar cloning, growth of tumor cells in nude mouse explants, etc. 5. Establish function of the protein encoded by the silenced geneeither through known characteristics or testing for activity of recognized protein motifs in culture systems, etc. 6. Document tumor suppressor activity and functions of the gene for cell renewal, etc., especially for totally unknown genes, through mouse knock-out studies.
E PIG ENE TIC
altered DNA methylation
APe mutation chrom 5 deletion
0 E T E R MIN ANT S
0 F CAN C E R
•
465
~
RAS gene mutation
chrom 18 loss
~
'~-=---,/ stem celis, prevention, risk assessment, and early diagnosis
chrom 17 loss (p53) MSH2 & MLHI mutation
tumor progression, treatmen1 and markers of prognosis
Figure 4. The Early Role for Abnormal DNA Methylation in Tumor Progression This is depicted in the classic model (Kinzler and Vogelstein 1997) for genetic alterations during the evolution of colon cancer. The altered DNA methylation is shown to occur very early (red arrow), as discussed in the text, during conversion of normal to hyperplastic epithelium. This places it in a strategic position (left bottom black arrow and left bottom box) for channeling stem cells into abnormal clonal expansion (see Fig. 5) by cooperating with key genetic alterations. These epigenetic abnormalities also have marker connotations, as shown in the bottom left black box. The abnormal DNA methylation continues to accrue during progression from noninvasive to invasive and ultimately, metastatic tumors (right bottom arrow and right box). This has connotations for cancer treatment and for markers of prognosis.
genic knock-out approaches to establish the role of the gene as a tumor suppressor gene and to understand the functions of the encoded protein in development, adult cell renewal, etc. Mouse knock-out studies have proven extremely rewarding in documenting the function of HIC-l as a tumor suppressor gene after it was identified by screening genomic regions that have undergone loss of heterozygosity (LOH) in cancerous cells (W.Y. Chen et al. 2003, 2004). These challenges, and especially step six, reveal the value of discovering genes epigenetically silenced in cancer, but create a major scope of work to be considered by investigators in the field. 5 Epigenetic Gene Silencing and Its Role in the Evolution of Cancer-Importance for Early Tumor Progression Stages
In the classic view of cancer evolution, as articulated by Vogelstein and colleagues (Kinzler and Vogelstein 1997), a series of genetic changes drives progression from early premalignant stages, through the appearance of invasive cancer, to onset of metastatic disease (Fig. 4). This progression does not necessarily occur in the same exact linear order from tumor to tumor. We know that throughout this course of events, epigenetic changes are occurring as well: There is early appearance of both the widespread loss of normal DNA methylation and more focal gains in gene promoters that we have been dis-
cussing. Thus, there is the potential for interaction of epigenetic and genetic events to drive progressive cellular abnormalities throughout the entire course of neoplastic progression. In this scenario, data for two epigenetic aspects-loss of imprinting (LOI) (as discussed in Chapter 23) and gene silencing-are proving to be extremely important for very early stages of cancer development. LOI involves a process wherein the silenced allele of imprinted genes becomes activated during tumorigenesis such that biallelic expression of the gene, and excess gene product, are established (Rainier et al. 1993). The most studied example is for IGF2 in tumors such as colon cancer (Kaneda and Feinberg 2005). In this case, the promoter hypermethylation event in the imprinted H19 gene on chromosome 11 p is the result of a complicated chromatin control process (see Chapter 19), to abnormally activate the silenced IGF2 allele (Kaneda and Feinberg 2005). The resultant biallelic IGF2 expression leads to excess production of the growth-promoting IGF2 protein. Experimental evidence suggests that this could play a role in very early progression steps of colon cancer (Kaneda and Feinberg 2005; Sakatani et al. 2005). In fact, recent mouse model studies suggest that LOI events alone may be sufficient to initiate the tumorigenesis process (Holm et al. 2005). A second common neoplastic transition, the epigenetic silencing of genes, occurs in early phases of neoplastic development. This relates heavily to the questions
466
•
C HAP T E R 2 4
posed about the roles of cellular stress and exposure in the development of disease states. The genes involved often appear to set the stage for stressed cells to survive DNA damage events and/or chronic injury settings, to clonally expand as stem/progenitor-type cells, and to then be predisposed to later genetic and epigenetic events to drive tumor progression (Fig. 5). The first evidence for such involvement comes from data for several classic tumor suppressor genes that can be either mutated or epigenetically silenced in human cancers. For example, the epigenetic silencing of p16/1k4/1 occurs very early in populations of premalignant cells, during the early changes that precede tumors such as lung cancer (Belinsky et al. 1998) and in small populations of hyperplastic epithelial cells in otherwise normal breast in some women (Holst et al. 2003). In experimental settings where normal human mammary epithelial cells are grown in cell culture (on plastic), this type of p16 silencing is a prerequisite for very early steps toward cell trans-
formation (Kiyono et al. 1998; Romanov et al. 2001). This loss of gene function accompanies a failure of subsets of the mammary cells to reach a mortality checkpoint, and these cells then develop progressive chromosomal abnormalities and telomerase expression as they continue to proliferate. A second example concerns the mismatch repair gene, MLH1. This gene is mutated in the germ line of families in which members are predisposed to a type of colon cancer with multiple genetic alterations and termed the "micro-satellite" instability phenotype (Liu et al. 1995). However, 10-15% of patients with nonfamilial colon cancers also have tumors with this phenotype, and the majority of these cancers harbor epigenetic silencing of a non-mutated MLHl gene (Herman et al. 1998; Veigl et al. 1998). In cell culture, reexpression of this silenced MLHl gene produces reappearance of a functional protein that restores a considerable portion of the damage mismatch repair (Herman et al. 1998).
normal differentiation
•••
stem/progenitor
000 cell compartment
abnormal clonal expansion
tumor progression
Figure 5. Epigenetic Gene Silencing Events and Tumorigenesis The earliest steps in tumorigenesis are depicted as abnormal clonal expansion, which evolves during the stress of cell renewal. This is caused by factors such as aging and chronic injury, from, e.g., inflammation. These cell clones are those at risk of subsequent genetic and epigenetic events that would drive tumor progression. Abnormal epigenetic events, such as the aberrant gene silencing focused upon in this chapter, could be the earliest heritable causes, in many instances, for inducing the abnormal clonal expansion from within stem/progenitor cell compartments in a renewing adult cell system. The gene silencing is triggered by chromatin modifications that repress transcription, and the DNA hypermethylation of this chromatin serves as the tight lock, as discussed in the text, to stabilize the heritable silencing. The gene silencing, in turn, disrupts normal homeostasis, which prevents stem and progenitor cells from moving properly along the differentiation pathway for a given epithelial cell system (top cells with deepening blue colors) and channels them (large red arrows) into the abnormal clonal expansion.
E PIG ENE TIC
Most recently, Chfr, a checkpoint-regulating gene that also controls another type of genomic integrity, chromosomal stability and ploidy, has been shown to be mutated in tumors but is more often silenced epigenetically in lung and other cancers and, importantly, early in progression stages of colon cancer (Mizuno et al. 2002). Mouse knock-out studies reveal a tumor suppressor role for this gene based on its function as an E3 ubiquitin ligase that regulates Aurora A, a control gene for mitosis. Embryonic cells from the mice have chromosomal instability and a predisposition to transformation. As the list of hypermethylated genes in cancer has expanded, key silencing events in early tumor progression are now being defined for candidate tumor suppressor genes that only have a history of epigenetic change and not mutations. For example, the DNA repair gene, 06MGMT, is silenced early in colon cancer progression (Esteller et al. 2001a), and this loss of function can predispose cells to persistence of alkylation damage at guanosines and, thus, G to A point mutations. Indeed, silencing of this gene occurs in premalignant colon polyps, prior to the appearance of a high rate of these mutations in both the p53 and RAS genes in later colon tumor progression phases (Esteller et al. 2001a; Wolf et al. 2001). Similarly, the GST-Pi gene is silenced in virtually all premalignant lesions that are predisposing to prostate cancer, putting cells at risk of oxidative damage at adenines (Lee et al. 1994). The third type of silenced genes-those discovered by approaches to randomly screening cancer genomes for epigenetically silenced genes-is also beginning to contribute significantly to our understanding of the early role of gene silencing in cancer. A particularly intriguing scenario has emerged in the progression of colon cancer: Epigenetic loss of function occurs in a family of genes, discovered through the microarray approach outlined earlier (Suzuki et al. 2002), which may allow early abnormal activation of a developmental pathway that is universally involved with the initiation and progression of this disease. Transcriptional silencing of the secreted frizzled related protein genes (SFRPs) (Suzuki et al. 2004) removes an antagonistic signal for interaction ofWnt ligands with their membrane receptors (Finch et al. 1997). This silencing correlates with Wnt-driven up-regulation of overall cellular levels of ~-catenin, due especially to increased presence and activity of this transcription factor in the nucleus (Suzuki et al. 2004). Such transcription is the canonical readout for increased Wnt pathway activity (Morin et al. 1997; Gregorieff and Clevers 2005). Most important, SFRP silencing occurs in very early lesions
0 E T E R MIN ANT S
0 F CAN C E R
•
467
predisposing to colon cancer, before common mutations in downstream Wnt pathway proteins occur, which also result in activated ~-catenin in the nucleus (Morin et al. 1997; Gregorieff and Clevers 2005). Thus, early activation of the Wnt pathway by epigenetic events appears poised to allow early expansion of cells, predisposed to activate the pathway further through mutational events. Persistence of both the epigenetic, through Wnt-driven increases in cellular ~-catenin, and genetic alterations, through crippling of the protein complex that degrades ~-catenin or activating Wnt mutations then seem to complement one another in driving progression of the disease (Suzuki et al. 2004). Another example of this group of genes involves HIC-l (hypermethylated-in-cancer 1), which encodes a zinc finger transcriptional repressor. HIC-1 was discovered by random screening for hypermethylated CpG islands in a hot spot for chromosomal loss in cancer cells (Wales et al. 1995). This gene, which is silenced early in cancer progression but is not mutated, has proven to be a tumor suppressor in a mouse knock-out model (W.Y. Chen et al. 2003, 2004). It complements p53 mutations, partially through loss of function, which leads to up-regulation of SIRTl (Chen et al. 2005), a key protein for sensing cell stress and contributing to stem/progenitor cell growth (Howitz et al. 2003; Nemoto et al. 2004; Kuzmichev et al. 2005). Thus, the data discussed above contribute to the thematic hypotheses outlined in Figure 4. This suggests that some of the earliest heritable changes in the evolution of tumors may be epigenetic changes, which often involve the tight transcriptional silencing of genes, maintained by promoter DNA methylation. The challenges to understand these scenarios further are integrally linked to key challenges for the study of epigenetic changes in cancer, which are outlined in Table 5 and discussed more fully below. The meeting of such challenges, particularly for understanding the contribution of epigenetic changes in the very earliest steps in neoplastic progression, may strikingly enrich molecular strategies aimed at the prevention of, and early intervention for, cancer. 6 The Molecular Anatomy of Epigenetically Silenced Cancer Genes
Genes that are silenced in neoplastic cells are important for understanding the initiation and maintenance of cancer. They also serve as excellent models for understanding how gene silencing may be initiated and maintained, and how the mammalian genome is packaged to facilitate regions of
468 •
C HAP T E R
2 4
Table 5. Major research challenges for understanding the molecular events mediating epigenetic gene silencing in cancer 1. Elucidate links between simultaneous losses and gains of DNA methylation in the same cancer cells. 2. Determine the molecular nature of boundaries, and how they change during tumorigenesis, that separate areas of transcriptionally active zones encompassing gene promoters from the transcriptionally repressive areas that surround them and which may prevent the repressive chromatin from spreading through the active zone. Among the candidate mechanisms are roles that may be played by key histone modifications, by insulator proteins, by chromatin-remodeling proteins, etc. 3. What is the order of events for the evolution of gene silencing in cancer with respect to histone modifications, DNA methylation, etc.? Which comes first, and what are the key protein complexes that target the processes (DNA-methylating enzymes, histonedeacetylating and methylation enzymes, epG methyl-binding proteins, Polycomb-silencing complexes, etc.) that determine the events? 4. Which specific DNA-methylating enzymes are required for initiating and/or maintaining the most stable gene silencing, and what protein complexes contain them, including their interaction with key components of the histone code? 5. Once established, what are all of the components of chromatin and DNA methylation machinery, and the hierarchy of their involvement, required to maintain the gene silencing, and how are they reversible?
transcription and repression of transcription. In turn, the understanding of chromatin function, which is a major emphasis of many of the chapters in this book, is facilitating our understanding of what may trigger aberrant gene silencing in cancer and how the components of this silencing maintain the attendant transcriptional repression. Work of several laboratories has contributed to the current understanding of the chromatin configuration that surrounds hypermethylated CpG islands in promoters of multiple genes aberrantly silenced in cancer cells. These studies have also highlighted how this chromatin differs from those surrounding the same genes when they are basally expressed. In normal cells, or in cancer cells where the genes are not transcriptionally repressed, these genes are characterized by having a zone of open chromatin wherein the CpG islands are not DNA methylated, the nucleosomes are irregularly spaced such that hypersensitive sites can be detected, and key histone residues are marked by posttranslational modifications typical for active genes. Active covalent histone marks include acetylation of H3 at lysines 9 and 14 (H3K9ac and H3K14ac) and methylation ofH3K4 (Nguyen et al. 2001; Fahrner et al. 2002). At both the 5' and 3' borders of the above open chromatin region, there appears to be a stark transition in
chromatin structure, with characteristics of transcriptionally repressed genomic regions flanking the CpG island (Fig. 6). In these border regions, there is methylation of the less frequent CpG sites, and recruitment of methyl cytosine-binding proteins (MBDs) and their partners (e.g., histone deactylases or HDACs) to the methylated CpGs (Chapter 18). The regions outside the CpG islands thus appear to be accessible to enzymes that catalyze histone methylation marks correlating with gene silencing. As a result of all of these factors, deacetylation of key histone residues, and presence of repressive histone methylation marks associated with transcriptional repression occur, most especially, H3K9me2 (Nguyen et al. 2001; Fahrner et al. 2002; Kondo et al. 2003). These juxtaposed regions of active and repressive chromatin patterns (Fig. 6) suggest that the CpG islandcontaining promoters of active genes reside in a zone which is "protected;' or alternatively, not targeted for repressive chromatin marks and DNA methylation (Nguyen et al. 2001; Fahrner et al. 2002; Kondo et al. 2003). Inherent to these concepts is the likelihood that molecular "boundaries" exist at the 5' and 3' borders of the promoter CpG islands in expressed genes. One major challenge is to define the precise nature of these boundaries. At present, candidates are the histone modifications themselves which mark the protected region of the promoter, the transcriptional activator and coactivator complexes which directly underpin active transcription, complexes of proteins which accomplish nucleosome placement and/or movement (i.e., nucleosome remodel~ ers) that may mark genes for active transcription. These may promote access of transcriptional activating complexes, replacement of classic histones by variant histones such as H3.3 (Chapter 13), which appear to support active transcription, and action of insulator protein complexes and their recognition sequences. It is in the context of defining how one or more of these candidate processes maintain zones of transcriptionally permissive chromatin around the non-DNA-methylated CpG island containing promoters of active genes that genes silenced in cancers are superb research models for understanding modulation of gene expression in mammalian genomes. The way in which the above transcriptionally active chromatin organization of CpG island containing promoters becomes converted during tum