Cap5-BiolMol-Farm13-14

La trascrizione: da DNA ad RNA
An organism may contain many types of somatic cells, each with distinct shape and
function. However, they all have the same genome. The genes in a genome do not have any
effect on cellular functions until they are "expressed". Different types of cells express
different sets of genes, thereby exhibiting various shapes and functions.
Gene expression" means the production of a protein or a functional
RNA from its gene. Several steps are required:
Transcription: A DNA strand is used as the template to synthesize a
RNA strand, which is called the primary transcript.
RNA processing: This step involves modifications of the primary
transcript to generate a mature mRNA (for protein genes) or a
functional tRNA or rRNA.
For RNA genes (tRNA and rRNA), the expression is complete after a
functional tRNA or rRNA is generated. However, protein genes
require additional steps:
Nuclear transport: mRNA has to be transported from the nucleus to
the cytoplasm for protein synthesis.
Protein synthesis: In the cytoplasm, mRNA binds to ribosomes,
which can synthesize a polypeptide based on the sequence of mRNA.
The central dogma
According to the above process, the flow of genetic information is in
the following direction:
DNA > RNA > Protein.
This rule was dubbed the "central dogma", because it was thought
that the same principle would apply to all organisms. However, we
now know that for RNA viruses, the flow of genetic information starts
from RNA.
Schematic illustration of transcription. (a) DNA before transcription. (b)
During transcription, the DNA should unwind so that one of its strand can
be used as template to synthesize a complementary RNA.
Growth of a nucleic acid strand is always in the 5' to 3' direction. This is true not only
for the synthesis of RNA during transcription, but also for the synthesis of DNA during
replication. The enzymes, called polymerases, are used to catalyze the synthesis of nucleic
acid strands. RNA strands are synthesized by RNA polymerases. DNA strands are
synthesized by DNA polymerases.
The entire transcription process should involve the following essential steps:
(i) Binding of polymerases to the initiation site. The DNA sequence which signals the
initiation of transcription is called the promoter. Prokaryotic polymerases can recognize
the promoter and bind to it directly, but eukaryotic polymerases have to rely on other
proteins called transcription factors.
(ii) Unwinding (melting) of the DNA double helix. The enzyme which can unwind the
double helix is called helicase. Prokaryotic polymerases have the helicase activity, but
eukaryotic polymerases do not. Unwinding of eukaryotic DNA is carried out by a specific
transcription factor.
(iii) Synthesis of RNA based on the sequence of the DNA template strand. RNA
polymerases use nucleoside triphosphates (NTPs) to construct a RNA strand.
(iv) Termination of synthesis. Prokaryotes and eukaryotes use different signals to terminate
transcription. [Note: the "stop" codon in the genetic code is a signal for the end of peptide
synthesis, not the end of transcription.]
Transcription in eukaryotes is much more complicated than in prokaryotes, partly because
eukaryotic DNA is associated with histones, which could hinder the access of polymerases
to the promotor.
Transcription is a process in which one DNA strand is used as template to
synthesize a complementary RNA. The following is an example:
Note that uracil (U) of RNA is paired with adenine (A) of DNA. There are a few
different names for these nucleic acid strands. The DNA strand which serves as
the template may be called "template strand", "minus strand", or "antisense
strand". The other DNA strand may be termed "non-template strand", "coding
strand", "plus strand", or "sense strand". Since both DNA coding strand and RNA
strand are complementary to the template strand, they have the same sequences
except that T in the DNA coding strand is replaced by U in the RNA strand.
The function of RNA polymerases
Both RNA and DNA polymerases can add nucleotides to an existing strand, extending its
length. However, there is a major difference between the two classes of enzymes: RNA
polymerases can initiate a new strand but DNA polymerases cannot. Therefore, during DNA
replication, an oligonucleotide (called primer) should first be synthesized by a different
enzyme.
The chemical reaction catalyzed by RNA polymerases is shown. The nucleotides used to
extend a growing RNA chain are ribonucleoside triphosphates (NTPs). Two phosphate groups
are released as pyrophosphate (PPi) during the reaction. Strand growth is always in the 5' to 3'
direction. The first nucleotide at the 5' end retains its triphosphate group.
Simplified presentation for the
chain elongation. The vertical
line represents the pentose and
the slanting line denotes the
phosphodiester bond. Bases
are designated as N1, N2, etc.
The chemical reaction catalyzed
by RNA polymerases.
Classes of RNA polymerases
E. coli
An E. coli RNA polymerase is composed of five subunits: two  subunits, and one for each
, ', and  subunit.  (151 kD) and ' (156 kD) are significantly larger than  (37
kD). Several different forms of  subunits have been identified, with molecular weights
ranging from 28 kD to 70 98ik98ik4kD. The  subunit is also known as the  factor. It
plays an important role in recognizing the transcriptional initiation site, and also possesses
the helicase activity to unwind the DNA double helix. Nucleotide synthesis is carried out
by other four subunits, which together are called the core polymerase. The term
"holoenzyme" refers to a complete and fully functional enzyme. In this case, the
holoenzyme includes the core polymerase and the  factor.
Eukaryotes
There are three classes of eukaryotic RNA polymerases: I, II and III, each comprising two
large subunits and 12-15 smaller subunits. The two large subunits are homologous to the E.
coli  and ' subunits. Two smaller subunits are similar to the E. coli  subunit. However,
the eukaryotic RNA polymerase does not contain any subunit similar to the E. coli 
factor. Therefore, in eukaryotes, transcriptional initiation should be mediated by other
proteins.
RNA polymerase II is involved in the transcription of all protein genes and most snRNA
genes. It is undoubtedly the most important among the three classes of RNA
polymerases. The other two classes transcribe only RNA genes. RNA polymerase I is
located in the nucleolus, transcribing rRNA genes except 5S rRNA. RNA polymerase III is
located outside the nucleolus, transcribing 5S rRNA, tRNA, U6 snRNA and some small
RNA genes.
Transcription Mechanisms in Prokaryotes
In prokaryotes, binding of the polymerase's s factor to promoter can catalyze unwinding of
the DNA double helix. The most important s factor is Sigma 70, whose structure has been
determined by x-ray crystallography, but its complex with DNA has not been solved.
(b)
(a)
The structure of Sigma 70 and its DNA binding site. (a) Structure of Sigma 70, residues 114 to
448. PDB ID = 1SIG. (b) A model for the binding between Sigma 70 and the promoter, based on
biochemical studies. Residues Y425, Y430, W433 and W434 are directly involved in the unwinding
(melting) of the double helix.
The promoter is rich in A and T. The AT pair involves two hydrogen bonds whereas the
CG pair involves three hydrogen bonds. Therefore, AT pairs are easier to separate. The
DNA replication origin is also rich in A and T.
After the DNA strands have been separated at the promoter region, the core polymerase
(') can then start to synthesize RNA based on the sequence of the DNA template
strand. Since the role of the  factor is mainly to initiate transcription, it will be released
after about 10 ribonucleotides have been polymerized.
Elongation of the RNA strand continues until the core polymerase reaches the termination
site.
Tipi di RNA polimerasi DNA-dipendenti
Polimerasi
Geni trascritti
Batteri
RNA polimerasi
Tutti i geni batterici
Nuclei eucariotici
RNA polimerasi I
Geni dell’RNA ribosomiale (rRNA)
RNA polimerasi II
mRNA, snoRNA, alcuni snRNA,
miRNA
RNA polimerasi III
tRNA, 5S rRNA, alcuni snRNA
RNA polimerasi IV (soltanto nei
vegetali?)
siRNA coinvolti nella formazione di
eterocromatina
Organelli eucariotici
RNA polimerasi mitocondriale
Geni mitocondriali
RNA polimerasi del cloroplasto
(multiple; codificate sia nel
cloroplasto che nel nucleo)
Geni del cloroplasto
I promotori batterici hanno due sequenza consenso distinte
Struttura del fattore  batterico e dell’oloenzima della RNA polimerasi
Cambiamenti
conformazionali
durante i passaggi di
inizio della trascrizione
Terminazione della
trascrizione
Transcriptional Termination in Prokaryotes
In prokaryotes, the transcription is terminated by two major mechanisms: Rho-independent
(intrinsic) and Rho-dependent.
The Rho-independent termination signal is a stretch of 30-40 bp sequence, consisting of
many GC residues followed by a series of T ( "U" in the transcribed RNA). The resulting
RNA transcript will form a stem-loop structure to terminate transcription.
The stem-loop structure of the RNA transcript as a termination signal for the
transcription of the trp operon.
Rho-dependent mechanism
Rho is a ~ 50 kD protein, involved in bout half of E. coli transcriptional terminations. It
has been well established that six Rho proteins form a hexamer to terminate
transcription, but the precise mechanism is not clear. Experiments suggest that two
components are essential: (i) the upstream Rho loading site and (ii) the downstream
termination site. The Rho hexamer first binds to the RNA transcript at the upstream site
which is 70-80 nucleotides long and rich in C residues. Upon binding, the Rho hexamer
moves along the RNA in the 3' direction. If movement of the polymerase is slow, the
Rho hexamer will catch up and terminate the transcription at the downstream termination
site. Rho has ATPase activity which can induce release of the polymerase from DNA.
Correzione delle bozze da parte della RNA polimerasi
Regolazione della trascrizione
nei procarioti e negli eucarioti
Gene's Regulatory Elements
Transcriptional regulation is mediated by the interaction between transcription factors and
their DNA binding sites which are the cis-acting elements, whereas the sequences encoding
transcription factors are trans-acting elements. The cis-acting elements may be divided
into the following four types:
Promoters
Enhancers
Silencers
Response elements
Gene organization. The
transcription region consists
of exons and introns. The
regulatory elements include
promoter, response element,
enhancer and silencer (not
shown). Downstream
refers to the direction of
transcription and upstream
is opposite to the
transcription direction. The
numbering of base pairs in
the promoter region is as
follows. The number
increases along the direction
of transcription, with "+1"
assigned for the initiation
site. There is no "0"
position. The base pair just
upstream of +1 is numbered
"-1", not "0".
Promoter is the DNA region where the transcription initiation takes place. In prokaryotes,
the sequence of a promoter is recognized by the Sigma (s) factor of the RNA
polymerase. In eukaryotes, it is recognized by specific transcription factors.
E. Coli
E. coli has five sigma factors:
Sigma 70: Regulate expression of most genes.
Sigma 32: Regulate expression of heat shock proteins.
Sigma 28: Regulate expression of flagellar operon (involved in cell motion).
Sigma 38: Regulate gene expression against external stresses.
Sigma 54: Regulate gene expression for nitrogen metabolism.
E. coli  factors and the consensus sequences
of their recognition sites (promoters).
Table shows the consensus sequences of the promoters recognized by E. coli  factors
(except for Sigma 38 which is not clear). The consensus sequence is an ideal sequence for
the interaction with its regulatory protein. A promoter should contain an element which is
identical to or very close to the consensus sequence.
Direzione della
trascrizione
intorno al genoma
circolare di E. coli
Modello dell’operone di Jacob e Monod
The lac Operon Promoter
The lac operon of E. coli was used by Francois Jacob, Andre Lwoff and Jacques Monod to
investigate the transcription mechanism in 1960s, providing the first molecular details. Its
promoter sequence is shown in the following figure.
The lac operon promoter (
which regulates the
transcription of lacZ, not
lacI). (a) Presented so that
the genome sequence
numbers increase from left
to right. Its 5' to 3'
sequence is the same as in
the database (ACCESSION
AE000141). (b) Presented
so that the promoter
sequence numbers increase
from left to right. Its 5' to
3' sequence is comparable
to the consensus sequence.
(a)
(b)
(a)
(b)
E. coli operons. An operon consists of several genes which are transcribed together.
(a) The lac operon comprises lacA, lacY and lacZ. The lacI gene is not part of the operon,
but encodes a protein which could repress transcription of the lac operon.
(b) The trp operon consists of trpA, trpB, trpC, trpD, and trpE.
(Numbers represent the genome sequence
The DNA's two strands are complementary to each other. Only one strand's sequence is
sufficient to represent both strands' information. By convention, the sequence of the 5' to 3'
strand (from left to right) is used. From the above figure, the promoter sequence of the lac
operon is TATGTT at the -10 region, and TTTACA at the -35 region. The consensus
sequence of s70 which regulates the lac operon is TATAAT at the -10 region, and
TTGACA at the -35 region. They differ at three positions (two at the -10 region and one
at the -35 region). Experimentally, it has been found that binding of the RNA polymerase
to the lac promoter is relatively weak. Transcription of the lac operon often requires an
activation such as catabolite activator protein (CAP).
Transcriptional activation by CAP
In the absence of any transcriptional activator, binding of the polymerase to the promoter
of the lac operon is weak. The binding site of the catabolite activator protein (CAP) is
located just upstream of the promoter. In such case, the activator can enhance transcription
by direct contact with the polymerase, making it bind more efficiently to the promoter.
Transcriptional inhibition by lac repressor
The silencer (also known as operator) of the lac operon is located at the transcriptional
start site. When the lac repressor binds to this region, it will block the movement of the
polymerase, thereby inhibiting transcription.
Dimostrazione
sperimentale
del legame del
repressore lac
all’operatore
Regulation of lac Operon Transcription
Regulation of the lac operon transcription. (a) by CAP. (b) by lac repressor
Structure of the lac repressor/operator complex. The repressor is a homodimer. Each
subunit is represented by a different color. PDB ID = 1LBG.
Regolazione dell’operone lac
da parte di glucosio e lattosio
Complesso
CAP-DNA
Dimostrazione sperimentale della
formazione di un’ansa di DNA
L’operone del Triptofano (trp)
• La regolazione dell’operone del trp nei batteri è un
classico esempio di attenuazione trascrizionale.
• Due meccanismi di attenuazione:
• Controllo post-trascrizionale mediante ripiegamento
differenziale dell’mRNA.
• Controllo dell’inizio della trascrizione dell’operone trp
mediante il legame di un repressore, attivato dal
legame con il triptofano, a siti operatori, con
conseguente blocco dell’accesso della RNA
polimerasi al promotore trp.
Attenuazione
trascrizionale del
operone trp di E. coli
Meccanismi d’azione dei regolatori trascrizionali
• Legame cooperativo delle proteine al DNA: importante sia
nei procarioti che negli eucarioti; es. della proteina CAP che ha
un dominio di legame al DNA ed un dominio detto “regione di
attivazione” che entra in contatto con la RNA polimerasi che si
lega al promotore.
• Modificazioni allosteriche e legame al DNA: ci sono due
esempi, 1) il legame del cAMP alla proteine CAP che induce un
cambiamento allosterico in quest’ultima per cui si lega più
saldamente al DNA; 2) il legame dell’allolattosio o dell’IPTG alla
proteina repressore dell’operone lac che induce un
cambiamento allosterico nel repressore che fa diminuire la sua
capacità di legarsi al DNA.
• Formazione di anse da parte del DNA: la formazione di anse
di DNA permette a proteine, anche distanti, legate sul DNA di
interagire con la DNA polimerasi legata al promotore.
La trascrizione negli eucarioti e sua
regolazione
Transcription Mechanisms in Eukaryotes
In eukaryotes, there are three classes of RNA polymerases: I, II and III. This section will
focus on the RNA polymerase II (Pol II), which is involved in the transcription of all
protein genes.
Structure of the human TBP core domain complexed with DNA as determined by x-ray
crystallography. The DNA includes the TATA element. PDB ID = 1CDW.
Formazione del complesso
di preinizio ed inizio della
trascrizione
Assembly of the Pre-Initiation Complex (PIC)
TBP first binds to the promoter and then
recruits TFIIB to join TFIID (and TFIIA if
present). Before entering PIC, RNA Pol II and
TFIIF are bound together, which are recruited
by TFIIB. Finally, RNA Pol II recruits TFIIE,
which further recruits TFIIH to complete the
PIC assembly.
Struttura della
RNA Pol II
General transcription factors associated with RNA Pol II in human cells.
Fosforilazione della RNA pol II
Attività elicasica di TFIIH
Una regione contenente la TATA
box promuove l’inizio specifico
della trascrizione in vitro
Eukaryotes
In eukaryotes, there is a significant difference between the transcription of protein
genes and RNA genes.
The most common promoter element in eukaryotic protein genes is the TATA box,
located at -35 to -20. Its consensus sequence, TATAAA, is quite similar to the -10
region of the Sigma 70 recognition site. Another promoter element is called the
initiator (Inr). It has the consensus sequence PyPyAN(T/A)PyPy, where Py
denotes pyrimidine (C or T), N = any, and (T/A) means T or A. The base A at the
third position is located at +1 (the transcriptional start site).
TATA box and initiator are the core promoter elements. There are other elements
often located within 200 bp of the transcriptional start site, such as CAAT box and
GC box which may be referred to as promoter-proximal elements.
Eukaryotic promoter elements.
The protein which interacts with the initiator and TATA box is known as the TATA-box
binding protein (TBP), because the TATA box was discovered earlier than the
initiator. TBP recognizes not only the core promoter of protein genes, but also RNA
promoter. It is a subunit of the general transcription factor TFIID. In eukaryotes,
transcription requires several different general transcription factors and, in most cases, the
regulatory transcription factors.
Confronto fra un’unità trascrizionale semplice ed una complessa della RNA pol II
I motivi del nucleo del promotore della RNA pol II
Promotore
Posizione
Fattore di trascizione
Sequenza consenso
Elementi a monte del nucleo del promotore
BRE
TATA Box
Iniziatore
-37 a -32
-31 a -26
-2 a +4
TFHB
TBP
TAF1 e TAF2
(G/C)(G/C)(G/A)CGCC
TATA(A/T)AA(G/A)
PyPyA+1N(T/A)PyPy
Elementi a valle del nucleo del promotore
MTE
+18 a + 27
TFIID
DPE
+28 a + 32
TAF9 e TAF6
C(G/A)A(A/G)C(G/C)
(C/A/G)AACG(G/C)
(A/G)G(A/T)(C/T)(G/A/
C)
Elementi prossimali del promotore
CAAT box
GC box
-200 a -70
-200 a -70
CBF, NF1, C/EBP
Sp1
CCAAT
GGGCGG
CBF: fattore che lega CAAT; C/EBP: proteina che lega CAAT/enhancer.
Initiation
RNA Pol II does not contain a subunit similar to the prokaryotic  factor, which can
recognize the promoter and unwind the DNA double helix. In eukaryotes, these two
functions are carried out by a set of proteins called general transcription factors. The
RNA Pol II is associated with six general transcription factors, designated as TFIIA,
TFIIB, TFIID, TFIIE, TFIIF and TFIIH, where "TF" stands for "transcription factor" and
"II" for the RNA Pol II.
TFIID consists of TBP (TATA-box binding protein) and TAFs (TBP associated
factors). The role of TBP is to bind the core promoter. TAFs may assist TBP in this
process. In human cells, TAFs are formed by 12 subunits. One of them, TAF250 (with
molecular weight 250 kD), has the histone acetyltransferase activity, which can relieve the
binding between DNA and histones in the nucleosome.
The transcription factor which catalyzes DNA melting is TFIIH. However, before TFIIH
can unwind DNA, the RNA Pol II and at least five general transcription factors (TFIIA is
not absolutely necessary) have to form a pre-initiation complex (PIC).
Regolazione della trascrizione negli eucarioti
• Concetto di elementi cis-agenti and trans-agenti del
DNA e fattori trans-agenti.
• Fattori che intervengono nella regolazione della
trascrizione negli eucarioti:
• Interazioni DNA-proteina;
• Interazioni proteina-proteina;
• Struttura della cromatina;
• Architettura del nucleo;
• Compartimentazione cellulare.
Il controllo dell’espressione genica negli eucarioti può
essere
esercitato a molti livelli oltre che a livello trascrizionale.
Regolazione della trascrizione negli eucarioti
• La RNA polimerasi II è responsabile della trascrizione della
grande maggioranza dei geni eucariotici:
• Geni codificanti per mRNA;
• Geni codificanti per snoRNA;
• Geni codificanti per alcuni snRNA;
• Geni codificanti per microRNA.
• Negli eucarioti esistono altre due RNA polimerasi: RNA
polimerasi I localizzata nel nucleolo responsabile della sintesi
dell’rRNA grande; RNA polimerasi III si trova nel
nucleoplasma ed è responsabile della sintesi dell’tRNA,
dell’rRNA 5S e di alcuni snRNA.
Elementi regolatori dei geni codificanti proteine
• Fattori di trascrizione (attivatori o repressori) 
CREB, CTCF, FOG-1, GATA-1, NF-E2, NF-B, Sp1.
• Macchinario generale della trascrizione  RNA pol
II, TFIIB, TFIID (TBP + TAF), TFIIE, TFIIF, TFIIH,
Mediatore.
• Coattivatori e corepressori  complessi di
modificazione della cromatina (HAT, HDAC, CBP,
HMT, LSD1), complessi di rimodellamento della
cromatina (SWI/SNF, ISWI, SWR1).
• Fattori di allungamento  FACT, allungatore, TFIIS.
Elementi regolatori a lungo raggio
 Questi elementi degli eucarioti multicellulari comprendo:
• Enhancer;
• Silenziatori;
• Isolatori;
• Regioni di controllo del locus (LCR);
• Regioni di attacco alla matrice (MAR).
Alcuni enhancer che sono negli introni contribuiscono
all’espressione tessuto-specifica  enhancer nel secondo
introne del gene dell’apolipoproteina B (apoB delle LDL).
Enhancer e silenziatori
• Questi elementi sono di solito a 700 – 1000 o più bp
di distanza dal sito di inizio della trascrizione.
• Gli enhancer si possono trovare sia a monte che a
valle del gene, o dentro un introne, e possono
funzionare in entrambi gli orientamenti rispetto al
promotore.
• Un enhancer è lungo  500 bp e contiene  10 siti di
legame per fattori di trascrizione diversi.
• I silenziatori sono elementi simili agli enhancer, ma
reprimono l’espressione del gene.
Tre caratteristiche cruciali di
un elemento enhancer
Enhancer e silenziatori:
A distanza di 700-1000
o più bp dal sito di inizio
Della trascrizione.
Histone Acetylation
Acetylation of the lysine residues at the N terminus of histone proteins removes positive
charges, thereby reducing the affinity between histones and DNA. This makes RNA
polymerase and transcription factors easier to access the promoter region. Therefore, in
most cases, histone acetylation enhances transcription while histone deacetylation
represses transcription.
Acetylation and deacetylation
of the lysine residue.
Histone acetylation is catalyzed
by histone acetyltransferases
(HATs) and histone
deacetylation is catalyzed by
histone deacetylases (denoted
by HDs or HDACs). Several
different forms of HATs and
HDs have been
identified. Among them,
CBP/p300 is probably the most
important, since it can interact
with numerous transcription
regulators.
Isolatori
Un isolatore è una sequenza di DNA, di solito lunga
300 – 2000 bp, che ha la funzione di:
• Marcatore di un confine di cromatina, tra regioni di
eucromatina ed eterocromatina;
• Attività di blocco di un enhancer o di un silenziatore.
 Contiene un gruppo di siti di legame per proteine che
legano sequenze specifiche di DNA.
Gli isolatori funzionano come marcatori dei confini della cromatina e hanno attività di
blocco degli enhancer
Isolatore HS4: sito ipersensibile alla digesione da DNAasi I.
Gli elementi isolatori sono riconosciuti da almeno tre proteine
diverse che legano il DNA: fattore che lega CCCTC (CTCF), fattori
Di stimolazione a monte (USF) 1 e 1.
Regioni di controllo del locus (LCR)
• Le LCR sono sequenze di DNA che organizzano e
mantengono un dominio funzionale di cromatina attiva e fanno
aumentare la trascrizione dei geni a valle.
• Siti ipersensibili (HS) alla DNasi a monte del gruppo dei geni
della -globina.
• Le LCR sono presenti in altri loci genici fra cui: i gruppi di
geni che codificano le -globine, i pigmenti visivi, le proteine
del complesso maggiore di istocompatibilità, gli ormoni della
crescita umani, le serpine, le citochine delle cellule T helper di
tipo 2.
Regione di controllo del locus (LCR): prototipo di LCR è stato caratterizzato
come un gruppo di siti ipersensibili alla Dnasi I a monte del gruppo dei geni
della β-globina.
LCR per altri loci:
• Gruppi di geni codificanti le α-globine;
• Gruppi di geni codificanti i pigmenti visivi;
• Gruppi di geni codificanti le proteine del complesso maggiore di
istocompatibilità;
• Gruppi di geni codificanti gli ormoni della crescita umani;
• Gruppi di geni codificanti le serpine;
• Gruppi di geni codificanti le citochine delle cellule T helper di tipo 2.
Regioni di controllo del locus (LCR)
L’intero locus dei geni della -globina occupa  200
kb.
La LCR è necessaria per la trascrizione ad alto livello
di tutti i geni della famiglia della -globina.
La regolazione della formazione di “anse” di
cromatina è il meccanismo che controlla l’espressione
durante lo sviluppo dei geni della -globina.
La LCR interagisce con un solo promotore genico
alla volta, così da garantire l’espressione differenziale
dei diversi geni durante lo sviluppo.
Regioni di attacco alla matrice (MAR)
 L’organizzazione tridimensionale della cromatina ha anche un
ruolo centrale nel controllo della trascrizione negli eucarioti.
 Organizzazione in anse indipendenti della cromatina. La
formazione di queste anse dipende da elementi specifici di
sequenza del DNA che sono sparsi in tutto il genoma ad
intervalli di 5 – 200 kb.
 Queste sequenze di DNA sono chiamate regioni di attacco alla
matrice (MAR).
 Le MAR organizzano il genoma in  60000 anse di cromatina
con una dimensione media di 70 kb.
Regioni di attacco alla matrice (MAR)
 I geni attivi tendono a far parte di domini ad ansa piccoli,
mentre quelli inattivi sono associati a domini più grandi.
 Più del 70% delle MAR sono ricche di sequenze AT
 Le MAR si trovano vicino agli enhancers. Conferiscono
specificità tissutale e controllo durante lo sviluppo
dell’espressione genica reclutando fattori di trascrizione e
richiamando gli enzimi di rimodellamento della cromatina.
 MAR strutturali servono come ancore; MAR funzionali sono
dinamiche e aiutano a portare i geni sulla matrice nucleare.
Modello della regolazione trascrizionale da parte delle regioni di attacco alla matrice
(MAR)
I fattori di trascrizione
La regolazione dell’attività trascrizionale dei
geni avviene mediante cambiamenti della
quantità o dell’attività dei fattori di trascrizione.
Questi fattori influenzano la velocità di
trascrizione dei geni “target” positivamente o
negativamente tramite interazioni specifiche con
gli elementi regolatori del DNA e tramite
interazioni con altre proteine.
Meccanismi di regolazione della quantità e dell’attività dei
fattori di trascrizione
Meccanismi con cui agiscono fattori di trascrizione
attivatori per aumentare l’attività trascrizionale
 1) Stimolazione del reclutamento e del legame dei
fattori generali di trascrizione e della RNA pol II sul
nucleo del promotore per formare un complesso di
preinizio.
 2) Induzione di un cambiamento conformazionale od di
una modificazione post-traduzionale che stimola l’attività
enzimatica del macchinario generale di trascrizione.
 3)Interazione con i complessi di rimodellamento e di
modificazione della cromatina per permettere una
migliore accessibilità dei fattori generali di trascrizione o
di attivatori specifici al DNA.
Fattori di trascrizione: motivi del dominio che lega il
DNA
 Motivo elica-giro-elica (HTH) (es. repressore del trp, proteina
CAP, repressore lac, ecc.)  l’omeodominio (60
amminoacidi) è una variante del motivo HTH classico che è
presente in molti fattori di trascrizione che regolano lo
sviluppo (anche nei mammiferi).
Fattori di trascrizione: motivi del dominio che lega il
DNA
• L’omeodominio: un -elica contatta il solco maggiore del DNA
riconoscendo una sequenza di 6 pb (come nel HTH), mentre un braccio
flessibile interagisce specificamente con il solco minore del DNA.
Dominio a dito di Zn del recettore
dei glucocorticoidi (GR)
Motivo a dita di Zn
Fattori di trascrizione: motivi del dominio che
lega il DNA
 Motivo a dita di zinco: deriva il nome dallo schema della sua
struttura  un atomo di Zn coordina dei residui di cisteine e di
istidine, formando un ansa che ricorda la forma di un dito.
Struttura diffusa in molte famiglie di attivatori trascrizionali
(es. fattore Sp1, recettori degli ormoni steroidei).
Domini a cerniera di leucina
 Questo motivo è stato descritto per la prima volta nella
proteina C/EBP che lega CAAT box presente in molti
enhancer.
 Appartiene a questo gruppo di attivatori la proteina AP-1, un
eterodimero formato dall’unione di due protooncogeni c-Jun e
c-Fos, che regola l’espressione genica in risposta a vari
stimoli, quali, stress, infezioni virali e batteriche, e
l’attivazione da parte delle citochine.
Fattori di trascrizione: motivi del dominio che
lega il DNA
 Domini a cerniera lampo (leucine zipper): due lunghe strutture ad -elica
che contengono, contemporaneamente, il dominio di dimerizzazione ed il
dominio di legame al DNA. Queste strutture possono formare sia
omodimeri che eterodimeri.
Domini elica-ansa-elica basico (BHLH)
 E’ una struttura caratterizzata da due -eliche connesse da un ansa. I fattori
con questo motivo sono in genere eterodimeri, formati da un’elica più
lunga che lega il DNA attraverso amminoacidi basici.
 Appartengono a questa famiglia molti attivatori che hanno un ruolo nello
sviluppo, come Myo-D che svolge una funzione fondamentale nel
differenziamento delle cellule muscolari. Un altro esempio è la proteina cMyc, che è coinvolta nella trasformazione tumorale (linfoma di Burkitt).
Controllo combinatoriale della trascrizione
 Gli attivatori e gli inibitori sono coinvolti nella regolazione di
numerosi geni.
 E’ la combinazione tra diversi attivatori ed inibitori che rende
molto specifico il meccanismo di regolazione.
Sequenze di importazione ed esportazione nucleare
 Il traffico fra nucleo e citoplasma avviene attraverso i complessi dei
posi nucleari (NPC).
 I fattori di trascrizione, sintetizzati nel citoplasma, devono attraversare
questi pori per giungere nel nucleo.
 Le proteine sono indirizzate al nucleo da una sequenza amminoacidica
specifica chiamata sequenza di localizzazione nucleare (NLS); ne
esistono diverse, ma le più studiate sono le sequenze ricche di a.a
basici (Arg e Lys).
 Per l’uscita dal nucleo, le proteine devono anche avere una sequenza di
esportazione nucleare (NES); le meglio caratterizzate sono le piccole
sequenze idrofobiche ricche di Leu.
 Per il passaggio attraverso i pori le proteine con sequenze di
localizzazione e/o esportazione, si avvalgono di una famiglia di
recettori solubili, importine od esportine (carioferine).
Importazione nucleare
regolata
 I recettori degli ormoni
steroidei
Importazione nucleare regolata di NF-B
Risposta al cAMP e fattore CREB
P-CREB si lega a siti sul DNA chiamati
CRE (cAMP Response Element).
CREB è strettamente correlato, sia per
struttura che per funzione, a CREM
(cAMP Response Element Modulator)
ed a ATF-1 (Activating Transcription
Factor-1).
DNA Methylation
Methylation on the DNA's cytosine base is commonly used to repress transcription
Transcription of RNA Genes
The products of RNA genes include
rRNA, tRNA and small RNA
molecules. In the mammalian genome,
the three rRNA genes, 28S, 18S, and
5.8S, are clustered as a pre-rRNA gene.
which is transcribed by RNA
polymerase I. tRNA, 5S rRNA and the
U6 snRNA genes are transcribed by
RNA polymerase III. Most snRNA
genes are transcribed by RNA
polymerase II using the same
mechanism as transcribing protein
genes.
Transcription by RNA Polymerase I
RNA polymerase I is devoted to the
synthesis of pre-rRNA. Like RNA
polymerase II, a pre-initiation complex
(PIC) has to be formed before RNA
polymerase I can exert its function.
The PIC assembly for RNA polymerase I. The regulatory region of pre-rRNA gene contains a core element and an
upstream control element (UCE). Binding of two upstream binding factors (UBF) to both elements may induce DNA
looping, and subsequently recruiting TATA-binding protein (TBP) and TBP-associated factors (TAFI). Finally, RNA
polymerase I joins the complex and completes the assembly process.
Transcription by RNA
Polymerase III
Transcription of tRNA
gene
The regulatory region of
tRNA gene contains A box
and B box located inside the
transcription unit. The PIC
assembly begins with the
binding of TFIIIC to both
elements
The PIC assembly for the transcription of tRNA gene.
Duplicated Genes
Most proteins do not need duplicated genes, because the mRNA molecule transcribed from
one gene can be translated into many copies of its protein product. However, rRNA and
tRNA are the final gene products. In order to accelerate the production process, all species
contain an array of tandemly repeated RNA genes. The number of repeats ranges from tens
to 24,000.
Number of RNA genes.
*The X chromosome of fruit fly contains 250 copies of Pre-rRNAs, Y chromosome
contains 150 copies.
Most proteins do not need duplicated genes, There are four types of rRNA in mammalian
cells: 28S, 5.8S, 5S and 18S. In the human genome, 28S, 5.8S and 18S are clustered
together. They form a single transcription unit which will be separated by specific enzymes
after transcription. " Pre-rRNA" refers to their precursor. In humans, a repeat unit for the
pre-rRNA has about 40 kb in length, including a 13-kb transcription unit and a 27-kb
untranscribed spacer region. The transcription unit contains three spacers: ETS, ITS1 and
ITS2. They will be removed during RNA processing.