January 2018


Please read this section on translation for tomorrow. Read More…

Notes from "quest" writing

Just some things I thought you would like to know as I'm writing your quest Read More…

Telomeres, Telomerase and Hayflick

Telomeres and Telomerase:

After reading this, please read this short news piece from the journal "Nature."
Two problems:
  1. As we have discussed, the chemistry of replication leaves the lagging strand not fully replicated. There is a 3' extension of what had been the template. This would lead to shortening of the chromosome with each round of replication.
  2. We have not previously discussed this, but the ends of broken chromosomes, or just free ends of linear DNA, lead to recombination or “splicing” together of these fragments. We see the results of this in certain translocations we have discussed. Also, free ends of DNA tend to be degraded by enzymes quickly. So, what is special about the ends of our chromosomes that keeps this from happening?
Telomeres can be described as the specialized ends of chromosomes that protect the rest of chromosome and keep it stable.

Cancer and the Hayflick Limit

Our cells seem to go through a fixed number of divisions before they no longer can keep going. This is called the “Hayflick Limit” after the person who noticed it.
However, cancer cells are immortal…they overcome this limit. Also, there must be a way for cells to fix the problem so that we can “reset” the Hayflick limit when a new embryo is formed. One more thing: one-celled organisms don’t have a Hayflick limit. They can grow indefinitely. All of this pointed to some enzyme activity needed to maintain the ends.

The Sequence:

Well, since we are talking about DNA, it seems likely that there is some specific sequence that corresponds to the Telomere. There is. In humans and other vertebrates, the sequence is the short repeat 5'(TTAGGG)n, where “n” is an integer between 300 and 8000. So, that can be 50,000 bases.
  • How does it get there?
  • How does it achieve the goals above?


The enzyme Telomerase is a little unusual. It has a DNA polymerase activity and needs a 3' end to which it adds. But, it carries a short RNA that acts as its template. So, it finds the free 3' end, which is already longer than the newly replicated strand (due to the lagging-strand problem), uses the internal template to extend the repeated sequence over and over again. Because it uses RNA as a template, but polymerizes DNA, it is known as an RNA-dependent DNA polymerase, also known as “reverse transcriptase,” since it is the opposite of transcription. The image is stolen from Wikipedia and shows the repeating sequence
That’s how it gets there.
It fixes the shortening problem by just adding the sequence. This can then serve as a template for more lagging-strand synthesis. That will still leave a short 3' extension, but will re-lengthen the telomere.
It’s an interesting enzyme.I’ve found many other images with great detail on it, but decided to steal this one:

It shows that there is a long complex RNA that interacts with several proteins (Dyskerin, TERT and some smaller ones). The RNA is much more than the template. It folds into a complex structure that fits into the proteins. The reason I included this one because it makes reference to a couple of forms of a genetic disease known as “Dyskeratosis.” This is a version of premature aging where the effects are seen as premature organ failure, rather than an old-looking outward appearance. Does it make sense that mutations to genes in the telomerase complex would lead to that?

As for how it solves problem number two, the extended 3’ end will fold back around and, along with some accessory proteins, will form a complex, stable structure shown below in cartoon form:

This cartoon is from from:

A more realistic view of the DNA in it’s four-strand form is below: (Image from Wikipedia):


The green dot is a monvalent cation.
This depicts how the structure forms. Note that the strands are
Parallel, not antiparallel.
(image from this website:

Transcription and RNA Processing

Transcriptional regulation

As usual, images from wikicommons.
The first question is, how does the machine that makes RNA recognize a “gene?” In the past, I’ve called a “gene” the stretch of DNA encoding a protein
and its regulatory sequences. We can separate it into the “structural gene,” which contains the actual code of the protein as it would be found in the mRNA, and the regulatory sequences. A very basic regulatory region is called the “promoter.” It is a region that serves as the initial recognition site that says: “This is a gene, transcribe here.”
In addition to the hydrogen bonds that make up the base pairs, there are additional sites for hydrogen bonds and other contacts that can be made without “unzipping” the DNA, primarily along the major groove. In particular, the major grove is just about the right size for an alpha helix of protein to fit in. There, R-chains on the outside of the helix can make specific contacts with the bases and “read” the sequence (that is, bind in a sequence-specific manner). The first step in transcription is recognition of the promotor, almost always upstream from the structural gene. There are a couple of types of DNA binding proteins. Here are two examples:
Notice two things: the protein binds as a dimer. In fact, each is identical and the overall DNA sequence is a “palindrome.” Second, notice that one of the alpha-helices in each subunit is reaching down into the major groove, where it makes sequence specific contacts. This particular protein actually blocks access of the RNA polymerase in E. coli. But, similar proteins act to promote binding.
bZIPThis is a b-zip protein binding domain, of a class that acts to promote transcription in our cells. Notice the same two things. Even though the overall structure of the proteins is different (note that much of the structure is left out), the binding has some similarities.
The big picture is that the protein is reading the DNA composition and will bind to a sequence it recognizes. The DNA sequence that is recognized is called the promoter and the protein that binds is called, in general terms, a transcription factor. Things that bind DNA may stimulate or prevent transcription.
Here is a cool video on the process:

RNA Processing

Most of what I’ll be talking about here occurs specifically in
As usual, most images are from Wikicommons.
Following transcription, the “pre-mRNA” must be processed on its way out of the nucleus to the cytoplasm.
First, notice that the sequence that made up the promotor does not end up in the pre-mRNA. This is a general theme: With each step in the DNA→RNA→Protein pathway, the information needed to specify regulation of each step generally is lost as we go to the next step. This makes sense, since it is no longer needed.

Poly-A tail and GTP cap.

When the RNA polymerase reaches the end of the sequence that’s supposed to be transcribed, it hits a signal called the “polyadenylation signal.” The sequence in the DNA (coding strand) is 5'-AATAAA. Of course, the polymerase is reading the template strand, so you could say the “signal” really is 3' TTATTT. The sequence varies a fair bit. This signals the RNA polymerase to leave the DNA. The sequence now in the pre mRNA (AAUAAA, or something similar) recruits a protein complex that will cleave the mRNA near the 3' end and an enzyme called PAP, for Poly Adenyl Polymerase, uses ATP to add a long series of A’s to the end of the mRNA. These A’s, and there may be hundreds of them, are NOT encoded anywhere.
The utility of the tail seems to be in recruiting several poly-A binding proteins that protect the mRNA from nucleases that degrade it, facilitate the next steps in processing, the transport of the mRNA out of the nucleus to the cytoplasm, and regulation of translation.
Some prokaryotes do a version of poly-A tail also.
The other initial change is the addition of the GTP cap. Actually, it’s a modified G with a methyl group on position 7 of the base, cleverly called 7-methyl GTP. Wiki has a
short discussion of the mechanism of transfer. There is a specific enzyme that does the capping and the cap looks something like this:

Note the “inverted” 5'-5' link. The cap structure also protects the end of the mRNA from degradation.
RNA splicing:
The pre-mRNA has regions called “introns” (for “intervening sequences” with a cool “tron” ending), and Exons. The exons encode the protein and are what end up in the final mRNA. The introns are spliced out. We will talk about the mechanism and the implications later.
The final processed mRNA might look something like this:

processed Transcript
It comprises the coding sequence, the cap, the 5' and 3' “untranslated region” (UTR) and the poly-A tail. I think all the names are pretty self-explanatory. The 5' UTR, in particular, will contain sequences that contribute to regulating translation. Only the green stretch above will make it into the protein.

RNA Splicing part 1

Take a look at
this video which is presented by Cold Spring Harbor Lab, where I used to work years ago and where some of the work I discussed today was done.
Key words:
Spliceosome: (there’s a good descriptive name) It’s the assembly of proteins and RNA that carry out splicing.
snRNPs (pronounced “snurps”). These are “small nuclear ribonuclear proteins.” They are the components of the spliceosome.
snRNAs: small nuclear RNAs are the main catalytic components. It is the RNA that carries out the reaction. They have names like “U1,” and “U2.” etc.

Self Splicing:
The story about how RNAs got spliced became more obvious when self-splicing RNAs were found. I mentioned these today.
The general reaction looks like this:
There are two, successive reactions that occur in a concerted way sometimes using energy from an additional GTP that is brought in by the complex. The first takes a branch point sequence near the 3' end of the intron. The 2' OH of that branch point attacks the 5' exon/intron boundary. This splices the intron into a lariat and frees the 3' end of the exon, which then attack the 5' end of the next exon, resulting in the spliced exons and a lariat structure of the intron.
In the self-splicing form, “internal guide sequences” form base-paired structures that hold the players together and facilitate the attacks..
You can see below how the stem-loop structures formed using base pairs, then can fold into three-dimensional structures, facilitating catalysis.


You can see how the base pairing can be used to bring the active sites together in more detail here. By the way, these last two images are used without permission. The one below is from the Molecule of the Month blog/discussion and the one above is from Trends in Ecology and Evolution.

Normal mRNA splicing (non self-splicing)

The story is not all that different in the splicing of mRNA. There are guide sequences and a branch-point lariat. The chemistry varies a little as to who attacks whom when and where. But, the main difference is that the sequence of the intron is not that important for the structure to work. Instead of the intron folding into a complex structure, the separate RNAs in the snRNPs form those structures and then bind the specific sequences in the intron and at the intron-exon borders.

Two benefits of splicing

While no one believes that RNA splicing evolved
because of these benefits, these are real benefits organisms now enjoy because of it.
  1. Alternative splicing: Not all the exons are included when splicing of specific mRNAs takes place. Different versions of the mRNA may be formed. In the example I gave earlier, exons 15-19 could all be joined up in sequence, or, exon 15 could be spliced directly to 19, leaving out 16, 17 and 18. This results in different versions of the protein, with different functions. The domain structure of proteins makes this possible.
  2. Exon shuffling: Also because of the domain structure of proteins, it is possible to add exons to genes via recombination and create different versions of the protein. Any chunk of DNA that has portions of introns on its ends that then is spliced into another intron will do no damage to an existing gene (It can be spliced out at the mRNA level). But, it also provides the possibility, through alternative splicing, to evolve proteins with new functions.

The Operon

This is my blog on the Lac Operator. I'm also providing a link to another interactive site with a self quiz. Please run that activity also.
Read More…

DNA Replication

Introduction to DNA replication

How does "S" happen? Read More…