Summary

The DNA sequence information in the human genome is read and interpreted by proteins. Without interactions between proteins and DNA, life would not exist. My laboratory studies how proteins are directed to specific parts of the genome, so that they can do their specific jobs. Using a tool called the DNA microarray, we can determine the location of thousands of protein-DNA interactions in a single experiment. Then, we use this data to reconstruct the rules that the protein used to find out where it was supposed to go. Our studies may lead to better predictions of where defective proteins that cause cancer bind to the genome.

Research Description

The Lieb laboratory was established in the summer of 2002 at the University of North Carolina in Chapel Hill. We aim to understand a very basic unsolved problem in biology: How do proteins that interact with DNA find their proper targets in living cells? We study this problem in yeast, a single-celled fungus, and C. elegans, a very simple roundworm that lives in the soil. C. elegans is not the kind of worm you may have in mind: it is transparent, about the size of a piece of lint, and is barely visible with the naked eye.

Why do we use these simple creatures to study this problem?

Researchers often use simpler organisms, like yeast, to figure out the details of processes that are fundamental to all living things. Then these discoveries are tested in more complicated systems, where the knowledge can be used as the basis for creating drugs or other therapies to cure disease. The advantage is that yeast cells grow very quickly and are easy to manipulate in the lab, allowing discoveries to be made much more rapidly than they would be if human cells were studied first.

We use yeast and worms to study how DNA-binding proteins called transcription factors find their proper targets in living cells. Transcription factors control which genes get turned on and off, when those genes get turned on and off, and in what tissues they get turned on and off. Using a new research tool called the DNA microarray (see below), we are able to map all of the sites of interaction between any given transcription factor and the entire yeast genome (a "genome" is simply all of the DNA that makes up the chromosomes that are in a cell). We have begun to uncover some of the rules that proteins use to find their targets in yeast. For example, we were able to provide conclusive genome-wide evidence that DNA sequence information alone is not sufficient to direct a binding event. Even though many regions of the genome may contain a DNA sequence a transcription factor likes to bind, only sequences that are in particular parts of the genome are actually bound. We would also like to test whether the rules that proteins use in yeast also are used in a much more complicated system, C. elegans (these are the worms). Worms are simple compared to humans, but are much more complicated than a yeast cell. For example, worms have muscle, a nervous system, and a gut, all things yeast don't have. Therefore, worms are an important stepping stone in determining whether the way transcription factors behave in yeast is similar to the way they behave in humans.

Why is understanding how transcription factors find their targets important?

In humans, cancers occur when mutations in DNA lead to improper regulation of genes that control cell growth. It is likely that in all cancers, alterations in the abundance or binding specificity of at least one transcription factor contributes directly to uncontrolled cell proliferation. As one example, Burkitt's lymphoma is caused when a transcription factor called c-myc is turned on in improperly in immune cells, which in turn leads to inappropriate gene regulation and cancer. Another example is a protein called CTCF, which is involved in the occurrence of breast, prostate, and kidney tumors. When CTCF is mutated, the DNA binding spectrum of CTCF changes such that it no longer bind to its proper targets involved in regulating cell proliferation, but instead binds other targets. In both examples, which serve as general illustrations for numerous other cancers, alterations in the natural range of a transcription factor's gene targets are likely major factors in the development of cancer. We anticipate that our studies of DNA-binding specificity will lead to better predictions of the targets of human transcription factors that have the potential to cause cancer, which may ultimately lead to new therapies based on inhibiting binding to inappropriate targets. Our work may also lay the groundwork for experiments in mammalian systems to examine how any cancerous state affects the distribution of DNA-binding proteins, and how that altered distribution contributes to progression towards malignancy.

What are DNA microarrays?

Traditionally, biologists have studied one gene at a time, often focusing on a single gene for their entire career. DNA microarrays allow a single investigator to study all of the genes in an organism in parallel, in a single experiment. Microarrays have therefore greatly accelerated the pace of biological discovery. Even more important than the increase in throughput is our ability to for the first time obtain a complete picture of how genes act to together, in groups, to specify biological function. This was impossible before; the analogy would be trying to understand how a battle was won by studying the behavior and fate of an individual soldier. Now, we can zoom out and see the movement and behavior of entire regiments, while retaining the resolution to focus in on particular soldiers of interest, who may have performed critical functions.

DNA microarrays are simply arrays of thousands of discrete DNA sequences, for example PCR products or oligonucleotides, representing all 6,200 genes in Saccharomyces cerevisiae (baker's yeast), printed at high density onto a glass slide. A typical DNA microarray fabricated in our laboratory contains about 20,000 spots of DNA, which represent all of the genes and regulatory DNA in Saccharomyces cerevisiae. The spots are about 100 microns in diameter and are spaced at a center-to-center distance of 175 microns. A home-made robotic machine produces the DNA microarrays we use. It is housed in Fordham hall, and was produced from off-the-shelf electronics, motors, and motion-control software that are used in the semiconductor industry.

Because of the property of DNA that allows complementary DNA fragments to hybridize to each other, we can use these arrays to measure the relative levels of specific nucleic acids in biological samples. For example, we can separately grow yeast under two different conditions (for example cold and hot), and then extract mRNA from both yeast cultures. The RNA from each of the cultures can be labeled with a different fluorescent dye (for example red and green). The labeled RNAs are then mixed together and hybridized to the array. By the ratio of fluorescent signal coming from each spot, we can monitor the relative levels of every mRNA expressed by an organism under any given condition. The same can be applied to human cells, for example comparing gene expression from cancerous and non-cancerous tissue. There are many other uses for microarrays. My primary use of them is for mapping the position of protein-DNA interactions in the genome.