Skip to ContentGo to accessibility pageKeyboard shortcuts menu
OpenStax Logo

10.4 Structure and Function of Cellular Genomes

Microbiology10.4 Structure and Function of Cellular Genomes

Learning Objectives

By the end of this section, you will be able to:

  • Define gene and genotype and differentiate genotype from phenotype
  • Describe chromosome structure and packaging
  • Compare prokaryotic and eukaryotic chromosomes
  • Explain why extrachromosomal DNA is important in a cell

Thus far, we have discussed the structure and function of individual pieces of DNA and RNA. In this section, we will discuss how all of an organism’s genetic material—collectively referred to as its genome—is organized inside of the cell. Since an organism’s genetics to a large extent dictate its characteristics, it should not be surprising that organisms differ in the arrangement of their DNA and RNA.

Genotype versus Phenotype

All cellular activities are encoded within a cell’s DNA. The sequence of bases within a DNA molecule represents the genetic information of the cell. Segments of DNA molecules are called genes, and individual genes contain the instructional code necessary for synthesizing various proteins, enzymes, or stable RNA molecules.

The full collection of genes that a cell contains within its genome is called its genotype. However, a cell does not express all of its genes simultaneously. Instead, it turns on (expresses) or turns off certain genes when necessary. The set of genes being expressed at any given point in time determines the cell’s activities and its observable characteristics, referred to as its phenotype. Genes that are always expressed are known as constitutive genes; some constitutive genes are known as housekeeping genes because they are necessary for the basic functions of the cell.

While the genotype of a cell remains constant, the phenotype may change in response to environmental signals (e.g., changes in temperature or nutrient availability) that affect which nonconstitutive genes are expressed. For example, the oral bacterium Streptococcus mutans produces a sticky slime layer that allows it to adhere to teeth, forming dental plaque; however, the genes that control the production of the slime layer are only expressed in the presence of sucrose (table sugar). Thus, while the genotype of S. mutans is constant, its phenotype changes depending on the presence and absence of sugar in its environment. Temperature can also regulate gene expression. For example, the gram-negative bacterium Serratia marcescens, a pathogen frequently associated with hospital-acquired infections, produces a red pigment at 28 °C but not at 37 °C, the normal internal temperature of the human body (Figure 10.24).

A photo of an agar plate with pink cells on the left and one with beige cells on the right. Both plates are labeled S. marcescens. The pink culture was grown at 28 degrees; the beige culture at 37 degrees.
Figure 10.24 Both plates contain strains of Serratia marcescens that have the gene for red pigment. However, this gene is expressed at 28 °C (left) but not at 37 °C (right). (credit: modification of work by Ann Auman)

Organization of Genetic Material

The vast majority of an organism’s genome is organized into the cell’s chromosomes, which are discrete DNA structures within cells that control cellular activity. Recall that while eukaryotic chromosomes are housed in the membrane-bound nucleus, most prokaryotes contain a single, circular chromosome that is found in an area of the cytoplasm called the nucleoid (see Unique Characteristics of Prokaryotic Cells). A chromosome may contain several thousand genes.

Organization of Eukaryotic Chromosome

Chromosome structure differs somewhat between eukaryotic and prokaryotic cells. Eukaryotic chromosomes are typically linear, and eukaryotic cells contain multiple distinct chromosomes. Many eukaryotic cells contain two copies of each chromosome and, therefore, are diploid.

The length of a chromosome greatly exceeds the length of the cell, so a chromosome needs to be packaged into a very small space to fit within the cell. For example, the combined length of all of the 3 billion base pairs18 of DNA of the human genome would measure approximately 2 meters if completely stretched out, and some eukaryotic genomes are many times larger than the human genome. DNA supercoiling refers to the process by which DNA is twisted to fit inside the cell. Supercoiling may result in DNA that is either underwound (less than one turn of the helix per 10 base pairs) or overwound (more than one turn per 10 base pairs) from its normal relaxed state. Proteins known to be involved in supercoiling include topoisomerases; these enzymes help maintain the structure of supercoiled chromosomes, preventing overwinding of DNA during certain cellular processes like DNA replication.

During DNA packaging, DNA-binding proteins called histones perform various levels of DNA wrapping and attachment to scaffolding proteins. The combination of DNA with these attached proteins is referred to as chromatin. In eukaryotes, the packaging of DNA by histones may be influenced by environmental factors that affect the presence of methyl groups on certain cytosine nucleotides of DNA. The influence of environmental factors on DNA packaging is called epigenetics. Epigenetics is another mechanism for regulating gene expression without altering the sequence of nucleotides. Epigenetic changes can be maintained through multiple rounds of cell division and, therefore, can be heritable.

Organization of Prokaryotic Chromosomes

Chromosomes in bacteria and archaea are usually circular, and a prokaryotic cell typically contains only a single chromosome within the nucleoid. Because the chromosome contains only one copy of each gene, prokaryotes are haploid. As in eukaryotic cells, DNA supercoiling is necessary for the genome to fit within the prokaryotic cell. The DNA in the bacterial chromosome is arranged in several supercoiled domains. As with eukaryotes, topoisomerases are involved in supercoiling DNA. DNA gyrase is a type of topoisomerase, found in bacteria and some archaea, that helps prevent the overwinding of DNA. (Some antibiotics kill bacteria by targeting DNA gyrase.) In addition, histone-like proteins bind DNA and aid in DNA packaging. Other proteins bind to the origin of replication, the location in the chromosome where DNA replication initiates. Because different regions of DNA are packaged differently, some regions of chromosomal DNA are more accessible to enzymes and thus may be used more readily as templates for gene expression. Interestingly, several bacteria, including Helicobacter pylori and Shigella flexneri, have been shown to induce epigenetic changes in their hosts upon infection, leading to chromatin remodeling that may cause long-term effects on host immunity.19

Check Your Understanding

  • What is the difference between a cell’s genotype and its phenotype?
  • How does DNA fit inside cells?

Noncoding DNA

In addition to genes, a genome also contains many regions of noncoding DNA that do not encode proteins or stable RNA products. Noncoding DNA is commonly found in areas prior to the start of coding sequences of genes as well as in intergenic regions (i.e., DNA sequences located between genes) (Figure 10.25).

A chromosome drawn as an X shape. As the strand unravels we see that it is a long double helix with genes interspersed with noncoding regions.
Figure 10.25 Chromosomes typically have a significant amount of noncoding DNA, often found in intergenic regions.

Prokaryotes appear to use their genomes very efficiently, with only an average of 12% of the genome being taken up by noncoding sequences. In contrast, noncoding DNA can represent about 98% of the genome in eukaryotes, as seen in humans, but the percentage of noncoding DNA varies between species.20 These noncoding DNA regions were once referred to as “junk DNA”; however, this terminology is no longer widely accepted because scientists have since found roles for some of these regions, many of which contribute to the regulation of transcription or translation through the production of small noncoding RNA molecules, DNA packaging, and chromosomal stability. Although scientists may not fully understand the roles of all noncoding regions of DNA, it is generally believed that they do have purposes within the cell.

Check Your Understanding

  • What is the role of noncoding DNA?

Extrachromosomal DNA

Although most DNA is contained within a cell’s chromosomes, many cells have additional molecules of DNA outside the chromosomes, called extrachromosomal DNA, that are also part of its genome. The genomes of eukaryotic cells would also include the chromosomes from any organelles such as mitochondria and/or chloroplasts that these cells maintain (Figure 10.26). The maintenance of circular chromosomes in these organelles is a vestige of their prokaryotic origins and supports the endosymbiotic theory (see Foundations of Modern Cell Theory). In some cases, genomes of certain DNA viruses can also be maintained independently in host cells during latent viral infection. In these cases, these viruses are another form of extrachromosomal DNA. For example, the human papillomavirus (HPV) may be maintained in infected cells in this way.

A drawing of a cell. the cell has a large sphere labeled nucleus, smaller ovals labeled mitochondria and small green ovals labeled chloroplasts.
Figure 10.26 The genome of a eukaryotic cell consists of the chromosome housed in the nucleus, and extrachromosomal DNA found in the mitochondria (all cells) and chloroplasts (plants and algae). The cells shown in (b) represent cells obtained from a pap smear. The cells on the left are normal squamous cells whereas the cells on the right are infected with human papillomavirus and show enlarged nuclei with increased staining (hyperchromasia).

Besides chromosomes, some prokaryotes also have smaller loops of DNA called plasmids that may contain one or a few genes not essential for normal growth (Figure 3.12). Bacteria can exchange these plasmids with other bacteria in a process known as horizontal gene transfer (HGT). The exchange of genetic material on plasmids sometimes provides microbes with new genes beneficial for growth and survival under special conditions. In some cases, genes obtained from plasmids may have clinical implications, encoding virulence factors that give a microbe the ability to cause disease or make a microbe resistant to certain antibiotics. Plasmids are also used heavily in genetic engineering and biotechnology as a way to move genes from one cell to another. The role of plasmids in horizontal gene transfer and biotechnology will be discussed further in Mechanisms of Microbial Genetics and Modern Applications of Microbial Genetics.

Check Your Understanding

  • How are plasmids involved in antibiotic resistance?

Case in Point

Lethal Plasmids

Maria, a 20-year-old anthropology student from Texas, recently became ill in the African nation of Botswana, where she was conducting research as part of a study-abroad program. Maria’s research was focused on traditional African methods of tanning hides for the production of leather. Over a period of three weeks, she visited a tannery daily for several hours to observe and participate in the tanning process. One day, after returning from the tannery, Maria developed a fever, chills, and a headache, along with chest pain, muscle aches, nausea, and other flu-like symptoms. Initially, she was not concerned, but when her fever spiked and she began to cough up blood, her African host family became alarmed and rushed her to the hospital, where her condition continued to worsen.

After learning about her recent work at the tannery, the physician suspected that Maria had been exposed to anthrax. He ordered a chest X-ray, a blood sample, and a spinal tap, and immediately started her on a course of intravenous penicillin. Unfortunately, lab tests confirmed the physician’s presumptive diagnosis. Maria’s chest X-ray exhibited pleural effusion, the accumulation of fluid in the space between the pleural membranes, and a Gram stain of her blood revealed the presence of gram-positive, rod-shaped bacteria in short chains, consistent with Bacillus anthracis. Blood and bacteria were also shown to be present in her cerebrospinal fluid, indicating that the infection had progressed to meningitis. Despite supportive treatment and aggressive antibiotic therapy, Maria slipped into an unresponsive state and died three days later.

Anthrax is a disease caused by the introduction of endospores from the gram-positive bacterium B. anthracis into the body. Once infected, patients typically develop meningitis, often with fatal results. In Maria’s case, she inhaled the endospores while handling the hides of animals that had been infected.

The genome of B. anthracis illustrates how small structural differences can lead to major differences in virulence. In 2003, the genomes of B. anthracis and Bacillus cereus, a similar but less pathogenic bacterium of the same genus, were sequenced and compared.21 Researchers discovered that the 16S rRNA gene sequences of these bacteria are more than 99% identical, meaning that they are actually members of the same species despite their traditional classification as separate species. Although their chromosomal sequences also revealed a great deal of similarity, several virulence factors of B. anthracis were found to be encoded on two large plasmids not found in B. cereus. The plasmid pX01 encodes a three-part toxin that suppresses the host immune system, whereas the plasmid pX02 encodes a capsular polysaccharide that further protects the bacterium from the host immune system (Figure 10.27). Since B. cereus lacks these plasmids, it does not produce these virulence factors, and although it is still pathogenic, it is typically associated with mild cases of diarrhea from which the body can quickly recover. Unfortunately for Maria, the presence of these toxin-encoding plasmids in B. anthracis gives it its lethal virulence.

A diagram of Bacillus cereus showing an oval cell with a folded loop of a chromosome. The second diagram of this cell has two small loops, one labeled px01 encoding toxin and the other labeled px02 encoding toxin.
Figure 10.27 Genome sequencing of Bacillus anthracis and its close relative B. cereus reveals that the pathogenicity of B. anthracis is due to the maintenance of two plasmids, pX01 and pX02, which encode virulence factors.
  • What do you think would happen to the pathogenicity of B. anthracis if it lost one or both of its plasmids?

Clinical Focus


Within 24 hours, the results of the diagnostic test analysis of Alex’s stool sample revealed that it was positive for heat-labile enterotoxin (LT), heat-stabile enterotoxin (ST), and colonization factor (CF), confirming the hospital physician’s suspicion of ETEC. During a follow-up with Alex’s family physician, this physician noted that Alex’s symptoms were not resolving quickly and he was experiencing discomfort that was preventing him from returning to classes. The family physician prescribed Alex a course of ciprofloxacin to resolve his symptoms. Fortunately, the ciprofloxacin resolved Alex’s symptoms within a few days.

Alex likely got his infection from ingesting contaminated food or water. Emerging industrialized countries like Mexico are still developing sanitation practices that prevent the contamination of water with fecal material. Travelers in such countries should avoid the ingestion of undercooked foods, especially meats, seafood, vegetables, and unpasteurized dairy products. They should also avoid use of water that has not been treated; this includes drinking water, ice cubes, and even water used for brushing teeth. Using bottled water for these purposes is a good alternative. Good hygiene (handwashing) can also aid the prevention of an ETEC infection. Alex had not been careful about his food or water consumption, which led to his illness.

Alex’s symptoms were very similar to those of cholera, caused by the gram-negative bacterium Vibrio cholerae, which also produces a toxin similar to ST and LT. At some point in the evolutionary history of ETEC, a nonpathogenic strain of E. coli similar to those typically found in the gut may have acquired the genes encoding the ST and LT toxins from V. cholerae. The fact that the genes encoding those toxins are encoded on extrachromosomal plasmids in ETEC supports the idea that these genes were acquired by E. coli and are likely maintained in bacterial populations through horizontal gene transfer.

Go back to the previous Clinical Focus box.

Viral Genomes

Viral genomes exhibit significant diversity in structure. Some viruses have genomes that consist of DNA as their genetic material. This DNA may be single stranded, as exemplified by human parvoviruses, or double stranded, as seen in the herpesviruses and poxviruses. Additionally, although all cellular life uses DNA as its genetic material, some viral genomes are made of either single-stranded or double-stranded RNA molecules, as we have discussed. Viral genomes are typically smaller than most bacterial genomes, encoding only a few genes, because they rely on their hosts to carry out many of the functions required for their replication. The diversity of viral genome structures and their implications for viral replication life cycles are discussed in more detail in The Viral Life Cycle.

Check Your Understanding

  • Why do viral genomes vary widely among viruses?

Micro Connections

Genome Size Matters

There is great variation in size of genomes among different organisms. Most eukaryotes maintain multiple chromosomes; humans, for example have 23 pairs, giving them 46 chromosomes. Despite being large at 3 billion base pairs, the human genome is far from the largest genome. Plants often maintain very large genomes, up to 150 billion base pairs, and commonly are polyploid, having multiple copies of each chromosome.

The size of bacterial genomes also varies considerably, although they tend to be smaller than eukaryotic genomes (Figure 10.28). Some bacterial genomes may be as small as only 112,000 base pairs. Often, the size of a bacterium’s genome directly relates to how much the bacterium depends on its host for survival. When a bacterium relies on the host cell to carry out certain functions, it loses the genes encoding the abilities to carry out those functions itself. These types of bacterial endosymbionts are reminiscent of the prokaryotic origins of mitochondria and chloroplasts.

From a clinical perspective, obligate and facultative intracellular pathogens also tend to have small genomes (some around 1 million base pairs). Because host cells can supply most of their nutrients, they tend to have a reduced number of genes encoding metabolic functions, making their cultivation in the laboratory difficult if not impossible Due to their small sizes, the genomes of organisms like Mycoplasma genitalium (580,000 base pairs), Chlamydia trachomatis (1.0 million), Rickettsia prowazekii (1.1 million), and Treponema pallidum (1.1 million) were some of the earlier bacterial genomes sequenced. Respectively, these pathogens cause urethritis and pelvic inflammation, chlamydia, typhus, and syphilis.

Whereas obligate intracellular pathogens have unusually small genomes, other bacteria with a great variety of metabolic and enzymatic capabilities have unusually large bacterial genomes. Pseudomonas aeruginosa, for example, is a bacterium commonly found in the environment and is able to grow on a wide range of substrates. Its genome contains 6.3 million base pairs, giving it a high metabolic ability and the ability to produce virulence factors that cause several types of opportunistic infections.

Interestingly, there has been significant variability in genome size in viruses as well, ranging from 3,500 base pairs to 2.5 million base pairs, significantly exceeding the size of many bacterial genomes. The great variation observed in viral genome sizes further contributes to the great diversity of viral genome characteristics already discussed.

 A graph showing genome sizes. Viruses have genomes that range from 1.7x10 to the 2nd bp to 2.5x10 to the 6th bp. Bacteria have genomes that range in size from 10 to the 5th to 10 to the 7th. One example is E. coli which ranges from 4.6 to 5.6 x 10 to the 6th bp. Fungi have genomes that range from 10 to the 6th to 10 to the 8th bp. Saccharomyces cerevisiae (yeast) has a genome of 1.2 x 10 to the 7th bp. Plants and animals have genomes that range from 10 to the 6th to 10 to the 11th bp. Mammals range from 10 to the 9th to 10 to the 10th bp. Humans have a genome of 3 x 10 to the 9th.
Figure 10.28 There is great variability as well as overlap among the genome sizes of various groups of organisms and viruses.


  • 18National Human Genome Research Institute. “The Human Genome Project Completion: Frequently Asked Questions.” Accessed June 10, 2016
  • 19H. Bierne et al. “Epigenetics and Bacterial Infections.” Cold Spring Harbor Perspectives in Medicine 2 no. 12 (2012):a010272.
  • 20R.J. Taft et al. “The Relationship between Non-Protein-Coding DNA and Eukaryotic Complexity.” Bioessays 29 no. 3 (2007):288–299.
  • 21N. Ivanova et al. “Genome Sequence of Bacillus cereus and Comparative Analysis with Bacillus anthracis.” Nature 423 no. 6935 (2003):87–91.
Order a print copy

As an Amazon Associate we earn from qualifying purchases.


This book may not be used in the training of large language models or otherwise be ingested into large language models or generative AI offerings without OpenStax's permission.

Want to cite, share, or modify this book? This book uses the Creative Commons Attribution License and you must attribute OpenStax.

Attribution information
  • If you are redistributing all or part of this book in a print format, then you must include on every physical page the following attribution:
    Access for free at
  • If you are redistributing all or part of this book in a digital format, then you must include on every digital page view the following attribution:
    Access for free at
Citation information

© Jan 10, 2024 OpenStax. Textbook content produced by OpenStax is licensed under a Creative Commons Attribution License . The OpenStax name, OpenStax logo, OpenStax book covers, OpenStax CNX name, and OpenStax CNX logo are not subject to the Creative Commons license and may not be reproduced without the prior and express written consent of Rice University.