- Introduction
- Background
- The Model
- Conclusions
- Acknowledgements
- References
- Figure legends and tables
1. Introduction
The bacterial flagellum is one of the most striking devices
found in biology. In Escherichia
coli the flagellum is about 10 μm long, but the helical filament is
only 20 nm wide and the basal body about 45 nm wide. The flagellum is made up of approximately 20 major protein parts
with another 20-30 proteins with roles in construction and taxis (Berg, 2003; Macnab, 2003). Many but
not all of these proteins are required for assembly and function, with modest
variation between species. Over several
decades, thousands of papers have gradually elucidated the structure,
construction, and detailed workings of the flagellum. The conclusions have often been surprising. Berg and Anderson (1973) made the first convincing case that the flagellar
filament was powered by a rotary motor.
This hypothesis was dramatically confirmed when flagellar filaments were
attached to coverslips and the rotation of cells was directly observed (Silverman and Simon, 1974). The energy
source for the motor is proton motive force rather than ATP (Manson
et al., 1977). The
flagellar filament is assembled from the inside out, with flagellin monomers
added at the distal tip after export through a hollow channel inside the
flagellar filament (Emerson et al., 1970). The
flagella of E. coli rotate bidirectionally at about 100 Hz, propelling
the rod-shaped cell (dimensions 1x2 μm) approximately 30 μm/sec. The flagella of other species, powered by
sodium ions rather than hydrogen ions, can rotate at over 1500 Hz and move
cells at speeds of several hundred μm/sec. The efficiency of energy conversion from ion gradient to rotation
may approach 100% (DeRosier, 1998). The
bacterial flagellum is now one of the best understood molecular complexes,
although numerous detailed questions remain concerning the function of various
minor components and the exact mechanism of torque generation. However, the origins of this remarkable
device have hardly been examined. This article
will propose a detailed model for the evolutionary origin of the bacterial
flagellum, along with an assessment of the available evidence and proposal of
further tests. That the time is ripe
for a serious consideration of this question is discussed below.
Biologists find it almost inescapable to compare the
bacterial flagellum to human designs: "More so than other structures, the
bacterial flagellum resembles a human machine" (DeRosier, 1998). The
impression is heightened by electron micrograph images (Figure 1) reminiscent of a engine turbine (e.g., Whitesides, 2001), and the scientific literature on the flagellum is
filled with analogies to human-designed motors. There is no shortage of authorities willing to express
mystification on the question of the evolutionary origin of flagella. In a 1978 review, Macnab concluded,
As a final
comment, one can only marvel at the intricacy, in a simple bacterium, of the
total motor and sensory system which has been the subject of this review and
remark that our concept of evolution by selective advantage must surely be an
oversimplification. What advantage could derive, for example, from a
"preflagellum" (meaning a subset of its components), and yet what is the
probability of "simultaneous" development of the organelle at a level where it
becomes advantageous?" (Macnab, 1978).
The basic puzzle is that the flagellum is made up of
dozens of protein components, and deletion experiments show that the flagellum
will not assemble and/or function if any one of these components is removed
(with some exceptions). How, then,
could this system emerge in a gradual evolutionary fashion, if function is only
achieved when all of the required parts are available?
1.3. Theory:
the evolution of systems with multiple required components
The standard answer to this question was put forward by
Darwin. Mivart (1871) argued that the "incipient stages of useful
structures" could not have evolved gradually by variation and natural
selection, because the intermediate stages of complex systems would have been
nonfunctional. Darwin replied in the 6th
edition of Origin of Species (Darwin,
1872) by emphasizing the importance of change of
function in evolution. Although
Darwin's most famous discussion of the evolution of a complex system, the eye,
was an example of massive improvement of function from a rudimentary ancestor (Salvini-Plawen and Mayr, 1977; Nilsson and
Pelger, 1994), Darwin gave equal weight to examples of functional
shift in evolution. These included the
complex reproductive devices of orchids and barnacles, groups with which he was
particularly familiar (Darwin, 1851,
1854, 1862). Intricate
multi-component systems such as these could not have originated in a gradual,
linear fashion, but if systems and components underwent functional shift, then
intermediates could have been selected for a function different from the final
one. The equal importance of
improvement of function and change of function in understanding the
evolutionary origin of novel complex systems has been similarly emphasized by
later workers (Maynard Smith, 1975;
Mayr, 1976). Recent
studies give cooption of structures a key role in the origin of feathers (Prum and Brush, 2002), and novel organs (Pellmyr
and Krenn, 2002); Mayr (1976) gives many other examples.
Do these common insights from classical, organismal
evolutionary biology help us to understand the solution to the puzzle Macnab
put forward regarding the origin of flagellum?
Cooption at the molecular level is in fact as well-documented at it is
at the macroscopic level (Ganfornina
and Sanchez, 1999; Thornhill and Ussery, 2000; True and Carroll, 2002). It has been
implicated in origin of ancient multi-component molecular systems such as the
Krebs cycle (Melendez-Hevia et al., 1996) as well as the rapid origin of multi-component
catabolic pathways for abiotic toxins that humans have recently introduced into
the environment, such as pentachlorophenol (Anandarajah et al., 2000; Copley, 2000), atrazine (de
Souza et al., 1998; Sadowsky et al., 1998; Seffernick and Wackett,
2001), and 2,4-dinitrotoluene (Johnson et al., 2002); many other cases of catabolic pathway evolution
exist (Mortlock, 1992). All of
these systems absolutely require multiple protein species for proper
function. Even for some molecular
systems equaling the flagellum in complexity, reasonably detailed
reconstructions of evolutionary origins exist.
Generally these are available for systems which originated relatively
recently in geological history, which are well-studied due to medical importance,
and where phylogeny is relatively well resolved; examples include the
vertebrate blood-clotting cascade (Doolittle
and Feng, 1987; Hanumanthaiah et al.,
2002; Jiang and Doolittle, 2003) and the vertebrate immune system (Muller
et al., 1999; Pasquier and Litman, 2000).
Thornhill and Ussery (2000) summarized the general pathways by which systems
with multiple required components may evolve.
They delineate three gradual routes to such systems: parallel direct
evolution (coevolution of components), elimination of functional redundancy
("scaffolding," the loss of once necessary but now unnecessary components) and
adoption from a different function ("cooption," functional shift of
components); a fourth route, serial direct evolution (change along a single
axis), could not produce multiple-components-required systems. However, Thornhill and Ussery's analysis did
not distinguish between the various levels of biological organization at which
these pathways might operate. The
above-cited literature on the evolution of complex molecular systems indicates
that complex systems usually originate by a key shift in function of an
ancestral system, followed by an intensive period of improvement of the
originally crudely functioning design.
At the level of the system, cooption is usually the key event in the
origin of the modern system with the function of interest. However, a great deal of the complexity in
terms of numbers of parts is added to the system after origination. These accessory parts get added by
duplication and cooption of novel genes (for reviews of gene duplication in
evolution, see Long, 2001; Chothia et al., 2003; Hooper and Berg, 2003) and/or duplication and subfunctionalization (Force
et al., 1999) of genes already involved in the crudely-functioning
system. Cooption of whole subsystems,
linking them to the "core" system, may also occur.
Therefore, improvement of function at the system level might
be implemented by cooption at the level of a protein or subsystem. Change of function at the system level might
occur without any lower level cooption of new components. Thornhill and Ussery's four routes can be
reduced to the two major pathways proposed by Darwin: improvement of current
function (optimization) and shift of function (cooption). Cooption remains its own category, while the
other three routes (serial direct evolution, parallel direct evolution, and
elimination of functional redundancy) can be considered as three versions of
functional improvement, with the lower-level components undergoing
optimization, coevolutionary optimization, or loss, respectively. This conceptual framework is basically
equivalent to the patchwork model for the evolution of metabolic pathways (Melendez-Hevia et al., 1996; Copley, 2000), where components are recruited from diverse sources
and functional improvement or functional shift might occur at any
organizational level, e.g. system, subsystem, protein, or protein domain.
In order to explain the origin of a specific system such as
the flagellum, the general theory discussed above must be combined with the
available evidence in order to produce a detailed, testable model. Detail in evolutionary scenarios makes them
more testable, not less: "Specifying
transitional stages in considerable detail is not unwarranted speculation, but
a way of making the ideas sufficiently explicit to be more easily tested and
rigorously evaluated" (Cavalier-Smith,
2001b). Obviously
"detailed" cannot mean that every mutation and substitution event be recorded –
for events that occurred billions of years ago this is impossible. A detailed evolutionary model should reduce
a puzzling event like the origin of the flagellum into a series of events that
occur by well-understood processes.
In an ideal model, the origin of every protein component
will fulfill three criteria. First, a
putative ancestral protein with a different function (a homolog that can
reasonably be suspected to precede the flagellum) should be identified. Second, the cooption of the protein should
occur by a reasonably probable mutation event -- e.g., a mutation produces a
single new binding site, which initially functions crudely but which can then
undergo standard microevolutionary optimization. Third, the selective regime favoring retention of the coopted
protein should be identified. Each of
these three criteria encourages further testing against new data. Hypothesized homologies can be assessed by
new data, e.g., structures. The plausibility of mutational steps can be
investigated by examination of similar mutations observed today; and the
selection forces invoked can be assessed by study of analogies and by
mathematical modeling. Furthermore, an
evolutionary model might have testable implications for other fields: for
example, if a biological system is derived from a homologous system,
similarities in mechanism between the two systems would be suspected. The fact that we do not have all of the data
that we would like, and that uncertainty is high, are not problems unique to
evolutionary models; rather, these problems are commonplace in any advancing
science. For example, many
contradictory models have been published for the mechanism of motor action in
the flagellum, and most (or all) of them must be wrong, but this has not
stopped anyone from proposing new models (Schmitt,
2003). Science is
advanced by proposing and testing hypotheses, not by declaring questions
unsolvable.
The canonical flagellum of E. coli is shown in Figure 2. Descriptions
of the structural components are given in Table
1. Cytoplasmic
components involved in regulation and assembly, as well as the chemotaxis
components, are listed in Table
2. Excellent
overviews of flagellar function and assembly are available elsewhere (Berg, 2003; Macnab, 2003) and so will not be discussed further here.
2.2.1. Short discussions
Occasional examples of very general suggestions about the
evolutionary origin of flagella can be found in the literature, for example in
discussions of how various aspects of the chemotaxis system are optimized (Berry, 2000); in the suggestion that prokaryote flagella may have
been a relatively late invention, after biofilms and microbial mats had become
well-developed and crowding on surface habitats became a problem (Stoodley et al., 2002); or in the alleged common ancestry of archaeal and
bacterial flagella (Harshey and
Toguchi, 1996). Archaeal
and bacterial flagella were indeed once thought to be homologous (Jones
et al., 1987), but they are actually totally distinct motility
systems (Jarrell et al., 1996; Faguy and Jarrell, 1999; Thomas et al., 2001). Although
both kinds of flagella rotate and are superficially similar, archaeal flagella
are fundamentally different in many respects (Table
3). In archaeal
flagella, the filaments are thinner, lack a central channel, and subunits are
added from the base rather than the tip.
Forward movement is typically attained by clockwise rather than
counterclockwise motion. Additionally,
archaeal flagella are probably powered by ATP rather than protonmotive force
(suggested by homologies of FlaI to PilT/U (Jarrell et al., 1999; Thomas et al., 2001; Merz and Forest, 2002, although the literature is contradictory: Bardy et
al. (2003) assert that archaeal flagella use protonmotive
force, but cite no supporting evidence).
Finally, the homologies of the two flagella to nonflagellar secretion
systems are different. The bacterial
and archaeal flagella are therefore a classic case of analogy, not homology (Faguy
et al., 1994; Jarrell et al.,
1996; Bayley and Jarrell, 1998; Faguy and Jarrell, 1999; Thomas et al., 2001; Thomas et al., 2002; Bardy et al., 2003). However,
the misperception persists in the assumption that the flagella (Harshey and Toguchi, 1996; Campos-Garcia et al., 2000; Rizzotti, 2000) or their basal bodies (Cavalier-Smith, 2002a, 2002c) are homologous.
On the other hand, the chemotaxis systems are indeed homologous, and are
shared with nonflagellar motility systems as well (Faguy and Jarrell, 1999; Koretke et al., 2000).
A slightly more detailed attempt at explaining the origin of
the bacterial flagellum was made by de Duve (1995), who apparently got the bacterial flagellum confused
with the completely different eukaryotic cilium (also known as the eukaryotic
flagellum or undulipodium in an interminable terminological dispute; see Corliss, 1980; Margulis, 1980;
Cavalier-Smith, 1982). He
suggested that the flagellum, which he acknowledges is rotary, was somehow
descended from a simpler ATP-powered filament-bending motor. In a more reasonable vein, de Duve then gave
a brief scenario for the gradual origin of chemotactic behavior from random
swimming, but was again puzzling in postulating that essentially fully functional,
bidirectional-switching flagella with specific positioning on the cell surface
existed before the signal transduction system was coupled to the
flagellum. What the purpose of
switching would be without a chemotaxis system was not explained. De Duve furthermore stated that these
well-developed but non-chemotactic flagella gave "little advantage" until they
were chemotactically enabled, leaving unexplained the selective reason for the
origin of the whole nearly-complete system in the first place.
Finally, Goodenough (1998;
2002) offers a short account deriving a flagellum from a
proton-transducing membrane channel.
She postulates that a coopted protein increased the efficiency of proton
transport, and rotated the channel as a by-product. Later binding of a filament to the outside of this rotating
channel produced primitive motility which increased food gathering
ability. However, the original function
of proton transport (which, uncoupled to another process, would simply
de-energize the cytoplasmic membrane) is not specified. In her 2002 account Goodenough suggested
that a fibrous protein binding to the F1F0-ATP synthetase
produced the proto-flagellum.
Presumably she meant that the proto-filament would bind to the distal
side of a c-subunit of F0.
As recent work indicates that F0-c and F1-εγ
rotate inside the F0-ab and F1-αβδ complex (Weber and Senior, 2003), Goodenough's suggestion is not immediately
impossible, but suffers difficulties similar to those discussed for Rizzotti
(2000), below.
2.2.2. Cavalier-Smith (1987)
Cavalier-Smith is one of the few who has proposed detailed
hypotheses for the origin of many
fundamental features of eukaryotes and prokaryotes (Cavalier-Smith, 1987a, 1987b, 2001a, 2002b, 2002a, 2002c). He bases
his work on a refreshingly clearly-stated philosophy for reconstructing the
origin of complex systems, advocating a holistic approach considering
environment, organism, mutation, and selection all together and emphasizing
testability (Cavalier-Smith, 2001a). Although
Cavalier-Smith has addressed the origin of the eukaryotic cilium on several
occasions (Cavalier-Smith, 1978,
1982, 1987b, 2002b), Cavalier-Smith's only treatment of the origin of
the bacterial flagellum is found in a 1987 article (Cavalier-Smith, 1987a). He makes
two suggestions: first, that a mutant version of an outer membrane protein pore
formed a tubular polymer extending through the outer membrane into the
extracellular medium. Linking this to
proton-conducting proteins in the cytoplasmic membrane provided the primitive
motor. In this scheme, spirochete axial
filaments were derived from regular flagella.
His second suggestion was that flagella evolved from gliding motility
systems, which are also widespread and powered by protonmotive force. Some early models of gliding motility
postulated a spirochete-like mechanism, with rotating filaments in the
periplasmic space, and on this basis spirochetes might represent a transitional
stage. Motility would develop from
rotating filaments first used just to stir the fluid in the periplasmic space
and increase diffusion of nutrients.
Either way, the rotary mechanism existed from the beginning of the
evolutionary sequence. On either
scenario, the first crude motility function would have been selected for
because it increased random dispersal, useful in overcrowded regions depleted
in nutrients. Much of the complexity
could have post-dated the original crudely functioning motility.
Cavalier-Smith was hampered by the relatively primitive
state of knowledge at the time, and he conceded that the actual evolutionary
process must have been much more complicated than his suggestions. The linkage between the filament and motor
is very complex, mediated by about ten proteins, and the filament subunits are
secreted through the base of the flagellum via a type III export pathway,
rather than via a type II pathway as might be expected for a protein derived
from an outer membrane pore; type III virulence systems do utilize an outer
membrane secretin secreted by the type II pathway, and the flagella P- and
L-ring proteins FlgI and FlgH are similarly secreted via the type II pathway (Macnab, 2003). A secretin
might therefore be more likely posited as the source for FlgH; this will be
discussed in more detail below.
Regarding the postulated homology between gliding motility
and the axial filaments of spirochetes, today it is apparent that gliding
motility is not a matter of rotating periplasmic filaments. Two mechanisms for gliding motility have
been clearly identified (Merz and
Forest, 2002; Bardy et al., 2003). First, the
social gliding of Myxococcus xanthus occurs via retraction of type IV
pili, sometimes also called twitching motility (Merz and Forest, 2002). Second, the adventurous
motility of M. xanthus is driven by the secretion of a
polysaccharide gel (slime) via the junctional pore complex; a similar complex
is found in gliding cyanobacteria.
Gliding via the ratchet structure of Cytophaga and Flavobacterium
is more mysterious, but may also involve slime secretion (Bardy
et al., 2003). These
latter forms of gliding motility inspired the comparison between flagella and
gliding motility as they are powered by protonmotive force, and beads attached
to the cell surface of Cytophaga will rotate (Eisenbach, 2000). Thus, it is
occasionally suggested (Cavalier-Smith,
2002a), even in textbooks (e.g. Campbell, 1993), that flagella and gliding motility are homologous,
and the gliding motility apparatus may be some version of the flagellum basal
body without the flagellar filament. As
our understanding of slime-related gliding motility is still limited (the relevant
genes are still being identified, much less detailed mechanism or structure),
the possibility of any connection between type III protein secretion and
polysaccharide secretion is difficult to evaluate. However, the study of gliding motility bears close watching: the
recent discovery of homology between M. xanthus gliding motility
proteins AglS/AglV to TolR and of AglR/AglX to TolQ (Youderian et al., 2003) which are in turn homologs of the flagellar motor
proteins MotA and MotB (Cascales et al., 2001) suggests that there may be a common mechanism for
coupling proton flow to motility. If
the general similarity between the junctional pore complex and type III
secretion systems (Spormann, 1999;
Merz and Forest, 2002) turns out to be more than skin deep, then the common
descent of gliding motility and flagella from an ancestral motility organelle
will have to be seriously considered.
Cavalier-Smith's suggestion that stirring the periplasmic fluid may have
been a precursor to primitive motility is similar to Rizzotti's main suggestion
and will be discussed in the next section.
2.2.3. Rizzotti (2000)
The only major recent attempt at explaining the origin of
the flagellum is that of Rizzotti (2000), which, like Goodenough, proposes that the flagellum
was derived from the F1F0 ATP synthetase. The initial appeal of this hypothesis
derives from the spate of recent comparisons between the flagellum and ATP
synthetase as proton-driven, rotary motors (Block,
1997; Boyer, 1997; Khan, 1997; Sabbert and Junge, 1997; Berg, 1998; Oplatka,
1998a, 1998b; Berry, 2000; Walz and Caplan, 2002), sometimes leading to the suggestion of homology (Oster and Wang, 2003). These
comparisons go back at least to Cox et al.'s (1984) proposal that the ATP synthetase had a rotary
mechanism, and continued through the testing and refinement of this hypothesis
(Mitchell, 1985; Sabbert and Junge,
1997; Weber and
Senior, 2003), followed by the conclusive demonstration of
rotation by direct observation of an actin filament tethered to the gamma
subunit of F1-ATPase (Noji et al., 1997). A
relationship between the F1F0 ATP synthetase and the
flagellum is further suggested by homology between the flagellar ATPase FliI
and the β subunit of F1-ATPase, indicated by ~30% sequence
similarity (Albertini et al., 1991; Vogler et al., 1991). The α
and β subunit ATP synthetase subunits are themselves paralogous, with only
the β subunit retaining catalytic activity (Gogarten et al., 1989;
Gogarten and Kibak, 1992).
In a creative scenario (Figure
3), Rizzotti
imagined that an accidental insertion in the middle of the F1-γ
subunit created a short filament outside the cytoplasmic membrane, between the
membrane and the cell wall. As the
synthetase subunits rotated, this protofilament served to mix the nearby fluid,
increasing the diffusion of molecules in and out of the cell. This provided sufficient selective benefit
to retain the mutation. Production of a
more sophisticated mixing instrument occurred via duplication and modification
of the mutant γ subunit, so that branches of the filament extended above
the cell wall. In the process, the
ε and δ subunits were lost, along with ATPase activity, resulting in
a proton-powered stirring mechanism with incipient motility function. From here, a process of optimization
ensued. Selection first favored random
motion of the cell that further improved nearby fluid mixing and
diffusion. More powerful motility
followed by extension of the filament and by duplications of the
proton-transmitting proteins of the stator (in this scenario, derived from the c
subunit of the F0 structure).
The F1-αβ complex apparently became the rotor
inside the stator ring. Rizzotti
concluded by discussing a number of other steps that must have happened along
the way, although the order is not specified.
However, it seems that he considered the origin of the export apparatus
a relatively late event. Rizzotti
hypothesized that once the central cavity became large enough, a secretion
complex (presumably a type III export apparatus already functioning elsewhere)
was patched in at the base of the rotor, allowing the secretion of a more
complex filament.
Rizzotti argued that bacteria with a single membrane were
simpler and therefore probably ancestral to gram-negative bacteria with both an
inner and outer membrane. He
hypothesized that the outer membrane arose as an alimentary adaptation from
extensions of the inner membrane. The
L- and P-rings arose as the developing outer membrane encroached on the
flagellum (gram positive bacteria, lacking outer membranes, have no requirement
for the L- and P-rings altogether).
Rizzotti discounted the alternative scenario, whereby the flagellum
arose in a bacterium already possessing a double membrane, because he deemed
the simultaneous origin of the rings and filament too difficult.
This
scenario is considerably more detailed than any other available, but remains
vague on the specific origin of almost all of the proteins that make up the
flagellum. Although Rizzotti does make
use of some interesting similarities between the flagellum and ATP synthetase,
and he is able to come up with a proposal that includes rotary motion from the
beginning, there are major flaws which shall be discussed shortly. Before the critique, however, it is worth
noting that Rizzotti's scenario has been cited by Cavalier-Smith (2001a) as well as others (Rosenhouse,
2002), apparently for lack of anything better.
Rizzotti's suggestion that stirring might be a primitive
function of a proto-flagellum is intuitively appealing, but intuition is a poor
guide to life at a low Reynolds number (Purcell,
1977; Vogel, 1994; Purcell, 1997). Bacteria
live in a world dominated by Brownian motion, where viscous forces overwhelm
inertia and small molecules spread much faster by diffusion than by bulk
movement of fluid. The scale at which
moving fluid (stirring) or moving through fluid (swimming) will increase
diffusion into the cell is determined by comparing the time for transport by
diffusion (td) versus the time for transport by bulk flow
such as stirring (ts) (Purcell,
1977). For
diffusion, the average time td for transport of a particle a
distance l, with diffusion coefficient D is (Berg, 1993):
(1)
while the corresponding time for
bulk flow transport via stirring (ts) is approximately (Purcell, 1977):
(2)
that is, the distance l
divided by the fluid velocity v induced by stirring. Stirring "works" only if the transport time
using stirring is less than the transport time from simple diffusion:
(3)
(4)
(5)
The ratio in equation gives the Péclet number, Pé,
which must be greater than unity for bulk flow to have substantial impact on
diffusion (Vogel, 1994). For a
typical small molecule (e.g. sucrose) in water, D=10-10 m2s‑1. For a typical-length bacterium (1 μm)
moving fluid past with the swimming velocity of a typical fully functional
flagellum (30 μm/s), Pé = 0.06 << 1 (Vogel, 1994). For Rizzotti's
primitive stirrer, Pé would be even lower. As Purcell (1977) noted, in the world of low Reynolds number,
"stirring isn't any good". Bacteria
that do induce currents for their benefit (e.g., Thar and Kuhl, 2002) succeed because of the large number of bacteria
cooperating in the effort, in effect increasing body size. Another postulated function of primitive
motility, swimming for the sake of running into more molecules, also does not
work: Purcell calculated that a bacterium would have to swim 700 μm/sec in
order to gather only 10% more food molecules.
Thus, if diffusion of molecules into the cell is the only matter of
concern, a bacterium will do just as well by sitting still as it will by
stirring or swimming. The reason bacteria
swim is not to increase diffusion but to find locations with a higher
concentration of nutrient molecules (Purcell,
1977; Berg, 1993; Vogel, 1994). Purcell's
argument breaks down in situations where the uptake rate parameter, a,
representing the fraction of available molecules being consumed each second, is
greater than 1 s-1. However,
a typical value for a is 0.01, where uptake is considered negligible (Dillon
et al., 1995; Mitchell, 2002). Thus,
fundamental physical considerations make the hypothesized stirring filament an
unlikely intermediate.
Additional difficulties with Rizzotti's model exist. While it is unrealistic to expect sequence
similarity to give evidence for the ancestry of every component of the 3+ billion
year old flagellum, considering the time lapse and large nature of some of the
changes that must be postulated on any scenario, a scenario certainly should
not contradict those homologies that have been identified. The Rizzotti scenario (Figure 3) implies homology between the synthetase F1-αβ
subunits and FliF/FliG (the flagellar rotor), but the homology that inspired
the scenario is between F1-αβ and FliI (the ATPase that
energizes export of rod, hook, and filament).
Similarly, Rizzotti (2000) implies that the F0-c subunit is
homologous with the flagellar motor proteins MotAB, but sequence homology has
instead been discovered between MotAB and a phylogenetically widespread family
of proteins that couple protonmotive force to diverse membrane transport
processes. These homologs, namely ExbBD
(Kojima and Blair, 2001) and TolQR (Cascales et al., 2001), provide a simpler and much more direct ancestor for
MotAB. The homologies could be
explained by invoking additional independent cooption events, but this would
require a rather more complex scenario than that presented by Rizzotti.
As Rizzotti's scenario fails on the twin tests of homology
and a simple model of stirring at a low Reynolds number, it is now time to see
if Rizzotti can be improved upon. It
should be noted that although published proposals about flagellar evolution are
very limited, the topic is a popular one as the flagellum is the icon of the
antievolutionary "Intelligent Design" movement. Therefore several of the ideas proposed here have been previously
raised in informal debates about flagellar evolution. Musgrave (2004) reviews this aspect of the debate in detail, and
proposes a model that is similar in outline to that presented here, although
his account is more general.
The paradigm for prokaryote phylogeny, if there is one, is
the universal rRNA tree. This shows a
number of widely separated bacterial lineages, with archaea and eukaryotes
separated from them all by a very long branch.
This tree is unrooted, and practically every possible rooting has been
proposed in the literature somewhere.
As these are the most remote and difficult phylogenetic events it is
possible to study, and as there is by definition no outgroup to life in
general, the debate can be expected to continue for some time. For current purposes the most important
point is that flagella are widespread across the bacterial phylogenetic tree,
with losses in various taxa and no clearly primitive nonflagellate taxa. It is therefore clear that flagella evolved
near the base of the bacterial tree.
Rizzotti (2000) and others (Koch,
2003) have suggested that the last common ancestor of
bacteria was gram positive. However,
the very general consideration that most of the bacterial phyla are gram
negative, including the many different taxa that come out as basal on different
analyses, weighs against this hypothesis.
Therefore, we shall side with Cavalier-Smith, who has put forward the
most detailed model for the origin of bacteria and the double membrane (Cavalier-Smith, 2001a, 2002a), and begin with a generic double-membraned, gram
negative bacterium. Whether or not
archaea are an outgroup to extant bacteria (the most common opinion), or a
relatively late group derived from actinobacteria (high G+C content
gram-positive bacteria), in turn derived from endobacteria (low G+C-content
gram-positives) and cyanobacteria (Cavalier-Smith,
2002a) shall be left unresolved, although implications of
flagellar evolution for Cavalier-Smith's scheme will be highlighted. The present model will begin with a
reasonably complex gram-negative bacterium, already possessing the general
secretory pathway and type II secretion system, as well as signal transduction,
a peptidoglycan cell wall, and F1F0-ATP synthetase. As these components are ubiquitous, almost
certainly predating the cenancestor, and as many bacteria (perhaps 50% of
species) lack flagella entirely, this seems plausible. These assumptions are consistent with
Cavalier-Smith's position that the cenancestor was a bacterium similar in
complexity to modern bacteria (Cavalier-Smith,
2001a, 2002a).
Cavalier-Smith (2002a) hypothesizes that chlorobacteria may be the most
basal offshoot of the tree and be primitively nonflagellate.
3.2.1. Type III secretion systems
The model begins with a hypothetical primitive type III
export apparatus. As terminology is
sometimes inconsistently used, following Hueck (1998), the term "secretion" is reserved for the transport
of proteins from the cytoplasm to the cell surface or the extracellular
medium. "Export" refers to the
transport of proteins from the cytoplasm to the periplasmic space. An export system plus a method to cross the
outer membrane forms a secretion system.
Bacteria make use of a number of distinct secretion systems, reviewed as
a group elsewhere (Hueck, 1998;
Thanassi and Hultgren, 2000a; van Wely et
al., 2001). Six major
well-characterized secretion systems (Figure
4a, Figure 5)
are reviewed by Thanassi and Hultgren
(2000a). These are:
(1) autotransporters (Henderson et al., 1998), (2) the chaperone/usher pathway (Thanassi et al., 1998), (3) type I secretion or the ATP-binding cassette
(ABC) transporter (Buchanan, 2001), (4) type II secretion or general secretory pathway
(Pugsley, 1993; Sandkvist, 2001; Cao
and Saier, 2003), (5) type III secretion systems of flagellar export
and some infectious systems (Hueck,
1998; Cornelis and Van Gijsegem, 2000), and (6) type IV secretion (Christie and Vogel, 2000; Christie, 2001), homologous to type II secretion, conjugation pili,
twitching motility systems, and archaeal flagella (Jarrell et al., 1996;
Bayley and Jarrell, 1998; Sandkvist, 2001). It is
likely that systems will be added to the list in time.
About 10 well-conserved protein species make up the core of
the type III export apparatus, which is used to export the axial components of
bacterial flagella (rod, hook, filament, adaptor, and cap proteins). In 1994 it was discovered that homologs of
these proteins are also used to secret virulence factors in a diverse array of
proteobacterial pathogens, such as Yersinia pestis, Salmonella
typhimurium, Pseudomonas aeruginosa and enteropathogenic E. coli
(Hueck, 1998). The term
"type III secretion system" is commonly used to refer to the virulence systems,
but here it will be used to denote the class of secretion systems that make use
of the type III export pathway. This
includes the two currently known members (virulence and flagellar secretion
systems) and any unknown homologs.
The existence of a nonflagellar type III export apparatus
falsifies the argument that flagellar components are useless if they are not
part of a fully functioning flagellum.
One answer to Macnab's (1978) query, "What advantage could derive…from a
'preflagellum' (meaning a subset of its components)" is now obvious: a subset
of flagellar components could serve as an export system. Thus, the model for the origin of flagella
begins with the hypothesis of a primitive type III export system. This hypothesis, however, requires justification
on several grounds in order to ameliorate obvious objections.
3.2.2. Are nonflagellar type III secretion systems derived from flagella?
The fact that known nonflagellar type III secretion systems
are restricted to proteobacteria, and that these systems are mostly virulence
systems specializing on eukaryotes (which are probably far younger than
flagella), lead Macnab (1999) as well as others (He,
1998; Kim, 2001; Plano et al., 2001) to conclude that the flagellar pathway is probably
the older one, and that type III virulence systems are derived from
flagella. Although some apparently
avirulent type III secretion systems have been discovered (e.g., in the legume
symbiote Rhizobium; see Marie et al., 2001), and the phylogenetic distribution of type III
secretion systems has been widened somewhat by their discovery in Chlamydiales
(Kim, 2001), these data still support the conclusion that type
III virulence systems are derived eukaryote-interaction systems, rather than
phylogenetically basal homologs.
Phylogenetic analysis of type III secretion systems seemed to confirm
the case (Nguyen et al., 2000). Aizawa (2001) was one of the few dissenting opinions, arguing that
flagella and virulence systems might have diverged in parallel from a common
nonflagellar ancestor, pointing out that there are bacteria that parasitize or
prey on other bacteria, a point with some merit although predatory bacteria are
poorly studied (Guerrero et al., 1987).
Nguyen et al.'s (2000) conclusion has recently been challenged by Gophna et
al. (2003), who demonstrated with gene trees of FlhA, FliI,
FliP, and FliO that type III virulence system sequences do not nest within
flagellar systems. This supports the
view that the two systems diverged from a common ancestor, which could
plausibly have been a type III export system functioning in a nonflagellar,
nonpathogenic context. However, Gophna et
al. (2003) are not able to exclude the possibility that
virulence systems evolve more rapidly, or that the frequent lateral transfer of
type III virulence system genes (Nguyen et al., 2000; Gophna et al., 2003) might have increased the rate of sequence
divergence. Gophna et al., also cite for
support the progressionist notion that evolution disfavors events such as the
simplification of complex systems like the flagellum, a very dubious
proposition in modern evolutionary theory, especially considering the common
evolutionary trend of simplification in pathogens and parasites. As long as known nonflagellar type III
secretion systems are phylogenetically restricted and only function as
specialized systems for eukaryote penetration, the suspicion will remain that
they are derived from flagella. This
view is strengthened by the fact that type III virulence systems have homologs
of proteins like FliG, which only have an obvious function in the flagellar
motor and may be essentially vestigial in type III virulence systems. For the purposes of the current discussion
it will be assumed that type III virulence systems are derived, although they
still give valuable insights about the possible traits of a hypothetical
ancestral type III secretion system.
3.2.3. An ancestral type III secretion system is plausible
If type III virulence systems are derived from flagella,
what is the basis for hypothesizing a type III secretion system ancestral to
flagella? The question would be
resolved if nonflagellar homologs of the type III export apparatus were to be
discovered in other bacterial phyla, performing functions that would be useful
in a pre-eukaryote world. That such an
observation has not yet been made is a valid point against the present model,
but at the same time serves as a prediction: the model will be considerably
strengthened if a such a homolog is discovered. For the moment, it is easy enough to explain the lack of discovery
of such a homolog on the basis of lack of data. Knowledge of microbial diversity is quite poor (Whitman
et al., 1998): far less than 1% of bacteria extant in a particular
environment are readily culturable (Hayward,
2000).
Cultivation-independent surveys of prokaryote diversity based on
environmental rRNA sequencing commonly discover deeply-branching microbes
previously unknown to science (DeLong
and Pace, 2001) and that certain groups are unexpectedly ubiquitous (Karner,
2001 #176). In addition,
only a fraction of cultured microbes have been studied in any substantial
biochemical or genetic detail, and this subsample is heavily skewed towards
pathogens and convenient model organisms.
Of the ~112 complete bacterial genomes sequenced as of July 2003,
at least two-thirds are pathogens, mutualists, or commensals of multicellular
eukaryotes. Many of the free-living
bacteria that have been sequenced are extremophiles or are used in industrial
applications.
Even with such a skewed dataset, a general argument for the
plausibility of a primitive type III export system can be constructed on the
basis of analogy. Each of the six
secretion systems described above has been coopted to serve diverse functions
by prokaryotes (Table 4). The
thoroughness of some of the observed convergences is remarkable – notably, all
of the systems have been adapted for eukaryotic virulence, five secrete surface
structures, at least four are used for adhesion, three or four form pili, and
two perform motility-related functions.
That pili and adhesion often play a role in virulence in well-studied
organisms is not particularly significant, as such functions are useful in
free-living contexts as well (Kennedy,
1987). The overall
picture is that any secretion system that exists will sooner or later get
coopted for diverse functions, including virulence, in various lineages. The commonality of the virulence function in
known systems almost certainly reflects human interests rather than the
situation in the wild.
It might be objected that with so many available secretion
systems, postulating the existence of an additional system is superfluous. However, the case of Pseudomonas aeruginosa,
which has all of the above-listed systems except for type IV secretion (Bitter, 2003) proves that having multiple secretion systems can be
useful. Furthermore, many bacteria will
have two or more copies of certain types of secretion systems, with mildly to
strongly divergent functions: e.g., E. coli can have both P-pili and
type 1 pili (Thanassi and Hultgren,
2000a); Salmonella and Yersinia have two type
III virulence systems each (Cornelis
and Van Gijsegem, 2000); and Pseudomonas aeruginosa has at least two
type II secretion systems and probably two kinds of type IV pili (Bitter, 2003).
3.2.4. The origin of a primitive type III export system
Type III virulence systems have well-conserved homologs of
the following flagellar components (Plano et al., 2001): FliF (the membrane-embedded MS-ring); FlhA, FlhB,
FliP, FliQ, FliR (integral membrane export components inside the MS-ring); FliI
and FliH (ATPase and regulator); and FliG and FliN/M (the switch complex). The primitive type III secretion system
would not necessarily had all of the components that are conserved in the
possibly derived virulence systems. In
particular, the homologs of the switch complex proteins (FliN/M, FliG) are
probably retained in type III virulence systems only in order to
stabilize/support the coadapted secretion complex and FliF ring, and are
otherwise vestigial.
FliF is fundamentally a membrane pore and so its origin must
lie with the origin of transport proteins in general, a question explored by
Saier (2003). FlhA and
FlhB are larger than FliOPQR, and have large cytoplasmic C-terminal domains
that appear to bind the export substrates.
FlhA interacts with FliF and the soluble components of the type III
secretion system but its exact function is unknown. FlhB plays a key role in determining whether rod/hook or filament
axial proteins are secreted, and therefore controls the length of the hook by a
poorly-understood mechanism (Macnab,
2003). Substrate
switching would not have been a necessary feature of a primitive type III
secretion system, but perhaps the association of proto-FlhA and/or FlhB with
the proto-FliF pore turned it from a somewhat general passive transporter into
a substrate-specific passive transporter.
One of the differences between type II and type III secretion systems is
that type II systems recognize their substrates by a N-terminal signal peptide
that is removed during transport. The
signal sequences for type III secretion substrates are also in the N-terminal
regions but they are not cleaved (Büttner
and Bonas, 2002). Perhaps
this difference allowed the primitive type III secretion system to export an
important substrate on a different control circuit independent of the sec
pathway, and this finer control was the selective basis for the retention of
the system.
3.2.5. The relationship between type III export and the F1F0-ATP synthetase
That a phylogenetically basal type III export apparatus must
have existed is supported by several additional facts. As discussed previously, the protein that
powers protein export in type III secretion, FliI, has long been considered
homologous to the F1 subunit of F1F0-ATP
synthetase on the basis of about 30% amino acid identity to the active F1-β
subunit (Albertini et al., 1991; Vogler et al., 1991; Gogarten et al., 1992). The F1-ATPase
is a heterohexamer made up of alternating α-subunits (noncatalytic) and
β-subunits (catalytic). This
pattern is shared by all bacteria and is also found in the archaeal A-ATP
synthase and eukaryote V-ATP synthase, so F1-α and F1-β
are thought to have diverged before the cenancestor (Gogarten and Kibak, 1992). FliI, on
the other hand, probably consists of a homohexamer of catalytic subunits
(FliI's hexameric nature was only recognized very recently: Blocker
et al., 2003; Claret et al., 2003). It diverges
before the F1-α and F1-β split in sequence
similarity trees, and thus probably also diverged prior to the cenancestor (Gogarten and Kibak, 1992). However, it
is more similar to the F1 subunits than the more distantly related
hexameric ATPases such as the RNA/DNA helicase termination factor rho (Boyer, 1997), and therefore Gogarten and Kibak (1992) conclude that the FliI family diverged specifically
from a primitive F1-ATPase prior to the cenancestor. There is not similar evidence that flagella
specifically evolved before the cenancestor, so this is a point in favor of the
primitive type III export system hypothesis.
In light of the long-established homology between FliI and F1-αβ,
it is surprising that there have been no searches for further homologies
between the F1F0-ATP synthetase and type III export
system. Sequence similarity searches do
not turn up significant hits, but considering the timespan and divergence in
function this is not necessarily surprising. Several recent discoveries seem
suggestive of further homology. First,
FliH forms a (FliH)2FliI heterotrimer with FliI (Minamino and Macnab, 2000; Minamino et al., 2001). FliH has an
elongated shape (Minamino et al., 2001), and both FliI and FliH are soluble cytoplasmic components
that associate intrinsically with the membrane and with lipid vesicles (Auvray
et al., 2002). If the FliH2
homodimer associates with the FliI6 complex in vivo, all of
this begins to look suspiciously similar to the association (Figure 4b) between the F1F0-ATP
synthetase F1-α3β3 and F0-b
subunits: two elongated F0-b subunits form a dimer and interact with
F1-α3β3. In F0-b it is the N-terminal region that associates
with the membrane, and the C-terminal region with the N-terminal regions of F1-α3β3
(Boyer, 1997; Weber and Senior, 2003). In FliH it
is known that the C-terminal region associates with N-terminal region of FliI (Gonzalez-Pedrajo et al., 2002), but the region responsible for membrane association
is undetermined (Auvray et al., 2002); F0-b – FliH homology would predict that
the FliH N-terminus associates with the membrane. Although PSI-BLAST searches on FliH only return F0-b
as a non-significant hit, a search of NCBI's CDART database based on FliH does
retrieve F0-b as a result with similar domain architecture, another
point in favor of the hypothesis of homology.
In the ATP synthetase, an F1-δ monomer
associates with the proximal end of F1-α3β3
and F0-b2. In the
type III export apparatus, it is FliJ that interacts with FliI and FliH2. FliJ seems to be required for the export of
all flagellar components, and so has been interpreted as a general chaperone in
the cytoplasm (Macnab, 2003). However,
this observation is equally well explained if FliJ is a required part of a FliI6FliH2
complex essential for export. Both FliJ
and F1-δ have a similar size and N-terminal binding sites to
the N-terminal regions of FliI/F1-α. There may also be a structural similarity: FliJ has a high
probability of exhibiting an N-terminal α-helical coiled-coil arrangement
(Macnab, 2003), using sequence-based predictions (Lupas
et al., 1991, method implemented at http://www.ch.embnet.org/software/COILS_form.html ). F1-δ has several conserved
α-helices at its N-terminal binding site to F1 (Weber
et al., 2003b). Although
predictions do not generally yield a high probability of coiled-coil structure
for F1-δ, a short non-exhaustive sampling of orthologs shows
that at least one FliJ protein does not show a high probability prediction of
coiled-coil structure either (Buchnera aphidicola, accession no. P57179)
while at least one F1-δ protein does (Rhodopseudomonas
blastica, accession no. P05437). It
appears that the C-terminal region of F1-δ associates with the
C-terminal region of F0-b2, although the details remain
to be worked out (Weber and Senior,
2003). Regarding
the FliJ-FliH2 interaction, Fraser et al. (2003) favor a model where the FliJ interacts with the
N-terminal region of FliH2, but their data (Gonzalez-Pedrajo et al.,
2002) shows that deletions in either the N-terminus
(perhaps the region that associates with the membrane) and middle (dimerization
region) of FliH preclude FliJ binding.
Homology between F1-δ and FliJ would predict that
FliJ-FliH interaction is actually mediated through the C-terminal regions of
each, but that the association may be rather weak, as it is between F0-b2
and F1-δ (Weber and
Senior, 2003).
Similarities in the type III secretion integral membrane
proteins FliPQR, the type II secretion proteins SecFEY, and F1F0-δca
were pointed out by Aizawa (2001), who calls these triplets the "proto-channel" and
suggests homology. His evidence is of a
very general nature (similarities in aliphatic index and other features) and so
cannot be accepted uncritically. In
particular, it is no longer thought that F1-δ (or its eukaryote
homolog OSCP) is associated with the membrane or ATP synthetase stalk (Weber
et al., 2003a), and the evidence discussed above points to a
different homology for F1-δ.
However, the proposed matches between FliQ--F0-c and FliR--F0-a
are decent in terms of protein size and also the number of transmembrane
helices of the respective proteins. And
surprisingly, a leap into speculation, matching FliP to F1-γ
and FliO to F1-ε, also seems to provide plausible matches. When the similarities between F1F0-ATP
synthetase and type III export components are tabulated (Table 5), it is apparent that that each component of the F1F0-ATP
synthetase can be matched to a component of the type III export apparatus with
a similar size and topology, as far as evidence is available (the function and
structure of the flagellar proteins FliOPQR are poorly understood).
Individually, the cited similarities are easily attributable
to chance, but together they are at least suggestive. Table 5 also shows that there are some apparent
dissimilarities. Notably, while both F0-c
and FliQ have 2 transmembrane helices, the loop between the helices is exposed
to the cytoplasm in F0-c (Birkenhager et al., 1999), while the loop between the helices in FliQ was
predicted to be periplasmic (Ohnishi et al., 1997); a reversal of this finding would support the
homology hypothesis. The weakest case
for homology is between F1-ε and FliO; FliO is predicted (Ohnishi
et al., 1997) to have a single transmembrane helix, while the
structure of F1-ε has been solved (Wilkens and Capaldi, 1998) as a two-domain protein that binds to the
stalk. However, both proteins tolerate
substantial variability; F1-ε functions with large deletions (Wilkens and Capaldi, 1998) and clear homologs of FliO have not even been
identified in type III virulence systems (Gophna et al., 2003).
The hypothesis that the entire F1F0-ATP
synthetase may have been coopted in toto into a primitive gated pore
(proto-FliF and proto-FlhA/B) is certainly provocative; it would explain at a
stroke the origin of most of the type III export system and provide a
phylogenetically basal precursor to the flagellum even though basal type III
export systems remain undiscovered. The
complex would fit well in FliF ring; using the stoichiometry of FlhA2FlhB2
proposed by Macnab (2003), and the equivalent stoichiometry of an ATP
synthetase for the other integral membrane components, FliO1P1Q12R1,
the total number of transmembrane helices is 60, well within the approximate
MS-ring capacity of about 70 transmembrane alpha-helices (Fan
et al., 1997). Fan et
al. estimate <3 copies of FliR per flagellum, which is consistent with
the ATP synthase hypothesis, but also estimate 4-5 copies for FliP, which is
not, so if the ATP synthetase hypothesis is true it would be expected that the
FliP finding is in error.
Macnab (1999) called the homology between FliI and F1-αβ
"inexplicabl[e]". However, there may be
a relatively simple explanation. If the
postulated homology between the ATP synthetase and type III export is correct,
then probably the key event in the origin of type III export was the
association of the F1F0-ATP synthase with a proto-FlhA or
FlhB inside the proto-FliF ring, converting it from a passive to active
transporter. Since little is known
about the details of the coupling of ATPase activity to protein export, this
step remains speculative. Probably
motion in the ATPase was linked to a conformational change in FlhA and/or FlhB,
with the proton pumping function of the synthetase lost soon afterwards. Currently there are several documented
associations between FlhAB and the rest of the type III export apparatus (Macnab, 2003). These
associations include proteins in both the "F0" and "F1"
regions of the type III export apparatus.
FlhA or FlhB may thus take over some of the linker role that is played
by F0-b in the ATP synthetase and (on the homology hypothesis) by
FliH in the type III export apparatus; this would help to explain why FliH is
not absolutely required for successful construction of flagella, and FliH null
mutants can be compensated by mutations in FlhA and FlhB (Minamino et al., 2003).
Other possible hypotheses for the origin of the type III
export apparatus are not currently ruled out, such as the idea that much of
apparatus is descended from a passive channel and that only a portion of the F1F0-ATP
synthetase was coopted to power transport, or that there is an ancient,
obscured homology between the various secretion systems. Alternatives are currently disfavored
because they are more complex and explain the origin of fewer components. However, even if FliI remains the only
confirmed homolog to the F1F0-ATP synthetase, general
considerations indicate that the evolution of an export system is not very
difficult. A diversity of export systems
of varying complexity exist, and that there is a functional continuum of
membrane complexes ranging from single proteins and passive pores through to
active, gated export systems, indicates that there are no major evolutionary
puzzles to solve. The cataloguing and
categorizing of transport proteins is already yielding insights into their
origin (Saier, 2003).
The ATP synthetase homology hypothesis has the advantage of
numerous testable implications for the structure and function of FliHIJOPQR. The ATP synthetase is relatively
well-understood; structures have been determined for most of the components and
a number of sophisticated techniques for studying the complex as a whole have
been developed. If the homology
hypothesis is correct, then similar structures would be expected for the
corresponding type III export components, and many of the techniques applied to
the ATP synthetase should apply to the export apparatus. It is worth noting in passing that if a
significant portion of the type III export apparatus is indeed homologous to
the ATP synthetase, then it becomes fairly likely that the rotary flagellum
contains within it a rotary motor powering protein export. This is a fairly incredible notion, but
would merely be the latest in a long line of surprises yielded by the study of
the flagellum. This possibility might
mean that the proto-flagellar secretion system was rotating from the start
(echoing the rotation-early hypotheses of Cavalier-Smith, Goodenough, and
Rizzotti), although this is not a necessary postulate for the rest of the
scenario to proceed.
For the remainder, the hypothesis of a primitive type III
export system will be taken for granted.
This complex would have transported proteins manufactured in the
cytoplasm into the periplasmic space.
If secretins were already available from the type II secretion system,
as they probably were given their ubiquity, then from the start the type III
export system would have been a primitive kind of type III secretion system, as
small proteins could diffuse in the periplasmic space until they found an outer
membrane pore and diffused out.
Digestive proteases or antibiotic molecules are likely candidates for
the secreted proteins. Alternatively,
the export system could have originally secreted proteins destined for the
periplasm, and later cooption of a secretin converted the export system into a
secretion system.
The association of an outer membrane channel with the type
III export apparatus would improve the efficiency of secretion. This advantage would increase as exported
substrates became larger, because the peptidoglycan cell wall only allows the
diffusion of globular proteins with a size less than about 50 kDa (Young, 2001); as protein size increased, diffusion would be
increasing impeded. Once a new
single-step secretion channel was available it would be possible to secrete
larger proteins and proteins that would be harmful if left to wander about the
periplasmic space. These are selective
forces that would favor the spread and diversification of the secretion system,
after its origin as an efficiency-improving measure.
Outer membrane secretins have been coopted repeatedly by
various versions of the secretion systems discussed above (Hueck, 1998; Thanassi, 2002; Bitter, 2003); if the type III virulence system is derived from
the flagellum, it probably originated in part by replacing the flagellar L- and
P-ring proteins with a secretin. The
first association of a secretin with a primitive type III export apparatus was
probably mediated by the simultaneous cooption of a secretin and its outer
membrane lipoprotein chaperone (Dailey
and Macnab, 2002). Both of
these proteins are secreted by the very old type II secretion pathway. The channel to the extracellular medium
could be recruited in a single step if a mutation caused the secretin to
associate with the type III export apparatus.
The secretin appears to cross both the cell wall and outer membrane in
the Hrp pilus, and to associate without help with the pilus FliF homolog in the
cytoplasmic membrane (Blocker et al., 2003), so a P-ring-like protein does not appear to be a
prerequisite for secretion. Thus the
P-ring may have been a later addition to the system, perhaps even coinciding
with the early stages of improvement of the proto-flagellum and the loss of the
secretin (see below).
In the model, flagellin and all of the proteins of the axial
structure – FlgBCFG (rod), FlgE (hook), FlgKL (adaptor), FlgD and FliD (cap),
in addition to FliC (flagellin) -- are descended from a common ancestral pilin
secreted from the primitive type III secretion system. All of these proteins are placed in the
axial protein family (Homma et al., 1990a; Hirano et al., 2001). Homma et
al. (1990a) put the rod, hook, and first adaptor (FlgK) proteins
into a closely-related subfamily. The
divergence of the axial filament family probably occurred mostly after the
origin of a functioning protoflagellum; this will be discussed in a later
section. First, the origin of the pilus
must be considered.
The diversity of surface structures based on secretion
systems was documented in Table
4; modern flagella retain many of these functions (Moens and Vanderleyden, 1996). To expand
on a likely function of a primitive pilus, successfully adhering to a surface
can be a problem for a floating bacterium: at a low Reynolds number, the
boundary layer near a surface can be a significant barrier (Vogel, 1994). A bacterium
can increase its chances of attachment by secreting adhesins with an affinity
for the desired surface, ensuring successive attachment if it happens to get
near a surface (e.g., the adhesins secreted by autotransporters, independent of
pili (Henderson et al., 1998)). It can
increase its chances still further either by putting the adhesin at the end of
a filament (e.g., the PapG adhesin located at the tip of the P pilus fibrillum
(Sauer et al., 2000); the flagellar cap FliD of Pseudomonas aeruginosa
doubles as an adhesin (Scharfman et al., 2001)) and/or by making the filament adhesive along its
whole length, which is a common occurrence in modern bacterial flagella as well
as many other surface structures (Kennedy,
1987; Moens and Vanderleyden, 1996; Fernandez and Berenguer, 2000; Giron et al., 2002). Probably
any filament, adhesins or no, will have some utility in attaching to inorganic
surfaces, simply by expanding effective size and surface area available for
adhesion. Even in the absence of
specific adhesins, charge, hydrophobicity, and/or van der Waals forces can be
exploited for more general surface adhesion (Vogel,
1988), particularly at the small scale of bacteria.
Three hypotheses present themselves as to how the ancestral
pilus originated: filament-first, cap-first, and modified filament-first. The latter hypothesis combines the best
features of the filament-first and cap-first hypotheses.
3.4.1. Filament-first hypothesis
One way that the kinds of pili described above could get
their start is by simple polymerization of an adhesin. The adhesin would have inherited from its
ancestor the ability to bind with the outer membrane channel and with the
extracellular substrate; all that would have to be added is self-binding
capability. The plausibility of this
step is attested by several facts: first, structures made up of multiple copies
of the same subunit are biochemically ubiquitous, and the evolution of large
multimeric complexes has in many instances been traced back to simpler
ancestors, e.g., AAA ATPases (Mocz
and Gibbons, 2001). Second,
polymerization into a filament or tubule via mutation is a quite common event:
sickle-cell hemoglobin, derived by only a single substitution from regular
hemoglobin, forms not only self-assembling polymers but dynamic polymers (Mitchison, 1995). In fact,
Mitchison (1995) argues that evolution can start with just about any
protein fold and produce a self-assembling polymer.
An alternative to polymerizing an adhesin is to postulate
that a gene for a pre-existing filament-forming protein was coopted by
transposition of the promoter and N-terminal signal sequence of an already
secreted protein. Support for this
second possibility might be found in homology between flagellin and a modern
filament-forming protein. Homology between
flagellin and actin has been proposed (Novikova et al., 2000; Harris and Elder, 2002), and ancient actin homologs such as FtsA are present
in prokaryotes (Mitchison, 1995), but the evidence is weak. Harris and Elder (2002) cite an 8/13 sequence match between flagellin and
actin in the N-terminal region, but this could easily be due to chance. Novikova et al. (2000) found that flagellar filaments co-precipitate with
rabbit skeletal myosin, and that flagellin and F-actin compete for myosin
binding, but this might be explained by a general similarity of filaments
rather than homology. Given the large
divergence of flagellin from the more conserved rod and hook members of the
axial filament family (Homma et al., 1990a), any relationships outside of this family are bound
to be difficult to detect.
A simple assumption is that the first filament was a chain
of monomers, probably in an open helix.
Longer filaments are presumably better for adherence than short
filaments, and thus selection for adhesion can be expected to favor longer
filaments. Once a polymer filament of
reasonable length has been built, however, there may be a difficulty in
extending it. The problem does not
arise if filament subunits are added at the base, as occurs in type IV pili:
type IV pili are based on type II secretion systems, which use a two-step
process to transport proteins. First
proteins are exported into the periplasm, and then they are pushed out through
a secretin, perhaps via a "plunger"-type mechanism involving a pseudopilus (Thomas
et al., 2001). However,
the type III secretion system exports proteins from the cytoplasm in one
step. By exporting a number of
individual subunits, a short filament binding to the outer membrane pore can be
formed, but its possible length will be severely limited by the decreasing
chances of successfully adding monomers to the receding distal tip. This problem might be overcome in a gradual
manner by modifications to the open helix, so that it better corralled the
monomers as they exited the secretin.
Each mutation that brought the turns of the helix closer together would
decrease the rate of monomer escape, and allow the extension of the
filament. A tube with closed or nearly
closed walls would be the optimal solution, and selection for rigidity
(necessary for very long filaments) would also favor the closed tube.
The result would be something rather like the modern type
III virulence pili, which appear to have far less complex axial structures than
flagella. Indeed, despite several
investigations it has yet to be determined that the Hrp pilus has any axial
components (rod-like, hook-like, etc.) apart from the main protein of the pilus,
HrpA (MxiH/PrgI in Shigella/Salmonella; Blocker et al., 2003), and the extracellular portion of the filament seems
to extend continuously into the secretion complex (Fernandez and Berenguer, 2000) whereas in flagella there is a distinction between
filament, hook, and rod.
It might be objected at this point that the flagellum
requires the cap (FliD) in order to chaperone the flagellin subunits into place
at the elongating tip of the filament; without it, they diffuse away and are
lost (Blocker et al., 2003). The hook
has its own temporary cap (FlgD), and it has been suggested, but not proven (Hirano
et al., 2001; Berg, 2003; Macnab, 2003), that the rod has a cap protein as well (FlgJ). However, the necessity of the cap for
successfully assembling subunits is ambiguous.
Flagellin will self-assemble into filaments in vitro (Hirano
et al., 2001). No cap has
been identified in any type III virulence systems (Blocker et al., 2003), and although PrgJ has been suggested as a possible
cap for the Salmonella needle (Sukhan et al., 2003), the evidence is indeterminate as Sukhan et al.
could not detect PrgJ in sheared-off needles and did not detect it at needle
tips using immunoelectron microscopy (they therefore suggest that PrgJ may be a
basal component). The polar flagellum
of Vibrio species grows normally without the cap (Bardy
et al., 2003), probably because it is sheathed by an extension of
the cell membrane (McCarter, 2001) that constrains the subunits. Finally, even in the "standard" flagellum
the adaptor proteins FlgK and FlgL are added without any capping structure (Macnab, 2003), leading Macnab (2003) to argue that "capping structures are perhaps best viewed as a means
of increasing efficiency of addition rather than as an absolute requirement." On this view, the cap could be a relatively
late evolutionary addition to the pilus structure, originating by
pentamerization of a pilus subunit and initially improving speed and efficiency
of pilus assembly. Later co-adaptation
between filament and cap subunits would make it a more-or-less required
feature.
3.4.2. Cap-first hypothesis
An alternative to gradual formation of a hollow pilus would
be to start with the cap. In Pseudomonas
aeruginosa, FliD serves not only as a cap protein, but also as an
adhesin. P. aeruginosa infects
the human respiratory system by adhering to mucins. FliD mutants were found to be nonadhesive, which could occur
either because FliD is a necessary adhesin, or because the flagellar filament
fails to assemble without the cap.
However, Arora et al. (1998) found that FliC null mutants retained adhesive
ability. This implies that it is FliD
specifically which serves as the adhesin, and not the whole flagellar filament,
a conclusion supported by additional lines of evidence (Arora et al., 1998;
Scharfman et al., 2001). On the filament-first
hypothesis, an adhesin attached to the outer membrane secretin mutated to form
a polymeric chain. The fact that caps
can be adhesive, however, suggests an alternative hypothesis. Instead of a mutation forming a polymeric
chain, the mutant adhesin formed an oligomer that associated with the distal
rim of the outer membrane secretin, approximately covered the mouth of the
secretin, and allowing more adhesin monomers to be packed into the available
space. In this case, a pentamer ring
was approximately the right size. Once
this was established, however, the utility of the adhesive "secretin cap" would
be further improved if it could be extended away from the surface of the cell. This could occur by mutation of a duplicate
cap protein that formed a slightly wider ring.
This ring would associate with the base of the cap, but the size
mismatch would allow the insertion of more subunits, forming a short pilus in
one step. On this view, the primitive
pilus would derive its structure of ~5 subunits per turn from the pentameric
cap, rather than the reverse. The
channel inside the pilus would be descended from the hole in expanded
ring.
3.4.3. Modified filament-first hypothesis
The filament-first hypothesis has the disadvantage of
explaining the addition of distal subunits to the filament before a tube
structure could evolve. The cap-first
hypothesis has the difficulty that a pentameric proto-cap covering the surface
of the secretin pore might impede the secretion of other substrates. The difficulty is not insurmountable, as
secreted substrates might escape from beneath the sides of the cap, or
alternatively might knock loose the cap, which is then replaced by continually
secreted cap protein (both mechanisms may operate in modern flagella). However, these problems are easy to solve if
the hypothesized adhesin pentamer were to initially form a ring atop the
secretin, instead of a cap. Secretion
of diverse substrates could then continue unabated without continual secretion
of the adhesin protein. From here, a
proto-pilus could easily be formed by a mutant adhesin that polymerized the
ring into a tube. This hypothesis is
simpler and more appealing because it combines the advantages of the two
previous scenarios: a pilus initially assembled by simple mechanisms without a
cap, but having a well-formed tube structure from the start, and allowing the
uninterrupted secretion of other type III secretion system substrates. On this hypothesis, the proto-cap would
again be a late addition (a modified pilus protein) increasing assembly
efficiency.
It is interesting to reflect on the surprise many
researchers felt when the mechanism of flagellar filament construction – adding
subunits at the tip rather than the base – was discovered. It has been called "astonishing" and
"somewhat bizarre" (Macnab and
DeRosier, 1988). However,
with the modified filament-first hypothesis in hand, the decidedly unintuitive
method of filament assembly used by the flagellum can be seen as a product of
the constraint of building a pilus from the starting point of type III
secretion. That flagella can also be
built perfectly well from the base is shown by the archaeal flagellum and its
type IV secretion-based system. But
unlike the type IV secretion system, which has periplasmic ATPases and other
outer membrane-associated active transport components, the type III system had
no mechanism of powered outer membrane transport available to build on.
3.4.4. Improvements on the type III pilus
Once a primitive pilus has evolved, a number of rapid
improvements can be expected. First
would be optimization of the pilin protein for its new role, under selection
for increased strength, minimizing breakage, increased speed of assembly, etc. Addition of the cap and subsequent
coevolution of the pilin and cap subunits could have occurred fairly early,
particularly as the pili became very long and assembly times became
significant. Pilus lengthening would be
selected for because it increases "reach" and adhesive surface area. That lengthening is a trivial matter of
regulation is shown by various lab-produced mutants that exhibit lengthened
hooks, needles, etc. due to simple mutations.
Type III pili might have reached the length of flagellar filaments (10
μm) long before motility originated; flagella and Hrp pili are of
comparable length (He and Jin, 2003, figure 1).
Soon the type III secretion system would become a specialized
pilus-secretion apparatus, and pili would be adapted for a variety of more
strenuous uses. For example, pili might
be used as stalks to elevate the bacterium above the surface, in order to
better access light, a particular concentration of molecules, or escape
competition from bacteria on the surface, all functions of attachment
organelles today (Dyer, 2003). Duplication
and modification of the pilin protein would allow greater functional
flexibility, such as adhesion to different substrates and the production of
certain kinds of pili based on environmental stimuli. One important modification that might have occurred after these
trends were well along would be to strengthen the pilus attachment to the cell
by extending the filament down into the secretion system to attach to the
export apparatus embedded in the cytoplasmic membrane. This would have been effected by cooption of
a duplicate pilin. The core tubule
structure of the flagellar rod, hook, and filament is constructed exclusively
by the N- and C-terminal domains of the axial proteins; the middle domains are
placed on the outside of the tubule, and in flagellin are highly modifiable and
often dispensable (Cohen-Krausz and
Trachtenberg, 2003). Therefore,
the "proto-rod" probably originated by loss of the outer domains, assuming that
the extracellular pilus had them for adhesion or structural purposes. This duplication event would create the
ancestors of the rod/hook subfamily and flagellin. Initially, one cap protein could chaperone the assembly of both
structures, but as they diverged, the cap protein would be duplicated as well
to allow specialization on each protein, assuming that modern flagellar rods
have cap proteins, as has been suggested (FlgJ; Berg, 2003; Macnab, 2003).
It is not clear that modern type III virulence pili make use
of rodlike proteins at all (the filament may simply extend all the way from the
cytoplasmic export apparatus out into the extracellular space), so it is also
possible that differentiation of filament and rod proteins occurred later and
that attachment to the export apparatus occurred via modification of the pilin
subunit. However, the hypothesis that
the duplication was early helps to explain the high divergence between
flagellin and the rod/hook subfamily, as well as explaining how the filament
became attached to the export apparatus instead of the outer membrane secretin
(although some attachment to the secretin may have remained; see below). Another protein, FliE, serves as the adaptor
protein between FliF (the MS-ring) and FlgB (the proximal rod protein). FliE homologs have not yet been detected in
type III virulence systems, so the utility of a FliE-like adaptor in nonmotile
systems is ambiguous. Here it will be
assumed that it was a relatively late, post-motility addition that strengthened
the attachment between the MS-ring and the rod. Investigation of the attachment mechanisms of modern type III
pili to the secretion system may shed light on the relative likelihood of these
possibilities.
3.5.1. The selective advantage of undirected motility
Even with a complex pilus in place, the modern flagellum
could not have originated in a single step.
It is hypothesized that the first, very crude motility function was
random dispersal. The function was
probably not stirring or gathering more food by more rapid movement, because of
the previously-discussed constraints of life at a low Reynolds number. Dispersal, on the other hand, is both a
ubiquitous adaptation in biology and rather undemanding in terms of
motility. Vogel (1994) reports that passive dispersal (i.e., unguided
dispersal by wind or current) is found in every phylum of animals and division
of plants. For creatures such as
bacteria, some dispersal will occur without any adaptations whatsoever: random
physical events can dislodge them from the substrate, and Brownian motion and
larger-scale turbulence and flow will move them about. However, dispersal is not always a good
thing for a bacterium. Since a
bacterium existing in a spot is most likely descended from a successfully
reproducing bacterium also at that spot, and therefore the environment is
conducive to reproduction, we might expect that the best choice for a bacterium
would be to stay where its at, rather than gambling everything on a rather
literal leap into the blue. On the
other hand, this logic will rapidly break down because any environment
conducive to replication will soon become filled to the brim with bacteria, at
which point competition for nutrients, space, light, etc. will become
severe. In such a situation there are
numerous potential responses (spore formation, killing fellow bacteria by
secreting antibiotics, etc.), but one of them is clearly dispersal (Dusenbery, 1998; Stoodley et al., 2002).
The best general strategy would be for the bacterium to
"decide" whether or not to disperse based on environmental cues: if life is
good, stay put, but if resources are scarce, go somewhere else. In fact, this is basically what regulates
the production of flagella in modern bacteria.
In E. coli, the master operon (class 1) for flagella encodes FlhC
and FlhD. These proteins activate the
genes for flagellar biosynthesis in the next operon (class 2), but are
repressed in high glucose conditions, where nutrients are plentiful and movement
is pointless. If conditions
deteriorate, the level of cyclic AMP in the cell increases, and the expression
of FlhDC is activated by cAMP and the catabolite repressor/activator protein
(CAP) (Berg, 2003). Dispersal
processes and evolutionary stable dispersal strategies are tractable to
mathematical and computer modeling, resulting with a large literature (e.g., Gandon and Roussett, 1999; Lebreton et al., 2000; Mathias et al., 2001; Poethke and Hovestadt,
2002), although most of it is not aimed specifically at
bacteria (for exceptions see Kreft et al., 2001; Kerr et al., 2002). All that is
needed for the argument to proceed at this point is that dispersal be a widely
beneficial behavior. However, the basic
dynamics of random bacterial diffusion are so simply described (Berg, 1993) that a short investigation of what kind of bacterium
might be expected to evolve random dispersal is irresistible.
Even dead or otherwise nonmotile bacteria have a non-trivial
diffusion coefficient: a sphere with a radius r will have a diffusion
coefficient Dsphere of (Berg,
1993, eqn. 4.13):

(6)
For a dead bacterium with a
diameter of 2 μm, Dsphere = 2.1x10-9 cm2/sec. The average time t it will take to
diffuse a distance x is given by (Berg,
1993, eqn. 1.10):
(7)
For the above bacterium, this means that it can be expected
to passively diffuse its 2 μm diameter in ~9.3 sec, purely by Brownian
motion. However, because Brownian
motion is a random walk, after each 'step' the diffusing particle has an equal
chance of going in any direction; the overall drift is zero. A large number of particles placed at one
location will gradually spread out in all directions by diffusion, but the mean
location of the population will not change from the starting point unless
additional forces come into play. The
fact that average diffusion time increases with the square of the distance
means that diffusion becomes an increasing poor way to travel as the dispersal
distance increases. The 2 μm
nonmotile bacterium, above, takes an average of 9 seconds to diffuse one body
length, but to diffuse 100 body lengths takes not 100 times as long (15
minutes) but 10,000 times as long (26 hours).
These figures are somewhat misleading as they completely ignore
turbulence and flow (perfume molecules would take a month to diffuse across a
room if diffusion was the only relevant process; Berg, 1993), but beneath the laminar boundary layer, very near a
surface, these forces will be reduced (Vogel,
1994).
Therefore, one very crude way for
a bacterium adhered to a surface to disperse is to cut loose its adhesins
and/or adhesive pili, and see how far it can move by passive dispersal. At least one such dispersal mechanism has
been documented (Coutte et al., 2003). If the
bacterium can manage to rise substantially in the boundary layer, flow or
turbulence may carry it some distance, at which point it can re-secrete
adhesive pili and attach to a new surface.
It will probably do better if it starts out at the top of long pilus;
similar dispersal-enhancing mechanisms are well known (fruiting bodies in
bacteria such as Myxococcus; Caulobacter stalks).
Under what conditions would it be
advantageous to enhance the effects of Brownian motion by adding crude active
motility? Usefully, Dusenbery (1997; 1998) has derived an equation allowing the calculation of
the relative utility of active and passive dispersal for organisms at a low
Reynolds number. Dusenbery assumes that
the organisms are spherical and that they swim at a velocity of 10 body
lengths/sec, a common value at widely varying scales (Dusenbery, 1996). Taking into
account the fact that rotational Brownian motion will keep any cell from
swimming in a straight line, the ratio of the diffusion coefficient with
motility (Dm) to the diffusion coefficient without it (D0)
is (Dusenbery, 1997; Table 2):
(8)
where r is again the radius
and u is the cell's swimming velocity in radii/sec (here, 20). Dusenbery calculates that a bacterium would
have to have a diameter of at least 0.64 μm in order to double its
diffusion coefficient for the purpose of undirected dispersal. Small bacteria have a very high diffusion
coefficient even without motility, and for small bacteria Brownian rotation is
so severe that swimming straight for any distance is impossible. Therefore, for very small bacteria, active
swimming with flagella is pointless; random Brownian motion is as good as it
gets (other swimming methods may be successful, e.g. the linear motor of Spiroplasma;
Trachtenberg et al., 2003). This has
the obvious implication that flagella cannot have evolved in very small
bacteria; Dusenbery surveyed bacteria genera and found that the smallest motile
genus had a diameter of 0.8 μm.
Similar minimum size constraints were found for motility with
chemotaxis, phototaxis, and thermotaxis (Dusenbery,
1997).
Equation (8) can be modified by converting
the relative swimming velocity u (in radii/sec) into the absolute
swimming velocity v (in cm/sec):
(9)
Equation (9) can be used to estimate the
minimum swimming velocity required for a protoflagellum to substantially
increase the diffusion coefficient of a cell.
Dm/D0 was calculated for cells ranging from 0-8
μm in diameter, with absolute swimming velocities of 0.1, 1, and 10
μm/sec (see Figure 6). Some
advantage to diffusion would result from motility for any values of Dm/D0. However, because the construction and
movement of filaments has some cost, we have followed Dusenbery in setting the
cutoff for "selectable motility function" at doubling the passive diffusion
coefficient (Dm > 2D0). As can be seen from Figure 6, Dusenbery's result is approximately reproduced
(swimming velocity here is absolute rather than relative to cell size, so
slightly different input values were used): Dm/D0
is doubled for a ~0.6 μm bacterium swimming at 10 μm/sec.
However, a crudely functioning
protoflagellum cannot be expected to push a bacterium at 10 μm/sec. The important result shown in Figure 6 is that even very slow absolute swimming velocities
can result in a significant improvement in the diffusion coefficient for large
bacteria. Swimming at 1/10 the velocity
of E. coli is advantageous to dispersal in a bacterium of ~2 μm
diameter, and swimming at 1/100 the velocity of E. coli is advantageous
to the dispersal of a bacterium with a diameter of ~6 μm. Two factors contribute to this pattern: for
larger bacteria, passive diffusion is markedly slower, increasing the relative
advantage of swimming. Similarly,
rotational diffusion is also slower for larger bacteria, but this factor
enhances the efficacy of swimming as swimming runs will take longer to be
randomly reoriented.
There are additional reasons to
think that the protoflagellum may have originated in a large bacterium. Similar beneficial scaling applies if
swimming velocity is considered in terms of body lengths/second: Dusenbery's
0.64 μm bacterium has to swim at 10 body lengths/second in order to beat
diffusion, but a 6 μm bacterium need only swim 0.17 body lengths/second in
order to achieve a benefit. A
consideration of carbon budgets also points this direction: for a
(hypothetical) very small flagellated bacterium, diameter 0.4 μm,
producing 10 peritricious flagella, each ten times the body length of the cell,
would cost 50% of the cell's carbon (Mitchell,
2002). However,
for a 1 μm cell, the relative cost of 10 flagella of proportional size is
only about ~2% of cell carbon. For the
6 μm diameter cell discussed above, it is ~0.2%. 6 μm diameter bacteria are well within the usual size range
of bacteria (Dusenbery, 1997). For a
moderately large bacterium, the costs of crude, poorly functioning flagella are
trivial, while the benefits in terms of dispersal are substantial. The exponential nature of the relationships
is such that moderate violations of the input assumptions will not greatly
change the qualitative results (Dusenbery,
1997); at some moderately large size the costs of
primitive motility become low and the benefits high.
3.5.2. Primitive flagella
The flagellar motor is made up of
two proteins, MotA and MotB. MotB binds
to the peptidoglycan cell wall, allowing the complex to serve as a stator. MotB (and perhaps MotA) also forms a proton
conducting channel. Although the exact
mechanism of motor function is still mysterious, with many proposed models (Berg, 2003; Schmitt, 2003), energy from the translocation of proteins in the
vicinity of MotB is somehow transformed into mechanical energy to move the
rotor. Probably this occurs by
conformational change in MotA, which then binds reversibly with the rotor
protein FliG, causing rotation.
Speaking very metaphorically, FliG appears to act like the teeth of a
gear, converting (in one model) the power stroke of MotA into rotary
motion. FliG is mounted on the central
MS-ring (FliF). Also attached to the
MS-ring (perhaps mostly but not exclusively via FliG) are the switch proteins
FliM and FliN. FliM contains a receptor
domain for the phosphorylated chemotaxis protein CheY-P, and the binding of
CheY-P induces some kind of conformational change in FliM, FliN, and FliG that
results in switching the direction of motor rotation from counterclockwise to
clockwise. This in turn results in a short
'tumble' which reorients the cell, and then the flagellum returns to
counterclockwise rotation.
Even given the minimal costs and
substantial selective benefits of crude motility, how could the sudden origin
of the rotary motor complex be mutationally possible? The basic answer is that the ancestors of the motor proteins were
already fully formed and serving other functions in the cell. It was recently discovered (Cascales et al., 2001; Kojima and Blair, 2001) that the flagellar motor proteins MotAB have
nonflagellar homologs: ExbBD and TolQR (Figure
4c). These
proteins share significant sequence similarity and all form ion channels that
energize work at a distance by a third protein; ExbBD and TolQR energize outer
membrane transport via action on TonB and TolA, respectively, while MotAB
energize flagellar motion via action on FliG.
The recently discovered homologs involved in Myxococcus gliding
systems (Youderian et al., 2003) will likely add another instance, although no
detailed studies of their function have been performed. The nonflagellar MotAB homologs are
phylogenetically widely distributed, found in proteobacteria, cyanobacteria, Aquifex,
and even in archaea (Kojima and
Blair, 2001). These facts
led Kojima and Blair to note that these proteins "could perform work in
contexts other than (and simpler than) the flagellar motor," and they conclude
that "ancestral forms of the MotA/MotB
complex might have arisen independently of any part of the rotor."
In order to form the motor-rotor
interface, however, the origin of a third protein, FliG, must also be accounted
for. No nonflagellar homologs of FliG
have been discovered (except in type III virulence systems), perhaps not surprisingly
given the peculiar function of this molecule and the radical change it must
have undergone, whatever its ancestral function. The structure of the middle and C-terminal domains of FliG has
been resolved (Brown et al., 2002), and is primarily made of alpha helices. Alpha helices are ubiquitous in proteins, so
FliG is not necessarily structurally bizarre, despite its unusual
function. Three general possibilities
present themselves for the origin of the FliG-MotAB complex. (1) TolQR homologs were coopted via a
mutation that allowed them to bind directly to FliF. FliG was a later addition that enhanced motility by improving
binding between the MS-ring and MotA, and gradually took over the interface
function completely. (2) Proto-FliG was
bound to FliF before the cooption of MotAB for some other reason, perhaps a
stabilization or structural function similar to that served by the derived FliG
homologs in type III virulence systems.
Mutant TolQR homologs then bound to proto-FliG. (3)
FliG was coopted simultaneously with MotAB, because it originated
as a fragment of a TolA homolog that ancestrally interacted with a TolQ
homolog. The third hypothesis is the
simplest and most direct pathway; the only novel interaction would be the binding
of the proto-FliG to FliF; binding to the proto-MotA would be inherited. This is less demanding than postulating the
re-engineering of the interface between a TolQ homolog and its substrate (a
feature of both hypotheses #1 and #2), and does not require postulating an
independent cooption of FliG from an unknown source. The hypothesis also has the advantage of being testable via
determination of the structures of TonB or TolA and investigation of their
interactions with ExbBD and TolQR.
On any of the hypotheses, it is
not absolutely necessary that crude motility be an immediate product; a coopted
TolQRA-like complex could first have associated with the type III secretion
system to enhance or help to control protein transport. All that would initially be required for the
very weak motility postulated above would be a slow rotation (other things
being equal, a swimming velocity of 1/100th that of E. coli
would imply a rotation rate similarly reduced; E. coli flagella rotate
at 100 Hz, so perhaps the protoflagellum rotated at ~1 Hz). In addition to the motility advantages
discussed above, wiggling and twisting would probably help to dislodge the
bacterium from the surface and from nearby bacteria; just such a function of
certain type IV bundle-forming pili has been observed in E. coli (Knutton
et al., 1999).
A possible objection here is that
the ancestral pilus cannot be expected to have been freely rotating, preadapted
for the addition of a motor complex; the rod might have been bound to the
peptidoglycan cell wall via the P-ring, making motility impossible. However, this assumes that a P-ring existed
at this point, a dubious assumption if a secretin can bridge both the cell wall
and cell membrane. Additionally, it is
actually not clear that pili are commonly rigidly fixed to the cell wall. It would be very interesting to know if type
III and other pili, when attached to a substrate, allow a bacterium to rotate
via Brownian motion in a fashion similar to motor-disabled bacteria with their
flagella attached to coverslips. Similar
observations would be useful for membrane-embedded structures such as outer
membrane secretins. One selective
reason that might favor freely-rotating pili prior to the evolution of motility
is, again, adhesion. A curved pilus
that is allowed to rotate via Brownian motion can continually explore more area
around a cell (particularly a large, slowly diffusing, slowly rotating cell)
than a rigidly attached pilus, increasing the chances of finding a
substrate.
A final possible objection is
that if the pilus was perfectly straight, then rotating it would not produce
motion. While this is true, it is
irrelevant because modern pili are not always straight or stiff (Bullitt and Makowski, 1998, Figures 2-4; Honma
and Nakasone, 1990; Yamashiro et al.,
1994), the interconversion of pilus shapes is mutationally
trivial, and there are many more ways to build a curved, helical filament than
a straight one from protein subunits.
Additionally, for the pilus rotated by Brownian motion postulated above,
some curvature and helical character would be a requirement in order for the
pilus to explore a larger area than a fixed pilus. The exact shape of the protoflagellum is not crucial in the drag-intensive
world of the low Reynolds number; Purcell (1997) has calculated that any number
of peculiar rotating shapes can swim with varying efficiency (and efficiency is
in fact basically energetically irrelevant at this scale; Purcell, 1977; Berg, 1993). Purcell (1977) notes: "Turn anything – if it isn't perfectly
symmetrical, you'll swim."
3.5.3. Loss of outer membrane secretin
Unlike other secretion systems (type I-IV, including type
III virulence systems), the type III secretion systems of bacterial flagella do
not actually have an outer membrane secretin.
The protein that plays the role of an outer membrane pore, FlgH, is
actually a lipoprotein that Dailey and Macnab (2002) suggest is homologous to the Salmonella type
III secretion system protein InvH. InvH
is a lipoprotein required for the insertion of the secretin InvG (SctC in
Hueck's (1998) unified nomenclature) into the outer membrane (Crago and Koronakis, 1998). Such a
mechanism is very common for outer membrane secretin assembly; in the type II
secretion system of Klebsiella oxytoca, the outer membrane lipoprotein
PulS binds 1:1 with the secretin protein PulD, preventing periplasmic
degradation and helping the localized assembly of the secretin into the outer
membrane. Twelve PulS proteins probably
form a ring about the 12-PulD secretin pore (Bitter,
2003).
These observations suggest a hypothesis for the origin of
the flagellar L-ring: it is not derived from an outer membrane secretin, as
would be naively assumed based on its position in the outer membrane. Rather, the FlgH L-ring may be derived from
the lipoprotein that was the chaperone for the secretin of the primitive type
III secretion system. Why, then, was
the secretin lost? An obvious
possibility is that it slowed the rotation of the protoflagellum. On the present model, the primitive type III
pilus originally bound to the outer membrane secretin, and later the channel
was extended down to the MS-ring in the cytoplasmic membrane. However, some association or binding might
have remained between the outer membrane secretin and the filament. If this was the case, it probably would not
have mattered for a pili rotating by Brownian motion or rotating at low Hz in
the protoflagellum. The outer membrane
can tolerate some rotational motion of embedded components, just as membranes
tolerate the lateral diffusion of proteins.
Some modern flagella are even covered by the membrane, such as the polar
flagella of Vibrio, although here a special sheath evidently prevents
tearing during rotation at 1500 Hz.
However, as rotation speeds increased, the risk of outer membrane
tearing would increase. A simple
solution to this problem could be to delete the secretin entirely; the outer
lipoprotein ring would take over the role of outer membrane pore, but would not
interact with the filament and would provide a bearing between the filament and
outer membrane. The P-ring protein
(FlgI) and its chaperone (FlgA) might have been added after this point by
cooption from another secretion system (perhaps FlgI is a highly modified
secretin itself). If one step in the
evolution of the bacterial flagellum was the loss of the outer membrane
secretin, it would be a classic example of evolutionary "scaffolding" (Thornhill and Ussery, 2000) at the molecular level, the only instance identified
in this evolutionary model. The
hypothesis would be strengthened if lipoproteins other than FlgH formed rings
in the outer membrane in other secretion systems; thus far, lipoprotein has not
been found in isolated secretin complexes although the ring structure is
suspected (Bitter, 2003).
3.5.4. Refinements
For a bacterium, a sphere is the
optimal shape for maximizing dispersal (Dusenbery,
1998). It can thus
be surmised that the cell was a coccus, like some flagellated bacteria today (Zaar
et al., 2003). In such a
cell, the surface positioning of a flagellum is irrelevant – one place is as
good as another on a sphere, so no positioning mechanism is required. Since the model postulates that random
dispersal was the original function of flagella, the first, crudely functioning
protoflagellum lacked many parts that are important in modern flagella. First, chemotaxis and switching are not
required for dispersal, so neither the Che proteins nor the switch complex
would have been required. If switching
is not required and the protoflagellar filament is sticking straight out at a
random position on spherical cell surface, then the hook region is similarly
dispensible. Once functional motility
was even marginally established, however, there would be rapid selection for
improvements. These might have occurred
in more or less any order, or concurrently, so they will be discussed
topically.
3.5.5. Chemotaxis and switching
The "flagellar" chemotaxis genes (Eisenbach, 2000; Table
2, this paper) are in fact not specific to flagella;
the same system is coupled to diverse motility systems, including archaeal
flagella and twitching motility (Faguy
and Jarrell, 1999; Bardy et al., 2003).
Furthermore, homologous components are tied to all manner of cellular
responses to the environment; the central two-component signal transduction
system (consisting of the histidine kinase CheA and the response regulator CheY
in flagellar chemotaxis) is ancient, found in all three domains, and used for
diverse functions. Their evolution is
discussed by Koretke et al. (2000). Another
major set of chemotaxis components, the membrane bound methyl-accepting
chemotaxis proteins (MCPs) which are the receptors for attractants and
repellants, have a similarly wide set of homologs with diverse functions (Zhulin
et al., 2003). More could
be said about details, as there are substantial variations between the
chemotaxis systems of various organisms (Eisenbach,
2000; Kirby et al., 2001), but for the purposes of this paper it will be
assumed that some sort of sensory transduction system preceded the origin of
the flagellum, and that one of the response regulators was the ancestor of
CheY. In modern flagella, a worsening
in conditions results in the increasing phosphorylation of CheY into CheY-P.
In E. coli, switching
rotation from counterclockwise (CCW) to clockwise (CW) causes the cell to
tumble, reorienting it to swim in a new direction. The probability of switching is increased by the binding of
CheY-P to FliM. If concentrations of
attractants are increasing during a run, CheY-P decreases and switching is
suppressed, and thus favorable runs tend to last longer. If repellents are increasing, CheY-P
increases, switching is promoted, and thus unfavorable runs are shortened. The cell uses this method to bias its random
walk, imposing an overall drift towards regions with higher concentrations of
attractants (Berg, 1993). However,
for bacteria with a diameter larger than 1.4 μm, run-and-stop or
run-and-reverse strategies are more energetically favored than the
run-and-tumble strategy, due to the larger costs of actively rotating a large
cell (Mitchell, 2002). As a result
the run-and-tumble strategy, while common in model organisms, is far from
universal. Rhodobacter sphaeroides
swims with a single, stop-start flagellum, with no reversing (Shah and Sockett, 1995; Shah et al., 2000). Passive
rotation via Brownian motion reorients the stopped cell. This is but one of many variations on
switching (Eisenbach, 2000), but probably resembles the most primitive version.
Explaining the origin of the
switch complex, which couples the chemotaxis system to flagellar rotation,
requires an examination of the domain structure and interactions of the switch
proteins. FliN and FliM, which make up
the C-ring, are homologs. FliN is
homologous to the C-terminal domain of FliM, and as a result the two proteins
probably occupy similar positions in the C-ring, perhaps alternating in a 3
FliN:1 FliM pattern, which approximately matches their stoichiometry (Mathews
et al., 1998; see also Figure
2, this paper).
FliM also has a N-terminal domain with no counterpart in FliN that is
the actual CheY-P receptor. CheY-P
binds to the receptor domain, increasing the probability of a switch to CW
rotation (Eisenbach, 2000) via an unknown mechanism involving interactions
between FliM/N and FliG (Mathews et al., 1998). The
receptor domain is homologous to the single-domain chemotaxis protein CheC of Bacillus
subtilis (Kirby et al., 2001). CheC binds
reversibly to the Bacillus C-ring, and is released when it binds to
CheY-P. CheC has not been found in E.
coli, but homologs are found in many early-branching bacteria, as well as
archaea. A cladogram generated for CheC
and the FliM CheC-like domain shows that CheC is phylogenetically basal (Kirby
et al., 2001).
A pre-existing sensory
transduction system could be coupled to flagellar rotation in a single step on
the hypothesis that a FliN-like protein existed for some nonflagellar cellular
response purpose, serving as a receptor for CheC. The exact function of modern CheC is not known, but it appears to
interact with CheA, CheD, and McpB, which form a receptor complex (Kirby
et al., 2001). CheC may
also have a FliM-like function via interaction with the C-ring (Szurmant et al., 2003). The
ancestor of FliN might therefore be found among the other proteins that CheC
interacts with. On the model, a
mutation in this FliN-like protein created a proto-FliN that bound to FliG,
slowing or jamming the motor. The
reversible binding of CheC to proto-FliN, however, happened to alleviate this
effect by changing the conformation of proto-FliN. CheY-P binding to CheC would result in the dephosphorylation of
CheY-P and the release of CheC from proto-FliN, resulting in the
slowed-rotation behavior. Chemotactic
behavior would thereby originate by a single mutation (all other interactions
would be inherited), which could then be followed by gradual improvements in the
initial crude function. This hypothesis
is more economical than supposing that FliN originated for some role in
structural support or enhancing export, and was later coopted to a switching
function via the binding of CheC, although this remains a possibility as FliN
is retained in type III virulence systems for some purpose. The first hypothesis suggests that the
homolog of FliN will be found within sensory transduction systems as one of the
proteins that CheC or a CheC homolog interacts with; it is hard to know where
to look with the latter hypothesis. The
considerable variations in the C-ring of bacteria may yield further hints, as
major variations on chemotaxis and the switch complex are known; for example, Aquifex
aeolicus lacks the traditional chemotaxis system as well as FliM; Bacillus
spp. have FliY (a FliM-FliN fusion protein; Bischoff and Ordal, 1992; Celandroni et al., 2000) rather than FliN.
In any case, the fusion of CheC-like and FliN-like proteins would
produce the FliM seen in most bacteria.
Hypothesizing a detailed pathway
explaining how a stop-start switch complex could be converted into the other
varieties of switching will depend on detailed knowledge of motor mechanism;
many models of the flagellar motor have been proposed and the question is far
from settled (Berg, 2003). However, if
proton-induced conformational chan