This is an open-access article distributed under the terms of the Creative Commons Attribution License. This article was made open access via a Publish and Read agreement between the Microbiology Society and the corresponding author’s institution.
Plasmids, extrachromosomal DNA molecules commonly found in bacterial and archaeal cells, play an important role in bacterial genetics and evolution. Our understanding of plasmid biology has been furthered greatly by the development of mathematical models, and there are many questions about plasmids that models would be useful in answering. In this review, we present an introductory, yet comprehensive, overview of the biology of plasmids suitable for modellers unfamiliar with plasmids who want to get up to speed and to begin working on plasmid-related models. In addition to reviewing the diversity of plasmids and the genes they carry, their key physiological functions, and interactions between plasmid and host, we also highlight selected plasmid topics that may be of particular interest to modellers and areas where there is a particular need for theoretical development. The world of plasmids holds a great variety of subjects that will interest mathematical biologists, and introducing new modellers to the subject will help to expand the existing body of plasmid theory.
Keywords: plasmids, modelling, bacterial evolution, mobile genetic elements, horizontal gene transfer
Supporting data for Figure 1 can be found at 10.6084/m9.figshare.23744571 [1]
Plasmids are extrachromosomal DNA molecules common in many bacteria [2]. They replicate independently from the chromosome (and from other DNA molecules in the cell), and often exist in the cell in multiple copies. They can be transmitted vertically to daughter cells on host cell division and in some cases horizontally to other bacteria. The simplest plasmids are effectively parasites of their hosts: they colonize the host and use its cellular machinery to reproduce themselves. But they also form part of the bacterial genome, and genes located on plasmids have effects on the metabolic processes of their hosts, on the host phenotype, and therefore on host fitness. The most studied of these plasmid-borne genes are antibiotic resistance genes, which are a serious threat to the continuing effectiveness of antibiotics in clinical use [3, 4].
The term ‘plasmid’ was coined by Lederberg [5] to refer to any genetic determinant outside the chromosome, including chloroplast and mitochondrial genomes and certain viruses, but is now restricted to the simple extrachromosomal DNA molecules found in bacteria, archaea and some eukaryotes. Historically, plasmids were of interest for two primary reasons: horizontal transfer between cells and antibiotic resistance. The earliest plasmids discovered were capable of transferring themselves between bacterial cells [2, p. 9], and this novel method of horizontal gene transfer was of great interest to bacterial geneticists.
Antibiotic resistance plasmids were also discovered early on: this greatly increased interest in plasmids as a clinically important contributor to the spread of antibiotic resistance [6]. Today, a much greater variety of plasmids are known and studied by biologists. As components of the bacterial genome, plasmids play an important role in the evolution of bacterial populations; therefore, a clear understanding of all aspects of plasmid biology is necessary for a full understanding of bacterial genetics.
Plasmids are also of interest in their own right, as evolving biological entities. Mathematical modelling can make, and has already made, an important contribution to understanding the biology of plasmids and their role in the ecology and evolution of bacteria. The existing literature (reviewed in [7]) includes models focusing both on fundamental plasmid biology and on particular contributions of plasmids to bacterial populations, especially antibiotic resistance. Mathematical approaches are diverse and depend on the question at hand: we briefly review them in Box 1.
A variety of methods have been applied to modelling plasmids and their hosts. The oldest and most widely used are deterministic differential equation models that divide the host population into groups based on plasmid content, in the simplest case just plasmid-carrying and plasmid-free bacteria. These models track the population sizes of the groups, coupling the population dynamics with the dynamics of the plasmid. This technique has been applied to exploring the effects of conjugation on plasmid dynamics [112, 220] and to the ‘plasmid paradox‘ [221–224] (discussed further in Boxes 4 and 5, respectively), as well as many other questions [69, 121, 122, 203]. Some models use difference equations in a similar fashion [182, 225]. Stochastic simulations of such models, usually with stochastic rates paralleling the transition rates of the deterministic models, are also common [225, 226]. Other models have used analytical stochastic approaches [198]; branching process models are particularly useful when modelling the invasion of a plasmid or the early spread of a novel plasmid variant [205, 227, 228]. While most plasmid models include population dynamics, some models, in the tradition of population genetics, fix the population size. Based on extensions of the classical Moran model [229], these models only consider changes in the relative frequencies of cell types: these can be deterministic [68, 230] or stochastic [230, 231]. The final common approach is the use of individual-based simulations [102, 206, 232, 233], which allow a realistic incorporation of many biological processes at the cost of little analytical tractability. Although most models are of a host population, there are some models of plasmids within a single cell, both deterministic [234, 235] and stochastic [199]. Of course, many studies combine multiple modelling methods: particularly common are the combination of an analytically tractable model of some sort with a simulation [225, 228, 233].
The biology, the specific question and the goal of the model determine which processes and features are taken into account. Models of a host population usually incorporate a few common biological processes. The most basic of these is the host population dynamics, which includes a growth model (often either exponential or Monod growth), competition between cells (often via a Lotka–Volterra model or explicit inclusion of a common resource), washout of cells (for chemostat models) and fitness costs or benefits to carrying particular plasmids. Horizontal transfer is frequently included: see Box 4 for a discussion of modelling horizontal transfer. Finally, loss of the plasmid during segregation is also a common model component: this can be modelled explicitly as a component of a stochastic model, but in ODE models it is usually modelled as a flux of cells from a plasmid-carrying to a plasmid-free compartment at a constant per capita rate. Other model features include physiological changes in the cells or age structure in the population [102, 112], migration [68], explicit modelling of the segregation of plasmids on host cell division [182, 228] and plasmid replication during the cell cycle [236], and many others.
The parameters associated with these models usually need to be assigned values, although for some models it may be possible to obtain analytical results (e.g. [223]). Many studies incorporate experimental and modelling work in the same study [98, 220, 232], and then the parameters can be estimated from the accompanying experiments. In the absence of codesigned experiments, parameter values or plausible parameter ranges usually need to be obtained from the literature, either directly or by additional parameter estimation from published data. For models with one or few parameters, it may be possible to explicitly explore the plausible parameter space. Otherwise, depending on the goals and scope of the study, a parameter sensitivity analysis might be required. Parameter estimates are usually obtained from in vitro experimental studies. Parameter estimation is often non-trivial (see also Box 4 for the estimation of transfer rates). The parameterization of in vivo models is notoriously difficult. An example of thoughtful and sophisticated parameter estimation for an in vivo model is the method developed by Tepekule et al. [226], who make use of various kinds of data and observations from multiple different sources, including microbiome time series data, to parameterize their model.
While a large number of plasmid models exist, this number is still small in the light of their biological interest. For those interested in learning about plasmids, there are a large variety of existing reviews covering plasmids and their evolution generally [6, 8, 9], the diversity of plasmids [10–12], antibiotic resistance plasmids [4, 13–15], the physiological functions of plasmids [16–21], and interactions between plasmids and between hosts and plasmids [14, 22–26]. Why therefore another review article?
The diversity and complexity of plasmids can easily be overwhelming for those without prior knowledge, which includes many theoretical biologists who originally have a background in a subject other than biology, such as mathematics, physics, or computer science. In particular, as soon as a novice sets up a model, many questions about meaningful biological assumptions appear. To lower the hurdle to start working on plasmids, we here present an overview of plasmid biology explicitly targeted at modellers new to the field.
An introduction to plasmids for modellers could certainly have various levels of biological detail and complexity: here, we want to go beyond a basic and largely conceptual level. From our own experience, we mostly see the need for a review that, while introductory, does not oversimplify the biology. While no prior knowledge about plasmids is required to read the article, some background about bacteria and microbiology is assumed. In the interest of accessibility to modellers with differing knowledge of microbiology, we provide a glossary of terms used: terms in the glossary are marked with an asterisk on first use. In the next three sections, we focus on plasmids themselves, discussing the variety of plasmids, their gene content and the important physiological functions they carry out; in the fifth section, we explicitly turn to the relationship between plasmids and their hosts, considering plasmid–host interactions, as well as interactions between plasmids. In four boxes, we highlight selected key topics related to plasmids and their hosts that are especially captivating for modellers, and provide a few references to relevant modelling work as a starting point for further reading. At the end of the article, we discuss several areas where we see particular scope (and need) for more modelling. Hopefully, the review will enable readers to begin working on models that will expand our understanding of plasmids and their hosts.
Plasmids are extremely widespread in the wild: bacteria and archaea from almost all taxonomic groups have been found to contain plasmids [8]. They also exist in eukaryotes, particularly fungi (see e.g. [27, 28]), and even in mitochondria within eukaryotic cells [29]; but we shall focus our attention on bacterial (and archaeal) plasmids. Precise quantification of the frequency of plasmids in different taxa is complicated by variable search effort: plasmids from bacteria of clinical importance or from model species are overrepresented among sequenced plasmids, particularly those from the Gammaproteobacteria, where antibiotic resistance plasmids have been the subject of extensive study [30].
Plasmids themselves may be naturally classified on the basis of biologically relevant properties. Perhaps the most important of these is the capacity for horizontal transmission* between bacteria by conjugation (the details of conjugation are discussed below). Conjugative plasmids carry all of the genes necessary for conjugation, and are therefore self-transmissible; mobilizable plasmids do not carry the full transfer machinery, but have a sufficient subset to be able to undergo conjugation in the presence of a conjugative plasmid which supplies the rest; the remaining plasmids are nontransmissible. It has been estimated that about half of all plasmids are nontransmissible, with the remaining half approximately equally divided between conjugative and mobilizable [31, 32]; however, recent results suggest that a large fraction of plasmids that are traditionally classified as nontransmissible might in fact be mobilizable, and mobilizable plasmids might make up the majority of all plasmids [33]. The capacity for conjugation is connected to two other biologically relevant properties: plasmid size and the number of copies of the plasmid maintained in a host cell. In general, conjugative plasmids are large and have a low copy number (typically one or a few copies per cell), while small plasmids, which tend to have a high copy number, are more often mobilizable or nontransmissible [8]: see Fig. 1 . Nontransmissible plasmids exhibit a much larger range of sizes than transmissible plasmids, and there are some nontransmissible plasmids that are even larger than typical conjugative plasmids, which may be in the process of becoming accessory chromosomes* [31, 34]. Plasmids may also be classified as broad or narrow host-range plasmids, depending on whether they are capable of becoming established in a large variety of hosts, or are reliant on a particular group of bacteria [16]. Plasmids are also classified based on topology, which has particular effects on the biology of replication: the majority of plasmids are circular, but some linear plasmids have been found [35–38]. Beyond this division of plasmids based on broad properties, it is natural to develop a systematic biological classification of plasmid types. The classical approach is to divide plasmids into incompatibility groups. We say that two plasmids are in the same incompatibility group if they are incompatible; that is, if they cannot be stably maintained together in the same cell line [39]. This concept is perhaps a bit counterintuitive – plasmids are in the same group if they are incompatible with each other – but it produces natural kinds because the cause of incompatibility is usually interference between common regulatory systems on the incompatible plasmids [20]: this mechanism of incompatibility is shown in Fig. 2(a) , and discussed further below. Because incompatible plasmids share fundamental genes, incompatibility groups may be looked on as ‘species’ of plasmids, and the production of new incompatibility groups as plasmid speciation [40, 41]. It has been suggested that in the modern world of abundant plasmid sequences, the concept of incompatibility groups could be replaced altogether in favour of directly comparing sequences of fundamental plasmid genes; this has been argued to address perceived conceptual limitations, such as the ability of single point mutations to create new incompatibility groups [42, 43]. These methods have used the sequences of the replication proteins (REP classification), the conjugative transfer proteins (MOB classification), or the entire plasmid to reconstruct phylogenetic relationships between plasmids [10, 44–47]. We shall see below that plasmids can fuse with each other and separate again, and they gain or lose segments by the movement of mobile genetic elements or by recombination: therefore the concepts of a plasmid type and the identity of a plasmid over time can be quite fluid in practice, with plasmids that belong to multiple incompatibility groups, or that significantly change their structure or gene content while remaining in the same incompatibility group. For the purposes of modelling, whether different plasmids count as being of the same type will generally simply have to be imposed by the modeller.

Distribution of plasmid lengths among mobility classes of plasmids. Plasmid length data are taken from PLSDB (retrieved 1 March 2023 [237]) and the distribution is derived by kernel density estimation in R [238] with the ggplot2 library [239]. The top panel shows all plasmids in the database; the bottom four panels show plasmids from the four bacterial families with the most sequences in the database. The graph was cut off at 300 000 bp for reasons of scale: 1844 plasmids in the database (5.34% of the total) are longer than the limit; the longest plasmid in the database is 4 605 385 bp. The plasmid sequences in PLSDB have been annotated with MOB-typer [240] to identify putative relaxases* or conjugative genes (see the explanation of these terms in the text). Those plasmids with neither are nontransmissible, those with both are conjugative, and those with only the relaxase are mobilizable; note that this means that those mobilizable plasmids with an oriT but no relaxase are not recognized as mobilizable. The 'weird’ class includes those plasmids that had conjugative genes but no relaxase: these would be nontransmissible, but nonetheless have all the genes for a secretion system and mating pair formation. It is possible that these are misidentified conjugative plasmids, which have an unknown relaxase, or misidentified mobilizable or nontransmissible plasmids, which do not actually have conjugative genes.

Mechanisms limiting coinfection of bacteria by multiple plasmids. (a) Unstable inheritance. Because the two plasmid types share a common copy number, random segregation at host cell division eventually ensures that they are separated into distinct hosts carrying only one plasmid type. (b) Surface and entry exclusion. The plasmids encode proteins which prevent the host cell from being a recipient in conjugation. (c) Destruction of the incoming plasmid. Novel plasmids are degraded by plasmid-encoded immune systems, such as CRISPR/Cas* or restriction enzymes* once in the host cell.
The understanding of plasmids must of course start with the plasmid itself, as a molecule, a sequence of nucleotides, and a collection of genes.
The genes carried on a plasmid may be broadly divided into two groups based on their function: those with plasmid-specific functions and those that primarily affect the host phenotype. The genes responsible for plasmids’ own housekeeping functions make up the plasmid backbone, while the other genes carried by a plasmid are called payload or accessory genes [16]. The housekeeping genes of the plasmid typically fall into a few classes: there are the genes responsible for the replication of the plasmid, a stability system that ensures the plasmid is stably inherited across host generations and, if the plasmid is transmissible, the genes responsible for conjugation or mobilization and overcoming host defences to establish the plasmid in the recipient cell [11]. The most important of these are the genes controlling replication. They will include the oriV (origin of vegetative replication, the sequence at which plasmid replication begins), as well as the genes for any proteins necessary for replication and for the components of a system to control replication of the plasmid [11]. Typically, the origin of replication and a few genes involved in replication form the minimum subsequence of the plasmid that is capable of replication in a host, called the ‘basic replicon’*, which constitutes an absolutely minimal plasmid [48]. The simplest plasmids – often called cryptic plasmids – may have no payload genes [49–51]; they therefore do nothing but hang around in the cell. But many plasmids, and for obvious reasons the plasmids of greatest interest, carry genes that contribute to the phenotype of the host cell. The most studied plasmid payload genes are antibiotic resistance genes. These are found extremely frequently, in a wide variety of hosts, and encoding resistance to a wide variety of antibiotics [3, 4, 52]; some aspects of plasmid-borne antibiotic resistance and models thereof are discussed in Box 2. Plasmids may also encode resistance to other environmental dangers, such as heavy metals or other toxins [53–55]. A second large class of payload genes provide some new metabolic process [56, 57]; this might include, for example, the ability to metabolize a new substrate for growth [58–60]. Virulence factors are also often found on plasmids [61]. Plasmids also carry genes responsible for social interactions between bacteria (discussed below). Particularly interesting examples of plasmid-borne traits include the gall-forming properties of the plant pathogen Agrobacterium tumefaciens , which are caused by the horizontal transfer of a portion of a plasmid to the host plant [62], and the symbiosis between nitrogen-fixing rhizobial bacteria and leguminous plants, in which both the genes responsible for nitrogen fixation and for interactions with the plant host are found on plasmids [63]. An extensive list of functions observed on plasmids may be found in [1, pp. 4–5]. Sometimes, even essential genes are found on plasmids [64, 65], although extrachromosomal replicons with the unique copy of an essential gene are sometimes categorized as secondary chromosomes* or chromids* instead of plasmids. Why particular genes are found on plasmids rather than on chromosomes is a longstanding question. Numerous hypotheses and models have been developed to explain the distribution of genes (see e.g. [63, 66–69]). Possible explanations include the advantages of mobility for local adaptation when there is patchy or temporally varying selection [63, 66]. In different conditions the same genes may be located on different replicons: it has been hypothesized that the majority (if not all) bacterial genes have spent time on plasmids and on chromosomes over evolutionary time [66].
Antibiotic resistance genes are frequently located on plasmids [241, 242], and the serious threat of antibiotic resistance means that understanding the contribution of plasmids to its evolution is one of the most important applications of plasmid biology. The reasons antibiotic resistance genes are so often found on plasmids may include that plasmids allow bacteria to temporarily acquire resistance genes that are not needed in nonselective environments [52], or that the presence of plasmids in multiple copies increases either the dosage of resistance genes [243, 244], or the rate of emergence of new resistance alleles [90, 245], or both. Horizontal transmission of plasmids enables antibiotic resistance to spread very quickly in bacterial communities [4, 13, 246]. The human gut is a site of substantial plasmid transfer. Resistance plasmids within the gut continue to evolve [247] and are transferred between members of the microbiota within individuals and spread across individuals, as shown by León-Sampedro et al. [248] for the plasmid pOXA-48 in the gut of hospital patients. During antibiotic treatment, not only the pathogen population but also the patient’s normal microbiome is exposed to antibiotic pressure, inadvertently selecting for resistance (so-called bystander selection). However, resistance plasmids are also found in human populations without strong exposure to antibiotics [249]. Resistance in commensal bacteria can be problematic for two reasons. First, some commensal species, such as Escherichia coli , Klebsiella pneumoniae , or Staphylococcus aureus , can turn pathogenic and cause infections of the urinary tract, the lung, or wounds, and are a major cause of nosocomial infections. Second, resistance plasmids may spread from commensal bacteria to obligate pathogens [250–253]. Despite the immense clinical relevance of plasmids, modelling studies are biased towards chromosomal resistance. Nonetheless, a body of theoretical literature on plasmid-mediated resistance has accumulated [15, 254], and plasmids are increasingly attracting the attention of modellers. For example, Svara and Rankin [207] developed an ODE model to determine under which treatment conditions – antibiotic dose and intervals between administrations – resistance on a conjugative plasmid would be favoured over resistance on the chromosome. Their model also includes an incompatible plasmid without the resistance gene, leading to six different cell types that compete with each other. Other models have, for example, addressed how patterns of drug use determine the prevalence of resistance plasmids within the microbiota at the level of the individual [226] and the level of the population [254] and examined the interaction of plasmid-borne resistance with specific modes of drug action [212]. Models furthermore often complement in vitro experiments to help explain and interpret the observed dynamics of plasmid-mediated antibiotic resistance (e.g. [245, 255]).
The genes on plasmids are not distributed arbitrarily, but tend to have an organized structure. This structure is often modular, with genes with related functions located together on the plasmid; there are then segments of the plasmid for replication, conjugation, different payload genes, and so on [11]; see, for example, the plasmids depicted in Fig. 3 . In addition, the genes carried on plasmids are often located within other, nested mobile genetic elements*: a plasmid-borne gene may be in a gene cassette*, which is integrated into an integron*, which is carried by a transposon*, which is located on the plasmid [16]. The modular structure suggests that plasmids may frequently evolve by the gain or loss of entire modules, whether by recombination* with chromosomes or other plasmids or by the movement of mobile genetic elements. This also means that plasmids are frequently genetic mosaics, which may combine components of several mobile genetic elements together with pieces of multiple original plasmids and chunks of chromosomes [70–75]. Large plasmids may contain multiple basic replicons (e.g. F, see [76]); this means that they can replicate starting from any one of their several oriV sequences using the replication mechanism encoded by that basic replicon. Usually, only one will be active at a time, since otherwise there would be conflicts between the replication processes; often different replicons will be active in different hosts, extending the plasmid host range [12].

Illustration of important processes in plasmid biology with examples of three different plasmids. (a) The low-copy-number (~1 copy per cell) plasmids p42a (conjugative) and p42d (nontransmissible) of Rhizobium etli CFN42 (GenBank accession numbers CP000134.1 and U80928.5 [256, 257]) fuse into a cointegrate and then are transferred by conjugation from a host to a new recipient. The two cells have been pulled together by a pilus, which has then been retracted. A single strand of a plasmid copy is transferred, while the other strand remains within the cell in circular form. Afterward, a second strand will be synthesized in both cells to generate double-stranded DNA. p42d is also referred to as pSym, since it contains most of the genes responsible for rhizobial symbiosis in this strain; it is normally transferred horizontally by cointegration with p42a [191, 258]. (b) The copies of the small, high-copy-number (~14 copies per cell) resistance plasmid pB1000 (GenBank accession number GU080070.1 [180]) have been segregated between daughter cells at host division; cell division is almost complete. Inside each bacterial cell, the plasmids and chromosome are depicted. The diagrams of the plasmids show the open reading frame* of each gene on the plasmid as an arrow coloured by the function of the encoded protein. The region of pB1000 marked in the replication colour contains the plasmid origin of replication and the coding sequences for the RNAs involved in regulation of replication. The origins of replication of p42a and p42d are located inside the RepA gene; the origin of transfer of p42a is marked with a dot in the mobilization colour.
In addition to the formation of mosaics by accumulation of portions of plasmids and other replicons, several plasmid copies may fuse into one molecule, forming a plasmid multimer; plasmids frequently exist in hosts as a mixture of monomers and multimers of various sizes [77–80]. Multimerization occurs not only between plasmids of the same type, but also of distinct types, forming plasmid cointegrates that exhibit the properties of both their components [72, 81–84]: this is depicted in Fig. 3(a) . The formation of plasmid multimers and cointegrates is driven by homologous recombination* or recombination of transposable elements located on one or the other plasmid [85–87], and sometimes by more exotic processes (reviewed in [22]); to counteract multimerization, plasmids frequently encode multimer resolution systems.
The backbone genes of plasmids have functions related to the ‘life’ of the plasmid itself, and are responsible for its maintenance in the host, its propagation (vertically or horizontally) and similar housekeeping. These functions constitute the physiology of the plasmid, and their presence is what distinguishes a plasmid from a simple fragment of DNA that might be picked up by transformation*.
It is a defining attribute of plasmids that they replicate in their host cells autonomously from the chromosome. Although most plasmids rely partially on host proteins for replication, they also carry genes essential to their replication and responsible for its regulation. The regulation of replication ensures that the plasmid is present in the host population at a fixed number of copies per cell. Control of copy number is necessary to ensure the stable maintenance of the plasmid: if the copy number falls too low, there is a greater risk of producing plasmid-free segregants, while if the copy number is allowed to grow without limit, the plasmid will impose a heavy fitness cost on its hosts; Box 3 discusses the multilevel selection acting on traits such as plasmid copy number. We have already seen that there are low- and high-copy-number plasmids; but there is much copy number variation among high-copy-number plasmids, from around 10 copies to hundreds (e.g. [78, 88]). The copy number is not exactly identical from cell to cell, but is subject to some noise, which can be reduced by a partitioning system [89]. The plasmid copy number may of course be subject to evolution, even over short time scales when the plasmid carries an important host function [90–92].
Multilevel selection
Because plasmids depend on a bacterial host to survive and propagate, they offer an excellent example of multilevel selection: the fitness of plasmids depends both on their own properties (of replication, maintenance, etc.) and indirectly on the fitness of their hosts [66]. As we have seen, plasmid-borne genes frequently affect host fitness as well as the fitness of the plasmid itself: this includes both direct effects on host phenotype and the indirect fitness effects discussed in the text. Interactions between these two levels have important implications for the evolution of plasmids and for their contribution to the evolution of their hosts. For example, there is a trade-off between rates of vertical and horizontal transmission of plasmids [15, 100, 128]: the costs to the host of conjugation mean that increased horizontal transmission reduces vertical transmission. A similar trade-off also leads to conflicting selection pressures on the plasmid copy number: a plasmid variant with a higher copy number outcompetes a variant with a lower copy number at the intracellular level, but if the copy number increases too much, the burden on the host cell may become too high [129, 259].
Allele dynamics on multicopy plasmids
An effect of multicopy plasmids on bacterial evolution is the possibility for loci on multicopy plasmids to be heterozygous (reviewed in [8]). The dynamics of alleles on multicopy plasmids depend on processes at two levels – intracellular plasmid replication and segregation and population dynamics at the cellular level. If the alleles have fitness effects, the host fitness depends on the plasmid composition within the cell, and various forms of dominance or heterozygote advantage are possible. As illustrated in Fig. 4 , random segregation of plasmid copies at host cell division changes the plasmid composition from mother to daughter cells, which is termed segregational drift [260]. In the long term, this leads to the loss of heterozygosity on plasmids unless there is selection for maintaining the two plasmid types together. Segregational drift does not alter the allele frequency in the bacterial population per se, but since it leads to the generation of wild-type homozygous cells, it reduces the number of cells carrying the mutant allele, which increases the strength of genetic drift. This means novel mutations on plasmids have a higher chance of being lost than those on a monoploid chromosome, at least in the absence of gene dosage effects [98, 228, 261]. Replication of plasmids may also be a source of drift: selection of plasmids to replicate, if at least partially random, creates a ‘rich-get-richer’ effect that leads to more extreme biases in plasmid content in offspring [230]. The evolutionary dynamics on multicopy plasmids have recently received substantial attention from both experimentalists [90, 261] and theoreticians [228, 230] and from both together [98, 245, 260, 262]. Early models have studied the dynamics of incompatible multicopy plasmids, which is a closely related problem [198, 200, 201].

Schema of the effect of segregational drift on a novel allele located on a plasmid. A novel allele originally arises on one copy of the plasmid, and by random segregation wild-type and mutant plasmids eventually end up in different, homozygous cells. We here describe one way of modelling the process mathematically [228]. In the simplest case, cell dynamics are modelled by a birth-death process with per capita birth and death rates b i and d i , which may depend on the number of mutant plasmids i in a given cell. Prior to cell division, plasmids are replicated such that the cell contains twice the original number. Two possible models for plasmid replication are ‘regular replication’, where each copy is replicated exactly once, and ‘random replication’, where copies are randomly picked for replication one by one (cf. [198]). Each daughter cell then receives half of those plasmids with segregation being random with respect to mutant and wild-type variants. With n total plasmid copies and x mutant copies after plasmid replication, the probability that one daughter cell receives j and the other one x − j mutant copies is then given by ( 2 − δ j , x / 2 ) ( x j ) ( 2 n − x n − j ) / ( 2 n n ) , where δ i , j denotes Kronecker’s delta. Adapted from [90, 245].
The replication control system ensures that the replication rate per plasmid copy per host generation is greater than one when the copy number is below the target and less than one when the copy number exceeds the target [1, p. 31], so that the copy number is increased by replication or reduced as host cell divisions dilute the plasmids faster than they replicate to reach the target. The rate of replication is regulated by negative feedback: one common mechanism is by control of the production of a plasmid-encoded Rep protein that is responsible for initiating the replication of the plasmid by binding to the oriV and recruiting the necessary enzymes for replication. Expression of Rep is repressed by some trans-acting* product encoded by the plasmid: a separate, specialized repressor*, Rep itself, or another protein cotranscribed with Rep. As the plasmid copy number increases, more of this repressor is produced, and it prevents the production of Rep protein, creating a negative feedback loop [2, 17, 18, 93–95]. This mechanism explains the origin of plasmid incompatibility due to a common basic replicon. Since the repressors are trans-acting, two distinct plasmids with the same basic replicon will contribute to repressing each other’s replication, and therefore will share a common copy number. In the absence of selection for both plasmid types, this means that after a few generations random segregation* at host cell division will have separated the plasmid types into distinct host cells, and thus the two plasmids cannot be stably maintained together [20]; see Fig. 2(a) . When replication is permitted by the control system, the selection of the plasmid copies to be replicated seems to be at random in at least some cases [96, 97], although it is difficult to definitively confirm this experimentally, and it may not be the case for all plasmids [98]. The mechanisms of replication are reviewed in [19].
When a host cell divides, the plasmid copies present end up in one or other of the daughter cells (see Fig. 3b ). In the absence of any intervention by the plasmid, this will happen essentially at random: the plasmids in one portion of the cell when it divides go to one daughter, and the plasmids elsewhere go to the other. For plasmids with a high copy number, this may be sufficient to stably vertically transmit the plasmid: with n copies, the probability of producing a plasmid-free daughter by random segregation is 2 1−n , which may be sufficiently small to be negligible. Indeed, some high-copy-number plasmids do not seem to have any other partitioning mechanism [96]. In practice, this is quite successful in reducing plasmid loss to a very low level [99].
For low-copy-number plasmids a further active partitioning system that ensures that both daughter cells receive a copy of the plasmid is necessary. This can be done by localizing plasmids in separate regions of the cell before cell division (e.g. [100, 101]). A common class of such systems functions by encoding a protein complex and a cis-acting* site on the plasmid. The protein complex binds a pair of plasmids together at their respective copies of the cis-acting site and then physically separates the pair at cell division, possibly by binding to a point on the cell membrane or another host structure [21].
Even high-copy-number plasmids need to ensure that they are not made unstable by multimerization. The replication control systems we have discussed above control the number of copies of the basic replicon in the cell; this means that if multimers of the plasmid form, the number of physical molecules containing the plasmid falls without a derepression of replication (as the number of copies of the basic replicon remains the same). Since the number of physical molecules is what determines the probability of producing a plasmid-free daughter cell, multimerization reduces the stability of the plasmid. Moreover, larger plasmid multimers are replicated more often, since more copies of the basic replicon means more oriVs at which to initiate replication. A cell with multimers is thus likely to obtain more multimers, and reduce the plasmid’s stability further, in the so-called ‘dimer catastrophe’ [102]. To counteract this effect, plasmids generally encode a multimer resolution system, which converts multimers back to monomers by site-specific recombination* [21, 103, 104].
Another group of plasmid functions, variously called postsegregational killing, host killing, toxin–antitoxin systems, or plasmid addiction, are often included with stability systems, but we shall consider them separately below.
Perhaps the most famous physiological function of plasmids is the ability to transfer themselves from one host to another. Transfer by conjugation requires physical contact between the donor and recipient cell, which is usually, at least in Gram-negative* bacteria, achieved using a proteinaceous structure called a pilus: see Fig. 3(a) . Usually, a single strand of the plasmid is transferred into the recipient cell, and the second strand is then resynthesized in both cells, meaning that conjugation also involves duplication of the plasmid; however, in some plasmids of Streptomyces, the entire double-stranded plasmid is translocated from the donor to recipient [105, 106]. In some cases, there is the possibility of retrotransfer, the transfer of genetic material from the recipient back to the host, during conjugation [107]. Some plasmids can undergo horizontal transfer through more exotic mechanisms [108], sometimes involving cooperation between donor and recipient [109], but here we focus on common conjugation. In addition to the pili, which bring the donor and recipient into contact so that conjugation can occur, the typical conjugation machinery consists of four parts.
The first is the oriT, or origin of transfer, the sequence on the plasmid at which transfer begins. One strand of the plasmid is nicked* here by the second component, a relaxase*, which binds to the free end of the strand. The relaxase interacts with a type IV coupling protein (T4CP, the third component), and is transferred, together with the attached plasmid strand, by a type IV secretion system (T4SS, the fourth component) into the recipient cell. Once the entire strand has passed into the recipient cell, the relaxase catalyses the ligation* of the transferred strand back into a circular form [31]. This mechanism explains the distinction between conjugative and mobilizable plasmids discussed above: conjugative plasmids carry all of these components, while mobilizable plasmids encode only the oriT and corresponding relaxase, but not the expensive T4SS (and may or may not carry the T4CP), and therefore need a conjugative plasmid to provide the missing components. Some mobilizable plasmids only have the oriT, and rely on another plasmid even for a matching relaxase; these generally have a reduced rate of transfer [22, 33, 110]. The T4SS, T4CP and pili are each large protein complexes, which are costly to express, and the process of conjugation itself is energetically costly [25]. Moreover, some bacteriophages* bind to the proteins of the pilus (so-called ‘male-specific phages’), so bacterial cells expressing pili are at greater risk of infection. Therefore the conjugative phenotype is tightly regulated, to avoid imposing a large fitness cost on the host cell, and is almost always unexpressed [24, 111]. Signals that lead to expression of conjugation may include the detection of suitable recipients or specific environmental conditions [17, 111]; also, a temporary period of expression typically occurs immediately after a novel plasmid is first transferred to a recipient, before the repression becomes effective [112]. The modelling of conjugation in general, and conjugation rates in particular, is discussed in Box 4.
Dating back to Stewart and Levin [221], the majority of modelling studies consider a well-mixed population (such as a liquid culture) and assume that conjugative plasmid transfer follows the principles of mass-action kinetics [263]. Transfer is thus proportional to the densities of plasmid-free and plasmid-carrying cells and a conjugation coefficient γ . The framework is similar to classical epidemiological models with infected and uninfected patients. Based on the model shown in Fig. 5, Levin et al. [220] showed that mass-action kinetics with a constant γ captures the plasmid dynamics well in populations with constantly dividing cells (but not during lag phase or close to stationary phase). Models based on this approach have been used to address a large range of questions on plasmid dynamics, such as the plasmid paradox described in Box 5 [223], the question of which genes are carried on plasmids [69, 264], the dynamics of cooperative genes and cheating [265], the evolution of antibiotic resistance [207, 226] (see also Box 2) and the role of conjugation in the rate of adaptation [208, 227]. A simplifying assumption in most of these models is that the transfer coefficient γ is constant in time and independent of, for example, resource availability [263]. Dependence of γ on a resource C can be modelled by a Monod function (see e.g. [266]). Spatial models of plasmid transfer are the minority. One of the first examples of a spatial model is the individual-based lattice model by Krone et al. [232], in which zero, one, or two cells may reside on the sites of a lattice. Plasmid-free cells receive plasmids at stochastic rates that depend on the number of donors and transconjugants* in a local neighbourhood and the amount of nutrients in a ‘nutrient neighbourhood’. The model is combined with experiments on agar surfaces, capturing the experimental observations well.
How much do plasmids conjugate? The donor and recipient species, the plasmid, environmental factors and coinfecting plasmids influence the rate of plasmid transfer [267]. There is, surprisingly, no standard method for estimating conjugation rates [268–270]. Many studies do not estimate the conjugation coefficient γ but report quantities such as the number of transconjugants per donor or per recipient. Methods to estimate the conjugation coefficient γ are mostly derived from models that describe plasmid spread by ordinary differential equations [220, 266, 269]. For example, Levin et al. [220] solved their ODEs ( Fig. 5 ) to obtain the expression