The biophysical nature of cells: potential cell behaviours revealed by analytical and computational studies of cell surface mechanics

Background The biophysical characteristics of cells determine their shape in isolation and when packed within tissues. Cells can form regular or irregular epithelial structures, round up and form clusters, or deform and attach to substrates. The acquired shape of cells and tissues is a consequence of (i) internal cytoskeletal processes, such as actin polymerisation and cortical myosin contraction, (ii) adhesion molecules within the cell membrane that interact with substrates and neighbouring cells, and (iii) processes that regulate cell volume. Although these processes seem relatively simple, when combined they unleash a rich variety of cellular behaviour that is not readily understandable outside a theoretical framework. Methods We perform a mathematical analysis of a commonly used class of model formalisms that describe cell surface mechanics using an energy-based approach. Predictions are then confirmed through comparison with the computational outcomes of a Vertex model and 2D and 3D simulations of the Cellular Potts model. Results The analytical study reveals the complete possible spectrum of single cell behaviour and tissue packing in both 2D and 3D, by taking the typical core elements of cell surface mechanics into account: adhesion, cortical tension and volume conservation. We show that from an energy-based description, forces and tensions can be derived, as well as the prediction of cell behaviour and tissue packing, providing an intuitive and biologically relevant mapping between modelling parameters and experiments. Conclusions The quantitative cellular behaviours and biological insights agree between the analytical study and the diverse computational model formalisms, including the Cellular Potts model. This illustrates the generality of energy-based approaches for cell surface mechanics and highlights how meaningful and quantitative comparisons between models can be established. Moreover, the mathematical analysis reveals direct links between known biophysical properties and specific parameter settings within the Cellular Potts model.


Background
The assembly of a multicellular organism is achieved, alongside cell division, apoptosis and differentiation, by changing cell shape and tissue topology. Such cellular and tissue dynamics are highly determined by the cell's physical properties. Therefore, to increase our understanding of morphogenesis, we need quantitative analyses and descriptions of the biomechanical properties of cells and tissues. However, despite the recent advances in this field [1][2][3][4][5][6][7][8][9][10][11][12], we still lack detailed knowledge as well as computational power to approach cell mechanics from a molecular perspective. Accordingly, there are several multicellular modelling approaches that waive the fine detail of specific molecular interactions by describing cell properties on a biophysical cellular level (see [13] for a review). Such approaches are analogous to the simplification of considering the surface tension of a liquid droplet, instead of explicitly specifying the molecular interactions involved. These cellular models maintain the desired mesoscopic cell features, while facilitating the comparison between simulations and experimental data, given its reduced number of variables and effective parameters [14][15][16]. Although such mesoscopic approaches lack direct links between the molecular level and the macroscopic biological outcome, they remain powerful instruments for comprehending cellular behaviour, since they allow the investigation of novel regulatory elements and biochemical processes. Lecuit and Lenne [17] evaluated an overarching class of such mesoscopic biophysical models, the Cell Surface Mechanics (CSM) models. These describe how biologically-generated tensions -namely adhesionderived tension, cortical tension and tension due to the cell's internal pressure -affect (i) the displacement of the cell's membrane, (ii) modify the cell's shape and (iii) influence its dynamics. Nevertheless, major issues prevent us from gaining optimal understanding from these models.
Firstly, it is not always clear how to quantitatively link experimental observations to model implementations. We will therefore discuss how biophysical properties are linked to subcellular components, such as the structure and mechanics of the cytoskeleton and the interactions between adhesion molecules among neighbouring cells. On a pragmatic level, this leads us to derive a protocol for CSM class simulations [17], to determine and modify expected cell behaviour in simulations and help establish a link with experimental observations. A second source of confusion relates to the wide variety of computational implementations used to describe tissues. It is not yet evident how a model-to-model comparison can be drawn, confusing the biology community when trying to compare common insights and features between different theoretical studies. Likewise, modellers are frequently not aware of the common features and artefacts of their implementations, making them susceptible to overlooking valuable information from parallel studies. We address this second concern in two ways: (i) we initially discuss in detail how a commonly used lattice-based model formalism -the Cellular Potts model (CPM) -can (and should) be seen as being part of the CSM class, and how to determine its parameters and expected dynamics, whilst ensuring appropriate scaling; and (ii) we compare CPM simulations to analytical predictions as well as other CSM implementations, and highlight overlaps and interesting deviations.
We use the CPM as a reference model formalism from the CSM class. Originally, CPM was proposed to describe cellular dynamics based on differential adhesion, thereby defining tissue surface tension relations [18,19], to consolidate Steinberg's Differential Adhesion Hypothesis (DAH) [20]. Classical CPM simulations, in which cells display an internal pressure and positive adhesion coupling constants, successfully captured cell sorting phenomena [18,19,21]. However, in the classical formulation the CPM did not include the effects of cortical tension. Ouchi et al. [22] have proposed an extension with a term representing the surface constraint tension (to modify the relationship between cell motility within aggregates and adhesion strength [23]), alongside using negative instead of positive coupling constants for the adhesion (discussed later). Recently, such a surface constraint term has been identified as the cortical tension, the third "ingredient" underlying cell surface mechanics [17,24]. Since then, cortical tension within the extended CPM has been used in several multicellular modelling studies, for example, in the description of cortical-tension dependent cell shape alterations in dendritic cells [14]; in reverse engineering of the critical parameters determining Drosophila's eye geometry [24]; and to understand germ-layer organisation in zebrafish [15]. A number of other computational model formalisms have been proposed which, analogous to the CPM, use an energy-based description for volume conservation, adhesion and cortical tension [25][26][27][28]. They adopt, however, different ways of describing the cell shape as well as its dynamics. Such models have been used to explore the role of the interfacial tension, reproducing, through parameter changes of the CSM, different forms of epithelial cell packing [25]; plausible mechanisms underlying cell intercalation [27]; and again, the Drosophila's eye geometry [26].
Surprisingly, despite the growing acceptance of the concept and importance of cortical tension from both modelling and cell biology communities, a clear explanation for the role of cortical tension in cell surface mechanics is still lacking. In fact, although several analytical studies regarding CSM have been published [29,30], it is still unclear what cell behaviour can be expected even when just a single cell is considered in terms of its cortical tension. For multicellular modelling, such a baseline understanding is vital to pinpoint the role of each biophysical parameter, allowing us to distinguish and predict beforehand individual-and collective-cell-driven mechanisms. We used the CPM formalism to numerically verify our analytical predictions and to determine their implications in actual cell dynamics, highlighting the fact that the analytical results are independent of the specific simulation framework.

The cell surface mechanics model
From empirical evidence, it is well known that cells maintain and regulate their size through osmotic pressure control [31]; that internal cellular cortical regulation, through myosin contraction, deforms cells; and that interactions with other cells influence individual shapes, since when two cells are brought together to adhere, their contact interface will extend dependent on their adhesion strengths [32]. Energy-based models for cell surface mechanics bring these three main observations together through a description of three independent energy-terms ( Figure 1A). Pressure is described by the energy variation due to volume deviation relative to the cell's target volume (resting volume). The contractility driven by the cell cortex actin-myosin complex is described by an elastic (Hookean) tension applied to its membrane, i.e. the energy increases quadratically with the deviation from the membrane's resting length. The adhesion between two interfaces, such as cell-cell or cell-extracellular contacts, can be simply described as a constant surface tension term. An in-depth comparison how and to which extent those biophysical processes can be linked to the underlying cell biology can be found in [17]. Although the effective surface tension -interfacial tension -that follows from these terms is an equilibrium property, it also proves to be a powerful concept in understanding the dynamics of cells and tissues [17,33]. Most studies until now considered 2D epithelial cell layers, although some work has been done on 3D tissue (e.g. [14]). Because in epithelial tissue cell adhesion and cell-cell contacts are typically restricted to a small zone close to the apical cell surface, it is generally considered reasonable to treat epithelial cell packing as a 2D problem [24,25].
Although the nomenclature varies throughout the literature, in all 2D studies mentioned above the energy function E takes the form of E (p, a J, λ p , P, λ a , A) = Jp+λ p (p − P) 2 where p and a are the perimeter and area of the cell (see Figure 1A). The function uses five parameters for the cellular properties: J, an energy per contact length due to adhesion to other cells or the surrounding medium; P, the membrane resting length; and A, the target cell area (resting area); while the constraints are modulated by the Lagrange multipliers λ p and λ a (comparable to elastic constants), which weigh the relative tension contributions of actin-myosin contraction and cell deformations, respectively. Although modifications of the above energy function could and have been proposed (see, e.g., [34]), almost all studies on CSM have been using this basic framework, sometimes further simplified (see, e.g., [19,25]), or extended with additional terms that, for example, capture chemotaxis, the microstructure of the extracellular matrix or fluid dynamics [35][36][37]. These extensions, such as combining CSM with chemotaxis, can trigger highly intricate and sophisticated dynamics [38]. Nevertheless, Figure 1 Cell surface mechanics. (A) Illustration of the three main forces regulating the cell surface mechanics: adhesion-driven tension, cortical tension and pressure. Adhesion is described as a constant tension J along the surface (green arrows). The cortical tension is described by an elastic tension with equilibrium length P and elasticity constant of λ p (orange spring). (B) The interfacial tension is defined as the altogether adhesiondriven and cortical tensions. (C) Deformations away from the cell's target area A generates a pressure (white arrows).
understanding the dynamics of the core CSM model is an essential ground step to enable understanding of the full process and in interpreting the meaning and consequences of any subsequent model extension.
Note that the above equation is a simplification which assumes that the cell is completely surrounded by homogeneous contacts (which could be other cells or medium). In the case of an heterogeneous cell environment, the first term, in its most general form, should be written as σ J σ p σ , where σ indicates surrounding medium and/or neighbouring cells; J σ the energy per contact length specifically for the interface between the cell and σ ; and p σ the length of that given interface. The sum over all of the cell interfaces' lengths gives its perimeter p = σ p σ . For the initial analysis we will assume the environment to be homogeneous. There has been ample discussion whether it is more reasonable to use positive or negative J-values [22,24]. Part of the issue is that Jvalues may be used to encode several biological processes, making it a challenge to quantify or ascribe to them a biological meaning (see [29] for a detailed discussion). We therefore assume that its sign (as well as the sign of P, see discussion on mapping between J and P below) is undetermined. It is nonsensical, however, to consider negative values for the perimeter and area constraints, and it seems unreasonable to use a negative target area. Moreover, while in many modelling studies no perimeter constraint is being used (corresponding to λ p = 0), it is nonsensical to have no area constraint. We therefore assume that λ p and A are always non-negative and λ a is positive. We initially focus on a 2D cell, and later extend our analysis to 3D tissues.
Note that the formalism, besides discarding any intracellular detail, also describes cell surfaces without explicit "surface elements", whose movement could be followed over time and would require energy to move closer/away from each other (when not affecting its perimeter or area). While clearly being a coarse simplification, this reduced level of membrane complexity is what allows CSM models to capture complex tissue dynamics involving many cells. (Note that while numerically CSM dynamics might be calculated through displacements of introduced surface elements, they are not relevant for the energy calculation of the configuration, and hence for the dynamics itself.) From the energy function above, we can derive important quantities that will greatly facilitate the understanding of cell and tissue dynamics. Firstly, the cell's interfacial tension γ -the work required to extend the membrane by a unit area -is expressed in 2D as the change in energy per unit perimeter length ( Figure 1B) and depends on both the adhesion and the cortical tension, We want to emphasise that while both forces contribute to the cell interfacial tension, only adhesion is specific to the cell-cell or cell-medium interactions at the interface. Also, although on a regulatory level adhesion and cortical tension might be biologically linked, they can be physically separated within this energy description. The interfacial tension can be split up into a length-independent component, J − 2λ p P, and a component which depends linearly on the perimeter length, 2λ p p. We can thus write where τ is defined as the length-independent component of the interfacial tension. The sign of τ is undetermined, while the length-dependent component is always nonnegative.
The pressure within the cell that contributes to a force per unit membrane area can be represented as the work required per unit volume decrease or, equivalently, the decrease in energy per unit volume increase (in 2D, area increase) ( Figure 1C): The force F applied at a certain point of the cell's membrane due to the above energy function (Eq. 1) is the negative of the local gradient of the function at that point, ∇E, which can be elegantly rewritten in terms of changing area and perimeter: The force F represents a vector at a point x on the membrane, which can be decomposed into an interfacial tension force F γ = −γ ∇p and a pressure driven force F = ∇a. An imbalance between these forces should lead to a dynamical change of the cell shape. It is important to note that biological cells are highly dissipative objects, for which viscous forces greatly exceed inertial forces (low Reynolds number, see [39]). Thus, the motion of cell deformations due to the force F acting on the cell membrane is usually assumed to be overdamped, with inertial terms being negligible when compared to dissipative terms, leading to first-order dynamics, i.e. F ∝ d x/dt [13,40].
As one of our aims is to clarify how model parameters determine cell and tissue behaviour, we will first characterize the equilibrium states of single cells. To do so, we use the above definitions and derived variables.

Single cell analysis
To understand the role of the biophysical parameters on the cell mechanics we started asking two basic questions. Firstly, what are all the possible equilibrium cell sizes, what is their stability, and how are these defined by the parameters; and secondly, what is the parameter regime for which, at equilibrium, the cell's interfacial tension (γ ) is positive. The answer to the former question tells us for which cell size(s) the forces acting on the cell are in balance, while the answer to the latter question enables us to determine whether the cell shape is stable. In 2D, when γ > 0 the cell tends to minimise its perimeter relative to the enclosed area a. Under this condition, the circle is the cellular shape that best minimises the a/p ratio [41].
We begin by assuming that the cell has a circular shape, as this is the minimal-energy solution given positive interfacial tension. In such a scenario, the cell's perimeter and area cannot be independently specified. Instead both can be specified as a function of the cell radius r, with p = 2πr and a = πr 2 , respectively. Given the circular shape assumption, the energy E can be expressed as a function of r, where R p = P/2π and R a = √ A/π are the reciprocal target radii of P and A, Once again, from this energy function we can extract the cell's interfacial tension (γ ) and pressure ( ), but now as a function of the radius. For the circular cell, Eq. 2, Eq. 4 read as

Positive interfacial tension at the equilibrium cell radii
Under positive interfacial tension, energy reduction leads to surface minimization. When the interfacial tension is negative, the tendency of reducing the surface for a determined area is lost, as there is no energy cost to additional surface -in fact, the tendency is to increase the length of the interface, causing cells which are constrained in area to adopt different non-circular and ruffled shapes. Thus, to predict the qualitative behaviour of the cell regarding shape, it is therefore important to determine the sign of the interfacial tension at the equilibrium radius.
To determine the equilibrium cell radii, we first take the derivative of E with respect to the radius of the cell, By definition, at equilibrium the energy gradient is zero (provided r * > 0, because an equilibrium radius r * < 0 is nonsensical, and r * = 0 is always an equilibrium, independent of the energy gradient). Therefore, the equilibrium radius can be found by solving for ∂E ∂r = 0 (Eq. 8). To avoid directly analysing this cumbersome cubic equation, we rewrite Eq. 8 into a form from which we can directly determine the sign of the interfacial tension at equilibrium. The first step is to substitute Eq. 7 into Eq. 8, which yields Then we bring in the equilibrium radius r * (for which the energy gradient is zero) giving us For a cell to have a stable circular shape, the interfacial tension at the equilibrium radius has to be positive, i.e. γ (r * ) > 0. From the requirement γ (r * ) > 0 it follows that where, as before, τ describes the perimeter-independent component of the interfacial tension, τ = J − 4πλ p R p . Moreover, from Eq. 10 we find that at equilibrium γ (r * ) = r * (r * ). The requirement γ (r * ) > 0 therefore also implies that r * (r * ) > 0, and thus − 2πλ a r * r * 2 − R 2 a > 0 , (12a) again provided r * > 0. By combining inequalities Eq. 11 and Eq. 12, the requirement for an equilibrium radius to have a positive interfacial tension becomes −τ 4πλ p < r * < R a .
Two conclusions can be drawn from this inequality. First, a solution is only possible when Consequently, the interfacial tension at r * > 0 is going to be negative for any τ < −4πλ p R a , independently of the specific value of r * . Secondly, when τ > −4πλ p R a , Eq. 13 is always fulfilled, and hence it follows that the interfacial tension at r * has to be positive. This can be concluded in the following way: When τ > −4πλ p R a , it is not immediately clear that the interfacial tension is positive, but it does directly imply that −τ 4πλ p < R a . Only when −τ 4πλ p < r * < R a the interfacial tension is positive, but there seem to be other possibilities which could give a negative interfacial tension, namely r * < −τ 4πλ p < R a and −τ 4πλ p < R a < r * . Now let's assume that the first situation applies, i.e. r * < −τ 4πλ p < R a . Because of r * < −τ 4πλ p , we know from Eq. 11b that γ (r * ) < 0. Consequently, to fulfil Eq. 10, r * < 0 as well. But this implies that (Eq. 12) r * > R a , which renders this approach inconsistent. Using the inverse reasoning, it is clear that the other situation, i.e. r * > R a , is also not possible. Therefore, when τ > −4πλ p R a , it automatically follows that −τ 4πλ p < r * < R a , and so the interfacial tension at r * is always positive. This allows for a straightforward calculation to determine if the interfacial tension at r * is positive. This is the case when The condition τ > −4πλ p R a thus determines an important parameter range in which the circular shape is stable. When the circular shape is unstable, the shape of the cell is hard to predict and will critically depend on the specific implementation of the cell surface and surface dynamics within the diverse CSM model formalisms. Even though the cell shape will be unpredictable, in this parameter regime there is no inherent conflict between fulfilling the area constraint (equivalent to solving (a * ) = 0) and fulfilling the effective perimeter constraint (equivalent to solving γ (p * ) = 0); hence, it is expected that for the shape that will be taken up, a * = A and p * = P − J/ 2λ p . In contrast, when the circular shape is stable, we can analyse the cell radius equilibria for any CSM implementation, as follows next.

Cell equilibrium radii and their stability
The cell equilibria radii are determined by finding the radii for which ∂E/∂r = 0, which involves the solving of a reduced cubic equation on r. We start by rewriting Eq. 8, separating the terms that are dependent on r from the r-independent term, τ (see Figure 2): With Eq. 16 we can assess how the number of equilibria depends on the two composite parameters, τ and = 3λ a . If < 0, the right-hand side term 2πλ a r r 2 − 3 cannot become negative (it is nonsensical to consider r < 0), so by itself this term does not give rise to non-trivial equilibria. The only non-trivial equilibrium then occurs when τ balances the monotonically increasing right-hand side term (see Figure 2A). This solution is unique and stable, the equilibrium radius is positive if τ < 0 (r * = ρ 3 , Figure 2C), but negative and therefore nonsensical if τ > 0. Consequently, when < 0 and τ > 0 the only equilibrium is the trivial solution r * = ρ 0 = 0, which is stable, while when τ < 0 the trivial solution r * = ρ 0 = 0 is unstable (Figure 2A, C). In contrast, if > 0, the right-hand side term 2πλ a r r 2 − 3 is not monotonically increasing and hence there can be three real roots (see Figure 2B). Nevertheless, the possible equilibria, their sign and their stability can be easily understood if one recognizes that modifying the value τ corresponds to a vertical shift in the function ∂E/∂r (Figure 2A, B). By starting with τ = 0 (for which ∂E/∂r is an antisymmetric function) and then shifting the graph vertically, all possible equilibria combinations occur successively (Figure 2A, B). For two parameter combinations the number of equilibria changes ( Figure 2B, not taking negative r-values into account). The lower dotted line shows the transition, at τ = 0, from two to three equilibria, while the upper dotted line shows the transition from three equilibria to one equilibrium. The latter transition corresponds to a fold bifurcation, while the former transition is a non-standard bifurcation that bears resemblance to a transcritical bifurcation, but formally does not correspond to one. We will therefore henceforth refer to it as a 'pseudo-transcritical' bifurcation. The fold bifurcation takes place when the minimum of the ∂E/∂r function equals zero. The radius at which ∂E/∂r has a minimum is given by r = √ , which can be easily determined by setting ∂ 2 E ∂r 2 = 0: Therefore, the transition from two to zero positive equilibria takes place when when > 0. (As discussed above, when < 0, ∂E ∂r is monotonically increasing, which means that there is no such transition, see Figure 2A).
To obtain convenient expressions of the abovementioned equilibria, we define here the aggregate param- , the stable non-trivial positive equilibrium is given by the following equation (note that in this case α is a negative real number) When τ < α 2π the above equation is still correct, but inconvenient if used in a non-symbolic computational environment. This is because in this parameter regime arccos 2πτ α returns a complex number, with the cos() operation returning again real numbers. Therefore, in order to prevent calculations involving complex numbers in the intermediate step, we rewrite ρ 1 for τ < α 2π as (again, note that α is still a negative real number) Note that both equations are mathematically equivalent, differing only in how promptly they can be numerically evaluated. When > 0, 0 < τ < − α 2π , the second non-trivial equilibrium, which is non-stable (again, α is a negative real number), is given by Finally, as discussed above, when < 0, τ < 0 there is only one non-trivial positive equilibrium, which is stable ( Figure 2A). In this case α is a positive imaginary number. We therefore introduce another aggregate parameter ζ , which is positive and real when < 0, namely ζ = − α 2π 2 . The equilibrium radius is then given by The parameter conditions for the four possible dynamic behaviours of a 2D circular shaped cell are summarised in Table 1. These conditions determine regions of different IV τ > 4πλ a 3 2 cell behaviour, as shown in Figure 3A. The analysis reveals that, within a cell surface mechanics description, all possible cell behaviours are captured by four modes only, and that these can be distinguished based only on the two aggregate parameters, τ and .

3D spherical cell
When an equivalent analysis is performed for the 3D case, we again find that there are only four qualitatively different behaviours possible for a single cell. Moreover, these behaviours can again be fully determined by only two aggregate parameters. Below, we briefly present the outline of the analysis, leaving the details for Appendix A.

Energy function, interfacial tension and pressure
In 3D, the energy function becomes The absence of positive tension along the membrane will lead to an unpredictable cell shape, at which a * = A, and p * = P − J/ 2λ p . Region II (green area): The cell has a positive interfacial tension at the equilibrium radius. However, when the radius is sufficiently smaller than the equilibrium radius, the interfacial tension is negative. Region III (blue area): a circular cell shape is always stable. There are two possible stable equilibria (bistable regime), a positive r * = ρ 1 and r * = 0, separated by an energy maximum at ρ 2 . Region IV (pink area): the only stable equilibrium is r * = 0, i.e. cells shrink and disappear. (B) Bifurcation diagram for a 3D spherical cell, plotted as a function of the aggregate parameters μ and ν. Qualitatively, the same four regions as obtained for the 2D case are observed for the spherical cell. Again, the cell behaviour is fully determined by two degrees of freedom, here shown for the (ν, μ)-space.
where s and v are the surface area and volume of the cell, respectively. Again, five parameters describe the cellular properties: J, an energy per surface area unit due to adhesion with the surrounding medium or neighbouring cells; S, the membrane rest surface area; V , the target cell volume. The constraints are modulated by the Lagrange multipliers λ s and λ v , which weigh the relative tension contributions of actin-myosin contraction and cell deformations, respectively. Again we consider that J-values can be both positive and negative, but that negative values for the Lagrange multipliers are nonsensical. Using the surface and volume of a sphere (s = 4πr 2 and v = 4 3 πr 3 respectively), we re-write E for a spherical cell as a function of r, where R s = 1 2 S π and R v = 3 3V 4π . As before, interfacial tension and pressure can be defined as Again, the interfacial tension has a surface-areadependent and a surface-area-independent component, 8πλ s r 2 and J − 8πλ s R 2 s , respectively. As in 2D, we denote the surface-area-independent component of the interfacial tension by τ , i.e. τ = J − 8πλ s R 2 s and γ = τ + 8πλ s r 2 . (Note that the expression for τ differs from the 2D case.)

Cell equilibrium radii, their interfacial tension and stability
The first step is to determine whether the interfacial tension at an equilibrium radius is expected to be positive. As discussed for the 2D case, this does not require specific knowledge regarding the radius at equilibrium. Following a comparable analysis as performed for the 2D case, we derive in Appendix A the following condition for a positive interfacial tension in 3D: which is very comparable to the condition expressed in Eq. 14b for the 2D case.
In the next step we look at the equilibrium radii themselves. As for the 2D case, the cell radii are determined by finding the radii for which ∂E/∂r = 0, which now involves solving a quintic equation of r. In Appendix A we show that the derivative of E with respect to the radius of the cell can be written as where τ is, as previously, the surface-area-independent part of the interfacial tension, here given by J − 8πλ s R 2 s , which can be either positive or negative; and the other lumped parameters are respectively a = 4 3 πλ v , b = 8πλ s , and c = 4 3 πλ v R 3 v , with a strictly positive and b and c strictly non-negative for any reasonable parameter choice.
The first observation is that ∂E ∂r r=0 = 0 for any parameter choice. This is an important difference with the 2D case. It implies that in 3D, a cell with a radius close to zero slows down its rate of shrinkage or expansion (depending on the stability of the zero-radius equilibrium), while in 2D, the rate of shrinkage or expansion typically remains non-zero for very small radii (given by ∂E ∂r r=0 = τ ). This difference has a clear impact on the cell dynamics, and plays a role in computational implementations of CSM formalisms.
The closed-form solutions to ∂E ∂r = 0 for the 3D case are not included here because their full expressions are hardly informative. Note, however, that its numerical evaluation can be easily performed if needed.
As in 2D, the number of equilibria and their stability depend on the sign of τ . If τ is negative, there is only one stable positive equilibrium radius, whereas if τ is positive two scenarios are possible: no equilibrium at all or two non-trivial equilibria, one lower and unstable, and another greater and stable, as summarized in Figure 3B, and derived in Appendix A.
The two regimes for which τ > 0 are separated by a fold bifurcation. The transition from two to no equilibria implies that, at the bifurcation, ∂E ∂r r=r * = 0 and ∂ 2 E ∂r 2 r=r * = 0. The bifurcation lines that separate all the possible dynamical regimes can be captured by defining a two-dimensional parameter (μ, ν)-plane (see 3 2 . In this plane, the bifurcation lines are defined where f (μ) = sinh 1 3 arcsinh (μ) . Figure 3B shows the full 3D bifurcation diagram.
Thus, by calculating μ and ν, which are dependent on the parameters of the CSM, one can immediately establish (without the need for simulations) the expected behaviour of a single spherical cell. The distinguishing feature from the 2D case regards the dynamics (but not the stability) close to zero radius.

Cellular Potts model (CPM)
The CPM, a prominent member of the class of cell surface mechanics formalisms, distinguishes itself by being a lattice-based formalism. This property requires special attention on how to relate the CPM numerical descriptors to the variables of the analytical description of CSM. Specifically, it is crucial to explain how the variables p and a of the analytical description relate to the lattice-based descriptors of perimeter and area in a CPM simulation.
A cell within the CPM consists of a set of lattice (or grid) points that together define a region in a two dimensional (2D) or three dimensional (3D) lattice. In 2D, the set of lattice points (i, j) that define a specific cell are indicated by a unique identifier σ i,j . There are as many possible cell shapes as the degrees of freedom defined by the number of permutations of lattice points composing the cell σ , i.e. the bulk area n σ = ij δ σ ij ,σ (where δ is Kronecker's delta function). This is an important feature of the CPM compared to other methods that treat cells as points or centre-of-mass based entities, as it allows an indefinite freedom for cell deformation, being only limited by the spatial resolution. Although many CPM studies explicitly relate the lattice spatial unit x to a physical length (e.g. 1 μm), relevant model parameters, such as target area or adhesion energies are still often expressed in terms of the lattice arbitrary spatial unit only. To link such latticepoint-based parameters to the above analysis, as well as to understand how a change in spatial resolution affects the parameters, we here explicitly take the resolution of the lattice k = 1/ x into account. We thus distinguish between the bulk area n σ , expressed in number of lattice points, and the actual area of a CPM cell a σ , expressed in x 2 . These two areas only differ in that n σ is the number of lattice points composing the CPM cell σ , whereas a σ = n σ /k 2 is the physical area of a CPM cell, and k the factor describing the resolution of the lattice. The area a in the analytical description maps to the CPM area a σ . The explanation on how the CPM cell perimeter p σ relates to the perimeter p of the analytical description is not as simple as for the area, and is therefore deferred to the next section.
In a CPM simulation the cell shape dynamics are calculated using a Monte Carlo algorithm. In a random sampling through the lattice, for each lattice point a local shape change (σ ij → σ i j ) is considered, i.e. a protrusion/retraction of a cell into a neighbouring cell or medium. An energy change E = E(σ i j ) − E(σ ij ) is calculated which defines a probability p(σ ij → σ i j ) for that event to happen, The energy function E from which the dynamics are computed is either Eq. 1 for 2D lattices or Eq. 22 for 3D lattices, as explained previously. In Eq. 28, the parameter Y represents a yield -the ability of the membrane to resist a force, in many studies, including this one, set to zero -and T a simulation temperature which determines the extent of fluctuations. The higher the temperature, the more probable it is to observe energetically unfavourable cell shape deformations.

Contact length perimeter and contact surface area in the CPM
The CPM cell perimeter p σ does not have a straightforward connection to the perimeter p (or surface area s in 3D) of the analytical description. In fact, the CPM cell perimeter is a source of much confusion and implementation errors within the community, requiring clarification. The source of the confusion stems from the fact that there is no direct way of mapping the perimeter or surface area of a cell onto a 2D or 3D lattice (although the area (in 2D) and volume (in 3D) are readily available). When determining the (local) perimeter, one can not simply take the total edge length between neighbouring lattice points belonging to different cells, because this leads to large grid effects, with diagonally oriented edges presenting a √ 2 ≈ 41% larger edge length than horizontally or vertically oriented edges [35,42]. It could seem that the cell's discrete boundary could be approximated by a continuous description, i.e. using interpolation techniques. However, not only can these techniques be computationally expensive, they also presume that there is a clear definition of the ordered set of points belonging to the cell boundary, which, for two reasons, does not hold for the CPM. Firstly, the CPM does not keep track of "boundary sites", the boundary being solely a consequence of the set of lattice points constituting the cell, therefore not providing any ordering. Secondly, the boundary of a CPM cell is not necessarily a single simple closed curve. Rather, the fraction of lattice points occupied by the cell drops from unity to zero when passing the edge of the cell, allowing for discontinuities in the boundary at the level of the lattice site, while still capturing a single, continuous boundary at the interface level. Thus, interpolation techniques render themselves ill-defined in the context of the CPM, and can therefore not be used. In short, the perimeter p (or surface area s in 3D) does not follow directly from the lattice where cells are embedded. Nevertheless, it is an important quantity which has to be determined in CPM simulations, because both the adhesion-driven tension and the cortical tension depend on it. Therefore an alternative method has to be used to compute the perimeter in the CPM, to ensure that comparisons can be drawn between the CPM and the more general CSM. To understand how to correctly calculate it, we have to look at how these tensions are implemented in the CPM formalism. Solving the forces is done implicitly via the energy variation E caused by a local deformation (σ ij → σ i j ), which involves an effective computation of the local perimeter variations, i.e. p = p σ i j − p σ ij . Instead of calculating p in terms of a change of contour length, a local neighbourhood is defined around the lattice point σ i,j for which the state change is evaluated. Then the algorithm sweeps all the lattice points defined by this neighbourhood, summing the number of lattice points that do not belong to this cell before and after the evaluated state change to obtain a measure of the change in contact between the cell and its neighbours. Usually, a radial neighbourhood is used. Figure 4A shows neighbourhoods of different sizes. The set of lattice points (x, y) around a site (x 0 , y 0 ) that form the neighbourhood is determined by the neighbourhood radius: where r is the lowest radius that satisfies this inequality [43]. Note that often neighbourhood level rather than radius is used, based on the sequence of all possible neighbourhoods ranked by size. It is clear that while the neighbourhood is grid-based, at larger neighbourhoods the shape approximates a circle. This is quantified in Figure 4B, indicating that the area divided by πr 2 , the expected area if it were a circle, converges to unity for larger neighbourhood levels.
In the CPM the perimeter of the cell is approximated by counting for each lattice point belonging to the cell within the predefined neighbourhood all the lattice points not belonging to that cell. Therefore this computation involves two nested loops, each with the algorithmic complexity of the specific spatial dimensions, i.e., O(n 2 ) for 2D and O(n 3 ) for 3D. This approach leads to neighbouring lattice points being counted multiple times. Figure 4C illustrates this effect when a second level (or Moore) neighbourhood is being used. While for this configuration some lattice points are counted only once, others are counted up to 5 times. For a 20th level neighbourhood ( Figure 4D), there is an effective zone around the cell, with decreasing contribution to the cell's perceived perimeter at larger distance from the cell. So how can such a proxy for perimeter be related to the geometric length measure? To make the link between the perceived perimeter and the true perimeter, we consider a cell whose cell edge can be approximated by a non-curved interface at the scale of the neighbourhood, while the neighbourhood itself is sufficiently large to be considered to be more or less circular. Figure 4E illustrates for such a scenario how the perceived perimeter within the CPM is calculated. It shows for a number of lattice points, indicated in red, their contribution to the perceived perimeter, indicated in blue. The upper row illustrates the discrete process as is performed in a CPM simulation, while the lower row illustrates a continuous approximation of this process. Two important features are noteworthy. First, any lattice point whose distance to the cell edge lies within the neighbourhood radius contributes to the perceived perimeter, but the lattice points close to the cell boundary contribute more than the lattice points further away from the boundary. Secondly, each lattice point should be regarded to occupy a specific area. Given that for each lattice point this area is multiplied with the area within its neighbourhood that extends outside of the cell, the used proxy for perimeter length in the CPM, p CPM , has dimensions of area times area, or L 4 , where L indicates dimension of the quantity length. This implies that a correction factor ξ (with dimension L 3 ) is needed to scale back from this proxy with dimension L 4 to the effective length p, with dimension L. This correction factor corresponds to a scaling factor ξ that multiplies per infinitesimal perimeter length. For large neighbourhoods, in 2D, this factor converges to: as illustrated in Figure 4E. In this equation, r indicates the radius of the used neighbourhood; x gives the position of the 'lattice sites' contributing to the perceived perimeter (red dots in Figure 4E), relative to the cell edge, with negative values indicating internal and positive values indicating external to the cell; and x is varied over the interval [ 0, r + x], defining the distance to the cell edge within the neighbourhood's protruding region (i.e., the position within the blue circular segments in Figure 4E). The first integral captures the fact that contributions to the perceived perimeter are generated by lattice sites located from a distance equivalent to the neighbourhood radius inwards, up to the cell edge. The second integral captures the area of the neighbourhood extending outside of the cell. Often in CPM studies, the neighbourhood level instead of the radius is reported. Therefore, in order to determine ξ via Eq. 29, the level has to be mapped to the radius first (see Table 2). This problem is equivalent to the sum of two squares representation, and hence an analytic expression for such a conversion does not exist, neither in 2D nor in 3D, and needs to be calculated iteratively [44].
Once ξ is determined, the 2D actual geometric perimeter p can be approximated by Given that Jp equates to J CPM p CPM , it follows that for the actual adhesion energy Similar relationships for the actual geometric membrane rest length and perimeter constraint follow, namely  Reciprocally, known adhesion energies, membrane rest lengths, and perimeter constraints, respectively, can be appropriately scaled within the CPM formalism by applying the corrections In 3D, the proxy used for surface area in CPM simulations has dimensions volume times volume, or L 6 , while the required correction factor ξ to scale back to the effective surface area, with dimensions L 2 is given by The ways to scale s, J, S, and λ s between the geometric and CPM parameters in 3D are analogous to the scaling of p, P, and λ p in 2D, If one aims for a fair comparison between CPM simulations and either the analytical results shown throughout this paper, or the simulations performed with other CSMbased formalisms, the factor ξ has to be determined for the chosen neighbourhood and the parameters adapted accordingly. Moreover, without applying such a correction no direct quantitative link can be made between biophysical parameters used in CPM simulations and experimental measurements.
The analytical results given in Eq. 29 and Eq. 33 assume a perfectly circular neighbourhood. Small neighbourhoods can deviate substantially from this approximation, as shown in Figure 4B. We therefore measured the residual error in perimeter length after applying the correction factor. Figure 4F and H show the residual error for 2D and 3D, respectively, while Figure 4G and I show the same data magnified. For neighbourhoods with a radius equal or larger than 5, both in 2D and in 3D, the error remains within 6%. For small neighbourhoods, however, the errors can be large, with a 59% mismatch for a 2nd level neighbourhood (see Table 2). Good choices are the 5th level neighbourhood (0.56% error) and the 6th level neighbourhood (0% error). In 3D, the often used 3rd level neighbourhood presents a 27% error, while the 7th level neighbourhood presents a 6% error. In general it is best to use a numerically established correction factor ξ for small neighbourhoods, rather than directly applying Eq. 29 and Eq. 33 (see Table 2).
The choice of the neighbourhood is therefore very important, because it defines both the "localness" of the interfacial tension computation as well as its dependence on the grid geometry, which can be a source of angular bias in the cell's shape [35]. In general, a radial neighbourhood spanning a wider region will take more lattice points into account and will be closer to the circle (or sphere in 3D), leading to a better approximation of the perimeter. However, the neighbourhood should be sufficiently small compared to the size of the cells themselves, as with increasing neighbourhood radius the information integration becomes less local. Hence, there is a trade-off between being local, which is desired as it improves spatial resolution, and isotropy of the computation itself. Moreover, larger neighbourhoods are computationally more costly.
In addition, the same neighbourhood level/radius needs to be used in all computations involving perimeter, such as the energy change calculations stemming from adhesion as well as from perimeter conservation.
In order to prevent anisotropic bias stemming from the neighbourhood choice, one may opt to increase its radius while concurrently increasing the spatial resolution (e.g. a 20th level neighbourhood would already be very close to being perfectly isotropic for most CPM studies). This way, a good compromise between the localness and isotropy of the interfacial tension computation can be achieved.

CPM spatial resolution
In many of the previous CPM studies (e.g. [21,45]), the absolute space scale was not always deemed crucial and hence often not reported. However, due to a wider application of the CPM, including its role as a predictive tool of experimental data, it has become important to explicitly state the spatial scale (e.g. [14,46,47]). Particularly, it is often relevant to run simulations at different levels of resolution, to allow for more precision, to reduce anisotropy, or to achieve a different spatial scale of temperaturedriven fluctuations (the latter being directly linked to the lattice space scale). We therefore present here how one can easily adapt the simulation parameters to a change in the spatial resolution. After deciding on a certain scaling for each lattice point in the simulation, all parameters should be changed in such a way that the cell dynamics remain similar. This corresponds to deriving a new parametrisation, J , λ p , P , λ a , A (or J , λ s , S , λ v , V in 3D), such that the energy derivative (Eq. 5) remains equal to that prior to the scaling. When the space scale is refined by a factor k, then any 2D cell σ with a perimeter p σ and area a σ will have both a new perimeter p σ = kp σ and a new area a σ = k 2 a σ after rescaling by k. To keep the balance between the different forces as given in Eq. 5 when the spatial resolution is changed, the following equalities should hold: By substituting p σ = kp σ and a σ = k 2 a σ into Eq. 1, it follows that which, substituted into Eq. 35b gives Hence, in order to generate an equivalence with the cell dynamics before scaling, the following adjustments have to be made to the parameters: This analysis can be extrapolated to 3D, giving the following parameter correspondence: Note that the validity of this scaling does not depend on any assumptions regarding the shape of cells. Figure 5 shows simulations at different resolutions (k ∈ {1, 2, 5, 10}), with respective scaled parameters. We observe similar cell shape, perimeter length, and area, as well as the same overall patterning, namely the formation of three tissue rings driven by differential-adhesion cell sorting. Note, however, that the time-scale changes, with dynamics slowing down when the spatial resolution increases. For instance, the rounding up of initial clusters, driven by a negative energy gradient and happening on a short time-scale, is, expressed in lattice points, inversely related to the spatial scale, hence k times slower in terms of lattice points, but k 2 times slower in terms of actual geometric length ( Figure 5B, upper row). The coalescence of small clusters into larger ones happens on an even slower time-scale. This results from the comparatively lower amplitude of the temperature-driven fluctuations when the spatial resolution is increased. Many small, neutral or almost neutral fluctuations allow cells and small clusters to slowly move around, alike a random walk. Therefore, the mean distance travelled by cells, expressed in lattice points, scales with x 2 = 1/k 2 , which in actual geometric length implies a slowing down of the displacement by a factor k 3 , as shown in the simulations (see Figure 5B, remaining rows). Consequently, simulations at a 10-fold higher resolution evolve 1000 times slower, requiring 10 5 more computational time in 2D (given that the field is 10×10 = 100 times larger), and 10 6 more computational time in 3D. The computational strength of the CPM lies in the fact that an energy change E caused by a local shape change can be calculated locally and hence very efficiently, in both both 2D and 3D simulations. As a consequence, both in 2D and 3D computational time scales linearly with the number of cells. The observation that, in sharp contrast, a 10-fold higher resolution requires a 100,000 or even a million times more calculations (in 2D and 3D, respectively), shows that to obtain a reasonable and feasible time-scale for simulations, it is essential to very carefully choose the resolution (which can cause simulations to run either much faster or much slower than other CSM implementations), with increasing cell numbers inflicting much less a computational time cost.
One-to-one mapping between adhesion energies J and target perimeter P An important insight that follows from the mathematical analysis is that all possible dynamics within a CSM model can be perfectly captured by two descriptors only, which is significantly smaller than the number of parameters defining the model itself. Because of this, a degenerate mapping can be derived for different sets of adhesion energies (J) and perimeter constraints (P) that perfectly conserve the dynamics (i.e. not only in the limit close to target radius, not only for circular cells, but for any possible configuration). This result is highly relevant, since it has been claimed in the literature that the dynamics for negative J-values would be fundamentally different from simulations which use positive J-values [22]. We here show, both analytically and through simulations, that a simple scaling allows for using positive or negative values (or a mixture of both) without affecting the dynamics. The energy description for each individual cell σ in a tissue, neighbouring one or more cells σ and/or medium M can be written as follows: The term σ J τ (σ ),τ (σ ) p σ ,σ depicts the total tension associated with all the adhesion contacts between the cell σ and the neighbours σ , while the term J τ (σ ),M p σ ,M depicts the tension with the medium. The division by two captures that the energy linked to the tension between cells is shared by both cells, while this not the case for the tension with the medium [19]. Note that this division simply represents the fact that all interfacial energy contributions are only counted once. The adhesion energies J τ (σ ),τ (σ ) and J τ (σ ),M are assigned to the contacts between the two cell types τ (σ ) and τ (σ ), and between the cell type τ (σ ) and the medium M, respectively. Similarly, the variables p σ ,σ and p σ ,M denote the contact length between cells σ and σ , and between cell σ and the medium M, respectively. So the total perimeter of the cell σ is p σ = p σ ,M + σ p σ ,σ . All the other variables and parameters retain the same meaning as described before for Eq. 1. We first define an energy description with modified J-and P-values, after which we require both descriptions to be equivalent (except for a possible vertical offset in the energy landscape, which does not affect the dynamics). With offsets to the parameters, the energy description looks as follows: The equation presented above can be expanded into which is simply Because the dynamics of the system described by E σ will not be different from E σ if E σ = E σ + const., it is easily recognizable that the terms 2λ p P σ P σ + λ p P 2 σ , which do not depend on any variable, do not contribute at all to the dynamics. Therefore, in order for the dynamics of both systems to be equal, the remaining terms One can find now what P σ should become to compensate for the change in the adhesion-energies J τ (σ ),τ (σ ) : which, put in a more simplified form, gives: where J w = 1 2 σ J τ (σ ),τ (σ ) p σ ,σ + J τ (σ ),M p σ ,M /p σ is the weighted mean adhesion-driven interfacial tension. This simple expression shows the balance that has to exist to keep the dynamics the same if the adhesion energies are to be changed, negative-or positive-wise. The equation shows that in general this compensation depends on the extent of the contact length of cell σ with other neighbouring cells, i.e. p σ ,σ (this dependence is implicit in J w ), and it is therefore not possible to find a unique mapping between a change in one specific adhesion energy J τ (σ ),τ (σ ) and the perimeter constraints P σ . The reason is that if one changes a specific adhesion energy by adding J τ (σ ),τ (σ ) (J τ (σ ),τ (σ ) → J τ (σ ),τ (σ ) + J τ (σ ),τ (σ ) ), the equilibrium contact lengths may change for that cell. This changes the weights (of the weighted average J w ), i.e. p σ ,σ , and probably also the cell shape, as the cell will now have a new equilibrium partitioning of its contact lengths with the neighbouring cells.
In contrast, in the specific case that all adhesion energies are changed concomitantly there is a perfect oneto-one mapping between the change in adhesion and the change in perimeter constraint. By choosing J τ (σ ),τ (σ ) = J cell,cell to be equal for all σ , σ , and J τ (σ ),M = J cell,medium = 1 2 J cell,cell to be equal for all σ , Eq. 45 can be rewritten as In short, increasing or decreasing all cell-cell adhesion energies with a constant J cell,cell (and all cell-medium adhesion energies with half of this value) can be perfectly compensated by concurrently increasing or decreasing all perimeter constraints with the same value divided by 4λ p . CPM simulations confirm this one-to-one mapping between the change J and the corresponding change P σ needed to keep the dynamics the same ( Figure 6). Importantly, even in the case that J is sufficiently negative to change all initially positive J-values into negative J-values, the dynamics remain unaffected ( Figure 6B). In 3D the same correspondence holds, provided the obvious changes in notation are made, so the relationship is for the general case, and for the specific case of a concurrent change of all J-values with J cell,cell (but cell-medium adhesion energies with half of this value). When applying this method, it is important to realize that there is no such scaling possible for the parameter λ p . One consequence is that one cannot rescale a model which does not take a perimeter constraint into account (λ p = 0) into one which does. That is, the rescaling only works once the perimeter constraint is already taken into account. Likewise, to be explicit, rescaling the model such that P σ = 0 does not mean that perimeter constraint has been taken out of the equation. It rather implies a perimeter constraint with a rest length of zero.

Single cell simulations
Having discussed fundamental issues regarding anisotropy, perimeter and scaling, we next study whether the analytical results can be corroborated by CPM simulations. To illustrate all predicted qualitatively distinct dynamics, we performed 2D single cell CPM simulations for each of the four described regions. Given that the type of dynamics is only dependent on two aggregate parameters, τ and , we arbitrarily defined four sets of parameters, one for each of those regions, indicated in Figure 7A. The expected equilibrium area a * and perimeter p * are shown in Figure 7B.
In region I, a cell of initial radius 50 x grows rapidly in size ( Figure 7C1). In this region, the cell is expected to have a negative interfacial tension at the circular shape. Therefore, the circular shape is unstable and the area and perimeter constraints can be both fulfilled independently, while the cell takes up a complex shape. In such a case, the stable equilibrium area a * is directly defined by the cell's target area (a * = A), while the equilibrium perimeter p * is determined by an interfacial tension of γ = 0 (p * = − τ 2λ p = P − J 2λ p ). Figure 7C2 shows the time dynamics of the perimeter and radius of four cells with different initial sizes, the blue line indicating the cell ( Figure 7C1). The dashed lines indicate the expected perimeter and radius at the unstable equilibrium with circular cell shape, while the solid lines indicate the expected perimeter and radius when both constraints are fulfilled independently in a complex cell shape. After 500 Monte Carlo Step (MCS), the predicted p * = 2000 x and a * = 31416 x 2 (or r * = 100 x) perfectly match the simulations, while the cell shapes are clearly deviating from being circular.
In region II, the simulated cell shows a more isotropic shape that approaches a circular shape ( Figure 7D1). The simulated cells starting at different radii all converge to the same equilibrium perimeter and area ( Figure 7D2). While the analytical equilibrium area a * is approached very closely by the numerical simulations, the analytical equilibrium perimeter p * = 619 x deviates considerably from what is observed in the simulations (which is around 800 x, see Figure 7D2). This deviation can be explained by the membrane fluctuations that are due to the stochastic nature of the CPM update scheme. Close to region I the interfacial tension at equilibrium is only marginally larger than zero. Consequently, stochasticity can easily cause the formation of excess perimeter, causing its length to deviate from the analytical, deterministic prediction. This effect strongly depends on the simulation temperature parameter T. When this parameter is lowered, the difference between the predicted and numerical observed perimeter at equilibrium is reduced. Note, however, that low simulation temperatures can also introduce undesirable grid effects [35].
In region III, the cell dynamics exhibits bistability. In Figure 7E2, the dashed lines indicate the analytical unstable equilibrium (p * = 194 x, a * = 2995 x 2 ) and the solid lines the stable non-trivial equilibrium (p * = 482 x, a * = 18468 x 2 ), with the other stable equilibrium given by p * = 0 x, a * = 0 x 2 . Figure 7E1 shows the simulations of two different cells, one starting below the unstable steady-state (initial radius r = 30 x), the other starting above the unstable equilibrium (initial radius r = 36 x). While the larger cell increases in size, the smaller cell eventually disappears. Note that in this region there is hardly any discrepancy between the analytical and observed values, due to a much higher interfacial tension at equilibrium.
In region IV, the simulated cells exhibit a very smooth membrane, due to an even higher interfacial tension. The interfacial tension dominates over the cell's pressure at any cell size, triggering a continuous size decrease until the cell vanishes ( Figure 7F1). Both the perimeter and area concomitantly decrease until the trivial equilibrium p * = 0 x, a * = 0 x 2 is reached ( Figure 7F2).
To further determine the level of agreement between the CPM simulations and the mathematical analysis, we performed 10,000 simulations of a single cell, varying the J-value as well as the initial radius of the cell. From the analysis, we know that cells should either shrink to a zero size (in region IV, as well as too small cells in region III), or should reach an equilibrium area and perimeter. We therefore registered after 10,000 MCS whether the cell had disappeared, and, if not, what its size and perimeter had become (averaged over the last 1,000 MCS). We then plotted those measurements on top of the analytically predicted bifurcation diagram, using the corresponding τ -value as the bifurcation parameter (Figure 8), in the following way. For each cell that had disappeared, we plotted the initial perimeter and area (yellow and pink regions, respectively). The boundary between initial sizes that do or do not lead to shrinkage should correspond to the unstable equilibrium in region III (solid red line), while in region IV cells should always disappear. For each cell that did not disappear, we plotted the mean and standard deviation of the final perimeter and area (blue dots, intensity indicates fraction of cells not disappearing). Those points should correspond to the stable equilibrium in region I, II and III (solid blue line). We indeed find a close correspondence between the analytical and computational results (Figure 8), but with some remarkable differences. First, we find that region IV, in which cells are never sustained, starts at slightly lower τ -values in the numerical simulations, and that the stable equilibrium in region III lies at slightly lower values and the unstable equilibrium at slightly higher area values. Both are due to the fact that, while the analysis was done for a perfectly round cell, such a shape is not actually obtained in the CPM.  This stems from the stochastic fluctuations that form an intrinsic part of the CPM, rather than from the fact that the CPM is lattice-based (as discussed before, this only causes a small deviation, especially when using 6th level neighbourhood, as done here). Those fluctuations cause the cell to temporarily deviate from a round shape, increasing the amount of perimeter per area. As will be discussed in the next section, this causes small changes in the position of the bifurcation lines, as well as in the equilibrium area and perimeter.
Secondly, in the analysis we find a sharp transition from region I (with a negative interfacial tension for an equilibrium circular cell, leading to a non-circular cell shape) and region II (with a positive interfacial tension, in which a circular shape is preferred). In the computer simulations we find that this transition is more blurred: when the interfacial tension becomes small, the forces restoring a round shape from thermally-driven shape deformations become small as well. Hence, close to the region I/II boundary the cell shape increasingly deviates from a round shape, resulting in a smooth transition. Still, overall, those simulations present a close match between the expected perimeter and area from the mathematical analysis. It illustrates that the CPM, albeit a lattice-based, discrete and stochastic model formalism, still presents equilibrium dynamics as expected from the basics of cell surface mechanics.
Likewise, the analytical 3D bifurcation diagram ( Figure 3B) correctly predicts the single cell dynamics in 3D CPM simulations ( Figure 9A), although again small deviations are observed close to region boundaries (data not shown).

Epithelial cell packing
In the previous section we have pointed out deviations from the analytical results when cell shapes differ too much from a perfect circle (in 2D) or perfect sphere (in 3D). Clearly, within a packed tissue context such an assumption of circularity is invalid. We therefore questioned whether the analysis could be extended to noncircular shapes.
For 2D we focus on epithelial tissue. Simple epithelium is normally a cell monolayer tissue made of cells strongly adhering to each other in a plane of cell junctions. The cellular organization of the epithelium derives from the mechanical properties, particularly at the level of cell junctions. Several modelling studies have looked whether the morphological properties of epithelial tissue can be derived from simple rules describing its mechanical properties [24,25]. Here, we try to approach the characterization of epithelial cell packing using the CSM description. We explore the assumption that the behaviour of an epithelial sheet is no more than the collective behaviour of its individual cells. Due to high cell density within epithelial tissues, cells are typically densely packed and exhibit hexagonal shapes. We therefore seek how a cell behaves when it is constrained to a hexagonal shape.
Du et al. [28] numerically solved the energy function for an infinite system of hexagonal cells. Instead of writing the energy function for the hexagonal cell, we rather consider a broader class of shapes, which includes the hexagon, but also the circle, through a parametrisation which links the amount of perimeter to the amount of surface area. We do so by choosing for any cell shape an arbitrary length factor l (which could be the longest radius, the length of a rib or any other convenient choice), and then introducing the constants k p and k a , which scale the length factor l to the perimeter p and area a, respectively: For example, a hexagonal cell can be described by k p = 6 and k a = 3 √ 3 2 , which can be derived when taking for l the polygon's side length (but any choice for l will lead to the same result). Likewise, a circular cell can be described by k p = 2π and k a = π, typically taking for l the radius r.
The energy E as function of l reads as E l| J, λ p , L p , λ a , L a , k p , k a = Jk p l + λ p k p l − k p L p 2 + λ a k a l 2 − k a L 2 a 2 .
While the parameters k p and k a are constants that relate the perimeter and area with l, L p and L a are the reciprocal target lengths of the target perimeter P and area A.
The analysis from the single cell case can be easily performed for this more general case, see Appendix B, yielding The analysis shows that not only for the circular shape but for any choice of specific cell shape, all possible dynamics within the class of CSM models can be fully captured by a two-parametric bifurcation diagram, defined by the aggregate parameters τ and and determined by its perimeter/area balance. Due to the generalization, τ and , as well as the bifurcation lines between region I and II (γ (l * ) = 0) and between III and IV ( ∂ 2 E ∂l 2 l=l * = 0) become a function of the perimeter/area balance itself. The overall structure of the bifurcation diagram, however, does not change; the same qualitative results as obtained for the circular cell hold for any other cell shape, including the hexagonal shape. Figure 10A shows the bifurcation diagram specifically for the hexagonal case. Comparing it with Figure 3A indeed shows that the specific parameter values for the bifurcations change, but the overall structure does not.
We next determined whether this analytical result can indeed correctly describe cell behaviour of hexagonal cells in a multicellular context. We therefore compared the analytical predictions to published simulation results of a vertex model [25], as well as to a series of tissue simulations using the CPM formalism.

Vertex model cell packing simulations
To show that the results regarding the epithelial cell packing are a general consequence of the CSM model and not a quality particular to the CPM or any other numerical implementation, we took published data about the modelling of epithelial cell packing that uses a different mathematical formalism and compared it to our analytical predictions. For this, we used the work reported in Farhadifar et al. [25], where the authors used the Vertex model to investigate the mechanical . We predict one more bifurcation line, = 0, while in [25] region II and III are considered a single region, the so called "Hexagonal Network".
properties of epithelial cell packing. Their approach describes the CSM tensions using the same energy description as presented here, compare Eq. 1 with Box 1 in [25].
To compare with the results from [25], we introduce the correspondence between their notation and ours: J = , λ p = , λ a = K, A = A 0 and P = 0. The fact that they assumed P = 0 in their approach means that our description of the CSM holds more generally, because with such an assumption the influence of the cortical tension on the interfacial tension is always positive. The authors also introduced a normalised tension and normalised contractility (see Eq. 84), which were used to depict their biophysical parameter space.
The derivation of the conditions defining regions I-IV for this parametrization is straightforward from Eq. 52f, Eq. 52g and Eq. 52h, and is given in Appendix D. From the condition γ (l * ) > 0 one derives > − √ k a 2k p ; the condition ∂E ∂l l=0 = 0 simply yields = 0; and the condition As is shown in Figure 10B, for the particular case of the hexagon, for which k p = 6 and k a = 3 √ 3 2 , the three bifurcation lines become: By representing the bifurcation diagram of Figure 10A in the equivalent ( , )-space, we show that the analytical results are perfectly matched by the Vertex model simulations of [25] (Figure 10B): Eq. 53a, which separates the positive from the negative interfacial tension regime, matches the boundary obtained by simulation (green dots in Figure 10B, extracted from [25]). Eq. 53b, which separates the regime with a single positive stable equilibrium from the bistable regime in our analysis, is not present in their simulation bifurcation diagram. Nevertheless, when we follow our analysis, the so-called Case II of their parameter space should be located within the bistable regime. Indeed, a snapshot of their simulation reveals a huge variation in cell size, strongly suggesting bistability (see Figure 2H in [25]). Finally, Eq. 53c, which separates region III and IV, lies right on top of their numericallyderived line that separates the "Hexagonal network" and "cell vanishing" regimes. Please note that a comparable analytical effort has been published to understand the boundaries between the different observed behaviours [30]. That study, however, did not note the distinction between region II and III, and they derived an approximation of the boundary between region III and IV, while the solution presented here is exact. Consequently, unlike [30], our analytically derived boundary between region III and IV matches perfectly the numerical simulations over their full domain.
These results show that the tissue properties which were defined in [25] as "Soft Network", "Hexagonal Network" and "Cell Vanishing" perfectly correspond to region I, region II/III, and region IV, respectively, as presented in this paper. It highlights that such tissue properties are basically a manifestation of the mechanical properties of individual cells and not due to collective cell behaviour per se (except that collective cell behaviour is needed to give rise to the hexagonal cell shapes). In fact, knowledge on the expected cell shape combined with single cell analysis turned out to be sufficient to account for the whole range of observed tissue behaviour.

CPM cell packing simulations
We then asked if a comparable level of correspondence can be found between the CPM simulations and the mathematical analysis when considering a homogeneous tissue. As for the single cell case, we performed several simulations in order to illustrate and compare the different parameter spaces. Again we defined four sets of parameters, one for each region ( Figure 11A). A bifurcation diagram with the expected equilibria in each region, under the assumption of a hexagonal cell shape, is shown in Figure 11B. (For comparison, the equilibria for the circular case are shown as well.) In region I the cells in the cluster show a fringed border ( Figure 11C1). In this region, the cells are expected to have a negative interfacial tension when taking up a hexagonal shape. Consequently, the hexagonal shape is unstable, and the tissue is formed by dynamically changing, non-hexagonal cells, with tissue topology alterations by neighbour changes, during which some cells loose their contact, while others come into contact with each other (so-called T1 transitions, see [19]). In this regime, both perimeter and area constraint are fulfilled independently, the equilibrium area a * defined by the cell's target area and the equilibrium perimeter p * determined by a zero cell interfacial tension, as shown in Figure 11B, C2. The predicted p * = 2, 000 x and a * = 31, 416 x 2 (or l * = 110 x) are perfectly matched by the simulations.
In region II, the simulated cell cluster takes up a honeycomb shape, with each individual cell approaching the expected hexagonal shape ( Figure 11D1). After initiation of the simulation, all cells rapidly converge to the same equilibrium perimeter and area ( Figure 11D2). As was also observed for the single cell simulations, the observed equilibrium perimeter is larger than the expected equilibrium perimeter p * = 648 x, due to the stochasticity of the CPM update scheme. For this particular parameter choice the difference is around 20%, but it increases when the equilibrium is followed towards region I and decreases when followed towards region IV, with lower simulation temperatures yielding a much closer match.
In region III, the cells exhibit bistability ( Figure 11E1, F1). Figure 11E1 illustrates that when a simulation is initiated with cells that are larger than the unstable equilibrium (p * = 221 x, a * = 3, 509 x 2 ), the cells evolve towards the non-trivial stable equilibrium (p * = 490 x, a * = 17, 336 x 2 , Figure 11E2). In contrast, in a simulation initiated with cells below the unstable equilibrium ( Figure 11F1), most cells vanish (p * = 0 x, a * = 0 x 2 ), but some, due to stochastic fluctuations, pass the In all simulations, = 3, 675. In regions I-IV, τ = −10 5 , −2 · 10 4 , 10 5 , and 2 · 10 5 , respectively. (B) Blue lines: Analytically derived bifurcation diagram of the cell dynamics, assuming hexagonal cell shape, as a function of τ (or equivalently J), with fixed at 3,675. Orange lines: corresponding bifurcation diagram for circular cell shape. Both equilibrium perimeter p * and area a * are shown. For J > 67, 011 cells take up a hexagonal shape, so p * and a * are trivially related; for J < 67, 011, p * and a * vary independently. Solid and dashed lines represent stable and unstable equilibria, respectively. Dots refer to the equilibria at simulation parameter settings, which can be stable (blue circles), unstable with respect to size (open circles), or unstable with respect to shape (open diamonds). The bifurcation points are indicated by black dashed lines. (Note the discontinuity in the p * -axis.) (C1-G1) Snapshots of epithelial packing simulations in the different regions, as indicated. Initial cell size in (E1) was larger than the unstable equilibrium (p * = 221, a * = 3, 509), whereas in (F1) it was smaller. All snapshots are displayed at the same scale, except for (F1), enlarged by a factor 1.75. Hexagonal cells, defined as cells with exactly six neighbours, are coloured green, all others light blue. unstable equilibrium to end up in the non-trivial stable one ( Figure 11F2). Note that due to the increased interfacial tension compared to the previous simulations, fluctuations that deviate the cell shape from perfect hexagonality are much less pronounced, and hence the simulations match the analytical results more precisely.
In region IV, the cell edges are even smoother due to the high interfacial tension ( Figure 11G1). However, all cells vanish rapidly, reaching the trivial equilibrium (p * = 0 x, a * = 0 x 2 , Figure 11G2).
Overall, the predicted behaviour for a hexagonal cell closely matches the numerical CPM simulations of multiple cells, illustrating that again the tissue behaviour can be captured by CSM considerations at the cellular level. A complete overview of the parameters used in the 2D analysis, the general results for any cell shape, and the specific ones for circular and hexagonal cells, are given in Table 3.

3D cell packing
In densely-packed 3D tissues, cells take up the form of rhombic dodecahedrons (equivalent to the hexagonal shape in 2D tissues), which is the most efficient spacefilling packing as it requires the smallest amount of surface area per volume. Again we first derive a general solution for any cell shape, introducing a surface scaling factor, k s = s l 2 , and a volume scaling factor k v = v l 3 . In Appendix E we find that after introducing such a generalization for cell shape, the bifurcation lines we found previously for the spherical shape (Eq. 27) become 3 2 , and f (μ) = sinh 1 3 arcsinh (μ) . Again, we find that the bifurcation diagram does not qualitatively change due to this generalization of cell shape, and that the specific shape of individual cells only modifies the precise parameter values at which the bifurcations occur. Given that for the rhombic dodecahedron , the analytical expectation of the behaviour of 3D densely-packed tissue becomes defined by the two aggregate parameters with the bifurcation lines located at Again, when the analytical results are compared to CPM simulations of a cell tissue, we find a very close correspondence between them. Figure 9B shows 3D CPM simulations of a small tissue for each of the four possible regions. It shows the individual cells within a cluster that is not in direct contact with the medium. As for the 2D case, in region I the cells take up an irregular shape, fulfilling both the surface area and the volume requirement. In region II, the cell shape is very regular, indeed very closely corresponding to a rhombic dodecahedron. In region III, the initial size of the individual cells determines whether the cells disappear or not, while in region IV all cells eventually disappear. The boundaries between those regions of different qualitative behaviour, as observed in the 3D CPM simulations, again closely match the analytical predictions (data not shown).

Discussion
Our analytical results, in combination with computer simulations, have elucidated how the biophysical parameters of CSM models can be translated into qualitative cell dynamics. Some observations and underlying assumptions made during this process merit further discussion. To start with, in region I of the cellular dynamics, the observed behaviour is driven by a negative interfacial tension. However, it is questionable whether such a parameter regime is biologically and physically reasonable; the main issue being that negative interfacial tension implies the intermingling of neighbouring cells. This phenomenon is not observed in low resolution cell shape CSM models, which often use, for example, a single line element for the whole interface between two cells (e.g. [25]), effectively inhibiting cells from intermingling. However, in the CPM the cell's interface is resolved at much higher detail, clearly exposing the intermingling between cells (see Figure 11C1). From a biological perspective, such dynamics might be considered pathological and unrealistic, and one should be weary of using such a dynamical regime to capture normal biological processes.
On the other range of the spectrum, it is also worth discussing whether it is reasonable to assume that cells can completely disappear (i.e. shrink to zero) due to their cell surface mechanics. One could argue that the elastic description of area and perimeter conservation becomes unreasonable when deviations become large, and that while water can freely enter and leave the cell, ultimately the dry cellular mass should prevent full disappearance. Nevertheless, experimental biological studies have been showing pressure-driven apoptosis [48,49], which has been used as a possible interpretation for vanishing cells in the CPM [50]. Another explanation has been proposed by Marinari et al. [16], in a combined experimental

∂E ∂l
Energy variation per length change  and modelling study of the Drosophila notum. They showed that in a 2D CSM description of the notum an increased pressure resulted in the local disappearance of cells, which was then attributed to the process of delamination, when cells leave the epithelial plane and enter the (non-described) underlying tissue. Given that the most reasonable dynamics are found for (a large range of ) well-balanced parameter choices, it will be important for most theoretical studies which are based upon CSM concepts -including the CPM -to apply the insights derived from our study: (i) to choose parameters sensibly, (ii) to ensure model behaviour matches experimental data, and (iii) to prevent potential modelling artefacts.
Although the analysis presented here is relevant for the large class of CSM models that use the same underlying building blocks, there is obviously a wide range of different model descriptions in use to capture cell dynamics. These alternative models cannot always be easily encapsulated within the same theoretical framework. For instance, other common strategies to describe the biophysics of cells is using the so-called subcellular elements method [51,52] or by modelling of cellular material as a continuous medium taking stress and strain tensors explicitly into account [53]. These modelling strategies grant a finer description of forces, not just at the membrane, as with CSM formalisms, but also within the cell. Although this level of detail is important when studying the cell rheology, quite often the simplifications defining CSM models make them better suited for morphodynamical simulations, where part of the individual cell detail may be exchanged for the feasibility of many-cell simulations.
Finally, we argue that the proposal suggested by Ouchi et al. [22] -to use negative instead of positive Jvalues (adhesion energy constant) to solve the apparent mismatch (regarding the relationship between adhesion and cell motility) between models and experimentsfails to reconcile experiments and simulations. Using both theory and simulations, we have shown here that this limitation is not particular to the CPM, but rather present in any modelling framework that describes the strength of adhesion through an effective lower tension.

Conclusions
The analytical results presented here have numerous practical applications. First, they allow researchers to determine the expected dynamics due to interactions between volume conservation, adhesion and cortical tension in 2D and 3D CSM models. Table 4 presents a hands-on guide to be used as a protocol for determining the dynamics of any CSM study. Additionally, this study has shown how the CPM formalism fits within the realm of CSM models, and how biophysically quantitative simulations should be performed within this formalism. Table 5 provides a guide for the usage of the CPM in such a quantitative fashion. Furthermore, our analytical results directly predict the (change in) dynamics, given specific (modifications in) cellular properties, which can be used when linking simulations to biophysical experiments and predicting the experimental outcome. Finally, we have shown that the amount of possible dynamics is much more limited than the parameter space involved originally suggested. This implies that there is no unique combination of volume conservation, adhesion and cortical tension that leads to a given dynamical behaviour. One consequence is that it is impossible to derive from a single set of experimental observations a unique set of specific parameters that capture those dynamics. Using the analysis presented here, however, it becomes possible to specifically define the set Table 4 For the perplexed: how to determine the expected dynamics in any CSM model Step 1. Write the Hamiltonian or energy function in the standard form as used in this paper (i.e.
Step 2. When using CPM, first the effective values of J, and P and λ p in 2D, or S and λ s in 3D, have to be determined, or inversely the correct values of J CPM etc. have to be assigned. This is done by calculating the perimeter scaling factor ξ , which depends on the neighbourhood radius, in its turn depending on the neighbourhood level. The value of ξ can be obtained either by using the theoretical formulae Eq. 29, Eq. 33, for 2D and 3D respectively, in the case that the neighbourhood is sufficiently large, or by using the numerical estimates given in Table 2, for small neighbourhoods that can present significant deviations from those theoretical values. Transformations between the effective and CPM parameter values are then given by Eq. 31, Eq. 32 for 2D, and Eq. 34 for 3D. Note that because of the way interfacial tension is implemented in the CPM, J = ξ J CPM for cell-medium interfaces and ξ 2 J CPM for cell-cell interfaces.
Calculate k p , k a (for 2D) cq. k s , k v (for 3D) for the cell shape of interest. The values for typical shapes in 2D and 3D, namely circle and hexagon, cq. sphere and rhombic dodecahedron, are given in Table 3 and Table 6.
The expected behaviour of a single cell or tissue is given by Figure 3; the bifurcation lines which form the transitions between the different zones are given in Table 3 (for 2D) and Table 6 (for 3D).

Rescale spatial resolution
When the spatial resolution of a CPM model is k-fold increased, the required changes in the standard set of kinetic parameters are given in Eq. 38 and Eq. 39.

Resizing neighbourhood
When the neighbourhood used in a simulation is changed, the value of J CPM , and P CPM and λ p CPM in 2D, or S CPM and λ s CPM in 3D, have to be modified, such that the effective values remain the same (Eq. 31, Eq. 32, Eq. 34). This can be achieved by setting J CPM = ξ old ξnew J CPM , where ξ old and ξ new are the perimeter scaling factor before and after resizing the neighbourhood, respectively. Likewise, P CPM = ξnew ξ old P CPM , λ p CPM =  Table 4.

Concurrent rescaling of J and P
It is possible to concurrently change all J values (for example from all being positive to all being negative) in a CPM, or in fact any CSM simulation, without causing any change in the dynamics of the model (Figure 6), by means of well-chosen shifts in the membrane rest lengths of all cells (P σ and S σ in 2D and 3D, respectively). The required shifts in the rest lengths are given in Eq. 47 (for 2D) and Eq. 49 (for 3D). In contrast, it is not possible to change only a subset of the J values without causing changes in dynamics. Specifically, in such a case, it is still possible to keep for a specific configuration the weighted mean adhesion-driven interfacial tension constant (using Eq. 46 for 2D and Eq. 48 for 3D), but those weighted means are expected to change over time (for example due to cell sorting), generating an imbalance and hence a change of dynamics on the long run.
of combinations of cellular properties that could explain the observed behaviour. It enables the pinpointing of the critical relationship between cell surface mechanics and tissue dynamics, which are not immediately clear from computational studies alone.
We can use Descartes' rule of signs to determine the highest number of equilibria larger than zero (remember that r * < 0 is nonsensical). When τ < 0, there is only one sign change, limiting the number of equilibria larger than zero to at most one, while when τ > 0, two sign changes limit it to at most two.
Further insights can be obtained by looking at the second derivative of the energy E, The slope of the function ∂E ∂r at the equilibrium r * = 0 is 8πτ , implying that this equilibrium is always stable for τ > 0 (positive slope), and always unstable for τ < 0 (negative slope). (Recall that the force on the cell membrane is positive in the direction of the energy gradient steepest descent.) Moreover, because ∂E ∂r r=0 = 0 and ∂E ∂r r→∞ = ∞, a negative slope implies an odd number of equilibria larger than zero, while a positive slope implies an even number. Combined with the insights derived from Descartes' rule of signs, we can therefore conclude that when τ < 0, there is at least and at most one equilibrium for r > 0, i.e. there is always one single equilibrium. Also, this equilibrium has to be stable, since the slope at the equilibrium has to be positive. When τ > 0, there can be either no equilibria, or two positive equilibria. When there are two equilibria, the lower one will be the unstable and the higher one the stable. The next step is to determine those parameter regimes for τ > 0 without and with two positive equilibria. Both regimes are separated by a bifurcation for which we want to find a parametrisation. We only need to analyse the subset of our parameter space for which τ > 0. The bifurcation is a fold bifurcation, and the transition from two to no equilibria implies that, at the bifurcation point, ∂E ∂r r=r * = 0 and ∂ 2 E ∂r 2 r=r * = 0. Moreover, we know that r * > 0. Hence, at the bifurcation point This reduces, at the bifurcation point, the fifth and fourth order equations into a cubic and a quadratic equation, respectively. The cubic equation is a reduced cubic equation (see Eq. 64g, left-hand side). If we define , the cubic equation can be written as Given that b is strictly non-negative and a strictly positive, φ ≥ 0. We first study when φ > 0. The only positive solution of this cubic equation is then given by [54,55] λv 3 2 and f (μ) = sinh 1 3 arcsinh (μ) .
From the quadratic equation (Eq. 64g, right-hand side) one can derive the value of τ , Because both a and φ are positive, it still holds that when ν < 0, there is only one, stable positive equilibrium, while when ν > 0, the bifurcation line ν = f (μ) μ − f (μ) separates the region with two positive from the region without equilibria. Since f (μ) < μ for any positive μ, the line lies in the first quadrant of the (μ, ν)-plane. There are two equilibria as long as The different regimes are shown in Figure 3B, and the bifurcation lines are given in Table 6. Alternatively, for the specific case when φ = 0 (meaning that λ s = 0), the solution of Eq. 65 is simply Moreover, in this case the quadratic equation (Eq. 64g, right-hand side) reduces to a linear equation, allowing again to derive the value of τ at which the bifurcation occurs, where ν = τ 12a = J 16πλ v (taking into account that the specific case of φ = 0 only occurs when λ s = 0, and hence τ is equal to J) and μ = ψ = separates the region with two positive equilibria from the region without equilibria, again located in the first quadrant of the (μ , ν )-plane.

B Cell packing: derivation of the equations for the hexagonal cell
In order to derive the equations for a hexagonal cell shape, we write the equations for a general family of shapes, for which the perimeter and area can be parametrised as p = k p l and a = k a l 2 , respectively, with l being a basic length scale. Substituting a = k a l 2 , p = k p l, A = k a L 2 a and P = k p L p into the general equation E (p, a) = Jp+λ p (p − P) 2 + λ a (a − A) 2 yields E l| J, λ p , L p , λ a , L a , k p , k a = Jk p l + λ p k p l − k p L p 2 + λ a k a l 2 − k a L 2 a 2 .
The interfacial tension γ and pressure are then given by γ = ∂E ∂p = J + 2k p λ p (l − L p ) = τ + 2k p λ p l , (71a) where τ = J − 2λ p k p L p , with the same meaning as before. From Eq. 71a, it follows that the length l at which the interfacial tension is zero, is given by The derivative of the energy function E (l) is easily obtained by differentiation of Eq. 70: = k p J + 2λ p k p l − L p + 2k a l 2k a λ a l 2 − L 2 a , (73b) = k p γ − 2k a l . (73c) Thus, as before, at equilibrium zero tension implies zero pressure. The first bifurcation line can therefore be determined by combining Eq. 71b and Eq. 72: Using a same reasoning as for the circular shape (see Eq. 11b, Eq. 12b and text below it), it follows that the interfacial tension at any non-trivial equilibrium is always positive when τ > −2λ p k p L a , and negative otherwise.
The stability of the trivial equilibrium l * = 0 can be determined as follows. Evaluating the derivative of the energy function (Eq. 73c) at l * = 0 gives ∂E ∂l l * =0 = k p γ (0) = τ .
Consequently, as for the circular cell shape, the condition τ = 0 defines the second bifurcation line, with τ < 0 implying an unstable trivial equilibrium, and τ > 0 a stable one. To find the third bifurcation line requires analysis of the second derivative of the energy function, given by = k p (2λ p k p ) − 2k a 2λ a k a (L 2 a − l 2 ) + 8k 2 a λ a l 2 , (76b) = 2k 2 p λ p − 4k 2 a λ a (L 2 a − l 2 ) + 8k 2 a λ a l 2 , (76c) = k p 2k p λ p − 4k 2 a λ a L 2 a k p + 12 k 2 a λ a k p l 2 , (76d) where β = 4 k 2 a λ a k p and = L 2 a 3 − k 2 p λ p 6k 2 a λ a are aggregate parameters. We have pointed out before that the third bifurcation line can be found when the inflection point of the energy function coincides with the equilibrium cell
In the cell sorting CPM simulations, a Moore neighbourhood was used. For the first simulation, k = 1;

D Notation correspondence to Farhadifar et al. (2007)
In the paper of Farhadifar et al. [25] a same CSM energy description was used to simulate epithelial cell packing.
In order to make a straightforward comparison, we introduce a mapping to their notation. In their study, L p = 0, and consequently τ = J. The parameter equivalences are The authors also define the composite parameters normalised tension and normalised contractility , which are used to depict their biophysical parameter space. These are defined as: can be expressed as a function of : The bifurcation line defining negative or positive interfacial tension is given by τ = −2k p λ p L a . Given that L p = 0, both τ = J and = J, and thus this condition can be expressed in their notation as For the hexagon this yields which was also derived in [30]. Because L p = 0, the bifurcation line at τ = 0 is simply given by = 0. Using the derivation in the previous section, the third bifurcation line is given by which for the hexagon yields The interfacial tension and pressure are then given by γ = ∂E ∂s = J + 2k s λ s (l 2 − L 2 s ) = τ + 2k s λ s l 2 , (91a) where τ = J − 2k s λ s L 2 s . The first step is to find the cell size l for which the interfacial tension is zero: Because zero tension implies zero pressure, the first bifurcation line is given by