547
Views
0
CrossRef citations to date
0
Altmetric
Research Articles

Dynamical systems of self-organized segregation

&

ABSTRACT

We re-consider Schelling’s (1971) bounded neighborhood model as put into the form of a dynamical system by Haw and Hogan (2018). The aim is to determine how tolerance can prevent (or lead to) segregation. In the case of a single neighborhood, we explain the occurring bifurcation set, thereby correcting a scaling error. In the case of two neighborhoods, we correct a major error and derive a dynamical system that does satisfy the modeling assumptions made by Haw and Hogan (2020), staying as close as possible to their construction. We find that stable integration is then only possible if the populations in the two neighborhoods have the option to be in neither neighborhood. In the absence of direct movement between the neighborhoods, the problem is furthermore equivalent to independent single neighborhood problems.

1. Introduction

Segregation is a phenomenon in which people are separated based on factors such as ethnicity, religion or political preferences. It can manifest in a number of domains of everyday life, like residential districts, schools or workplaces, to name a few. In this paper, we focus on residential segregation in a district of a city. The broad mathematical definition of segregation that we use and the principles governing it, however, can be applied to a multitude of seemingly disparate situations, reaching as far as the division of members within a political party.

While residential segregation may be perceived by many as a phenomenon of the past, segregation still exists today in the twenty-first century. In some locations, patterns of segregation remain similar to the patterns of segregation of the past. For instance, the South African city of Chatsworth was created to segregate Indians during the Apartheid and even now, in the twenty-first century, Indians continue to make up the majority of Chatsworth’s population (Jones, Citation2019). In the USA, from 1980 to 2000, white and black segregation declined while Asian and Pacific Islander segregation increased (Iceland, Citation2004).

Segregation has several adverse effects on specific populations for both individuals and groups. Multiple studies (De la Roca et al., Citation2014, Williams and Collins, Citation2001, Cutler & Glaeser, Citation1997, Lewis, Citation1966) have provided empirical evidence that segregation, combined with poverty, leads to a multitude of problems. These problems include the reduction of contact of individuals with positive role models, increased exposure to violence, the promotion of a “culture of poverty,” aggravation of differences in individual socioeconomic status and differences in educational and employment opportunities. Studies also show that residential segregation has negative effects on the educational attainment and labor participation of minority ethnic groups and immigrant groups (Bolt, Citation2009, Musterd, Citation2003). Results from a study based in the Netherlands (Fajth & Bilgili, Citation2020) show that the physical concentration of non-western immigrants in a neighborhood leads to lower social integration, evidenced by lowered numbers of social ties between non-western immigrants and their native Dutch neighbors in such areas.

Residential segregation may sometimes have positive effects on specific categories of people. According to Bolt et al. (Citation2010), the clustering of ethnic minorities in a neighborhood may provide a sense of security, well-being and identity, particularly for individuals unable to speak the language of their host country. While discussing the larger impact of segregation on the societal level is beyond the scope of this paper, it is undeniable that segregation has significant effects on the lives of individuals of certain populations. This draws attention to the need to understand the causal factors of segregation in order to tackle the problem.

A possible approach to explaining these causal factors is to find out to what extent a mathematical analysis can answer the sociological questions posed in (Schelling, Citation1971, Haw & Hogan, Citation2018) and in (Haw and Hogan, Citation2020). Our aim is to determine how tolerance can prevent (or lead to) segregation. While the directions of research of sociological phenomena are ultimately decided by sociology, we will see that mathematics allows us to fully understand certain aspects of a model. In particular, with mathematics, we can determine whether a model is simply unsuitable for predicting real-world phenomena: already what we mean by “unsuitable model” can be phrased in purely mathematical terms (see below). The mathematical field that describes dynamic behavior like segregation is known as the theory of dynamical systems.

1.1. A dynamical systems approach to segregation

Thomas Schelling introduced the principles for a dynamical system of location-based segregation in his 1971 paper “Dynamic models of segregation.” This approach is particularly important as it captures the dynamic nature of segregation. Schelling’s Bounded Neighborhood (BN) model allows us to study unorganized segregation in bounded districts. In place of Schelling’s term “unorganized segregation,” however, we take up the suggestion of Haw and Hogan (Citation2020, footnote on p. 221) and speak of self-organized segregation. Self-organized segregation is driven by individual preferences (Schelling, Citation1971, p. 143) and stands in opposition to “organized segregation” like the South African Apartheid, which is segregation that occurs as a result of the practices of organizations and institutions.

Schelling’s BN model enables us to study how individual incentives and perceptions of differences can lead to collective segregation (1971, p. 145). The model describes a residential district (or a school, church, political party, etc.), referred to as a BN, in which the individuals are divided into two populations. For the case of a single neighborhood with two populations Schelling (Citation1971) used the example of a black population and a white population. To emphasize the far-reaching applications of our study, we follow (Haw & Hogan, Citation2018, Citation2020) and refer to X–population (majority) and Y–population (minority). The respective population densities X(t)0 and Y(t)0 are non-negative and depend continuously on time. The fact that X is a density implies that 0XXmax=1. For the Y–population we deviate from this and choose kYmax=Xmax=1, with k1 taking the role of a ratio parameter k=XmaxYmax between majority and minority. This makes the population ratio a parameter of the system that can be prescribed at will. As a result, the density of the Y–population satisfies 0YYmax=1k. We use the term “composition” to refer to a particular combination of the X– and Y–population densities in the BN. We refer to a BN with only one population type residing in it as “segregated” and to a BN with both population types as “mixed.”

The BN model focuses on how individual choices based on discriminatory behavior can lead to segregation. Individuals from each population type possess characteristics or features that are distinct from those of individuals from the other population type. Based on these differing characteristics, individuals of each population type show discriminatory individual behavior toward individuals of the other population type. Here, we use Schelling’s definition of “discriminatory” as “an awareness, conscious or unconscious, of sex or age or religion or color or whatever the basis of segregation is, an awareness that influences decisions on where to live, whom to sit by, what occupation to join or to avoid, whom to play with or whom to talk to” (Citation1971, p. 144). Individuals take into consideration the ratio of the two population types when deciding whether to move into, out of or remain in a BN. Their tolerance is built into the model in the form of tolerance limits, the maximum ratio of the two populations that they tolerate in a BN for them to continue residing in the BN or to move to the BN. These tolerance limits are in turn allocated to each population type via tolerance schedules. It is based on the tolerance schedules that the compositions of a BN changes over time as individuals move in and out or remain, thus allowing us to study the dynamics of the neighborhood and how self-organized segregation comes about. We further explain this in detail in § 2.1.

In the BN model, there exist two locations where individuals from the two populations can reside in – the BN and a location in which their population type “predominates or […] does not matter” (Schelling, Citation1971, p. 167). Throughout this paper, we refer to the latter as “the reservoir.” The BN is preferred over the reservoir by every individual, and individuals do not take into consideration the ratio of the two populations in the reservoir. Note that we do not follow Haw and Hogan (Citation2020) who use one reservoir for each population and consider both reservoirs to be segregated. Rather, we think of the reservoir as being the rest of the city. It does not include the BN, and we are not interested in the dynamics of the reservoir. The model is only concerned with the dynamics of the neighborhood.

Upon parameter variation, the BN model may display foldFootnote1 bifurcations. At such a bifurcation a mixed stable equilibrium and a mixed unstable equilibrium meet and vanish – or are both born, depending on the direction of the parameter variation. In the bifurcation set of (in § 2.3 below) this happens when crossing the green surface. In section 2, we explain this bifurcation set, which gives us a more comprehensive understanding of the model when predicting the complex social phenomenon of segregation. The fold bifurcations turn out to be organized by (dual) cusp bifurcations at the red line in , where the two parts of the green surface meet to form a cusp. These parts of the bifurcation set form the discriminant set of a cubic polynomial already obtained in (Haw & Hogan, Citation2018), the polynomial (8) governing all mixed equilibria. The explanation of these bifurcations is facilitated by the surface in , which is the bifurcation diagram detailing how the mixed stable equilibrium born in a fold bifurcation moves over to a second mixed unstable equilibrium to vanish in a second fold bifurcation. The bifurcation set in is completed by a blue surface where the remaining mixed unstable equilibrium merges with the empty neighborhood.

Figure 1. The dynamics of the BN model. Left: the bifurcation set consists of two surfaces separating the three open regions of parameters with the three different behaviours of the system. The smooth surface, blue, is given by (4) and the cuspy surface, green, is the discriminant set given by (9). At the red line, the dark green and light green parts of (9) meet to form a cusp. Right, bottom: the two segregated equilibria (1,0) and (0,1k), both red, attract every point except for the ones on the line separating their basins of attraction and the points on that line tend to the empty equilibrium (0,0), blue. The parameter values (a,b,k)=(2,0.2,2) are between the smooth surface given by (4) and the two co-ordinate planes {a=0} and {b=0}. Right, middle: a saddle, green, has detached from the origin and its stable manifold now separates the basins of attraction of the two segregated equilibria, which still attract all other points. The parameter values (a,b,k)=(2,2,2) are between the smooth and the cuspy surface. Right, top: there is a third attracting equilibrium, red, which is mixed. The two segregated equilibria have basins of attraction bounded by the stable manifolds of the two saddles, both green. These lines also form the boundary of the basin of attraction of the attracting mixed equilibrium. The parameter values (a,b,k)=(2,8,2) are in the third open region of the bifurcation set, the one bounded only by the cuspy surface given by the discriminant (9).

Figure 1. The dynamics of the BN model. Left: the bifurcation set consists of two surfaces separating the three open regions of parameters with the three different behaviours of the system. The smooth surface, blue, is given by (4) and the cuspy surface, green, is the discriminant set given by (9). At the red line, the dark green and light green parts of (9) meet to form a cusp. Right, bottom: the two segregated equilibria (1,0) and (0,1k), both red, attract every point except for the ones on the line separating their basins of attraction and the points on that line tend to the empty equilibrium (0,0), blue. The parameter values (a,b,k)=(2,0.2,2) are between the smooth surface given by (4) and the two co-ordinate planes {a=0} and {b=0}. Right, middle: a saddle, green, has detached from the origin and its stable manifold now separates the basins of attraction of the two segregated equilibria, which still attract all other points. The parameter values (a,b,k)=(2,2,2) are between the smooth and the cuspy surface. Right, top: there is a third attracting equilibrium, red, which is mixed. The two segregated equilibria have basins of attraction bounded by the stable manifolds of the two saddles, both green. These lines also form the boundary of the basin of attraction of the attracting mixed equilibrium. The parameter values (a,b,k)=(2,8,2) are in the third open region of the bifurcation set, the one bounded only by the cuspy surface given by the discriminant (9).

Figure 2. Left: bifurcation diagram showing saddles (green) and stable nodes (red) meeting at the fold line (blue). The surface has 0X1, ends at b=1 and we only draw it for k1. Right: for better orientation the bifurcation diagram is rotated to a top view. The saddles are no longer shown (otherwise all of b1 would be green) and the fold line forms a cusp at (k,b)=(3,9) where it is projected along its vertical tangent.

Figure 2. Left: bifurcation diagram showing saddles (green) and stable nodes (red) meeting at the fold line (blue). The surface has 0≤X≤1, ends at b=1 and we only draw it for k≥1. Right: for better orientation the bifurcation diagram is rotated to a top view. The saddles are no longer shown (otherwise all of b≥1 would be green) and the fold line forms a cusp at (k,b)=(3,9) where it is projected along its vertical tangent.

From the BN model one can construct a model of segregation in two neighborhoods. Discarding the reservoir (think of a small city with only two districts) the X– and Y–populations move between the two neighborhoods according to their preferences based on tolerance schedules. This is the Two Neighborhood (TN) model of (Haw and Hogan, Citation2020) treated in section 3. We show in § 3.1 how to correct the dynamical system of (Haw and Hogan, Citation2020) to fully represent the TN model. In § 3.2, we then find that all points where the tolerance schedules of neither neighborhood are exceeded form an equilibrium – leading to lines and seas of equilibria for large parts of the parameter space (see in § 3.2 below). This makes the model unsuitable for purposes of modeling real-life phenomena, where “unsuitable” has a precise mathematical meaning to be explained now.

Mathematical systems use mathematical concepts to describe real-life phenomena and to predict their behavior. They are rough approximations of the exact relationships describing phenomena in the real world, after focusing on only a single ingredient of the complex relationship between the variables. The BN model describes how segregation in a BN changes as individual tolerances change. It is an abstraction of the particular relationship between individual discriminatory behaviors and segregation, taking into account the tolerances of individuals, while in reality, multiple other factors affect segregation as well. We do not expect the BN model to describe the real world (or parts of it) in an exact fashion. In reality, the real-world system of segregation can be understood as being governed by the model’s equations plus “small” changes and uncertainties. In deterministic systems like the ones used in (Haw & Hogan, Citation2018, Citation2020) the latter are given by small perturbations. Such perturbations should be sufficiently small, so small that the dynamics remains qualitatively unchanged under perturbation. The mathematical term for this concept is that of structural stability. Thus, having a structurally stable dynamical system as a model is a necessary condition for drawing conclusions on the real world. This is not possible if the system is not structurally stable. We explain the important concept of structural stability in more detail in § 2.3 below, but found it helpful to already here emphasize its impact.

David Haw and John Hogan explicitly formulated the BN model as a dynamical system in their 2018 paper “A dynamical systems model of unorganized segregation.” In their 2020 paper “A dynamical systems model of unorganized segregation in two neighborhoods,” they extended their 2018 dynamical system to create the TN model of self-organized segregation in two neighborhoods. In the present paper, we examine both their 2018 and 2020 papers with the following aims.

  1. Perform an in-depth analysis of the 2018 single neighborhood model by assessing in particular the structural stability of the system. Prior to our analysis we additionally correct the scaling used in (Haw & Hogan, Citation2018) – which, as it turns out, has little consequences and qualitatively the error does not matter.

  2. Correct the differential equations for the two-neighborhood model as put forward in (Haw and Hogan, Citation2020). We conclude that Haw and Hogan’s two neighborhood model is not a suitable model as it leads to dynamical systems that are not structurally stable.

  3. Examine the physical implications of the results obtained from both models. We summarize the predictions made possible by the models and point out how stable integration – in the form of trajectories toward a mixed equilibrium – becomes a possibility.

  4. Demonstrate that the two-neighborhood model with a reservoir population – as briefly presented in (Haw and Hogan, Citation2020) – is equivalent to two independent cases of the single-neighborhood model. We also show that the models have no periodic dynamics.

Our mathematical analysis confirms that the dynamics of the BN model is governed by the two surfaces (4) and (9) shown in the left part of (in § 2.3 below). We furthermore detail the bifurcations that occur when the parameters are moved across (9) and how this is governed by the surface shown in . Our analysis also points out that the reformulation of Haw and Hogan’s (Citation2020) TN model as a dynamical system has non-isolated equilibria, thus preventing structural stability. To achieve aim 3, we detail what our results suggest about how segregation based on individual preferences – in particular on discriminatory behavior – comes about. Allowing a reservoir population in the TN model is proposed as an extension in (Haw and Hogan, Citation2020, p. 245) and we use the discussion section to show how this simply reduces the TN model to two independent BN models. Prior to this we use the Poincaré index to rule out periodic orbits.

1.2. Outline

We treat two different models – the single neighborhood (BN) model and the two neighborhood (TN) model. These were both introduced in the present section, but for the model itself and for the analysis we use separate sections for each model. Section 2 introduces the basic concepts and theory behind the BN model and presents an overview of the dynamical system. In this section, we also correct the scaling in Haw and Hogan’s (Citation2018) paper. Furthermore, we provide an in-depth qualitative analysis and present our results. Section 3 contains our evaluation of Haw and Hogan’s (Citation2020) TN model. We argue that the given mathematical formulation inaccurately depicts their described model. We consequently introduce a new formulation and perform a qualitative analysis of the system. It turns out that Haw and Hogan’s TN model describes dynamical systems that are not structurally stable. Discussion of the two models and interpretation of the results of our analysis are presented together in section 4. We also prove that both models have no periodic orbits and touch upon Haw and Hogan’s (Citation2020) two-neighborhood model with a reservoir population. Lastly, in section 5, we suggest possible areas of improvement for the single- and the two-neighborhood model and future directions of research.

2. Population dynamics in a bounded neighborhood

To keep the present paper self-contained, we re-build the dynamical system of (Haw & Hogan, Citation2018) from scratch, also explaining the meaning of the model parameters. We re-do the scaling and explicitly show that reducing the model size from three parameters to two parameters is indeed possible. Our analysis unveils that the model leads to a structurally stable dynamical system – despite the empty neighborhood being a degenerate equilibrium. We also detail the bifurcation diagram, explaining how the BN model can exhibit stable mixed equilibria.

2.1. Model

Individual discriminatory behavior is translated into segregation via tolerance schedules, which are specific to each population. The tolerance schedule of a given population is a function that keeps track of how many members of the population can tolerate how many members of the other population.

The upper limits for the X– and Y–populations are denoted as Xmax and Ymax, respectively. We have Xmax=1=kYmax for the ratio parameter k1 – the ratio of majority to minoriy – and accordingly have Ymax=1k. For k>1 the Y–population is in minority. The phase space (in which the dynamics takes place) is the rectangle

P=(X,Y)R20X1,0Y1k

with boundary

P=(X,Y)P X=0orX=1orY=0orY=1k.

The dynamics in P is derived from the tolerance schedules RX of the X–population and RY of the Y–population. These describe the maximal ratios YX and XY abided by the X– and Y–populations, respectively (Schelling, Citation1971). Thus, for

RX(X)=YXandRY(Y)=XY

the system is in equilibrium. Taking linear tolerance schedules

RX(X):=a(1X)andRY(Y):=b(1kY)

with tolerance parameters a and b, this results in the parabolas

(1) Y=aX(1X)andX=bY(1kY).(1)

The X–population increases inside the first parabola, i.e. for

(X,Y)(X,Y)PY<aX(1X)

and decreases outside. Similarly, the Y–population increases inside the second parabola and decreases where X>bY(1kY).

2.1.1. Dynamical system

The simplest way to encode this as a dynamical system is to put

(2a) X˙:=dXdt=aX(1X)Y(2a)
(2b) Y˙:=dYdt=bY(1kY)X.(2b)

This makes the empty neighborhood (X,Y)=(0,0) an equilibrium. However, the dynamics of (2) does not leave the phase space P invariant. Indeed, for X=0, the right-hand side of (2a) is negative (or zero, at the equilibrium) resulting in members of the X–population moving out of the BN without living there in the first place. Similarly, the vector field (2) points downward at the boundary part {(X,Y)P|Y=0} of the phase space. Note that (2) does point into P at the remaining boundary points where X=1 or Y=1k.

This problem is remedied by Haw and Hogan (Citation2018) who multiply the right-hand sides of (2a) by X and of (2b) by Y, respectively, hence adjusting the dynamical system (2) to

(3a) X˙=X[aX(1X)Y]=:f(X,Y)(3a)
(3b) Y˙=Y[bY(1kY)X]=:g(X,Y).(3b)

This makes the axes {(X,Y)P|X=0} and {(X,Y)P|Y=0} invariant, in particular the corner points (X,Y)=(1,0) and (X,Y)=(0,1k) become equilibria.

2.1.2. Preliminary analysis

The Jacobian of the vector field (3) reads as

J(X,Y)=2aX3aX2YXY2bY3kbY2X.

Hence, the X–segregated equilibrium (X,Y)=(1,0) is attractive as J(1,0) has negative eigenvalues a and 1. Similarly, J(0,1k) has negative eigenvalues 1k and bk whence the Y–segregated equilibrium (X,Y)=(0,1k) is attractive as well.

However, the adjustment from (2) to (3) has the drawback of turning the empty (X,Y)=(0,0) into a degenerate equilibrium. Indeed,

J(0,0)=0000

and one has to go to second order to conclude that this equilibrium is repelling for ab>1 for the product of the tolerance parameters. In (below in § 2.3), we can see that for ab<1, the points (X0,Y0) on a line in the phase space are attracted to (0,0), i.e. there are initial compositions (X0,Y0) for which everybody moves out of the BN. Thus, the qualitative behavior changes when crossing the surface

(4) ab=1.(4)

All other equilibria are still obtained by intersecting the parabolas given in (1). In the cases where this results in a single mixed equilibrium (Xe,Ye) we have

detJ(Xe,Ye)<0

and this yields a saddle point, see again . As ab1 the saddle (Xe,Ye) moves toward (0,0) and reaches it at ab=1. The remaining phase portrait in has three mixed equilibria and in this case one of them is attractive, while the other two are again saddles. We continue our analysis in § 2.3 and first discuss a possible simplification of the BN model.

2.2. Scaling

Haw and Hogan (Citation2018) propose the scaling

(5) (t,X,Y)(tˆ,Xˆ,Yˆ)=(at,X,1aY)(5)

to remove the tolerance parameter a in (3), at the cost of replacing the tolerance b by β=ab and the population ratio k by α=ak. However, this would turn the Jacobian J(1,0) from

a101into1101

and thus change the ratio of eigenvalues from a to 1, while a scaling like (5) is only able to globally scale the eigenvalues (by a, the time reparametrization) but not their ratio. Indeed, the scaling (5) turns (3) into

(6a) X˙=X[X(1X)Y](6a)
(6b) Y˙=Y[bY(1kY)1aX]=Ya[βY(1αY)X](6b)

where we have dropped the hats. The extra factor 1a in (6b) is inevitable as now the Jacobian J(1,0) has eigenvalues 1 and 1a, so their ratio does remain equal to a.

2.2.1. Topological equivalence

The extra factor 1a does not influence the nullclines, whence in particular the equilibria of (3) have positions independent of the value of the tolerance a — provided that we replace the tolerance b by β and the ratio k by α. In fact, the dynamical systems (3) and

(7a) X˙=X[X(1X)Y](7a)
(7b) Y˙=Y[abY(1akY)X](7b)

are topologically equivalent, i.e. qualitatively they display the same dynamics. To prove this we switch to a better suited notation. Let x=(X,Y)P denote the vector of possible states whence

h:PR2x(f(x),g(x))

is the vector field in (3). The flow φt of x˙=h(x) collects all solutions: for every xP and all times tR with φt(x)P we have that

ddtφt(x)=h(φt(x)).

The trajectory of xP under φt is the curve {φt(x)|tR}. In cases where going backward in time leads out of the phase space P we simply restrict time to tτx with τx0 the time at which the trajectory passes through P. Similarly, let ψt denote the flow of (7).

Proposition 2.1

There is a homeomorphism η:PP that provides a topological equivalence between the flow φt and the flow ψt.

Note that fixing a, b and k in (3) yields exactly the values ab and ak in (7b), for which the two systems have the same nullclines.

Proof. To construct the homeomorphism η, a bijective mapping that is continuous and for which the inverse η1 is also continuous, we use the so-called stable manifolds

Ws(xe):=xPlim tφt(x)=xe

of the equilibria xe, consisting of the points attracted to xe. For an attractive equilibrium, this is the basin of attraction, and for a saddle, this is a line separating two basins of attraction. For sufficientlyFootnote2 small radius δ>0 let Wδs(xe):={xWs(xe)|xxe∥=δ} where x∥=X2+Y2 is the Euclidean length of x=(X,Y). For a saddle, Wδs(xe) consists of two points, one on each arc of the stable manifold, while for the segregated equilibria, Wδs(xe) is a quarter-circle.

For xP, we measure the time τ(x)R it takes until the trajectory of x intersects one of the sets Wδs(xe). We then use the flow ψt to move by t=τ(x) back in time (or forward if τ(x) is negative), i.e. we let

η(x):=(ψτ(x)φτ(x))(x)=ψτ(x)(φτ(x)(x)).

Outside of the equilibria this defines a diffeomorphism (differentiable with differentiable inverse). By also mapping each equilibrium to itself, we obtain a homeomorphism on all of P for which ψtη=ηφt, i.e. which conjugates the two flows. On trajectories leaving P at some negative time we then rescale time to ensure that η maps P to P.□

In case there are periodic orbits, the period cannot be changed by a conjugating homeomorphism. Originally, this was the reason to weaken the concept of topological conjugacy to that of topological equivalence by allowing for time to be rescaled along the trajectories. Then, occurring periodic orbits can be mapped to each other as well.

2.2.2. The discriminant set

Substituting the first parabola of (1) into the second parabola yields a quartic expression that vanishes at X=0 (the equilibrium at the origin) and at the roots of the cubic

(8) a2bkX32a2bkX2+ab(1+ak)Xab+1(8)

(the mixed equilibria). Since omitting the global factor 1a in (6b) does not change the stability type of equilibria, all sections of the bifurcation set with constant tolerance a=const. of the majority (see in § 2.3 below) are scaled versions of each other. In particular, the computation

(9) b±=9ak2a2k2±2ak(ak3)3a(4ak)(9)

by Haw and Hogan (Citation2018) remains valid and still yields the discriminant set of (8) separating the region with 6 equilibria from the region with 4 equilibria, see again . For (a,b,k) in the discriminant set of (8), i.e. satisfying the EquationEq. (9), the dynamical system (3) has 5 equilibria and a fold (or saddle node) bifurcation takes place, except when additionally ak=3 where the saddle undergoes a dual cusp bifurcation (see § 2.4 for more details).

The reduction from three parameters a,b,k to two parameters α,β simplifies the mathematics, but the interpretation of the product ab=α of tolerances and of the product ak=β of tolerance times population ratio is not obvious. For this reason, we avoid in the sequel the use of α and β and keep referring to the tolerances a,b and the ratio k. Note that the systems of Eqs. (6) and (7) do coincide if we put a=1, thus making b a kind of relative tolerance parameter.

2.3. Analysis: structural stability

One of the key concepts in the theory of dynamical systems is the concept of structural stability. A vector field h is said to be structurally stable if for all sufficiently small perturbations hε of h the flows φtε of hε and φt of h are topologically equivalent, i.e. they describe qualitatively similar dynamics.

Having a structurally stable system ensures that the mathematical results obtained for the model under consideration are applicable to the real world. Indeed, it is improbable that the real-world phenomenon of self-organized segregation is exactly described by the vector field (3). If the dynamics of (3) is structurally stable, then for all small perturbations the dynamics behave similarly. It suffices therefore that (3) is close enough. To keep the model’s predictions in line with what is observed in real life, however, one has to improve the model where the phenomena discovered in the dynamics of (3) differ from what is observed in the real world.

Unfortunately, the degeneracy of the equilibrium x=(0,0) prevents the vector field h=(f,g) in (3) from being structurally stable. Merely subtracting ε(x,y) from h yields hε(x,y)=h(x,y)ε(x,y) for which the empty BN (0,0) has the Jacobian

Jε(0,0)=ε00ε

making this an attractive equilibrium for any ε>0 and thus preventing a topological equivalence between φtε and φt. This can be remedied by removing a small disk Dδ:=Dδ(0,0)={xP|x∥<δ} from the phase space P (or rather a quarter-disk). For ab>1 the resulting phase space Pδ:=PDδ is still invariant under the flow. If additionally (a,b,k) does not satisfy (9), then all remaining equilibria are non-degenerate. The segregated BN (1,0) and (0,1k) are attractive, and in the interior of Pδ, there is a saddle and possibly another saddle and a mixed attractive equilibrium.

This mixed stable equilibrium x=(Xe,Ye), if it exists, is obtained by solving the cubic EquationEq. (8), which yields unwieldy formulas. However, the qualitative behavior is independent of the exact position of this equilibrium. The boundary of the basin of attraction consists of the stable manifolds of the two saddles.

Proposition 2.2

Let [0,1]λ(a(λ),b(λ),k(λ)) be a curve in parameter space that does not cross (4) or (9). Then, for all parameter values on this curve the vector fields (3) are topologically equivalent.

The proposition is formulated in such a way that the situation near the degenerate equilibrium x=(0,0) is included and the proof takes place on the original phase space P.

Proof. It suffices to show that the vector fields for λ=0 with flow φt and λ=1 with flow ψt are topologically equivalent. The equilibria and their stability types coincide, and so we can simply adapt the proof of proposition 2.1. The initial diffeomorphism becomes η(x)=ψτ(x)(ξ(φτ(x)(x))) where ξ maps the sets Wδs(xe), xe the equilibria of φt, to their counterparts in the flow ψt. Mapping each equilibrium of φt to its counterpart in ψt yields the conjugating homeomorphism. Then, time is again rescaled along trajectories that cut through P to ensure that P is mapped to P.□

In particular, for ab>1 the dynamical system (3) is structurally stable on Pδ with δ sufficiently small (as explained below), if (a,b,k) does not satisfy (9). The two possible phase portraits are given in . Also, all systems (3) with ab<1 have equivalent flows on P, represented by the third phase portrait in .

The discriminant set (9) defines the cuspy surface in , and EquationEq. (4) defines the smooth surface asymptotic to the (a,k)–plane and to the (b,k)–plane. This confirms that gives a faithful representation of the possible dynamics defined by (3), except for the dynamics at the bifurcations. The bifurcation at (4) concerns the degenerate equilibrium (0,0). We follow Haw and Hogan (Citation2018) and do not further consider this empty BN.

When determining the “sufficiently small” size δ of the disk Dδ to be removed from P to obtain Pδ, we have to make sure that the saddle xe does not lie in that disk – all while xe converges to (0,0) as ab1. To achieve a uniform size δ, we need a uniform bound ββ0>1 on β=ab. Here, we are allowed to choose β0=1.5 or β0=1.05 or whatever fixed β0>1 we find suitable.

2.4. Results: bifurcations

A bifurcation occurs where the system loses structural stability. As shown in § 2.2, we may fix the value of the tolerance a of the majority and so for simplicity’s sake we set a=1. Then β=b is the (relative) tolerance of the minority and α=k is the population ratio, while (4) turns into the equation b=1, so we restrict to bb0>1. The discriminant (9) yields a cuspy curve in the (k,b)–plane {a=1,bb0} with apex (k,b)=(3,9). Here, a dual cusp bifurcation takes place that organizes the whole bifurcation set, see proposition 2.3 below. Fixing a=1 the dynamical system (3) becomes

(10a) X˙=X[X(1X)Y](10a)
(10b) Y˙=Y[bY(1kY)X].(10b)

Next to the empty BN and the two segregated BN there are 1–3 equilibria that are the solutions of a cubic equation, which at the discriminant set defined by

(11) b±=9k2k2±2k(k3)34k(11)

turns from having one real solution to having three real solutions.

Proposition 2.3

At (k,b)=(3,9) the dynamical system (10) undergoes a dual cusp bifurcation.

In particular, when crossing one of the two lines (11) originating from (k,b)=(3,9) a fold bifurcation takes place where a saddle and a node meet and vanish. That (10) undergoes a dual cusp bifurcation and not a cusp bifurcation means that the third mixed equilibrium (next to the two saddles) is an attractive equilibrium.

Proof. Dividing the cubic (8) by 6bk (recall that we restricted to a=1) and translating XX+23 puts (8) into the standard form

16X3+μX+ν

of a cubic, with

μ=16k118andv=16bk118k+181.

For a negative coefficient of X3 we would have the cusp bifurcation of (Guckenheimer & Holmes, Citation2002, Figure 7.1.1), but since 16>0, we have the dual cusp bifurcation instead. In particular, two of the equilibria meet and vanish in a fold bifurcation along paths crossing (11)□

Omitting the tolerance parameter a in the picture of the bifurcation set (by putting a=1) allows us to draw a bifurcation diagram where we choose the majority X as the vertical co-ordinate. This results in where every point on the green surface corresponds to a saddle and every point on the red surface corresponds to the mixed stable node. Where the two surfaces meet we have fold bifurcations and the dual cusp bifurcation takes place where the fold line has a vertical tangent. Note that the two bifurcation diagrams of (Haw & Hogan, Citation2018, ) can be obtained by taking sections k=const. with 3<k<4 and k>4, respectively.

To complement the representation of the dynamics in we give in the phase portraits along a b–line with k=3.5 fixed. This clearly shows how the attracting equilibrium born in the first fold bifurcation moves over to the other saddle to vanish during the second fold bifurcation. At the apex (k,b)=(3,9) of the cusp, the phase portrait is similar to the middle one in , but with a non-hyperbolic saddle.

Figure 3. Phase portraits for a=1, k=3.5 and (bottom to top and left to right) b=11, b=b(3.5), b=12, b=16, b=b+(3.5), b=17. Saddles are shown in green, stable nodes in red and non-hyperbolic equilibria in blue.

Figure 3. Phase portraits for a=1, k=3.5 and (bottom to top and left to right) b=11, b=b−(3.5), b=12, b=16, b=b+(3.5), b=17. Saddles are shown in green, stable nodes in red and non-hyperbolic equilibria in blue.

3. The two-neighborhood model

A possible extension of Schelling’s BN model is to consider two BNs and study the movement of members of each population type between them. In § 3.1, we introduce Haw and Hogan’s (Citation2020) dynamical system for the resulting TN model and we explain the inaccuracies of their mathematical formulation. We then correct this formulation to a non-smooth system of differential equations. In § 3.2, we perform a qualitative analysis of the corrected system. We concentrate on Case I of (Haw and Hogan, Citation2020) where both BNs have the same X–tolerance schedule RX and the same Y–tolerance schedule RY. The problems already become apparent in this case, which allows us to simplify the notation. Before summarizing our results we comment on Cases II, III and IV where RX or RY (in Case IV both) become dependent on the BN.

3.1. Model

The sociological model proposed in (Haw and Hogan, Citation2020) and the mathematical description they give in terms of a dynamical system do not coincide, as we lay out in § 3.1.1. In § 3.1.2, we therefore change the mathematical description to a dynamical system that does correspond to the sociological model of (Haw and Hogan, Citation2020). The corrected system is no longer smooth, but nevertheless has a unique solution for every initial condition.

3.1.1. Haw and Hogan’s (2020) formulation

Haw and Hogan (Citation2020) extended Schelling’s BN model by considering two BNs labeled BN1 and BN2. For BNi the X– and Y–populations are denoted by Xi and Yi, respectively. They assume that any individual leaving one BN must enter the other BN, and there is no reservoir. The two BNs thus form a closed system and the total X– and Y–populations are given by Xtotal=X1+X2=1 and Ytotal=Y1+Y2=1k, which yields

(12) X2=1X1andY2=1kY1(12)

and in this way we decouple the dynamics of BN1. Indeed, by differentiating (12) we get

(13) dX2dt=dX1dtanddY2dt=dY1dt(13)

for BN2 whence the initially 4–dimensional system is split into two 2–dimensional systems, of which one mirrors the dynamics of the other.

Haw and Hogan (Citation2020) make similar assumptions for their TN model as those made by Schelling (Citation1971) for the BN model. A new assumption made for the TN model, however, is that each individual residing in one of the two BNs is only concerned with the population ratio of the BN of residence when deciding whether to remain or leave (Haw and Hogan, Citation2020, p. 223). Using the same tolerance schedules RX and RY (in the present Case I for both BNs), they then present the system

(14a) X˙1=f(X1,Y1)f(X2,Y2)=f(X1,Y1)f(1X1,1kY1)(14a)
(14b) Y˙1=g(X1,Y1)g(X2,Y2)=g(X1,Y1)g(1X1,1kY1)(14b)

governing the dynamics of BN1. Here f and g are still the functions defined in (3), but now applied in both BN1 and BN2. At first glance, the system (14) seems to reflect their described model. Upon closer inspection, however, one can observe that these equations are an incorrect mathematical formulation of the stated model.

Given the assumption that individuals only care about the population ratio of the BN that they currently reside in, the first term f(X1,Y1) on the right-hand side of EquationEq. (14a) accounts for the movement of the X1–population in and out of BN1, subject to the presence of the Y1–population. For f(X1,Y1)0 this reflects that the members of the X1–population remain in or leave BN1 according to the tolerance schedule. When f(X1,Y1)>0, the size of the Y1–population is tolerated by the whole X1–population and all X–members remain in BN1. The situation is most transparent if we additionally assume that f(X2,Y2)=0, i.e. in BN2 the X–population is in equilibrium. Then X˙1>0 and thus f(X1,Y1)>0 results in an increase in X1. The only reasonable explanation is that the X–population in BN1 is increasing as a result of X–members moving in from elsewhere besides BN1 or BN2. This, however, contradicts the assumption that we have a closed system with Xtotal=X1+X2. Thus, when f(X1,Y1)>0, Haw and Hogan’s (Citation2020) differential equation for X1 fails to adequately represent their stated model. This similarly applies to f(X2,Y2) and to the terms in (14b).

3.1.2. A new dynamical system

For the differential equations to correctly represent the model of (Haw and Hogan, Citation2020) the following conditions must be met.

  1. When X1(X1RX(X1)Y1)>0, we need to replace f(X1,Y1) in (14a) by 0. Indeed, members of the X–population in BN1 are satisfied with the population ratio and remain in BN1. As individuals only look at the composition of their own BN and we have a closed system of two BNs, no members of the X–population from outside enter BN1 and there is no contribution resulting from the population ratio in BN1.

  2. Similarly, when f(X2,Y2)>0, we need to replace f(X2,Y2) in (14a) by 0.

  3. When g(X1,Y1)>0, we need to replace g(X1,Y1) in (14b) by 0.

  4. When g(X2,Y2)>0, we need to replace g(X2,Y2) in (14b) by 0.

Hence, we have to adjust the differential equations (14) according to where the point (X1,Y1,X2,Y2) lies with respect to the parabolas Yi=XiRX(Xi) and Xi=YiRY(Yi), i=1,2. Therefore, we define

(15a) F1out:={(X1,Y1)Pf(X1,Y1)<0}(15a)
(15b) F1in:={(X1,Y1)Pf(X1,Y1)>0}(15b)
(15c) F2out:={(X1,Y1)Pf(1X1,1kY1)<0}(15c)
(15d) F2in:={(X1,Y1)Pf(1X1,1kY1)>0}(15d)
(15e) G1out:={(X1,Y1)Pg(X1,Y1)<0}(15e)
(15f) G1in:={(X1,Y1)Pg(X1,Y1)>0}(15f)
(15g) G2out:={(X1,Y1)Pg(1X1,1kY1)<0}(15g)
(15h) G2in:={(X1,Y1)Pg(1X1,1kY1)>0}.(15h)

The corrected dynamical system then reads as

(16a) X˙1=F1(X1,Y1)F2(X1,Y1)(16a)
(16b) Y˙1=G1(X1,Y1)G2(X1,Y1)(16b)

where

F1(X1,Y1):=f(X1,Y1)00when(X1,Y1)F1outf(X1,Y1)=0(X1,Y1)F1inF2(X1,Y1):=f(1X1,1kY1)00when(X1,Y1)F2outf(1X1,1kY1)=0(X1,Y1)F2inG1(X1,Y1):=g(X1,Y1)00when(X1,Y1)G1outg(X1,Y1)=0(X1,Y1)G1inG2(X1,Y1):=g(1X1,1kY1)00when(X1,Y1)G2outg(1X1,1kY1)=0(X1,Y1)G2in.

The dynamics of BN2 can still be obtained from that of BN1 using (13), which yields

(X2(t),Y2(t))=(1X1(t),1kY1(t)).

Before a qualitative analysis of the new system (16) can be conducted we must first confirm that (16) still has unique solutions (Blanchard et al., Citation1998, p. 66).

Proposition 3.1

At every point (X1,Y1)P the system (16) is locally Lipschitz continuous and thus satisfies the condition for existence and unicity of solutions for all initial conditions (X1,Y1).

A function H:PR is locally Lipschitz continuous if for every xP there exist constants δ>0 and K such that the inequality

|H(y)H(z)|K|yz|

is satisfied for all y,zDδ(x)={uP|ux∥<δ}. This strong form of continuity is satisfied by every continuously differentiable system – take for the Lipschitz constant K a local bound on the derivative.

Proof. As linear combinations of locally Lipschitz continuous functions are again locally Lipschitz continuous, we only treat F1 (the arguments for F2,G1 and G2 are similar). On F1out and F1in the function F1 is continuously differentiable and therefore locally Lipschitz continuous with K an upper bound of the length F1 of the total derivative of F1 on Dδ(x). At a point x=(X1,Y1) on the parabola Y1=X1RX(X1), the function F1 is still continuous, and we use that the ingredient f of F1 is continuously differentiable everywhere, hence with a Lipschitz constant K in x. The inequality

|f(y)f(z)|K|yz|

for y,zDδ(x) becomes

|F1(y)F1(z)|K|yz|

when both y and z are in F1out, while when both y and z are in F1in the latter simply reads as |00|K|yz|. When yF1in and zF1out, we have that

|F1(y)F1(z)|=|0f(z)||f(y)f(z)|K|yz|

as f(y)0 and f(z)<0 (and similarly when yF1out and zF1in).□

Note that during the proof we replaced F1in by its closure F1in to include the points on the parabola Y1=X1RX(X1).

3.2. Analysis

Again the scaling (5) allows a=1 to be achieved in (16), at the cost of replacing (k,b) by (α,β)=(ak,ab) and multiplying the right-hand side in (16b) by 1a. Haw and Hogan (Citation2020) scale time by 1k instead – effectively this means an extra scaling of time by 1α and correspondingly the vector field is multiplied by α. The extra scaling of time by 1α has the advantage that

αf(X2,Y2)=(1X1)[αX1(1X1)(1αY1)]

and

αg(X2,Y2)=1a(1αY1)[βY1(1αY1)(1X1)]

have no α in the denominator, so we keep it. For the nullclines, the parameters are again reduced from (a,b,k) to (α,β). By taking the tolerance a=1 we have α=k keeping its interpretation of the majority–minority ratio, while β=b can be interpreted as the relative tolerance of the Y–population (with respect to the X–population).

3.2.1. Equilibria

The segregated (1,0), (0,1α) have both right hand sides in (16) read as 00 and are always equilibria, even though the vector field is not differentiable on the union of the four parabolas. On F1outF2outG1outG2out the vector field (16) coincides with (14) and the analysis of (Haw and Hogan, Citation2020) remains valid. In particular, the mixed (12,12α) is also always an equilibrium. Additional equilibria (Xe,Ye) can be obtained from the roots Xe=X1 of a polynomial p6(X1) of degree 6. These 39 equilibria are the intersections of the two cubic nullclines computed by Haw and Hogan (Citation2020).

The cubic X1–nullcline always stays inside F1outF2out — except if the parabolas Y1=X1(1X1) and αY1=1αX1(1X1) overlap (so F1in and F2in intersect) and the right-hand side of (16a) is identically zero on F1inF2in. Then, the part of the cubic Y1–nullcline inside F1inF2in is a line of equilibria. Similarly, where the parabolas X1=βY1(1αY1) and X1=1βY1(1αY1) overlap, the part of the cubic X1–nullcline inside G1inG2in is a line of equilibria. If both F1inF2in and G1inG2in, then the common intersection F1inF2inG1inG2in is even a “sea of equilibria” with its boundary formed by parts of the parabolas. All such infinite sets of equilibria are centered at the mixed equilibrium (12,12α).

In particular, where valid the polynomial p6(X1) is never zero and where there are only finitely many equilibria, these are the segregated (1,0), (0,1α), both attracting and the mixed (12,12α), a saddle (see § 3.2.2 for more details). Before the bifurcations found by Haw and Hogan (Citation2020) can occur, the saddle has already exploded into an infinite set of equilibria. Note that the X1– and Y1–axes are no longer nullclines of the system whence the empty (0,0) is no longer an equilibrium, unlike in the single BN model.

The existence of a line or a sea of equilibria immediately shows that for such parameter values, the system (16) is not structurally stable. Let us therefore compute the parameter values (α,β) for which the parabolas start overlapping. Equating

αX1(1X1)=1αX1(1X1)

yields

X1=12±12α2α,

so the change occurs at α=2. Similarly,

βY1(1αY1)=1βY1(1αY1)

yields

Y1=12α±12αβ2αβ

and the change occurs at β=2α. We therefore have the bifurcation set shown in .

Figure 4. Bifurcation set of (16) in the (α,β)–plane (or, equivalently, for tolerance a=1 in the (k,b)–plane). Counterclockwise (starting top right) the five pictures of nullclines have parameter values (α,β)=(2.5,6),(1.5,6),(2,4),(1.5,2),(2.5,2). In blue the cubic X1–nullcline (dark) and the regions F1in, F2in (light) and in red the cubic Y1–nullcline (dark) and the regions G1in, G2in (light). Where an Fiin and a Gjin intersect the colours mix to light purple. Only in the two top pictures G1inG2in and only in the top right and bottom right pictures F1inF2in. Correspondingly, the top right picture features a (dark purple) sea of equilibria F1inF2inG1inG2in. The middle left picture is also not structurally stable as small perturbations can lead to both the bottom left picture as to one of the other three pictures (the latter are clearly not structurally stable, displaying infinitely many equilibria).

Figure 4. Bifurcation set of (16) in the (α,β)–plane (or, equivalently, for tolerance a=1 in the (k,b)–plane). Counterclockwise (starting top right) the five pictures of nullclines have parameter values (α,β)=(2.5,6),(1.5,6),(2,4),(1.5,2),(2.5,2). In blue the cubic X1–nullcline (dark) and the regions F1in, F2in (light) and in red the cubic Y1–nullcline (dark) and the regions G1in, G2in (light). Where an Fiin and a Gjin intersect the colours mix to light purple. Only in the two top pictures G1in∩G2in≠∅ and only in the top right and bottom right pictures F1in∩F2in≠∅. Correspondingly, the top right picture features a (dark purple) sea of equilibria F1in∩F2in∩G1in∩G2in. The middle left picture is also not structurally stable as small perturbations can lead to both the bottom left picture as to one of the other three pictures (the latter are clearly not structurally stable, displaying infinitely many equilibria).

3.2.2. Phase portraits

The adjustment from (14a) to (16a) is most severe on F1inF2in, where the parabolas intersect. For (X1,Y1)F1inF2out the term f(X1,Y1) is positive and gets adjusted to zero, but the term f(X2,Y2) is also positive, so the right-hand side of (16a) remains positive. Similarly, on F1outF2in the right-hand side of (16a) remains negative. On F1outF2out the cubic X1–nullcline separates the positive values below from the negative values above. Thus, we have the minor adjustment that the positive values below the X1–nullcline become smaller (but remain positive) within F1inF2out and that the negative values above the X1–nullcline become larger (but remain negative) within F1outF2in. The major adjustment is that the X1–nullcline gets broadened to F1inF2in where the parabolas intersect. The same applies mutatis mutandis to the adjustment from (14b) to (16b).

Proposition 3.2

For the adjusted system (16) the segregated equilibria (1,0) and (0,1α) are attractive.

Haw and Hogan (Citation2020) could simply compute the Jacobians in these points to prove that these are both stable nodes for (14). The system (16) is not differentiable at the segregated equilibria.

Proof. We focus on (1,0) as the argument for (0,1α) is similar. Take δ>0 so small that

Dδ(1,0)F2outG1out

and consider the vector field (16) restricted to Dδ(1,0). Between the two nullclines, the vector field points downward right, while on the two nullclines, the vector field points to the right and points downwards, respectively. Below the X1–nullcline the vector field points upward right and all trajectories reach the X1–nullcline in finite time. To the right of the Y1–nullcline the vector field points downward left and all trajectories reach the Y1–nullcline in finite time. Hence, all initial conditions (X1,Y1)Dδ(1,0) have trajectories that tend to (1,0) as t.□

The basins of attraction of the segregated equilibria are clearly larger than a mere disk Dδ, see e.g. , left. Within F1outF2outG1outG2out the Jacobian of (16) reads as

(17) J(X1,Y1)=(1+α)+6αX1(1X1)α1a1a[(α+β)+6αβY1(1αY1)](17)

Figure 5. Phase portraits of (16) for (α,β)=(1.5,2), left, and for (α,β)=(2,4), right—the choice a=1 implies k=α and b=β, respectively. The left phase portrait is structurally stable while the right phase portrait is not – a fact that can be derived only by looking at small perturbations (see figure 4). Note that both phase portraits are invariant under a rotation of 180—the dynamics of BN2 coincides with that of BN1.

Figure 5. Phase portraits of (16) for (α,β)=(1.5,2), left, and for (α,β)=(2,4), right—the choice a=1 implies k=α and b=β, respectively. The left phase portrait is structurally stable while the right phase portrait is not – a fact that can be derived only by looking at small perturbations (see figure 4). Note that both phase portraits are invariant under a rotation of 180∘—the dynamics of BN2 coincides with that of BN1.

where the global factor 1a in the second row corrects the scaling error of Haw and Hogan (Citation2018, Citation2020). The determinant

detJ(12,12α)=14a[2α2+(2α)β]

is negative for α<2 whence the isolated mixed equilibrium is a saddle. This yields for (α,β) with β<2α<4 the phase portrait in , left. The basins of attraction of (1,0) and (0,1α) are separated by the stable manifold of the mixed equilibrium. Recall that for β>2α we have that G1inG2in and (12,12α) is not isolated.

For (α,β) on the boundary β=2α the vector field (16) is only Lipschitz continuous in (12,12α). Taking for β<2α<4 the limit (α,β)(α,2α) shows that the isolated mixed equilibrium keeps behaving like a saddle when the limit is reached. This is also true when taking for β<2α<4 the limit (α,β)(2,β). In particular, for (α,β)=(2,4), the mixed equilibrium is isolated and behaves like a saddle, see , right.

When β>2α (and α<2) the vector field is differentiable on G1inG2in, but the entries in the second row of (17) have been replaced by 0. This yields a zero eigenvalue, corresponding to the line of equilibria. When α>2 (but β<2α) the first row of (17) consists of zeroes, corresponding to the line of equilibria in F1inF2in. When β>2α>4, the equilibria in F1inF2inG1inG2in have zero Jacobian (both eigenvalues vanish). Note that on the boundaries of these lines and sea of equilibria, the vector field (16) still vanishes, but is not differentiable.

3.2.3. Different tolerance schedules

In (Haw and Hogan, Citation2020) the vector field (14) has f and g depend on the BN, with f(X1,Y1) replaced by

(18a) f1(X1,Y1):=X1[a1X1(1X1)Y1](18a)

and f(X2,Y2) replaced by

(18b) f2(X2,Y2):=X2[a2X2(1X2)Y2].(18b)

Thus, BN1 and BN2 have different tolerance parameters a1 and a2, respectively. A scaling like (5) cannot change their ratio, which is denoted by

γ:=a2a1.

Similarly, g(X1,Y1) and g(X2,Y2) get replaced by

(18c) g1(X1,Y1):=Y1[b1Y1(1kY1)X1](18c)
(18d) g2(X2,Y2):=Y2[b2Y2(1kY2)X2](18d)

with different tolerance parameters b1 and b2, while the population ratio k=XtotalYtotal remains the same for both BNs. Scaling both Y1 and Y2 by 1a1 but t by 1k leadsFootnote3 to

α:=a1k,β1:=a1b1,β2:=a1b2

with again an extra factor 1a1 in the Y˙1–equation, but otherwise reducing parameters from (a1,a2,b1,b2,k) to (α,β1,β2,γ).

Generalizing (16) to (18) does not change the fundamental problem. While the symmetry line reflecting the parabola f1=0 into the parabola f2=0 may shift up or down, on their intersection the right-hand side of the X˙1–equation still has to vanish. The situation is similar for g1=0, g2=0 and the Y˙1–equation.

The bifurcations to further equilibria found in (Haw and Hogan, Citation2020) all require the cubic X1–nullcline to evolve extrema. However, before this can happen the X1–nullcline must have a horizontal inflection point, after which the parabolas f1=0 and f2=0 start to intersect. This similarly applies for the Y1–nullcline. Thus, before the bifurcations detailed in (Haw and Hogan, Citation2020) can take place, the adjustment of the vector field has already led to lines and seas of equilibria. Note that an intersection of parabolas no longer necessarily leads to infinitely many equilibria, as the “other” nullcline is no longer enforced to pass through, but a bifurcation does require the “other” nullcline to pass through the intersection of parabolas.

Again, such dynamics is not structurally stable. Also, for tolerance parameters dependent on the BN, the only phase portraits with finitely many equilibria have the two attracting segregated equilibria and a mixed saddle. As already explained by Haw and Hogan (Citation2020), the saddle is no longer fixed at (12,12α), though.

3.2.4. Results

The corrected system (16) fails to be everywhere differentiable, but in a mild way. It is still locally Lipschitz continuous, and hence for every initial condition, there is a unique solution – all these together form the flow of (16). Mutatis mutandis this applies to the system with right-hand sides (18) as well.

Our qualitative analysis yields a bifurcation set in , with three of the four open regions displaying dynamics that is not structurally stable. For the system (14), Haw and Hogan (Citation2020) were able to find bifurcations to attractive mixed equilibria, and we discuss in the concluding section 5 that it might be worthwhile to change the sociological model in such a way that (14) becomes a valid description.

A word of warning on the use of structural stability: for parameter values like (α,β)=(1.5,2) deep inside the structurally stable region of , the smallness requirement on perturbations is not particularly restrictive. However, the closer (α,β) is chosen to the boundary of that region, the smaller the perturbations are allowed to be. This phenomenon is not specific to the structurally stable part of the TN model, but is “built-in” to the definition of structural stability: for a perturbation to be sufficiently small may mean very small.

4. Discussion

We have achieved a complete understanding of the dynamics defined by (3) and by (16). We explained how the dynamics of (3) can be made structurally stable when the product of tolerances abβ0 is above a threshold β0>1 and detailed the occurring bifurcations. The system (16) is not structurally stable for (α,β) outside the small wedge {0<β<2α<4}. For (α,β) inside the wedge, the dynamics consists of two stable segregated equilibria, with basins of attraction separated by the stable manifold of a mixed saddle, see , left.

4.1. Interpretation

The BN model is very simple, and this is not only its weakness but also its strength. Among the weaknesses is the choice of the phase space

P=[0,1]×[0,1k]

where the relative size Ymax=1kXmax of the minority population is built into the model in such a way that there are immediate further consequences. For instance, where the segregated equilibrium (1,0) may grant one place per member of the X–population, the segregated equilibrium (0,1k) then grants on average k places to members of the Y–population. The capacity of the BN is simply not part of the model as the X–slots and the Y–slots are filled independently; the interdependence in (3) solely stems from the tolerances.

Another weakness of the BN model is that (2) not leaving P invariant is remedied by simple multiplication with X and Y, respectively. This turns the empty (0,0) into a degenerate equilibrium. The model still captures that the composition of a BN with a low number of residents does not exceed the tolerance of most individuals of either population. Consequently, individuals start moving in when the initial composition of the BN is close to the origin. The composition then changes over time as residents remain or leave, depending on whether their tolerances are exceeded. Eventually, the BN reaches a composition that corresponds to a stable node (or, exceptionally, to a saddle).

Two stable nodes present for all parameter values are (1,0) and (0,1k). This shows that regardless of how tolerant the two populations are, the BN can segregate. For parameter values outside the cuspy region, the initial composition merely decides whether the BN segregates to an all X–population or an all Y–population – and the precise parameter values determine which basin of attraction is larger (but that is already a quantitative statement that might be out of the scope of the BN model).

For parameter values inside the cuspy region, there is a third stable node, with a mixed population. The precise position of the parameters determines the position of the two saddles (and of the mixed stable node) and thus determines the sizes of the three basins of attraction. The node the dynamics leads to depends on the basin of attraction the initial composition lies in. Where the trajectory passes close to the stable manifold of a saddle, BN tipping can be achieved by pushing the state (X(t),Y(t)) of the system to the other side of that stable manifold.

An interesting prediction made by the BN model is that a stable node corresponding to a mixed BN only exists for parameter values inside the cuspy region. It is thus not sufficient to simply increase the values of the tolerance parameters to obtain a mixed BN. Instead, such an increase has to be achieved in such a way that the parameter values enter the cuspy region and one still has to ensure that the trajectory is in the basin of attraction of the mixed node. The phase portraits in convincingly show how increasing the tolerance b of the minority (Y) — with population ratio k and tolerance a of the majority (X) fixed – first leads to the creation of a mixed stable node and, as the tolerance b is increased even further, to the annihilation of that mixed stable node. Ultimately, the basin of attraction for the segregated stable node of an all Y population is increased, while the basin of attraction for an all X population has decreased. Note that the mathematical assumption that the tolerance a remains fixed during the increase in the tolerance b probably does not reflect real life and a might increase as well, keeping the system in the cuspy region.

The TN model on P with β<2α<4 has dynamics rather similar (although not topologically equivalent) to the BN model on Pδ with parameter values outside the cuspy region (but satisfying ab>1). The population trajectories lead to neighborhood compositions where BN1 is segregated with one type of population and BN2 is likewise segregated with the other type of population. The initial composition of the neighborhoods merely decides which particular population each BN is eventually composed of (unless we have an exceptional initial composition on the stable manifold of the saddle). The model is not well fitted to describe real-world scenarios that have parameter values (α,β) with α2 or β2α.

4.2. Further results

We include in this discussion section two further results that concern both models. One result is that there are no periodic orbits; the only orbits for which the past completely stays inside the phase space are the 3 or 5 equilibria and the 2 or 4 heteroclinic orbits from saddles to attractive equilibria. All other orbits leave the phase space Pδ (for the BN model), respectively P (for the TN model) in backward time. The second result concerns a two-neighborhood model with reservoir – but without direct movements between the neighborhoods. Such a model is brought back to two individual BN models.

4.2.1. Absence of periodic orbits

We can discard the possibility of periodic orbits discussed in (Haw & Hogan, Citation2018). The Poincaré index of a periodic orbit is +1 and has to be equal to the sum of the Poincaré indices of the equilibria within the periodic orbit (Guckenheimer & Holmes, Citation2002, § 1.8). The occurring mixed equilibria are stable nodes (index +1), saddles (index 1), equilibria undergoing a fold bifurcation (index 0), a non-hyperbolic saddle (index 1) and continuous sets of infinitely many equilibria (such a set has the Poincaré index 1 of a saddle).

This shows that the mixed stable node has to lie within a periodic orbit. A periodic orbit around the stable node has to intersect both nullclines twice. The direction at an X–nullcline is counterclockwise, and the direction at a Y–nullcline is clockwise, a contradiction. Hence, there are no periodic orbits.

4.2.2. Two neighborhoods with reservoir

A possible extension of the TN model consists of two BNs along with the option of an individual to be in neither BN, but elsewhere (in the reservoir). Haw and Hogan (Citation2020) already presented this case and described the reservoir as two extra segregated BNs. Instead, we keep our interpretation of the reservoir standing for the rest of the city, the dynamics of which we are not interested Footnote4 in. To move between the BNs, all individuals go through the reservoir: there is no direct movement. We now argue that this model is equivalent to two independent single BN models.

Haw and Hogan (Citation2020) denote the X,Y populations in BNi as (Xi,Yi), i=1,2. They also denote the X,Y populations in the reservoir as X3=1X1X2 and Y3=1kY1Y2. Note that the numbers 1 and 1k then no longer give the total population in BN1 and BN2. Instead, this opens the possibility that X3=0 or Y3=0 and correspondingly Haw and Hogan (Citation2020) present the differential equations

(19a) dX1dt= {f1(X1,Y1)When f1(X1,Y1)0     or        X3>0max(0,f2)When f1(X1,Y1)>0      and    X3=0(19a)
(19b) dY1dt={g1(X1,Y1)When g1(X1,Y1)0       or     Y3=0max(0,g2)When g1(X1,Y1)>0     and    Y3>0(19b)

with fi(Xi,Yi) and gi(Xi,Yi) given by (18). However, our interpretation of the reservoir standing for the rest of the city rules out the possibility of an empty reservoir. Then, the system (19) indeed reduces to (3).

The assumptions made in this model are the same as the assumptions made in the case of two independent single BN models. Individuals’ decisions to stay or move to the reservoir are based on whether the tolerances are exceeded. Accordingly, the TN model with reservoir is equivalent to two independent single BN models. These arguments generalize from two to multiple BN, proving the following result.

Proposition 4.1

In a system of several BNs where movement only occurs through the reservoir (that cannot be depleted), the dynamics in any of the BNs is that of an independent single BN.□

As we model the change in densities of the two populations and not the movement of individuals, the problem of one individual simultaneously moving to two different BNs does not arise.

5. Conclusion

We examined Schelling’s (Citation1971) and Haw and Hogan’s (Citation2018, Citation2020) BN and TN models for two populations, X and Y. The two models demonstrate how individual discriminatory preferences can lead to segregation. We carried out a qualitative analysis of both the BN model and the corrected TN model, determining the equilibria and their stability. In the corrected TN model the only dynamical attraction is to segregated equilibria, but in the BN model there is a region of parameters for which a significant part of the phase space is attracted to a mixed equilibrium. Schelling and Haw and Hogan developed their models to understand how segregation can emerge without external intervention, even among tolerant populations. It is therefore critical that these models are free from mathematical problems that undermine modeling results.

Both models have areas that require improvement. The TN model does not have great predictive power as its systems may not be structurally stable, limiting the freedom to set parameter values. The primary cause is the assumption that individuals only consider the composition of their own BN when deciding whether to remain or leave. Future research should attempt to revise this assumption, e.g. by having all individuals taking the population ratio of both BNs into account. Where both f1>0 and f2>0 in (18), the mathematical difference f1(X1,Y1)f2(X2,Y2) can then be thought of as sending the X–population to the greater attraction. However, this has serious consequences on the sociological interpretation of the model as individuals may then move even if the tolerance is not exceeded.

Future studies can improve the BN model by obtaining demographic data with different time points to construct reasonable and realistic tolerance schedules and parameter estimates. Haw and Hogan (Citation2018, Citation2020) study how replacing the linear tolerance schedules by polynomial or exponential tolerance schedules affects the dynamics. The parabolas (1) get replaced by functions that still have a unique maximum on [0,1] and [0,1k], respectively. One-bump functions were also obtained in an empirical study by Clark (Citation1991), but with functions that are not differentiable at the maximum. Schelling (Citation1971) already showed how such corners can be obtained with piecewise linear tolerance schedules. Monotonically decreasing tolerance schedules consisting of two linear pieces should keep the BN model simple enough to allow for a thorough study, while rendering the dynamical system more realistic.

Acknowledgments

We would like to thank Dr Özge Bilgili for guiding us toward a more-nuanced understanding of the complexity of the effects of residential segregation. We also thank the anonymous referee for improving the presentation of our results.

Disclosure statement

No potential conflict of interest was reported by the authors.

Notes

1 Also known as saddle-node bifurcations.

2 So small that making δ even smaller or 10% larger does not change the shape of Wδs(xe).

3 Next to γ as the scaling replaces the tolerance a2 in (18b) by αγ=a2k.

4 In particular, we do not exclude the possibility that there is a district in the city with only members of the X–population or a district with only members of the Y–population.

References

  • Blanchard, P., Devaney, R., & Hall, G. (1998). Differential Equations. Brookes/Cole.
  • Bolt, G. (2009). Combating residential segregation of ethnic minorities in European cities. Journal of Housing and the Built Environment, 24(4), 397–405. https://doi.org/10.1007/s10901-009-9163-z
  • Bolt, G., Özüekren, A. S., & Phillips, D. (2010). Linking integration and residential segregation. Journal of Ethnic and Migration Studies, 36(2), 169–186. https://doi.org/10.1080/13691830903387238
  • Clark, W. A. V. (1991). Residential preferences and neighborhood racial segregation: A test of the Schelling segregation model. Demography, 28(1), 1–19. https://doi.org/10.2307/2061333
  • Cutler, D. M., & Glaeser, E. L. (1997). Are ghettos good or bad? The Quarterly Journal of Economics, 112(3), 827–872. https://doi.org/10.1162/003355397555361
  • De la Roca, J., Ellen, I. G., & O’Regan, K. M. (2014). Race and neighborhoods in the 21st century: What does segregation mean today? Regional Science and Urban Economics, 47, 138–151. https://doi.org/10.1016/j.regsciurbeco.2013.09.006
  • Fajth, V., & Bilgili, Ö. (2020). Beyond the isolation thesis: Exploring the links between residential concentration and immigrant integration in the Netherlands. Journal of Ethnic and Migration Studies, 46(15), 3252–3276. https://doi.org/10.1080/1369183X.2018.1544067
  • Guckenheimer, J., & Holmes, P. (2002). Nonlinear Oscillations, Dynamical Systems, and Bifurcations of Vector Fields (7th ed.). Springer.
  • Haw, D. J., & Hogan, S. J. (2018). A dynamical systems model of unorganized segregation. The Journal of Mathematical Sociology, 42(3), 113–127. https://doi.org/10.1080/0022250X.2018.1427091
  • Haw, D. J., & Hogan, S. J. (2020). A dynamical systems model of unorganized segregation in two neighborhoods. The Journal of Mathematical Sociology, 44(4), 221–248. https://doi.org/10.1080/0022250X.2019.1695608
  • Iceland, J. (2004). Beyond Black and White: Metropolitan residential segregation in multi-ethnic America. Social Science Research, 33(2), 248–271. https://doi.org/10.1016/S0049-089X(03)00056-5
  • Jones, R. (2019, April 26). Apartheid ended 29 years ago. How has South Africa changed? National geographic. Retrieved March 5, 2021, from https://www.nationalgeographic.com/culture/2019/04/how-south-africa-changed-since-apartheid-born-free-generation/
  • Lewis, O. (1966). The culture of poverty. Scientific American, 215(4), 19–25. https://doi.org/10.1038/scientificamerican1066-19
  • Musterd, S. (2003). Segregation and integration: A contested relationship. Journal of Ethnic and Migration Studies, 29(4), 623–641. https://doi.org/10.1080/1369183032000123422
  • Schelling, T. C. (1971). Dynamic models of segregation. The Journal of Mathematical Sociology, 1(2), 143–186. https://doi.org/10.1080/0022250X.1971.9989794
  • Williams, D. R., & Collins, C. (2001). Racial residential segregation: A fundamental cause of racial disparities in health. Public Health Reports, 116, 404–416. https://doi.org/10.1016/S0033-3549(04)50068-7