ABSTRACT
We re-consider Schelling’s (1971) bounded neighborhood model as put into the form of a dynamical system by Haw and Hogan (2018). The aim is to determine how tolerance can prevent (or lead to) segregation. In the case of a single neighborhood, we explain the occurring bifurcation set, thereby correcting a scaling error. In the case of two neighborhoods, we correct a major error and derive a dynamical system that does satisfy the modeling assumptions made by Haw and Hogan (2020), staying as close as possible to their construction. We find that stable integration is then only possible if the populations in the two neighborhoods have the option to be in neither neighborhood. In the absence of direct movement between the neighborhoods, the problem is furthermore equivalent to independent single neighborhood problems.
1. Introduction
Segregation is a phenomenon in which people are separated based on factors such as ethnicity, religion or political preferences. It can manifest in a number of domains of everyday life, like residential districts, schools or workplaces, to name a few. In this paper, we focus on residential segregation in a district of a city. The broad mathematical definition of segregation that we use and the principles governing it, however, can be applied to a multitude of seemingly disparate situations, reaching as far as the division of members within a political party.
While residential segregation may be perceived by many as a phenomenon of the past, segregation still exists today in the twenty-first century. In some locations, patterns of segregation remain similar to the patterns of segregation of the past. For instance, the South African city of Chatsworth was created to segregate Indians during the Apartheid and even now, in the twenty-first century, Indians continue to make up the majority of Chatsworth’s population (Jones, Citation2019). In the USA, from 1980 to 2000, white and black segregation declined while Asian and Pacific Islander segregation increased (Iceland, Citation2004).
Segregation has several adverse effects on specific populations for both individuals and groups. Multiple studies (De la Roca et al., Citation2014, Williams and Collins, Citation2001, Cutler & Glaeser, Citation1997, Lewis, Citation1966) have provided empirical evidence that segregation, combined with poverty, leads to a multitude of problems. These problems include the reduction of contact of individuals with positive role models, increased exposure to violence, the promotion of a “culture of poverty,” aggravation of differences in individual socioeconomic status and differences in educational and employment opportunities. Studies also show that residential segregation has negative effects on the educational attainment and labor participation of minority ethnic groups and immigrant groups (Bolt, Citation2009, Musterd, Citation2003). Results from a study based in the Netherlands (Fajth & Bilgili, Citation2020) show that the physical concentration of non-western immigrants in a neighborhood leads to lower social integration, evidenced by lowered numbers of social ties between non-western immigrants and their native Dutch neighbors in such areas.
Residential segregation may sometimes have positive effects on specific categories of people. According to Bolt et al. (Citation2010), the clustering of ethnic minorities in a neighborhood may provide a sense of security, well-being and identity, particularly for individuals unable to speak the language of their host country. While discussing the larger impact of segregation on the societal level is beyond the scope of this paper, it is undeniable that segregation has significant effects on the lives of individuals of certain populations. This draws attention to the need to understand the causal factors of segregation in order to tackle the problem.
A possible approach to explaining these causal factors is to find out to what extent a mathematical analysis can answer the sociological questions posed in (Schelling, Citation1971, Haw & Hogan, Citation2018) and in (Haw and Hogan, Citation2020). Our aim is to determine how tolerance can prevent (or lead to) segregation. While the directions of research of sociological phenomena are ultimately decided by sociology, we will see that mathematics allows us to fully understand certain aspects of a model. In particular, with mathematics, we can determine whether a model is simply unsuitable for predicting real-world phenomena: already what we mean by “unsuitable model” can be phrased in purely mathematical terms (see below). The mathematical field that describes dynamic behavior like segregation is known as the theory of dynamical systems.
1.1. A dynamical systems approach to segregation
Thomas Schelling introduced the principles for a dynamical system of location-based segregation in his 1971 paper “Dynamic models of segregation.” This approach is particularly important as it captures the dynamic nature of segregation. Schelling’s Bounded Neighborhood (BN) model allows us to study unorganized segregation in bounded districts. In place of Schelling’s term “unorganized segregation,” however, we take up the suggestion of Haw and Hogan (Citation2020, footnote on p. 221) and speak of self-organized segregation. Self-organized segregation is driven by individual preferences (Schelling, Citation1971, p. 143) and stands in opposition to “organized segregation” like the South African Apartheid, which is segregation that occurs as a result of the practices of organizations and institutions.
Schelling’s BN model enables us to study how individual incentives and perceptions of differences can lead to collective segregation (1971, p. 145). The model describes a residential district (or a school, church, political party, etc.), referred to as a BN, in which the individuals are divided into two populations. For the case of a single neighborhood with two populations Schelling (Citation1971) used the example of a black population and a white population. To emphasize the far-reaching applications of our study, we follow (Haw & Hogan, Citation2018, Citation2020) and refer to –population (majority) and –population (minority). The respective population densities and are non-negative and depend continuously on time. The fact that is a density implies that . For the –population we deviate from this and choose , with taking the role of a ratio parameter between majority and minority. This makes the population ratio a parameter of the system that can be prescribed at will. As a result, the density of the –population satisfies . We use the term “composition” to refer to a particular combination of the – and –population densities in the BN. We refer to a BN with only one population type residing in it as “segregated” and to a BN with both population types as “mixed.”
The BN model focuses on how individual choices based on discriminatory behavior can lead to segregation. Individuals from each population type possess characteristics or features that are distinct from those of individuals from the other population type. Based on these differing characteristics, individuals of each population type show discriminatory individual behavior toward individuals of the other population type. Here, we use Schelling’s definition of “discriminatory” as “an awareness, conscious or unconscious, of sex or age or religion or color or whatever the basis of segregation is, an awareness that influences decisions on where to live, whom to sit by, what occupation to join or to avoid, whom to play with or whom to talk to” (Citation1971, p. 144). Individuals take into consideration the ratio of the two population types when deciding whether to move into, out of or remain in a BN. Their tolerance is built into the model in the form of tolerance limits, the maximum ratio of the two populations that they tolerate in a BN for them to continue residing in the BN or to move to the BN. These tolerance limits are in turn allocated to each population type via tolerance schedules. It is based on the tolerance schedules that the compositions of a BN changes over time as individuals move in and out or remain, thus allowing us to study the dynamics of the neighborhood and how self-organized segregation comes about. We further explain this in detail in 2.1.
In the BN model, there exist two locations where individuals from the two populations can reside in – the BN and a location in which their population type “predominates or […] does not matter” (Schelling, Citation1971, p. 167). Throughout this paper, we refer to the latter as “the reservoir.” The BN is preferred over the reservoir by every individual, and individuals do not take into consideration the ratio of the two populations in the reservoir. Note that we do not follow Haw and Hogan (Citation2020) who use one reservoir for each population and consider both reservoirs to be segregated. Rather, we think of the reservoir as being the rest of the city. It does not include the BN, and we are not interested in the dynamics of the reservoir. The model is only concerned with the dynamics of the neighborhood.
Upon parameter variation, the BN model may display foldFootnote1 bifurcations. At such a bifurcation a mixed stable equilibrium and a mixed unstable equilibrium meet and vanish – or are both born, depending on the direction of the parameter variation. In the bifurcation set of (in 2.3 below) this happens when crossing the green surface. In section 2, we explain this bifurcation set, which gives us a more comprehensive understanding of the model when predicting the complex social phenomenon of segregation. The fold bifurcations turn out to be organized by (dual) cusp bifurcations at the red line in , where the two parts of the green surface meet to form a cusp. These parts of the bifurcation set form the discriminant set of a cubic polynomial already obtained in (Haw & Hogan, Citation2018), the polynomial (8) governing all mixed equilibria. The explanation of these bifurcations is facilitated by the surface in , which is the bifurcation diagram detailing how the mixed stable equilibrium born in a fold bifurcation moves over to a second mixed unstable equilibrium to vanish in a second fold bifurcation. The bifurcation set in is completed by a blue surface where the remaining mixed unstable equilibrium merges with the empty neighborhood.
From the BN model one can construct a model of segregation in two neighborhoods. Discarding the reservoir (think of a small city with only two districts) the – and –populations move between the two neighborhoods according to their preferences based on tolerance schedules. This is the Two Neighborhood (TN) model of (Haw and Hogan, Citation2020) treated in section 3. We show in 3.1 how to correct the dynamical system of (Haw and Hogan, Citation2020) to fully represent the TN model. In 3.2, we then find that all points where the tolerance schedules of neither neighborhood are exceeded form an equilibrium – leading to lines and seas of equilibria for large parts of the parameter space (see in 3.2 below). This makes the model unsuitable for purposes of modeling real-life phenomena, where “unsuitable” has a precise mathematical meaning to be explained now.
Mathematical systems use mathematical concepts to describe real-life phenomena and to predict their behavior. They are rough approximations of the exact relationships describing phenomena in the real world, after focusing on only a single ingredient of the complex relationship between the variables. The BN model describes how segregation in a BN changes as individual tolerances change. It is an abstraction of the particular relationship between individual discriminatory behaviors and segregation, taking into account the tolerances of individuals, while in reality, multiple other factors affect segregation as well. We do not expect the BN model to describe the real world (or parts of it) in an exact fashion. In reality, the real-world system of segregation can be understood as being governed by the model’s equations plus “small” changes and uncertainties. In deterministic systems like the ones used in (Haw & Hogan, Citation2018, Citation2020) the latter are given by small perturbations. Such perturbations should be sufficiently small, so small that the dynamics remains qualitatively unchanged under perturbation. The mathematical term for this concept is that of structural stability. Thus, having a structurally stable dynamical system as a model is a necessary condition for drawing conclusions on the real world. This is not possible if the system is not structurally stable. We explain the important concept of structural stability in more detail in 2.3 below, but found it helpful to already here emphasize its impact.
David Haw and John Hogan explicitly formulated the BN model as a dynamical system in their 2018 paper “A dynamical systems model of unorganized segregation.” In their 2020 paper “A dynamical systems model of unorganized segregation in two neighborhoods,” they extended their 2018 dynamical system to create the TN model of self-organized segregation in two neighborhoods. In the present paper, we examine both their 2018 and 2020 papers with the following aims.
Perform an in-depth analysis of the 2018 single neighborhood model by assessing in particular the structural stability of the system. Prior to our analysis we additionally correct the scaling used in (Haw & Hogan, Citation2018) – which, as it turns out, has little consequences and qualitatively the error does not matter.
Correct the differential equations for the two-neighborhood model as put forward in (Haw and Hogan, Citation2020). We conclude that Haw and Hogan’s two neighborhood model is not a suitable model as it leads to dynamical systems that are not structurally stable.
Examine the physical implications of the results obtained from both models. We summarize the predictions made possible by the models and point out how stable integration – in the form of trajectories toward a mixed equilibrium – becomes a possibility.
Demonstrate that the two-neighborhood model with a reservoir population – as briefly presented in (Haw and Hogan, Citation2020) – is equivalent to two independent cases of the single-neighborhood model. We also show that the models have no periodic dynamics.
Our mathematical analysis confirms that the dynamics of the BN model is governed by the two surfaces (4) and (9) shown in the left part of (in 2.3 below). We furthermore detail the bifurcations that occur when the parameters are moved across (9) and how this is governed by the surface shown in . Our analysis also points out that the reformulation of Haw and Hogan’s (Citation2020) TN model as a dynamical system has non-isolated equilibria, thus preventing structural stability. To achieve aim 3, we detail what our results suggest about how segregation based on individual preferences – in particular on discriminatory behavior – comes about. Allowing a reservoir population in the TN model is proposed as an extension in (Haw and Hogan, Citation2020, p. 245) and we use the discussion section to show how this simply reduces the TN model to two independent BN models. Prior to this we use the Poincaré index to rule out periodic orbits.
1.2. Outline
We treat two different models – the single neighborhood (BN) model and the two neighborhood (TN) model. These were both introduced in the present section, but for the model itself and for the analysis we use separate sections for each model. Section 2 introduces the basic concepts and theory behind the BN model and presents an overview of the dynamical system. In this section, we also correct the scaling in Haw and Hogan’s (Citation2018) paper. Furthermore, we provide an in-depth qualitative analysis and present our results. Section 3 contains our evaluation of Haw and Hogan’s (Citation2020) TN model. We argue that the given mathematical formulation inaccurately depicts their described model. We consequently introduce a new formulation and perform a qualitative analysis of the system. It turns out that Haw and Hogan’s TN model describes dynamical systems that are not structurally stable. Discussion of the two models and interpretation of the results of our analysis are presented together in section 4. We also prove that both models have no periodic orbits and touch upon Haw and Hogan’s (Citation2020) two-neighborhood model with a reservoir population. Lastly, in section 5, we suggest possible areas of improvement for the single- and the two-neighborhood model and future directions of research.
2. Population dynamics in a bounded neighborhood
To keep the present paper self-contained, we re-build the dynamical system of (Haw & Hogan, Citation2018) from scratch, also explaining the meaning of the model parameters. We re-do the scaling and explicitly show that reducing the model size from three parameters to two parameters is indeed possible. Our analysis unveils that the model leads to a structurally stable dynamical system – despite the empty neighborhood being a degenerate equilibrium. We also detail the bifurcation diagram, explaining how the BN model can exhibit stable mixed equilibria.
2.1. Model
Individual discriminatory behavior is translated into segregation via tolerance schedules, which are specific to each population. The tolerance schedule of a given population is a function that keeps track of how many members of the population can tolerate how many members of the other population.
The upper limits for the – and –populations are denoted as and , respectively. We have for the ratio parameter – the ratio of majority to minoriy – and accordingly have . For the –population is in minority. The phase space (in which the dynamics takes place) is the rectangle
with boundary
The dynamics in is derived from the tolerance schedules of the –population and of the –population. These describe the maximal ratios and abided by the – and –populations, respectively (Schelling, Citation1971). Thus, for
the system is in equilibrium. Taking linear tolerance schedules
with tolerance parameters and , this results in the parabolas
The –population increases inside the first parabola, i.e. for
and decreases outside. Similarly, the –population increases inside the second parabola and decreases where .
2.1.1. Dynamical system
The simplest way to encode this as a dynamical system is to put
This makes the empty neighborhood an equilibrium. However, the dynamics of (2) does not leave the phase space invariant. Indeed, for , the right-hand side of (2a) is negative (or zero, at the equilibrium) resulting in members of the –population moving out of the BN without living there in the first place. Similarly, the vector field (2) points downward at the boundary part of the phase space. Note that (2) does point into at the remaining boundary points where or .
This problem is remedied by Haw and Hogan (Citation2018) who multiply the right-hand sides of (2a) by and of (2b) by , respectively, hence adjusting the dynamical system (2) to
This makes the axes and invariant, in particular the corner points and become equilibria.
2.1.2. Preliminary analysis
The Jacobian of the vector field (3) reads as
Hence, the –segregated equilibrium is attractive as has negative eigenvalues and . Similarly, has negative eigenvalues and whence the –segregated equilibrium is attractive as well.
However, the adjustment from (2) to (3) has the drawback of turning the empty into a degenerate equilibrium. Indeed,
and one has to go to second order to conclude that this equilibrium is repelling for for the product of the tolerance parameters. In (below in 2.3), we can see that for , the points on a line in the phase space are attracted to , i.e. there are initial compositions for which everybody moves out of the BN. Thus, the qualitative behavior changes when crossing the surface
All other equilibria are still obtained by intersecting the parabolas given in (1). In the cases where this results in a single mixed equilibrium we have
and this yields a saddle point, see again . As the saddle moves toward and reaches it at . The remaining phase portrait in has three mixed equilibria and in this case one of them is attractive, while the other two are again saddles. We continue our analysis in 2.3 and first discuss a possible simplification of the BN model.
2.2. Scaling
Haw and Hogan (Citation2018) propose the scaling
to remove the tolerance parameter in (3), at the cost of replacing the tolerance by and the population ratio by . However, this would turn the Jacobian from
and thus change the ratio of eigenvalues from to , while a scaling like (5) is only able to globally scale the eigenvalues (by , the time reparametrization) but not their ratio. Indeed, the scaling (5) turns (3) into
where we have dropped the hats. The extra factor in (6b) is inevitable as now the Jacobian has eigenvalues and , so their ratio does remain equal to .
2.2.1. Topological equivalence
The extra factor does not influence the nullclines, whence in particular the equilibria of (3) have positions independent of the value of the tolerance — provided that we replace the tolerance by and the ratio by . In fact, the dynamical systems (3) and
are topologically equivalent, i.e. qualitatively they display the same dynamics. To prove this we switch to a better suited notation. Let denote the vector of possible states whence
is the vector field in (3). The flow of collects all solutions: for every and all times with we have that
The trajectory of under is the curve . In cases where going backward in time leads out of the phase space we simply restrict time to with the time at which the trajectory passes through . Similarly, let denote the flow of (7).
Proposition 2.1
There is a homeomorphism that provides a topological equivalence between the flow and the flow .
Note that fixing , and in (3) yields exactly the values and in (7b), for which the two systems have the same nullclines.
Proof. To construct the homeomorphism , a bijective mapping that is continuous and for which the inverse is also continuous, we use the so-called stable manifolds
of the equilibria , consisting of the points attracted to . For an attractive equilibrium, this is the basin of attraction, and for a saddle, this is a line separating two basins of attraction. For sufficientlyFootnote2 small radius let where is the Euclidean length of . For a saddle, consists of two points, one on each arc of the stable manifold, while for the segregated equilibria, is a quarter-circle.
For , we measure the time it takes until the trajectory of intersects one of the sets . We then use the flow to move by back in time (or forward if is negative), i.e. we let
Outside of the equilibria this defines a diffeomorphism (differentiable with differentiable inverse). By also mapping each equilibrium to itself, we obtain a homeomorphism on all of for which , i.e. which conjugates the two flows. On trajectories leaving at some negative time we then rescale time to ensure that maps to .□
In case there are periodic orbits, the period cannot be changed by a conjugating homeomorphism. Originally, this was the reason to weaken the concept of topological conjugacy to that of topological equivalence by allowing for time to be rescaled along the trajectories. Then, occurring periodic orbits can be mapped to each other as well.
2.2.2. The discriminant set
Substituting the first parabola of (1) into the second parabola yields a quartic expression that vanishes at (the equilibrium at the origin) and at the roots of the cubic
(the mixed equilibria). Since omitting the global factor in (6b) does not change the stability type of equilibria, all sections of the bifurcation set with constant tolerance of the majority (see in 2.3 below) are scaled versions of each other. In particular, the computation
by Haw and Hogan (Citation2018) remains valid and still yields the discriminant set of (8) separating the region with equilibria from the region with equilibria, see again . For in the discriminant set of (8), i.e. satisfying the EquationEq. (9(9) (9) ), the dynamical system (3) has equilibria and a fold (or saddle node) bifurcation takes place, except when additionally where the saddle undergoes a dual cusp bifurcation (see 2.4 for more details).
The reduction from three parameters to two parameters simplifies the mathematics, but the interpretation of the product of tolerances and of the product of tolerance times population ratio is not obvious. For this reason, we avoid in the sequel the use of and and keep referring to the tolerances and the ratio . Note that the systems of Eqs. (6) and (7) do coincide if we put , thus making a kind of relative tolerance parameter.
2.3. Analysis: structural stability
One of the key concepts in the theory of dynamical systems is the concept of structural stability. A vector field is said to be structurally stable if for all sufficiently small perturbations of the flows of and of are topologically equivalent, i.e. they describe qualitatively similar dynamics.
Having a structurally stable system ensures that the mathematical results obtained for the model under consideration are applicable to the real world. Indeed, it is improbable that the real-world phenomenon of self-organized segregation is exactly described by the vector field (3). If the dynamics of (3) is structurally stable, then for all small perturbations the dynamics behave similarly. It suffices therefore that (3) is close enough. To keep the model’s predictions in line with what is observed in real life, however, one has to improve the model where the phenomena discovered in the dynamics of (3) differ from what is observed in the real world.
Unfortunately, the degeneracy of the equilibrium prevents the vector field in (3) from being structurally stable. Merely subtracting from yields for which the empty BN has the Jacobian
making this an attractive equilibrium for any and thus preventing a topological equivalence between and . This can be remedied by removing a small disk from the phase space (or rather a quarter-disk). For the resulting phase space is still invariant under the flow. If additionally does not satisfy (9), then all remaining equilibria are non-degenerate. The segregated BN and are attractive, and in the interior of , there is a saddle and possibly another saddle and a mixed attractive equilibrium.
This mixed stable equilibrium , if it exists, is obtained by solving the cubic EquationEq. (8)(8) (8) , which yields unwieldy formulas. However, the qualitative behavior is independent of the exact position of this equilibrium. The boundary of the basin of attraction consists of the stable manifolds of the two saddles.
Proposition 2.2
Let be a curve in parameter space that does not cross (4) or (9). Then, for all parameter values on this curve the vector fields (3) are topologically equivalent.
The proposition is formulated in such a way that the situation near the degenerate equilibrium is included and the proof takes place on the original phase space .
Proof. It suffices to show that the vector fields for with flow and with flow are topologically equivalent. The equilibria and their stability types coincide, and so we can simply adapt the proof of proposition 2.1. The initial diffeomorphism becomes where maps the sets , the equilibria of , to their counterparts in the flow . Mapping each equilibrium of to its counterpart in yields the conjugating homeomorphism. Then, time is again rescaled along trajectories that cut through to ensure that is mapped to .□
In particular, for the dynamical system (3) is structurally stable on with sufficiently small (as explained below), if does not satisfy (9). The two possible phase portraits are given in . Also, all systems (3) with have equivalent flows on , represented by the third phase portrait in .
The discriminant set (9) defines the cuspy surface in , and EquationEq. (4)(4) (4) defines the smooth surface asymptotic to the –plane and to the –plane. This confirms that gives a faithful representation of the possible dynamics defined by (3), except for the dynamics at the bifurcations. The bifurcation at (4) concerns the degenerate equilibrium . We follow Haw and Hogan (Citation2018) and do not further consider this empty BN.
When determining the “sufficiently small” size of the disk to be removed from to obtain , we have to make sure that the saddle does not lie in that disk – all while converges to as . To achieve a uniform size , we need a uniform bound on . Here, we are allowed to choose or or whatever fixed we find suitable.
2.4. Results: bifurcations
A bifurcation occurs where the system loses structural stability. As shown in 2.2, we may fix the value of the tolerance of the majority and so for simplicity’s sake we set . Then is the (relative) tolerance of the minority and is the population ratio, while (4) turns into the equation , so we restrict to . The discriminant (9) yields a cuspy curve in the –plane with apex . Here, a dual cusp bifurcation takes place that organizes the whole bifurcation set, see proposition 2.3 below. Fixing the dynamical system (3) becomes
Next to the empty BN and the two segregated BN there are 1–3 equilibria that are the solutions of a cubic equation, which at the discriminant set defined by
turns from having one real solution to having three real solutions.
Proposition 2.3
At the dynamical system (10) undergoes a dual cusp bifurcation.
In particular, when crossing one of the two lines (11) originating from a fold bifurcation takes place where a saddle and a node meet and vanish. That (10) undergoes a dual cusp bifurcation and not a cusp bifurcation means that the third mixed equilibrium (next to the two saddles) is an attractive equilibrium.
Proof. Dividing the cubic (8) by (recall that we restricted to ) and translating puts (8) into the standard form
of a cubic, with
For a negative coefficient of we would have the cusp bifurcation of (Guckenheimer & Holmes, Citation2002, Figure 7.1.1), but since , we have the dual cusp bifurcation instead. In particular, two of the equilibria meet and vanish in a fold bifurcation along paths crossing (11)□
Omitting the tolerance parameter in the picture of the bifurcation set (by putting ) allows us to draw a bifurcation diagram where we choose the majority as the vertical co-ordinate. This results in where every point on the green surface corresponds to a saddle and every point on the red surface corresponds to the mixed stable node. Where the two surfaces meet we have fold bifurcations and the dual cusp bifurcation takes place where the fold line has a vertical tangent. Note that the two bifurcation diagrams of (Haw & Hogan, Citation2018, ) can be obtained by taking sections with and , respectively.
To complement the representation of the dynamics in we give in the phase portraits along a –line with fixed. This clearly shows how the attracting equilibrium born in the first fold bifurcation moves over to the other saddle to vanish during the second fold bifurcation. At the apex of the cusp, the phase portrait is similar to the middle one in , but with a non-hyperbolic saddle.
3. The two-neighborhood model
A possible extension of Schelling’s BN model is to consider two BNs and study the movement of members of each population type between them. In 3.1, we introduce Haw and Hogan’s (Citation2020) dynamical system for the resulting TN model and we explain the inaccuracies of their mathematical formulation. We then correct this formulation to a non-smooth system of differential equations. In 3.2, we perform a qualitative analysis of the corrected system. We concentrate on Case I of (Haw and Hogan, Citation2020) where both BNs have the same –tolerance schedule and the same –tolerance schedule . The problems already become apparent in this case, which allows us to simplify the notation. Before summarizing our results we comment on Cases II, III and IV where or (in Case IV both) become dependent on the BN.
3.1. Model
The sociological model proposed in (Haw and Hogan, Citation2020) and the mathematical description they give in terms of a dynamical system do not coincide, as we lay out in 3.1.1. In 3.1.2, we therefore change the mathematical description to a dynamical system that does correspond to the sociological model of (Haw and Hogan, Citation2020). The corrected system is no longer smooth, but nevertheless has a unique solution for every initial condition.
3.1.1. Haw and Hogan’s (2020) formulation
Haw and Hogan (Citation2020) extended Schelling’s BN model by considering two BNs labeled and . For the – and –populations are denoted by and , respectively. They assume that any individual leaving one BN must enter the other BN, and there is no reservoir. The two BNs thus form a closed system and the total – and –populations are given by and , which yields
and in this way we decouple the dynamics of . Indeed, by differentiating (12) we get
for whence the initially –dimensional system is split into two –dimensional systems, of which one mirrors the dynamics of the other.
Haw and Hogan (Citation2020) make similar assumptions for their TN model as those made by Schelling (Citation1971) for the BN model. A new assumption made for the TN model, however, is that each individual residing in one of the two BNs is only concerned with the population ratio of the BN of residence when deciding whether to remain or leave (Haw and Hogan, Citation2020, p. 223). Using the same tolerance schedules and (in the present Case I for both BNs), they then present the system
governing the dynamics of . Here and are still the functions defined in (3), but now applied in both and . At first glance, the system (14) seems to reflect their described model. Upon closer inspection, however, one can observe that these equations are an incorrect mathematical formulation of the stated model.
Given the assumption that individuals only care about the population ratio of the BN that they currently reside in, the first term on the right-hand side of EquationEq. (14a)(14a) (14a) accounts for the movement of the –population in and out of , subject to the presence of the –population. For this reflects that the members of the –population remain in or leave according to the tolerance schedule. When , the size of the –population is tolerated by the whole –population and all –members remain in . The situation is most transparent if we additionally assume that , i.e. in the –population is in equilibrium. Then and thus results in an increase in . The only reasonable explanation is that the –population in is increasing as a result of –members moving in from elsewhere besides or . This, however, contradicts the assumption that we have a closed system with . Thus, when , Haw and Hogan’s (Citation2020) differential equation for fails to adequately represent their stated model. This similarly applies to and to the terms in (14b).
3.1.2. A new dynamical system
For the differential equations to correctly represent the model of (Haw and Hogan, Citation2020) the following conditions must be met.
When , we need to replace in (14a) by . Indeed, members of the –population in are satisfied with the population ratio and remain in . As individuals only look at the composition of their own BN and we have a closed system of two BNs, no members of the –population from outside enter and there is no contribution resulting from the population ratio in .
Similarly, when , we need to replace in (14a) by .
When , we need to replace in (14b) by .
When , we need to replace in (14b) by .
Hence, we have to adjust the differential equations (14) according to where the point lies with respect to the parabolas and , . Therefore, we define
The corrected dynamical system then reads as
where
The dynamics of can still be obtained from that of using (13), which yields
Before a qualitative analysis of the new system (16) can be conducted we must first confirm that (16) still has unique solutions (Blanchard et al., Citation1998, p. 66).
Proposition 3.1
At every point the system (16) is locally Lipschitz continuous and thus satisfies the condition for existence and unicity of solutions for all initial conditions .
A function is locally Lipschitz continuous if for every there exist constants and such that the inequality
is satisfied for all . This strong form of continuity is satisfied by every continuously differentiable system – take for the Lipschitz constant a local bound on the derivative.
Proof. As linear combinations of locally Lipschitz continuous functions are again locally Lipschitz continuous, we only treat (the arguments for and are similar). On and the function is continuously differentiable and therefore locally Lipschitz continuous with an upper bound of the length of the total derivative of on . At a point on the parabola , the function is still continuous, and we use that the ingredient of is continuously differentiable everywhere, hence with a Lipschitz constant in . The inequality
for becomes
when both and are in , while when both and are in the latter simply reads as . When and , we have that
as and (and similarly when and ).□
Note that during the proof we replaced by its closure to include the points on the parabola .
3.2. Analysis
Again the scaling (5) allows to be achieved in (16), at the cost of replacing by and multiplying the right-hand side in (16b) by . Haw and Hogan (Citation2020) scale time by instead – effectively this means an extra scaling of time by and correspondingly the vector field is multiplied by . The extra scaling of time by has the advantage that
and
have no in the denominator, so we keep it. For the nullclines, the parameters are again reduced from to . By taking the tolerance we have keeping its interpretation of the majority–minority ratio, while can be interpreted as the relative tolerance of the –population (with respect to the –population).
3.2.1. Equilibria
The segregated , have both right hand sides in (16) read as and are always equilibria, even though the vector field is not differentiable on the union of the four parabolas. On the vector field (16) coincides with (14) and the analysis of (Haw and Hogan, Citation2020) remains valid. In particular, the mixed is also always an equilibrium. Additional equilibria can be obtained from the roots of a polynomial of degree . These – equilibria are the intersections of the two cubic nullclines computed by Haw and Hogan (Citation2020).
The cubic –nullcline always stays inside — except if the parabolas and overlap (so and intersect) and the right-hand side of (16a) is identically zero on . Then, the part of the cubic –nullcline inside is a line of equilibria. Similarly, where the parabolas and overlap, the part of the cubic –nullcline inside is a line of equilibria. If both and , then the common intersection is even a “sea of equilibria” with its boundary formed by parts of the parabolas. All such infinite sets of equilibria are centered at the mixed equilibrium .
In particular, where valid the polynomial is never zero and where there are only finitely many equilibria, these are the segregated , , both attracting and the mixed , a saddle (see 3.2.2 for more details). Before the bifurcations found by Haw and Hogan (Citation2020) can occur, the saddle has already exploded into an infinite set of equilibria. Note that the – and –axes are no longer nullclines of the system whence the empty is no longer an equilibrium, unlike in the single BN model.
The existence of a line or a sea of equilibria immediately shows that for such parameter values, the system (16) is not structurally stable. Let us therefore compute the parameter values for which the parabolas start overlapping. Equating
yields
so the change occurs at . Similarly,
yields
and the change occurs at . We therefore have the bifurcation set shown in .
3.2.2. Phase portraits
The adjustment from (14a) to (16a) is most severe on , where the parabolas intersect. For the term is positive and gets adjusted to zero, but the term is also positive, so the right-hand side of (16a) remains positive. Similarly, on the right-hand side of (16a) remains negative. On the cubic –nullcline separates the positive values below from the negative values above. Thus, we have the minor adjustment that the positive values below the –nullcline become smaller (but remain positive) within and that the negative values above the –nullcline become larger (but remain negative) within . The major adjustment is that the –nullcline gets broadened to where the parabolas intersect. The same applies mutatis mutandis to the adjustment from (14b) to (16b).
Proposition 3.2
For the adjusted system (16) the segregated equilibria and are attractive.
Haw and Hogan (Citation2020) could simply compute the Jacobians in these points to prove that these are both stable nodes for (14). The system (16) is not differentiable at the segregated equilibria.
Proof. We focus on as the argument for is similar. Take so small that
and consider the vector field (16) restricted to . Between the two nullclines, the vector field points downward right, while on the two nullclines, the vector field points to the right and points downwards, respectively. Below the –nullcline the vector field points upward right and all trajectories reach the –nullcline in finite time. To the right of the –nullcline the vector field points downward left and all trajectories reach the –nullcline in finite time. Hence, all initial conditions have trajectories that tend to as .□
The basins of attraction of the segregated equilibria are clearly larger than a mere disk , see e.g. , left. Within the Jacobian of (16) reads as
where the global factor in the second row corrects the scaling error of Haw and Hogan (Citation2018, Citation2020). The determinant
is negative for whence the isolated mixed equilibrium is a saddle. This yields for with the phase portrait in , left. The basins of attraction of and are separated by the stable manifold of the mixed equilibrium. Recall that for we have that and is not isolated.
For on the boundary the vector field (16) is only Lipschitz continuous in . Taking for the limit shows that the isolated mixed equilibrium keeps behaving like a saddle when the limit is reached. This is also true when taking for the limit . In particular, for , the mixed equilibrium is isolated and behaves like a saddle, see , right.
When (and ) the vector field is differentiable on , but the entries in the second row of (17) have been replaced by . This yields a zero eigenvalue, corresponding to the line of equilibria. When (but ) the first row of (17) consists of zeroes, corresponding to the line of equilibria in . When , the equilibria in have zero Jacobian (both eigenvalues vanish). Note that on the boundaries of these lines and sea of equilibria, the vector field (16) still vanishes, but is not differentiable.
3.2.3. Different tolerance schedules
In (Haw and Hogan, Citation2020) the vector field (14) has and depend on the BN, with replaced by
and replaced by
Thus, and have different tolerance parameters and , respectively. A scaling like (5) cannot change their ratio, which is denoted by
Similarly, and get replaced by
with different tolerance parameters and , while the population ratio remains the same for both BNs. Scaling both and by but by leadsFootnote3 to
with again an extra factor in the –equation, but otherwise reducing parameters from to .
Generalizing (16) to (18) does not change the fundamental problem. While the symmetry line reflecting the parabola into the parabola may shift up or down, on their intersection the right-hand side of the –equation still has to vanish. The situation is similar for , and the –equation.
The bifurcations to further equilibria found in (Haw and Hogan, Citation2020) all require the cubic –nullcline to evolve extrema. However, before this can happen the –nullcline must have a horizontal inflection point, after which the parabolas and start to intersect. This similarly applies for the –nullcline. Thus, before the bifurcations detailed in (Haw and Hogan, Citation2020) can take place, the adjustment of the vector field has already led to lines and seas of equilibria. Note that an intersection of parabolas no longer necessarily leads to infinitely many equilibria, as the “other” nullcline is no longer enforced to pass through, but a bifurcation does require the “other” nullcline to pass through the intersection of parabolas.
Again, such dynamics is not structurally stable. Also, for tolerance parameters dependent on the BN, the only phase portraits with finitely many equilibria have the two attracting segregated equilibria and a mixed saddle. As already explained by Haw and Hogan (Citation2020), the saddle is no longer fixed at , though.
3.2.4. Results
The corrected system (16) fails to be everywhere differentiable, but in a mild way. It is still locally Lipschitz continuous, and hence for every initial condition, there is a unique solution – all these together form the flow of (16). Mutatis mutandis this applies to the system with right-hand sides (18) as well.
Our qualitative analysis yields a bifurcation set in , with three of the four open regions displaying dynamics that is not structurally stable. For the system (14), Haw and Hogan (Citation2020) were able to find bifurcations to attractive mixed equilibria, and we discuss in the concluding section 5 that it might be worthwhile to change the sociological model in such a way that (14) becomes a valid description.
A word of warning on the use of structural stability: for parameter values like deep inside the structurally stable region of , the smallness requirement on perturbations is not particularly restrictive. However, the closer is chosen to the boundary of that region, the smaller the perturbations are allowed to be. This phenomenon is not specific to the structurally stable part of the TN model, but is “built-in” to the definition of structural stability: for a perturbation to be sufficiently small may mean very small.
4. Discussion
We have achieved a complete understanding of the dynamics defined by (3) and by (16). We explained how the dynamics of (3) can be made structurally stable when the product of tolerances is above a threshold and detailed the occurring bifurcations. The system (16) is not structurally stable for outside the small wedge . For inside the wedge, the dynamics consists of two stable segregated equilibria, with basins of attraction separated by the stable manifold of a mixed saddle, see , left.
4.1. Interpretation
The BN model is very simple, and this is not only its weakness but also its strength. Among the weaknesses is the choice of the phase space
where the relative size of the minority population is built into the model in such a way that there are immediate further consequences. For instance, where the segregated equilibrium may grant one place per member of the –population, the segregated equilibrium then grants on average places to members of the –population. The capacity of the BN is simply not part of the model as the –slots and the –slots are filled independently; the interdependence in (3) solely stems from the tolerances.
Another weakness of the BN model is that (2) not leaving invariant is remedied by simple multiplication with and , respectively. This turns the empty into a degenerate equilibrium. The model still captures that the composition of a BN with a low number of residents does not exceed the tolerance of most individuals of either population. Consequently, individuals start moving in when the initial composition of the BN is close to the origin. The composition then changes over time as residents remain or leave, depending on whether their tolerances are exceeded. Eventually, the BN reaches a composition that corresponds to a stable node (or, exceptionally, to a saddle).
Two stable nodes present for all parameter values are and . This shows that regardless of how tolerant the two populations are, the BN can segregate. For parameter values outside the cuspy region, the initial composition merely decides whether the BN segregates to an all –population or an all –population – and the precise parameter values determine which basin of attraction is larger (but that is already a quantitative statement that might be out of the scope of the BN model).
For parameter values inside the cuspy region, there is a third stable node, with a mixed population. The precise position of the parameters determines the position of the two saddles (and of the mixed stable node) and thus determines the sizes of the three basins of attraction. The node the dynamics leads to depends on the basin of attraction the initial composition lies in. Where the trajectory passes close to the stable manifold of a saddle, BN tipping can be achieved by pushing the state of the system to the other side of that stable manifold.
An interesting prediction made by the BN model is that a stable node corresponding to a mixed BN only exists for parameter values inside the cuspy region. It is thus not sufficient to simply increase the values of the tolerance parameters to obtain a mixed BN. Instead, such an increase has to be achieved in such a way that the parameter values enter the cuspy region and one still has to ensure that the trajectory is in the basin of attraction of the mixed node. The phase portraits in convincingly show how increasing the tolerance of the minority — with population ratio and tolerance of the majority fixed – first leads to the creation of a mixed stable node and, as the tolerance is increased even further, to the annihilation of that mixed stable node. Ultimately, the basin of attraction for the segregated stable node of an all population is increased, while the basin of attraction for an all population has decreased. Note that the mathematical assumption that the tolerance remains fixed during the increase in the tolerance probably does not reflect real life and might increase as well, keeping the system in the cuspy region.
The TN model on with has dynamics rather similar (although not topologically equivalent) to the BN model on with parameter values outside the cuspy region (but satisfying ). The population trajectories lead to neighborhood compositions where is segregated with one type of population and is likewise segregated with the other type of population. The initial composition of the neighborhoods merely decides which particular population each BN is eventually composed of (unless we have an exceptional initial composition on the stable manifold of the saddle). The model is not well fitted to describe real-world scenarios that have parameter values with or .
4.2. Further results
We include in this discussion section two further results that concern both models. One result is that there are no periodic orbits; the only orbits for which the past completely stays inside the phase space are the or equilibria and the or heteroclinic orbits from saddles to attractive equilibria. All other orbits leave the phase space (for the BN model), respectively (for the TN model) in backward time. The second result concerns a two-neighborhood model with reservoir – but without direct movements between the neighborhoods. Such a model is brought back to two individual BN models.
4.2.1. Absence of periodic orbits
We can discard the possibility of periodic orbits discussed in (Haw & Hogan, Citation2018). The Poincaré index of a periodic orbit is and has to be equal to the sum of the Poincaré indices of the equilibria within the periodic orbit (Guckenheimer & Holmes, Citation2002, § 1.8). The occurring mixed equilibria are stable nodes (index ), saddles (index ), equilibria undergoing a fold bifurcation (index ), a non-hyperbolic saddle (index ) and continuous sets of infinitely many equilibria (such a set has the Poincaré index of a saddle).
This shows that the mixed stable node has to lie within a periodic orbit. A periodic orbit around the stable node has to intersect both nullclines twice. The direction at an –nullcline is counterclockwise, and the direction at a –nullcline is clockwise, a contradiction. Hence, there are no periodic orbits.
4.2.2. Two neighborhoods with reservoir
A possible extension of the TN model consists of two BNs along with the option of an individual to be in neither BN, but elsewhere (in the reservoir). Haw and Hogan (Citation2020) already presented this case and described the reservoir as two extra segregated BNs. Instead, we keep our interpretation of the reservoir standing for the rest of the city, the dynamics of which we are not interested Footnote4 in. To move between the BNs, all individuals go through the reservoir: there is no direct movement. We now argue that this model is equivalent to two independent single BN models.
Haw and Hogan (Citation2020) denote the populations in as , . They also denote the populations in the reservoir as and . Note that the numbers and then no longer give the total population in and . Instead, this opens the possibility that or and correspondingly Haw and Hogan (Citation2020) present the differential equations
with and given by (18). However, our interpretation of the reservoir standing for the rest of the city rules out the possibility of an empty reservoir. Then, the system (19) indeed reduces to (3).
The assumptions made in this model are the same as the assumptions made in the case of two independent single BN models. Individuals’ decisions to stay or move to the reservoir are based on whether the tolerances are exceeded. Accordingly, the TN model with reservoir is equivalent to two independent single BN models. These arguments generalize from two to multiple BN, proving the following result.
Proposition 4.1
In a system of several BNs where movement only occurs through the reservoir (that cannot be depleted), the dynamics in any of the BNs is that of an independent single BN.□
As we model the change in densities of the two populations and not the movement of individuals, the problem of one individual simultaneously moving to two different BNs does not arise.
5. Conclusion
We examined Schelling’s (Citation1971) and Haw and Hogan’s (Citation2018, Citation2020) BN and TN models for two populations, and . The two models demonstrate how individual discriminatory preferences can lead to segregation. We carried out a qualitative analysis of both the BN model and the corrected TN model, determining the equilibria and their stability. In the corrected TN model the only dynamical attraction is to segregated equilibria, but in the BN model there is a region of parameters for which a significant part of the phase space is attracted to a mixed equilibrium. Schelling and Haw and Hogan developed their models to understand how segregation can emerge without external intervention, even among tolerant populations. It is therefore critical that these models are free from mathematical problems that undermine modeling results.
Both models have areas that require improvement. The TN model does not have great predictive power as its systems may not be structurally stable, limiting the freedom to set parameter values. The primary cause is the assumption that individuals only consider the composition of their own BN when deciding whether to remain or leave. Future research should attempt to revise this assumption, e.g. by having all individuals taking the population ratio of both BNs into account. Where both and in (18), the mathematical difference can then be thought of as sending the –population to the greater attraction. However, this has serious consequences on the sociological interpretation of the model as individuals may then move even if the tolerance is not exceeded.
Future studies can improve the BN model by obtaining demographic data with different time points to construct reasonable and realistic tolerance schedules and parameter estimates. Haw and Hogan (Citation2018, Citation2020) study how replacing the linear tolerance schedules by polynomial or exponential tolerance schedules affects the dynamics. The parabolas (1) get replaced by functions that still have a unique maximum on and , respectively. One-bump functions were also obtained in an empirical study by Clark (Citation1991), but with functions that are not differentiable at the maximum. Schelling (Citation1971) already showed how such corners can be obtained with piecewise linear tolerance schedules. Monotonically decreasing tolerance schedules consisting of two linear pieces should keep the BN model simple enough to allow for a thorough study, while rendering the dynamical system more realistic.
Acknowledgments
We would like to thank Dr Özge Bilgili for guiding us toward a more-nuanced understanding of the complexity of the effects of residential segregation. We also thank the anonymous referee for improving the presentation of our results.
Disclosure statement
No potential conflict of interest was reported by the authors.
Notes
1 Also known as saddle-node bifurcations.
2 So small that making even smaller or larger does not change the shape of .
3 Next to as the scaling replaces the tolerance in (18b) by .
4 In particular, we do not exclude the possibility that there is a district in the city with only members of the –population or a district with only members of the –population.
References
- Blanchard, P., Devaney, R., & Hall, G. (1998). Differential Equations. Brookes/Cole.
- Bolt, G. (2009). Combating residential segregation of ethnic minorities in European cities. Journal of Housing and the Built Environment, 24(4), 397–405. https://doi.org/10.1007/s10901-009-9163-z
- Bolt, G., Özüekren, A. S., & Phillips, D. (2010). Linking integration and residential segregation. Journal of Ethnic and Migration Studies, 36(2), 169–186. https://doi.org/10.1080/13691830903387238
- Clark, W. A. V. (1991). Residential preferences and neighborhood racial segregation: A test of the Schelling segregation model. Demography, 28(1), 1–19. https://doi.org/10.2307/2061333
- Cutler, D. M., & Glaeser, E. L. (1997). Are ghettos good or bad? The Quarterly Journal of Economics, 112(3), 827–872. https://doi.org/10.1162/003355397555361
- De la Roca, J., Ellen, I. G., & O’Regan, K. M. (2014). Race and neighborhoods in the 21st century: What does segregation mean today? Regional Science and Urban Economics, 47, 138–151. https://doi.org/10.1016/j.regsciurbeco.2013.09.006
- Fajth, V., & Bilgili, Ö. (2020). Beyond the isolation thesis: Exploring the links between residential concentration and immigrant integration in the Netherlands. Journal of Ethnic and Migration Studies, 46(15), 3252–3276. https://doi.org/10.1080/1369183X.2018.1544067
- Guckenheimer, J., & Holmes, P. (2002). Nonlinear Oscillations, Dynamical Systems, and Bifurcations of Vector Fields (7th ed.). Springer.
- Haw, D. J., & Hogan, S. J. (2018). A dynamical systems model of unorganized segregation. The Journal of Mathematical Sociology, 42(3), 113–127. https://doi.org/10.1080/0022250X.2018.1427091
- Haw, D. J., & Hogan, S. J. (2020). A dynamical systems model of unorganized segregation in two neighborhoods. The Journal of Mathematical Sociology, 44(4), 221–248. https://doi.org/10.1080/0022250X.2019.1695608
- Iceland, J. (2004). Beyond Black and White: Metropolitan residential segregation in multi-ethnic America. Social Science Research, 33(2), 248–271. https://doi.org/10.1016/S0049-089X(03)00056-5
- Jones, R. (2019, April 26). Apartheid ended 29 years ago. How has South Africa changed? National geographic. Retrieved March 5, 2021, from https://www.nationalgeographic.com/culture/2019/04/how-south-africa-changed-since-apartheid-born-free-generation/
- Lewis, O. (1966). The culture of poverty. Scientific American, 215(4), 19–25. https://doi.org/10.1038/scientificamerican1066-19
- Musterd, S. (2003). Segregation and integration: A contested relationship. Journal of Ethnic and Migration Studies, 29(4), 623–641. https://doi.org/10.1080/1369183032000123422
- Schelling, T. C. (1971). Dynamic models of segregation. The Journal of Mathematical Sociology, 1(2), 143–186. https://doi.org/10.1080/0022250X.1971.9989794
- Williams, D. R., & Collins, C. (2001). Racial residential segregation: A fundamental cause of racial disparities in health. Public Health Reports, 116, 404–416. https://doi.org/10.1016/S0033-3549(04)50068-7