Full article: Randomness of Shapes and Statistical Inference on Shapes via the Smooth Euler Characteristic Transform

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

In this article, we establish the mathematical foundations for modeling the randomness of shapes and conducting statistical inference on shapes using the smooth Euler characteristic transform. Based on these foundations, we propose two Chi-squared statistic-based algorithms for testing hypotheses on random shapes. Simulation studies are presented to validate our mathematical derivations and to compare our algorithms with state-of-the-art methods to demonstrate the utility of our proposed framework. As real applications, we analyze a dataset of mandibular molars from four genera of primates and show that our algorithms have the power to detect significant shape differences that recapitulate known morphological variation across suborders. Altogether, our discussions bridge the following fields: algebraic and computational topology, probability theory and stochastic processes, Sobolev spaces and functional analysis, analysis of variance for functional data, and geometric morphometrics. Supplementary materials for this article are available online, including a standardized description of the materials available for reproducing the work.

Keywords:

1 Introduction

The quantification of shapes has become an important research direction. It has brought advances to many fields including geometric morphometrics (Boyer et al. Citation2011; Gao, Kovalsky, and Daubechies Citation2019; Gao et al. Citation2019), biophysics and structural biology (Wang et al. Citation2021; Tang et al. Citation2022), and radiogenomics (Crawford et al. Citation2020). When shapes are considered as random variables, their corresponding quantitative summaries are also random, implying that such summaries of random shapes are statistics. The statistical inference on shapes based on these quantitative summaries has been of particular interest (Fasy et al. Citation2014; Roycraft, Krebs, and Polonik Citation2023).

In this article, we bring together mathematical and statistical approaches to make three significant contributions to shape statistics: (i) we provide mathematical foundations for the randomness of shapes encountered in applications, bridging algebraic topology (Hatcher Citation2002) and stochastic processes (Hairer Citation2009); (ii) we connect the statistical inference on shape-valued data to the well-studied analysis of variance for functional data (fdANOVA, Zhang Citation2013), bridging topological data analysis (TDA, Edelsbrunner and Harer Citation2010) and functional data analysis (FDA, Hsing and Eubank Citation2015); and (iii) our framework does not rely on any assumptions about diffeomorphisms or pre-specified landmarks.

1.1 A Motivating Scientific Question

Through modeling the randomness of shapes, we aim to address the following statistical inference question: Is the observed difference between two groups of shapes statistically significant? For example, the mandibular molars in are from four genera of primates. A pertinent question from a morphological perspective is: In , do the molars from genus Tarsius exhibit significant differences from those from the other genera?

Fig. 1 Left: Molars from two suborders of the primates: Haplorhini and Strepsirrhini. The Haplorhini suborder has genera Tarsius (yellow) and Saimiri (grey). The Strepsirrhini suborder has genera Microcebus (blue) and Mirza (green). Right: Relationship between the four primate genera. Tarsier molars exhibit additional high cusps (highlighted in red). A similar figure was published in Wang et al. (Citation2021).

The primary objective of this article is to propose a powerful approach for testing hypotheses on random shapes. This would help address morphology-motivated statistical inference questions like the one raised above. In achieving this objective, we lay down the mathematical foundations that justify our hypothesis testing methods. We take two key steps: In Step 1, we find the appropriate representations of shapes; and in Step 2, we test hypotheses on shapes using these representations. In Section 1.2, we provide a literature review on shape representations and introduce the topological summary employed in this article. Section 1.3 begins by presenting the main theme of our hypothesis testing approach, followed by an overview of our contributions. Since the molars in are diffeomorphic to the two-dimensional unit sphere, some existing diffeomorphism-related methods can be considered for representing the molars (e.g., parameterized surfaces; Kurtek et al. Citation2011). In contrast, we aim to propose an approach that does not rely on any diffeomorphic assumptions, allowing for a wider range of applications.

1.2 Overview of Shape and Topological Data Analysis

In classical geometric morphometrics, shapes are represented using prespecified points called landmarks (Kendall Citation1989). The manual landmarking of a collection of shapes requires domain knowledge, can be very labor intensive, and is subject to bias (Boyer et al. Citation2011). Furthermore, an equal number of landmarks must be selected for each shape in a study in order to make comparisons (e.g., the Procrustes framework discussed in sec. 2.1 of Gao et al. Citation2019). This necessitates comprehensive information about entire collections of shapes for consistency, which can be difficult to obtain (e.g., landmarking cancer tumors, which can have very different morphology across a population of patients). Unfortunately, many datasets do not come with prespecified landmarks (e.g., Goswami Citation2015). Although many algorithms can automatically sample reasonable landmarks on shapes when their parameters are fine-tuned (e.g., Gao et al. Citation2019; Gao, Kovalsky, and Daubechies Citation2019), using a finite number of landmarks extracted from a continuum inevitably results in the loss of information. Diffeomorphism-based approaches (Dupuis, Grenander, and Miller Citation1998; Gao et al. Citation2019) are part of the “computational anatomy” that was historically studied by the “pattern theory school” pioneered by Ulf Grenander (Grenander and Miller Citation1998). They enable the comparison of (dis-)similarity between shapes with benefit of bypassing the need for landmarks. However, these approaches are based on the assumption that the shapes being compared are diffeomorphic to one another, making them unsuitable for many datasets (e.g., fruit fly wings in Miller Citation2015). Furthermore, parameterized curves and surfaces (PCS) provide a toolbox for assessing the heterogeneity of shapes with summary statistics that are invariant to reparameterizations (Kurtek et al. Citation2010, Citation2011, Citation2012). Despite their effectiveness in analyzing real data (e.g., DT-MRI brain fibers; Kurtek et al. Citation2012), PCS are based on assumptions about the diffeomorphism types of the shapes of interest. For example, Kurtek et al. (Citation2011) focuses on surfaces that are diffeomorphic to the two-dimensional unit sphere.

TDA opens the door for landmark-free and diffeomorphism-free representations of shapes. Motivated by differential topology, Turner, Mukherjee, and Boyer (Citation2014) proposed the persistent homology transform (PHT) with the capability to sufficiently encode all information within shapes (Ghrist, Levanger, and Mai Citation2018). To describe the PHT, we briefly provide some basics of TDA. One common statistical invariant in TDA is the persistence diagram (PD, Edelsbrunner and Harer Citation2010). When equipped with the Wasserstein distance, the collection of PDs, denoted as $D$ , is a Polish space (Mileyko, Mukherjee, and Harer Citation2011). Thus, probability measures can be applied, and the randomness of shapes can be represented using the probability measures on $D$ . The PHT takes values in $C (S^{d - 1}; D^{d}) = {continuous mapsF : S^{d‐1} \to D^{d}}$ , where $S^{d - 1}$ denotes the sphere ${x \in R^{d} : | | x | | = 1}$ and $D^{d}$ is the $d$ -fold Cartesian product of $D$ (Turner, Mukherjee, and Boyer Citation2014, Lemma 2.1 and Definition 2.1). A single PD does not preserve all information of a shape (Crawford et al. Citation2020). In contrast, the PHT is injective, which means it preserves all the information of a shape. However, since $D$ is not a vector space and the distances on $D$ are abstract (e.g., the Wasserstein and bottleneck distances, Cohen-Steiner et al. Citation2007), many fundamental statistical concepts do not easily apply to summaries resulting from the PHT. For example, the definition of moments corresponding to probability measures on $D$ (e.g., means) is highly nontrivial (Mileyko, Mukherjee, and Harer Citation2011). The difficulty in defining these concepts hinders the application of PHT-based statistical methods in $C (S^{d - 1}; D^{d})$ .

The smooth Euler characteristic transform (SECT, Crawford et al. Citation2020) offers an alternative summary statistic for shapes. The SECT not only preserves the information of shapes (Ghrist, Levanger, and Mai Citation2018, Corollary 1) but also represents shapes using continuous functions instead of PDs. More precisely, the values of the SECT are maps from the sphere $S^{d - 1}$ to a separable Banach space $B \overset{def}{=} C ([0, T])$ , the collection of continuous functions on a compact interval $[0, T]$ (values of T will be given in (3.1)). Hence, for any shape K, its SECT, denoted as ${SECT (K) (ν)}_{ν \in S^{d - 1}}$ , lies in $B^{S^{d - 1}} = {maps F : S^{d - 1} \to B}$ . Specifically, $SECT (K) (ν)$ belongs to $B$ for each $ν \in S^{d - 1}$ . As a result, the randomness of shapes K is represented via the SECT by a collection of $B$ -valued random variables. Probability theory in separable Banach spaces is better developed than in $D$ (e.g., Hairer Citation2009). In particular, a $B$ -valued random variable is a stochastic process with its sample paths in $B$ . As we will demonstrate in Section 3, $B$ here can be replaced with a reproducing kernel Hilbert space (RKHS). The theory of stochastic processes has evolved over a century and FDA is a well-developed branch of statistics. Consequently, a myriad of tools are available to underpin both the randomness of shapes and the statistical inference on shapes.

From an application perspective, Crawford et al. (Citation2020) applied the SECT to magnetic resonance images taken from tumors in a cohort of glioblastoma multiforme (GBM) patients. Using summary statistics derived from the SECT as predictors within Gaussian process regression, the authors demonstrated that the SECT can predict clinical outcomes more effectively than existing tumor shape quantification approaches and common molecular assays. The relative performance of the SECT in the GBM study suggests a promising future for its utility in medical imaging and broader statistical applications related to shape analyses. Similarly, Wang et al. (Citation2021) used derivatives of the Euler characteristic transform (ECT) as predictors in statistical models for subimage analysis. This analysis is akin to variable selection, aiming to identify physical features that are important for distinguishing between two classes of shapes. Lastly, Marsh et al. (Citation2022) highlighted that the SECT outperforms the standard measures employed in organoid morphology.

1.3 Overview of Contributions and Article Organization

Our goal is to address the hypothesis testing question posed in Section 1.1 by employing a landmark-free and diffeomorphism-free approach, which opens up possibilities for further applications in the future. We formulate the question more generically here. Let $P^{(1)}$ and $P^{(2)}$ be two distributions that generate two collections of random shapes, ${K_{i}^{(1)}}_{i = 1}^{n}$ and ${K_{i}^{(2)}}_{i = 1}^{n}$ . Detecting whether there is a significant difference between ${K_{i}^{(1)}}_{i = 1}^{n}$ and ${K_{i}^{(2)}}_{i = 1}^{n}$ is equivalent to rejecting the hypothesis $P^{(1)} = P^{(2)}$ . Since each shape $K_{i}^{(j)}$ is random, $SECT (K_{i}^{(j)})$ is a random variable taking values in a vector space (as discussed in Section 1.2) and can be decomposed as follows (see Theorem 5.1 for a rigorous version) (1.1) $SECT (K_{i}^{(j)}) = m^{(j)} + random terms, for j \in {1, 2},$ (1.1) where $m^{(j)}$ denotes the mean of $SECT (K_{i}^{(j)})$ with respect to the distribution $P^{(j)}$ . The random terms in (1.1) can be characterized by the Karhunen–Loève (KL) expansion (Hsing and Eubank Citation2015, sec. 7.3). To reject $P^{(1)} = P^{(2)}$ , it suffices to reject $m^{(1)} = m^{(2)}$ . That is, the question posed in Section 1.1 can be addressed by testing for the equality of two means. The important component of the test is the variance represented by the random terms in (1.1). In Section 5, we formulate this test as a two-sample problem in the fdANOVA literature (Zhang Citation2013, sec. 5.2). In addition, using the KL expansion, we provide a $χ^{2}$ -statistic in Section 5 to test the hypothesis. Throughout the article, our focus is on the two-sample problem. However, one may also consider employing the one-way fdANOVA to compare the means of three or more groups of shapes. The theoretical foundation and numerical experiments for this aspect are left for future research.

To develop our framework, we have to address the following mathematical foundation related questions: (i) What underlying probability spaces allow the randomness of shapes and their corresponding SECT? and (ii) Are the conditions of the KL expansion satisfied in our setting? We answer these questions in Sections 3 and 4—we model the randomness of shapes via the SECT using RKHS-valued random fields. The “theory of random sets” is a well-established framework for characterizing set-valued random variables (Molchanov Citation2005). However, its application to persistent homology-based statistics (e.g., the SECT) remains underexplored. In this article, we introduce a new probability space to characterize the randomness of shapes in a manner compatible with the SECT.

We first propose a collection of shapes as our sample space on which the SECT is well-defined. We then demonstrate that every shape in this collection has its SECT in $C (S^{d - 1}; H) = {continuous mapsF : S^{d‐1} \to H}$ , where $H = H_{0}^{1} ([0, T])$ is not only a Sobolev space (Brezis Citation2011) but also an RKHS (reasons for using $[0, T]$ instead of $(0, T)$ for $H_{0}^{1} ([0, T])$ are in Appendix A.1). Importantly, $C (S^{d - 1}; H)$ is a separable Banach space (Theorem C.1) and, hence, a Polish space. It helps construct a probability space to characterize the distributions of shapes. Building on this probability space, we define the mean and covariance of the SECT. Using the Sobolev embedding, we present some properties of the mean and covariance, which pave the way for the KL expansion of the SECT.

Traditionally, the statistical inference on shapes in TDA is conducted in the persistence diagram space $D$ , which is unsuitable for exponential family-based distributions and requires any corresponding statistical inference to be highly nonparametric (Fasy et al. Citation2014; Robinson and Turner Citation2017). The PHT-based statistical inference in $C (S^{d - 1}; D^{d})$ is even more difficult. With the KL expansion of the SECT, we propose a $χ^{2}$ -statistic for testing hypotheses on shapes. Beyond the mathematical foundations, we also provide simulation studies to illustrate the performance of our proposed hypothesis testing method. Lastly, we apply our proposed framework to answer the motivating question raised in Section 1.1.

We organize this article as follows. In Section 2, we provide the mathematical preparations. In Section 3, we define the SECT for a specific collection of shapes, highlighting its properties. In Section 4, we construct a probability space to model shape distributions. In Section 5, we propose the KL expansion of the SECT, leading to a statistic for hypothesis testing. In Section 6, we conduct simulation studies to evaluate our method. In Section 7, we apply our method to real data. In Section 8, we conclude the article. The Appendix provides the proofs of theorems, further data analysis, and future research topics.

2 Notations and Mathematical Preparations

To model the shapes discussed in our motivating question from Section 1.1, we need certain preparations regarding (i) topology and (ii) function spaces.

Topology

The first question we must address is: What are the “shapes” in our framework? Ghrist, Levanger, and Mai (Citation2018) and Curry, Mukherjee, and Turner (Citation2022) applied o-minimal structures (van den Dries Citation1998) to prove the injectivity of the PHT. Subsequent to this, o-minimal structures have been applied in many TDA studies to model shapes (e.g., Jiang, Kurtek, and Needham Citation2020; Kirveslahti and Mukherjee Citation2023). To stay consistent with the existing literature, we also model shapes using o-minimal structures. An o-minimal structure is a sequence $S = {S_{n}}_{n \geq 1}$ of subset collections $S_{n} \subseteq 2^{R^{n}}$ satisfying six set-theoretical axioms, where $2^{R^{n}}$ denotes the power set of $R^{n}$ . The precise definition of o-minimal structures is available in van den Dries (Citation1998) and is provided in Appendix A.3 for the reader’s convenience.

A typical example of o-minimal structures is the collection of semialgebraic sets. Specifically, a set $K \subseteq R^{n}$ is semialgebraic if it can be expressed as a finite union of sets of the form ${x \in R^{n} | p (x) = 0, q_{1} (x) > 0, \dots, q_{k} (x) > 0}$ , where $p, q_{1}, \dots, q_{k}$ are polynomial functions on $R^{n}$ . If we define $S_{n}$ as the collection of semialgebraic subsets of $R^{n}$ , then $S = {S_{n}}_{n \geq 1}$ is an o-minimal structure (van den Dries Citation1998, chap. 2). The unit sphere $S^{d - 1}$ , open ball $B (0, R) = {x \in R^{d} | | | x | |^{2} < R^{2}}$ for any R > 0, and all polyhedra (e.g., polygon meshes in computer graphics) are semialgebraic, given that they can be represented using either the polynomial $| | x | |^{2}$ or affine functions. We assume the following:

Assumption 1.

The o-minimal structure $S$ of interest contains all semialgebraic sets.

Hereafter, a “shape” refers to a compact set $K \in \underset{n \geq 1}{\cup} S_{n}$ for a prespecified o-minimal structure $S = {S_{n}}_{n \geq 1}$ satisfying Assumption 1. Assumption 1 incorporates many common shapes (e.g., balls and polyhedra) in our framework. More importantly, it implies the subsequent Theorem 2.1 through the “triangulation theorem” (van den Dries Citation1998, chap. 8). Although the definition of an o-minimal structure $S$ is highly abstract (see Appendix A.3), each compact set in $S$ resembles a polyhedron, which is precisely stated as follows.

Theorem 2.1.

Suppose $S = {S_{n}}_{n \geq 1}$ is an o-minimal structure satisfying Assumption 1 and $K \in \underset{n \geq 1}{\cup} S_{n}$ . If K is compact, there exists a finite simplicial complex S such that the polyhedron $| S | \overset{def}{=} \underset{s \in S}{\cup} s$ is homeomorphic to K, where each $s \in S$ denotes a simplex.

Herein, a finite simplicial complex S is a finite collection of simplexes. Each face of a simplex $s \in S$ also belongs to S (i.e., S is a so-called “closed complex” referred to in chap. 8 of van den Dries Citation1998). Theorem 2.1 directly results from the “triangulation theorem” (van den Dries Citation1998); hence, its proof is omitted. For the dth component $S_{d}$ of $S = {S_{n}}_{n \geq 1}$ , Theorem 2.1 indicates that the compact sets $K \in S_{d}$ are subsets of $R^{d}$ that are homeomorphic to polyhedra. Theorem 2.1 also implies that the homology groups of each compact $K \in S_{d}$ are well-defined and finitely generated; hence, the Betti numbers and Euler characteristic of K are well-defined and finite (Hatcher Citation2002, chap. 2).

Function Spaces

We apply the following notations throughout this article:

For any normed space $V$ , let $| | \cdot | |_{V}$ denote its norm. Denote $| | \cdot | |_{R^{d}}$ as $| | \cdot | |$ for succinctness.
Let X be a compact metric space equipped with metric d_X , and let $V$ denote a normed space. $C (X; V)$ is the collection of continuous maps from X to $V$ . Furthermore, $C (X; V)$ is a normed space equipped with $| | f | |_{C (X; V)} = sup_{x \in X} | | f (x) | |_{V}$ . The Hölder space $C^{0, \frac{1}{2}} (X; V)$ is defined as ${f \in C (X; V) | sup_{x, y \in X, x \neq y} (\frac{| | f (x) - f (y) | |_{V}}{\sqrt{d_{X} (x, y)}}) < \infty}$ . Here, $C^{0, \frac{1}{2}} (X; V)$ is a normed space equipped with $| | f | |_{C^{0, \frac{1}{2}} (X; V)} = | | f | |_{C (X; V)} + sup_{x, y \in X, x \neq y} (\frac{| | f (x) - f (y) | |_{V}}{\sqrt{d_{X} (x, y)}})$ . Obviously, $C^{0, \frac{1}{2}} (X; V) \subseteq C (X; V)$ . For simplicity, we denote $C (X) = C (X; R)$ and $C^{0, \frac{1}{2}} (X) = C^{0, \frac{1}{2}} (X; R)$ . For a given T > 0 (e.g., see (3.1)), we denote $C ([0, T])$ as $B$ .
The inner product of $H = H_{0}^{1} ([0, T]) = {f \in L^{2} ([0, 1]) | f' \in L^{2} ([0, T]) and f (0) = f (T) = 0}$ is defined as $〈 f, g 〉 = \int_{0}^{T} f' (t) g' (t) dt$ (Brezis Citation2011, chap. 8.3, Remark 17).
Suppose $(Y, d_{Y})$ is a metric space (not necessarily compact). Both $B (Y)$ and $B (d_{Y})$ denote the Borel algebra generated by the metric topology corresponding to d_Y.
${F (z)}_{z \in Z}$ denotes a function F defined on the set Z.

The following inequalities are useful for deriving many results presented in this article (2.1) $| | f | |_{B} \leq | | f | |_{C^{0, \frac{1}{2}} ([0, T])} \leq {\tilde{C}}_{T} | | f | |_{H}, for all f \in H,$ (2.1) where ${\tilde{C}}_{T}$ is a constant depending only on T. The first inequality in (2.1) results from the definition of $| | \cdot | |_{C^{0, \frac{1}{2}} ([0, T])}$ , while the second inequality is from Brezis (Citation2011) (Corollary 9.14; also see Appendix L.2). EquationEquation (2.1)(2.1) $| | f | |_{B} \leq | | f | |_{C^{0, \frac{1}{2}} ([0, T])} \leq {\tilde{C}}_{T} | | f | |_{H}, for all f \in H,$ (2.1) implies the following Sobolev embedding (2.2) $H_{0}^{1} ([0, T]) \overset{def}{=} H \subseteq C^{0, \frac{1}{2}} ([0, T]) \subseteq B \overset{def}{=} C ([0, T]) .$ (2.2)

3 Smooth Euler Characteristic Transform

In this section, we give the background on the SECT and propose corresponding mathematical foundations. Notably, we specify the “sample space”—a collection of shapes on which the SECT is well-defined. The SECT of the shapes in this sample space has properties that are suitable for the probability theory developed in Section 4. The molars in the motivating question from Section 1.1 will be modeled as elements of the sample space.

Suppose an o-minimal structure $S = {S_{n}}_{n \geq 1}$ satisfying Assumption 1 is given, and we focus on shapes in d-dimensional space $R^{d}$ . We assume the shape $K \in S_{d}$ is compact and $K \subseteq B (0, R) = {x \in R^{d} : | | x | | < R}$ , for example, the $K \subseteq R^{2}$ in or the surfaces of the mandibular molars in $R^{3}$ as presented by . For each direction $ν \in S^{d - 1}$ , we define a filtration ${K_{t}^{ν}}_{t \in [0, T]}$ of sublevel sets by the following (see for an illustration) (3.1) $\begin{matrix} K_{t}^{ν} \overset{def}{=} {x \in K | x \cdot ν \leq t - R}, for all t \in [0, T], \\ where T \overset{def}{=} 2 R . \end{matrix}$ (3.1)

Fig. 2 Consider the two-dimensional shape $K \in S_{2}$ in the left panel. For each pair of ν and t, the equation $x \cdot ν = t - R$ represents a straight line (or a hyperplane in a high-dimensional space). The subset $K_{t}^{ν}$ denotes the region below this line. Let $ϕ_{ν} (x) = x \cdot ν + R$ , then $K_{t}^{ν} = {x \in K | ϕ_{ν} (x) \leq t}$ . The right panel presents the function $(ν, t) \mapsto SECT (K) (ν, t)$ , where $ν \in S^{1}$ is identified by $θ \in [0, 2 π]$ through $ν = (cos θ, sin θ)$ . Procedures for generating the shape K and the right panel are given in Appendix D.1.

Fig. 2 Consider the two-dimensional shape K∈S2 in the left panel. For each pair of ν and t, the equation x·ν=t−R represents a straight line (or a hyperplane in a high-dimensional space). The subset Ktν denotes the region below this line. Let ϕν(x)=x·ν+R, then Ktν={x∈K|ϕν(x)≤t}. The right panel presents the function (ν,t)↦SECT(K)(ν,t), where ν∈S1 is identified by θ∈[0,2π] through ν=( cos θ, sin θ). Procedures for generating the shape K and the right panel are given in Appendix D.1.

We then have the following Euler characteristic curve (ECC, denoted as $χ_{t}^{ν}$ ) in direction ν (3.2) $\begin{matrix} χ_{t}^{ν} (K) \overset{def}{=} the Euler characteristic of K_{t}^{ν} = χ (K_{t}^{ν}) \\ = \sum_{k = 0}^{d - 1} {(- 1)}^{k} \cdot β_{k} (K_{t}^{ν}), \end{matrix}$ (3.2) for $t \in [0, T]$ , where $β_{k} (K_{t}^{ν})$ is the kth Betti number of $K_{t}^{ν}$ . The sum in (3.2) ends at d–1 because higher homology groups are trivial (Curry, Mukherjee, and Turner Citation2022, sec. 4). If $K_{t}^{ν}$ is a triangle mesh, $χ (K_{t}^{ν}) = # V - # E + # F$ , where $# V, # E$ , and $# F$ denote the number of vertices, edges, and faces of the mesh, respectively. Due to Theorem 2.1, the compactness of K guarantees that the Betti numbers in (3.2) are well-defined and finite.

The Euler characteristic transform (ECT) defined as $ECT (K) : S^{d - 1} \to Z^{[0, T]}, ν \mapsto {χ_{t}^{ν} (K)}_{t \in [0, T]}$ was proposed by Turner, Mukherjee, and Boyer (Citation2014) as an alternative to the PHT. Based on the ECT, Crawford et al. (Citation2020) further proposed the SECT as follows (3.3) $\begin{matrix} SECT (K) : S^{d - 1} \to R^{[0, T]}, \\ ν \mapsto SECT (K) (ν) = {SECT (K) (ν, t)}_{t \in [0, T]}, \\ where SECT (K) (ν, t) \overset{def}{=} \int_{0}^{t} χ_{τ}^{ν} (K) d τ - \frac{t}{T} \int_{0}^{T} χ_{τ}^{ν} (K) d τ . \end{matrix}$ (3.3)

A visualization of the function $(ν, t) \mapsto SECT (K) (ν, t)$ is presented in . The following lemma implies that the Lebesgue integrals in (3.3) are well-defined.

Lemma 3.1.

For any fixed $K \in S_{d}$ and $ν \in S^{d - 1}$ , the function $t \mapsto χ (K_{t}^{ν})$ is piecewise constant with only finitely many discontinuities.

Through the “cell decomposition theorem” (van den Dries Citation1998, chap. 3), Lemma 3.1 directly follows from either Lemma 3.4 of Curry, Mukherjee, and Turner (Citation2022) or “(2.10) Proposition” in Chapter 4 of van den Dries (Citation1998). Hence, the proof of Lemma 3.1 is omitted.

To investigate the distribution of $SECT (K)$ over different shapes K, we introduce the following condition to restrict our attention to a subset of $S_{d}$ .

Condition 3.1.

Let $K \in S_{d}$ . The condition is that K satisfies the following inequality (3.4) $sup_{k \in {0, \dots, d - 1}} [sup_{ν \in S^{d - 1}} (# {ξ \in {Dgm}_{k} (K; ϕ_{ν}) | pers (ξ) > 0})] \leq \frac{M}{d},$ (3.4) where ${Dgm}_{k} (K; ϕ_{ν})$ is the PD of K associated with the function $ϕ_{ν} (x) = x \cdot ν + R$ (also see ), $pers (ξ)$ is the persistence of the homology feature ξ, $# {\cdot}$ denotes the cardinality of a multiset, and M > 0 is a sufficiently large prespecified number.

Condition 3.1 involves technicalities from computational topology (Edelsbrunner and Harer Citation2010). To maintain the flow of the article, we relegate the details of this condition, as well as the definitions of ${Dgm}_{k} (K; ϕ_{ν})$ and $pers (ξ)$ , to Appendix B. Heuristically, Condition 3.1 implies the existence of a uniform upper bound on the number of nontrivial homology features of K across all directions ν. Hereafter, we focus on shapes in the following collection $\begin{matrix} S_{R, d}^{M} \overset{def}{=} {K \in S_{d} | K \subseteq B (0, R) is compact and satisfies \\ Condition 3. 1with fixed M > 0} . \end{matrix}$

Our proposed collection $S_{R, d}^{M}$ is suitable for modeling shapes in many applications. For example, the surfaces of the molars in are compact subsets of $R^{3}$ , bounded by a common open ball, and can be approximately represented by triangle meshes (hence, modeled by an o-minimal structure satisfying Assumption 1). In addition, the four genera of primates in share a phylogentic relationship which implies that their molars have common baseline features and satisfy Condition 3.1 with a shared upper bound M. In each application, the dimension d and radius R of the ball B(0, R) can easily be determined based on observed shapes. Although our mathematical framework requires the existence of such an M in (3.4), the value of M is not needed for our statistical methodology (see Section 5). Thus, Condition 3.1 does not hinder our proposed statistical methodology.

Lemma 3.1 implies that the function ${χ_{t}^{ν} (K)}_{t \in [0, T]}$ of t belongs to $L^{1} ([0, T])$ . Therefore, the function $SECT (K) (ν) = {SECT (K) (ν, t)}_{t \in [0, T]}$ of t is absolutely continuous on $[0, T]$ . Furthermore, we have the following regularity result of the Sobolev type.

Lemma 3.2.

For any $K \in S_{R, d}^{M}$ and $ν \in S^{d - 1}$ , the function $SECT (K) (ν)$ belongs to $H$ .

Lemma 3.2 is a special case of Lemma C.3. It indicates $SECT (S_{R, d}^{M}) \subseteq H^{S^{d - 1}} = {all mapsF : S^{d‐1} \to H}$ , which is enhanced by the following result.

Theorem 3.2.

For each $K \in S_{R, d}^{M}$ , we have: (i) There exists a constant $C_{M, R, d}^{*}$ depending only on M, R, and d such that the following inequality holds for any directions $ν_{1}, ν_{2} \in S^{d - 1}$ , (3.5) $\begin{matrix} | | SECT (K) (ν_{1}) - SECT (K) (ν_{2}) | |_{H} \\ \leq C_{M, R, d}^{*} \cdot \sqrt{| | ν_{1} - ν_{2} | | + | | ν_{1} - ν_{2} | |^{2}}; \end{matrix}$ (3.5) and (ii) $SECT (K) \in C^{0, \frac{1}{2}} (S^{d - 1}; H)$ , where $S^{d - 1}$ is equipped with the geodesic distance $d_{S^{d - 1}}$ .

Results complementary to Theorem 3.2 can be found in Theorem C.3, which imply that the function $(ν, t) \mapsto SECT (K) (ν, t)$ belongs to $C^{0, \frac{1}{2}} (S^{d - 1} \times [0, T])$ . Theorem 3.2(i) is an analog of Lemma 2.1 in Turner, Mukherjee, and Boyer (Citation2014). Theorem 3.2(ii) implies $SECT (S_{R, d}^{M}) \subseteq C^{0, \frac{1}{2}} (S^{d - 1}; H) \subseteq C (S^{d - 1}; H) \subseteq H^{S^{d - 1}}$ . As a result, (3.3) defines the following map (3.6) $SECT : S_{R, d}^{M} \to C (S^{d - 1}; H), K \mapsto SECT (K) .$ (3.6)

In Appendix D.1, we provide detailed proof-of-concept examples (similar to ) to visually illustrate the $SECT$ and support the regularity results in Theorems 3.2 and C.3.

Corollary 1 of Ghrist, Levanger, and Mai (Citation2018) implies the following result, which shows that the SECT preserves all the information of shapes $K \in S_{R, d}^{M}$ .

Theorem 3.3.

The map $SECT$ defined in (3.6) is injective for all dimensions d.

The map $SECT : S_{R, d}^{M} \to C (S^{d - 1}; H)$ is injective, but not surjective. Specifically, Theorem 3.2 suggests that the image of $SECT$ does not lie outside of $C^{0, \frac{1}{2}} (S^{d - 1}; H)$ . An explicit characterization of the image $SECT (S_{R, d}^{M})$ remains a topic for future research.

Inspired by Theorem 3.3, one may consider reconstructing a shape $K$ from either the $SECT (K)$ or the $ECT (K)$ . From a theoretical standpoint, a shape $K$ can be reconstructed using the “Schapira’s inversion formula” (Schapira Citation1995). Further details are available in Ghrist, Levanger, and Mai (Citation2018). From an algorithmic perspective, the proof of Theorem 3.1 in Turner, Mukherjee, and Boyer (Citation2014) offers an algorithm to reconstruct low-dimensional meshes from their ECT. Nevertheless, effective algorithmic approaches to reconstructing shapes are still underdeveloped. Challenges in reconstructing shapes are extensively discussed in Fasy et al. (Citation2018). A comprehensive exploration of the reconstruction using SECT is also left for future research.

Together with (3.6), Theorem 3.3 allows us to represent each $K \in S_{R, d}^{M}$ by $SECT (K) \in C (S^{d - 1}; H)$ . This perspective aids us in modeling the randomness of shapes using probability measures on the separable Banach space $C (S^{d - 1}; H)$ . Here, we prefer $C (S^{d - 1}; H)$ over $\frac{1}{2}$ -Hölder space $C^{0, \frac{1}{2}} (S^{d - 1}; H)$ . This is because $\frac{1}{2}$ -Hölder spaces are typically not separable (Hairer Citation2009, Remark 4.21). The separability condition is essential for probability measures on Banach spaces to exhibit non-pathological behavior (Hairer Citation2009, sec. 4).

4 Probabilistic Distributions over the SECT

To address the motivating question outlined in Section 1.1 using hypothesis testing, we need to view the observed shapes (e.g., the molars in ) as shape-valued random variables. In this section, we construct a probability space to model the randomness of shapes and make the SECT a random variable (in the measurable sense) taking values in $C (S^{d - 1}; H)$ . More importantly, this probability space helps justify the KL expansion of the SECT, which lays the foundations for our hypothesis testing method in Section 5.

Probability Space

Suppose $S_{R, d}^{M}$ is equipped with a σ-algebra $F$ . A distribution of shapes K across $S_{R, d}^{M}$ is represented by a probability measure $P = P (dK)$ on $F$ . Then, $(S_{R, d}^{M}, F, P)$ is a probability space. For each fixed $(ν, t)$ , the integer-valued map $χ_{t}^{ν} : K \mapsto χ_{t}^{ν} (K)$ is defined on $S_{R, d}^{M}$ . Hereafter, we assume the following:

Assumption 2.

For each fixed $(ν, t) \in S^{d - 1} \times [0, T]$ , the map $χ_{t}^{ν} : (S_{R, d}^{M}, F) \to (R, B (R))$ is a measurable function and, hence, a real-valued random variable.

A σ-algebra $F$ satisfying Assumption 2 exists. Here, we construct a metric ρ on $S_{R, d}^{M}$ and show that the Borel algebra $B (ρ)$ induced by ρ satisfies Assumption 2. We define (4.1) $\begin{matrix} ρ (K_{1}, K_{2}) \overset{def}{=} sup_{ν \in S^{d - 1}} {{(\int_{0}^{T} | χ_{τ}^{ν} (K_{1}) - χ_{τ}^{ν} (K_{2}) |^{2} d τ)}^{1 / 2}}, \\ for all K_{1}, K_{2} \in S_{R, d}^{M} . \end{matrix}$ (4.1)

Theorem 4.1.

The map ρ is a metric on $S_{R, d}^{M}$ . Assumption 2 is satisfied if $F = B (ρ)$ .

Under Assumption 2, the ECC ${χ_{t}^{ν}}_{t \in [0, T]}$ , for each $ν \in S^{d - 1}$ , is a stochastic process defined on the probability space $(S_{R, d}^{M}, F, P)$ . Since each sample path ${χ_{t}^{ν} (K)}_{t \in [0, T]}$ has finitely many discontinuities (Lemma 3.1), $\int_{0}^{t} χ_{τ}^{ν} (K) d τ$ for each $t \in [0, T]$ is a Riemann integral, which is equal to the limit of Riemann sum $\int_{0}^{t} χ_{τ}^{ν} (K) d τ = lim_{n \to \infty} {\frac{t}{n} \sum_{l = 1}^{n} χ_{\frac{tl}{n}}^{ν} (K)}$ . Given that each $χ_{\frac{tl}{n}}^{ν}$ is a random variable under Assumption 2, the limit of the Riemann sum for each $t \in [0, T]$ is a random variable as well. Therefore, for each $ν \in S^{d - 1}, {\int_{0}^{t} χ_{τ}^{ν} d τ}_{t \in [0, T]}$ with $\int_{0}^{t} χ_{τ}^{ν} d τ : K \mapsto \int_{0}^{t} χ_{τ}^{ν} (K) d τ$ is a stochastic process. Then, under Assumption 2, (3.3) defines the following stochastic process on $(S_{R, d}^{M}, F, P)$ for each $ν \in S^{d - 1}$ (4.2) $\begin{matrix} SECT (ν) \overset{def}{=} {\int_{0}^{t} χ_{τ}^{ν} d τ - \frac{t}{T} \int_{0}^{T} χ_{τ}^{ν} d τ \overset{def}{=} SECT (ν, t)}_{t \in [0, T]} . \end{matrix}$ (4.2)

Precisely, for each fixed ν, we have the stochastic process $SECT (ν) : K \mapsto SECT (K) (ν) = {SECT (K) (ν, t)}_{t \in [0, T]}$ defined on $(S_{R, d}^{M}, F, P)$ ; and, for each fixed $(ν, t)$ , we have the real-valued random variable $SECT (ν, t) : K \mapsto SECT (K) (ν, t)$ defined on $(S_{R, d}^{M}, F, P)$ .

Since $H$ is an RKHS (Appendix A.1), Lemma 3.2 and Theorem 4.2, together with Theorem 7.1.2 of Hsing and Eubank (Citation2015), imply the following. Its proof is omitted.

Theorem 4.2.

(i) For each $ν \in S^{d - 1}, SECT (ν)$ is a real-valued stochastic process with sample paths in $H$ . Equivalently, $SECT (ν)$ is a random variable taking values in $(H, B (H))$ . (ii) The map $SECT$ defined in (3.6) is a random variable taking values in $C (S^{d - 1}; H)$ .

Using Theorem 4.2 in conjunction with Theorem 3.3, we can represent random shapes (which model the surfaces of the mandibular molars in ) as $C (S^{d - 1}, H)$ -valued random variables. This representation through the SECT has no loss of information.

In Appendix D.2, we provide proof-of-concept examples to illustrate random shapes and their SECT representations visually. These examples relate the SECT to Fréchet regression (Petersen and Müller Citation2019), Wasserstein regression (Chen et al. Citation2023), and manifold learning (Dunson and Wu Citation2021; Meng and Eloyan Citation2021; Li, Mukhopadhyay, and Dunson Citation2022).

Mean and Covariance of the SECT

For deriving the KL expansion in Section 5, we define the mean and covariance of the SECT. To do so, we need the following lemma.

Lemma 4.1.

For any probability measure $P$ defined on the measurable space $(S_{R, d}^{M}, F)$ , we have $E {sup_{ν \in S^{d - 1}} | | SECT (ν) | |_{H}^{2}} = \int_{S_{R, d}^{M}} {sup_{ν \in S^{d - 1}} | | SECT (K) (ν) | |_{H}^{2}} P (dK) < \infty .$

Lemma 4.1, together with (2.1), implies that $E | SECT (ν, t) |^{2} \leq {\tilde{C}}_{T}^{2} \cdot E | | SECT (ν) | |_{H}^{2} < \infty$ for all $(ν, t) \in S^{d - 1} \times [0, T]$ . Then, we define the mean and covariance functions as follows (4.3) $\begin{matrix} m_{ν} (t) = E {SECT (ν, t)} = \int_{S_{R, d}^{M}} SECT (K) (ν, t) P (dK), \\ Ξ_{ν} (s, t) = cov (SECT (ν, s), SECT (ν, t)), for s, t \in [0, T] \\ and ν \in S^{d - 1} . \end{matrix}$ (4.3)

Lemma C.4 provides several properties of the mean $m_{ν}$ and covariance $Ξ_{ν}$ that validate our KL expansion of $SECT (ν)$ in Section 5. Additionally, Lemma C.4 demonstrates that the mean $m \overset{def}{=} {m_{ν}}_{ν \in S^{d - 1}}$ of SECT belongs to $C (S^{d - 1}; H)$ . A tentative discussion on the “pseudo-inverse” ${SECT}^{- 1} (m)$ is provided after Lemma C.4 in Appendix C.

In most shape analysis studies, data are preprocessed by alignment. In Appendix E, we introduce the “ECT alignment” as a preprocessing step before any statistical inference. Throughout the article, we assume that the data have been aligned using this method. The ECT alignment exploits rigid motions, does not rely on landmarks, and is equivalent to the alignment approach outlined in Wang et al. (Citation2021) (Supplementary Section 4). The primary objective of the ECT alignment is to minimize the differences between two shapes that arise from rigid motions. For instance, the molars in were aligned using the ECT alignment. Furthermore, the ECT alignment is compatible with our SECT framework. Appendix E demonstrates that the ECT alignment does not alter the qualitative properties of SECT (e.g., the measurability, Sobolev-regularity, and $\frac{1}{2}$ -Hölder continuity).

In applications, it is infeasible to sample infinitely many directions $ν \in S^{d - 1}$ and levels $t \in [0, T]$ . For given shapes K, we compute $SECT (K) (ν, t)$ for finitely many directions ${ν_{1}, \dots, ν_{Γ}} \subseteq S^{d - 1}$ and levels ${t_{1}, \dots, t_{Δ}} \subseteq [0, T]$ . To retain information about shapes K, one needs to properly set the numbers of directions and levels (i.e., Γ and Δ). From a theoretical viewpoint, Curry, Mukherjee, and Turner (Citation2022) comprehensively discussed the number Γ of directions needed to recover shapes K from $ECT (K)$ when K are “piecewise linearly embedded shapes with plausible geometric bounds.” From the numerical perspective, we note the following: (i) Wang et al. (Citation2021) provided detailed simulation studies on the choices of Γ and Δ in sub-image analysis, and a general guideline for setting Γ and Δ in practice was presented in Supplementary Table 1 therein; and (ii) in our Appendix K, we provide detailed numerical experiments on the tradeoffs between the choices of Γ and Δ, the statistical power of our proposed algorithms (Algorithms 1 and 2), and computational cost.

5 Testing Hypotheses on Shapes

In this section, we apply the probabilistic formulation from Section 4 and Lemma C.4 to test hypotheses on shapes. Suppose $P^{(1)}$ and $P^{(2)}$ are two distributions on the measurable space $(S_{R, d}^{M}, F)$ . Let $P^{(1)} \otimes P^{(2)}$ be the product probability measure defined on the product σ-algebra $F \otimes F$ , satisfying $P^{(1)} \otimes P^{(2)} (A \times B) = P^{(1)} (A) \cdot P^{(2)} (B)$ for all $A, B \in F$ . To address the motivating question from Section 1.1, we test the following hypotheses (5.1) $H_{0}^{*} : P^{(1)} = P^{(2)}, vs . H_{1}^{*} : P^{(1)} \neq P^{(2)},$ (5.1) for example, suppose $P^{(1)}$ and $P^{(2)}$ model the distributions of molars from two genera of primates (). Rejecting the $H_{0}^{*}$ in (5.1) helps distinguish the two genera of primates.

Define $m_{ν}^{(j)} (t) = \int_{S_{R, d}^{M}} SECT (K) (ν, t) P^{(j)} (dK)$ for $j \in {1, 2}$ as the mean functions corresponding to $P^{(1)}$ and $P^{(2)}$ . To reject the null $H_{0}^{*}$ in (5.1) (equivalently, distinguish two collections of shapes), it suffices to reject the null hypothesis H₀ in the following (5.2) $\begin{matrix} H_{0} : m_{ν}^{(1)} (t) = m_{ν}^{(2)} (t) for all (ν, t), versus \\ H_{1} : m_{ν}^{(1)} (t) \neq m_{ν}^{(2)} (t) for some (ν, t) . \end{matrix}$ (5.2)

Analysis of Variance for Functional Data (fdANOVA)

Considering the hypotheses in (5.2) for all directions $ν \in S^{d - 1}$ results in simultaneous multiple-comparisons and inflation of the Type I error. To address this issue, we focus on a specific direction, motivated by the observation that the null hypothesis H₀ in (5.2) is equivalent to $sup_{ν \in S^{d - 1}} {| | m_{ν}^{(1)} - m_{ν}^{(2)} | |_{B}} = 0$ . Hence, the direction of interest is defined as (5.3) $ν^{*} \overset{def}{=} arg {max}_{ν \in S^{d - 1}} {| | m_{ν}^{(1)} - m_{ν}^{(2)} | |_{B}} .$ (5.3)

Lemma C.4 and (2.2) imply ${m_{ν}^{(j)}}_{j = 1}^{2} \subseteq B$ for all ν. Lemma C.4, together with (2.1), confirms the existence of a maximizer in (5.3). The maximizer in (5.3) may not be unique. If there are multiple maximizers, we arbitrarily choose one, as this choice does not influence our framework. The null hypothesis $H_{0}$ in (5.2) is then equivalent to $| | m_{ν^{*}}^{(1)} - m_{ν^{*}}^{(2)} | |_{B} = 0$ , where the $ν^{*}$ defined in (5.3) is called a distinguishing direction. Hereafter, we investigate the distribution of $SECT (ν^{*})$ .

Based on the discussion above, testing the hypotheses in (5.2) is equivalent to testing $m_{ν^{*}}^{(1)} (t) = m_{ν^{*}}^{(2)} (t)$ for $t \in [0, T]$ using $SECT (ν^{*})$ , which is a fdANOVA problem that has been well-studied in the literature (e.g., Zhang Citation2013, sec. 5.2). However, many state-of-the-art fdANOVA approaches are incompatible with $SECT (ν^{*})$ . For example, the Gaussianity of $SECT (ν^{*})$ is not guaranteed (Remark C.1), and the “two-sample problem assumptions” in Section 5.2 of Zhang (Citation2013) may not be satisfied. Besides, the L²-norm-based test (Zhang and Chen Citation2007) and F-type test (Shen and Faraway Citation2004) are not preferred when the functional data are not Gaussian (Zhang Citation2013, chap. 5). Additionally, many fdANOVA methods are time-consuming. For example, tests based on random projections (TRP, Cuesta-Albertos and Febrero-Bande Citation2010) require the computation of (at least 30) L²-projections for each observed function, followed by the application of appropriate ANOVA tests to these projections. To address the Gaussianity issue and achieve computational efficiency, we propose a method for fdANOVA using the KL expansion. Our test has a foundation that aligns with the probabilistic framework of $SECT (ν^{*})$ in Section 4; it is comparable with the existing methods in terms of size and power (see Appendix J); and it is also computationally efficient, allowing for the permutation test used with our method.

Karhunen–Loève Expansion

Let $Ξ_{ν^{*}}^{(j)} (s, t)$ be the covariance function of the stochastic process $SECT (ν^{*})$ corresponding to $P^{(j)}$ , for $j \in {1, 2}$ (see (4.3)). Hereafter, we assume the following, which is true under the null hypothesis $H_{0}^{*} : P^{(1)} = P^{(2)}$ in (5.1).

Assumption 3

(Homoscedasticity). $Ξ_{ν^{*}} \overset{def}{=} Ξ_{ν^{*}}^{(1)} = Ξ_{ν^{*}}^{(2)}$ , where $ν^{*}$ is defined in (5.3).

This is a standard assumption in the fdANOVA literature (e.g., Zhang Citation2013, sec. 5.2) and can be tested using the methods proposed by Jia Guo and Zhang (Citation2019).

We define an integral operator on $L^{2} ({[0, T]}^{2})$ as $f \mapsto \int_{0}^{T} f (s) \cdot Ξ_{ν^{*}} (s, \cdot) ds$ . This operator is compact and self-adjoint (Hsing and Eubank Citation2015, Theorems 4.6.2 and Example 3.3.4). Moreover, the Hilbert-Schmidt theorem (Reed and Simon Citation1972, Theorem VI.16) suggests that there is a complete orthonormal basis ${ϕ_{l}}_{l = 1}^{\infty}$ for $L^{2} ([0, T])$ so that (i) each $ϕ_{l}$ is an eigenfunction with eigenvalue λ_l , (ii) $λ_{1} \geq λ_{2} \geq \dots \geq 0$ , and (iii) $lim_{l \to \infty} λ_{l} = 0$ . Lemma C.4 and Theorem 7.3.5 of Hsing and Eubank (Citation2015) imply the following KL expansion:

Theorem 5.1

(Karhunen–Loève expansion). (i) For each fixed $j \in {1, 2}$ , we have (5.4) $\begin{matrix} lim_{L \to \infty} sup_{t \in [0, T]} E^{(j)} {[SECT (ν^{*}, t) - m_{ν^{*}}^{(j)} (t) - \sum_{l = 1}^{L} \sqrt{λ_{l}} \cdot Z_{l}^{(j)} \cdot ϕ_{l} (t)]}^{2} = 0, \end{matrix}$ (5.4) where $Z_{l}^{(j)} (K) \overset{def}{=} \frac{1}{\sqrt{λ_{l}}} \int_{0}^{T} {SECT (K) (ν^{*}, t) - m_{ν^{*}}^{(j)} (t)} \cdot ϕ_{l} (t) dt$ for $l = 1, 2, \dots$ , and $E^{(j)}$ is the expectation associated with $P^{(j)}$ . For each $j \in {1, 2}$ , the random variables ${Z_{l}^{(j)}}_{l = 1}^{\infty}$ are defined on the probability space $(S_{R, d}^{M}, F, P^{(j)})$ , are mutually uncorrelated, and have mean 0 and variance 1. (ii) There exists $N \in F \otimes F$ so that $P^{(1)} \otimes P^{(2)} (N) = 0$ and (5.5) $\begin{matrix} δ_{l} (K^{(1)}, K^{(2)}) \\ \overset{def}{=} \frac{1}{\sqrt{2 λ_{l}}} \int_{0}^{T} {SECT (K^{(1)}) (ν^{*}, t) - SECT (K^{(2)}) (ν^{*}, t)} \cdot ϕ_{l} (t) dt \\ = θ_{l} + (\frac{Z_{l}^{(1)} (K^{(1)}) - Z_{l}^{(2)} (K^{(2)})}{\sqrt{2}}), \\ where θ_{l} \overset{def}{=} \frac{1}{\sqrt{2 λ_{l}}} \int_{0}^{T} {m_{ν^{*}}^{(1)} (t) - m_{ν^{*}}^{(2)} (t)} \cdot ϕ_{l} (t) dt, \end{matrix}$ (5.5) for any $(K^{(1)}, K^{(2)}) \notin N$ and each fixed $l = 1, 2, \dots$ . The null set $N$ is allowed to be empty.

Using the KL expansion in (5.4), the random sampling of shapes may be considered, which is discussed in Appendix M.1 and left for future research.

Our Approach

Consider two independent collections of random shapes ${K_{i}^{(j)}}_{i = 1}^{n} \overset{iid}{\sim} P^{(j)}$ , for $j \in {1, 2}$ (i.e., ${(K_{i}^{(1)}, K_{i}^{(2)})}_{i = 1}^{n} \overset{iid}{\sim} P^{(1)} \otimes P^{(2)}$ ). The pairing in $(K_{i}^{(1)}, K_{i}^{(2)})$ is arbitrary for the following reasons: (i) pairs $(K_{i}^{(1)}, K_{i}^{(2)})$ and $(K_{i}^{(1)}, K_{i'}^{(2)})$ with $i \neq i'$ have the same distribution $P^{(1)} \otimes P^{(2)}$ , and (ii) numerical experiments in Sections 6 and 7 demonstrate that the performance of our proposed algorithms is numerically invariant to shuffling the index i within each collection ${K_{i}^{(j)}}_{i = 1}^{n}$ . Without loss of generality, we assume that all the shapes have been aligned using the “ECT alignment” (Appendix E). Here, we present the theoretical foundation for employing ${(K_{i}^{(1)}, K_{i}^{(2)})}_{i = 1}^{n}$ to test the hypotheses in (5.2). This foundation helps address the motivating question from Section 1.1.

Without loss of generality, we assume $(K_{i}^{(1)}, K_{i}^{(2)}) \notin N$ , for all $i = 1, 2, \dots, n$ , where $N$ is the null set in Theorem 5.1 satisfying $P^{(1)} \otimes P^{(2)} (N) = 0$ . Then, we have (5.6) $\begin{matrix} ξ_{l, i} \overset{def}{=} δ_{l} (K_{i}^{(1)}, K_{i}^{(2)}) = θ_{l} + (\frac{Z_{l}^{(1)} (K_{i}^{(1)}) - Z_{l}^{(2)} (K_{i}^{(2)})}{\sqrt{2}}), \end{matrix}$ (5.6) where δ_l and θ_l are defined in (5.5). Theorem 5.1 implies that, for each fixed l, the random variables ${ξ_{l, i}}_{i = 1}^{n}$ are iid across $i = 1, \dots, n$ with mean θ_l and variance 1; for each fixed i, the random variables ${ξ_{l, i}}_{l = 1}^{\infty}$ are mutually uncorrelated across $l = 1, 2, 3, \dots$ . The following lemma represents the null H₀ in (5.2) using the means ${θ_{l}}_{l = 1}^{\infty}$ .

Lemma 5.1.

The null H₀ in (5.2) is equivalent to $θ_{l} = 0$ for all positive integers l.

Recall that $lim_{l \to \infty} λ_{l} = 0$ . When eigenvalues λ_l in the denominator of (5.5) are close to zero for large l, the estimated θ_l becomes unstable. Specifically, even if $m_{ν^{*}}^{(1)} (t) \approx m_{ν^{*}}^{(2)} (t)$ , an extremely small λ_l can move the corresponding estimated θ_l far away from zero. Using the standard approach in principal component analysis, we focus on ${θ_{l}}_{l = 1}^{L}$ with (5.7) $L \overset{def}{=} max {1, \tilde{L}}, where \tilde{L} \overset{def}{=} min {l \in N | \frac{\sum_{l' = 1}^{l} λ_{l'}}{\sum_{l^{″} = 1}^{\infty} λ_{l^{″}}} > 0.95} .$ (5.7)

Hence, to test the hypotheses in (5.2) via Lemma 5.1, we test the following (5.8) $\begin{matrix} {\hat{H}}_{0} : θ_{1} = \dots = θ_{L} = 0, versus {\hat{H}}_{1} : there exists \\ l' \in {1, \dots, L} such that θ_{l'} \neq 0 . \end{matrix}$ (5.8)

Under the null ${\hat{H}}_{0}$ in (5.8), for each $l \in {1, \dots, L}$ , the central limit theorem indicates that $\frac{1}{\sqrt{n}} \sum_{i = 1}^{n} ξ_{l, i}$ is asymptotically N(0, 1) when n is large. The mutual uncorrelation in Theorem 5.1 and the asymptotic normality of $\frac{1}{\sqrt{n}} \sum_{i = 1}^{n} ξ_{l, i}$ provide the asymptotic independence of ${\frac{1}{\sqrt{n}} \sum_{i = 1}^{n} ξ_{l, i}}_{l = 1}^{L}$ across $l = 1, \dots, L$ . Then, $\sum_{l = 1}^{L} {(\frac{1}{\sqrt{n}} \sum_{i = 1}^{n} ξ_{l, i})}^{2}$ is asymptotically $χ_{L}^{2}$ under the ${\hat{H}}_{0}$ in (5.8). At the asymptotic significance $α \in (0, 1)$ , we reject the ${\hat{H}}_{0}$ if (5.9) $\begin{matrix} \sum_{l = 1}^{L} {(\frac{1}{\sqrt{n}} \sum_{i = 1}^{n} ξ_{l, i})}^{2} > χ_{L, 1 - α}^{2} = the 1 - α lower quantile \\ of the χ_{L}^{2} distribution . \end{matrix}$ (5.9)

In applications, neither the mean $m_{ν}^{(j)} (t)$ nor the covariance $Ξ_{ν} (s, t)$ is known. Hence, the KL expansions in (5.5) cannot be directly used and must be estimated. In Appendix F, we propose a numerical foundation for conducting the asymptotic $χ^{2}$ -test in (5.9) and encapsulate the numerical procedures for the test in Algorithm 1. In all our analyses in Sections 6 and 7, the numerical estimates $\hat{L}$ (see (F.4) in Appendix F) of the L in (5.7) are smaller than 10. When the $\hat{L}$ values are large (e.g., several hundred), one may also consider applying the adaptive Neyman test proposed by Fan (Citation1996).

In the simulation studies presented in .1, our Algorithm 1 has comparable performance with more than ten existing state-of-the-art fdANOVA methods. Nonetheless, both Algorithm 1 and the existing methods exhibit Type I error inflation (e.g., the rejection rate of Algorithm 1 is 0.118, whereas the significance is 0.05). To mitigate this inflation, we may consider applying the permutation test using one of these methods that is computationally efficient. For example, Górecki and Smaga (Citation2015) proposed a permutation test based on an F-type statistic (FP). Specifically, Górecki and Smaga (Citation2015) approximated each observed function by basis functions via information criteria, and the F-type statistic was approximated by a form conducive to efficiently computing permutation-based p-values. However, the FP also exhibits Type I error inflation (see .1). Motivated by the FP, we apply the permutation test to the $χ^{2}$ -statistic defined in (5.9) in the following way: we first apply Algorithm 1 to our original shapes $K_{i}^{(j)}$ and then repeatedly reapply Algorithm 1 to the shapes with shuffled group labels j. The $χ^{2}$ -test statistic derived from the original data is then compared to that from the shuffled data. A detailed description of our permutation-based approach is presented in Algorithm 2 in Appendix F. Simulations in Section 6 demonstrate that our permutation-based approach eliminates the Type I error inflation encountered by Algorithm 1. The permutation nature of Algorithm 2 is also advantageous for small sample sizes. Note, however, that the power of Algorithm 2 under the alternative is moderately weaker than that of Algorithm 1. Lastly, the runtimes of Algorithms 1 and 2, when applied to simulations, are studied in Appendix K. We present the runtimes when applying the algorithms to real data in .

Table 1 Rejection rates (from 1000 experiments) for different indices ε (significance $α = 0.05$ ).

Display Table

Table 2 P-values of Algorithms 1, 2, and 4 for the dataset of mandibular molars.

Display Table

6 Experiments Using Simulations

We present simulations showing the performance of our Algorithms 1 and 2. In addition, we compare our algorithms with the “randomization-style null hypothesis significance test (NHST)” (Robinson and Turner Citation2017), the TRP using Wald-type permutation statistic (TRP-WTPS, Cuesta-Albertos and Febrero-Bande Citation2010; Pauly, Brunner, and Konietschke Citation2014), and the FP. Details of the randomization-style NHST are given in Appendix G and referred to as Algorithm 3. The application of the FP and TRP to the SECT is described in Section 5. We implement the FP and TRP-WTPS using the R package fdANOVA with its default parameters as recommended by Górecki and Smaga (2019). Additional simulations comparing our proposed algorithms and other existing fdANOVA methods are presented in Appendix J.

We focus on a family of distributions ${P^{(ε)}}_{0 \leq ε \leq 0.1}$ with shapes ${K_{i}^{(ε)}}_{i = 1}^{n} \overset{iid}{\sim} P^{(ε)}$ via (6.1) $\begin{matrix} K_{i}^{(ε)} \overset{def}{=} {x \in R^{2} | inf_{y \in S_{i}^{(ε)}} | | x - y | | \leq 0.2}, where \end{matrix}$ (6.1)

$S_{i}^{(ε)} \overset{def}{=} {(\frac{2}{5} + a_{1, i} \cdot cos t, b_{1, i} \cdot sin t) | \frac{1 - ε}{5} π \leq t \leq \frac{9 + ε}{5} π} \cup {(- \frac{2}{5} + a_{2, i} \cdot cos t, b_{2, i} \cdot sin t) | \frac{6 π}{5} \leq t \leq \frac{14 π}{5}}$ and ${a_{1, i}, a_{2, i}, b_{1, i}, b_{2, i}}_{i = 1}^{n} \overset{iid}{\sim} N (1, {0.05}^{2})$ . The ε denotes the dissimilarity between $P^{(ε)}$ and $P^{(0)}$ . For each $ε \in [0, 0.1]$ , through the discussion in Section 5, we test the following hypotheses via fdANOVA methods (i.e., FP, TRP-WTPS, Algorithms 1, and 2) $\begin{matrix} H_{0} : m_{ν}^{(0)} (t) = m_{ν}^{(ε)} (t) for all (ν, t) \in S^{d‐1} \times [0, T] versus \\ H_{1} : m_{ν}^{(0)} (t) \neq m_{ν}^{(ε)} (t) for some (ν, t), \end{matrix}$ where the mean $m_{ν}^{(ε)} (t) \overset{def}{=} \int_{S_{R, d}^{M}} SECT (K) (ν, t) P^{(ε)} (dK)$ , and the null hypothesis H₀ is true when $ε = 0$ . We also test $H_{0}^{*} : P^{(0)} = P^{(ϵ)} vs . P^{(0)} \neq P^{(ϵ)}$ using Algorithm 3.

We set T = 3, directions $ν_{p} = {(cos \frac{p - 1}{4} π, sin \frac{p - 1}{4} π)}^{⊺}$ for $p \in {1, 2, 3, 4}$ , levels $t_{q} = \frac{T}{50} q$ for $q \in {1, \dots, 50}$ (i.e., Γ = 4 and Δ = 50 in Algorithms 1, 2, and 3), the confidence level 95% (i.e., $α = 0.05$ ), and the number of permutations Π = 1000. For each $ε \in$ {0, 0.01, 0.02, 0.03, 0.04, 0.05, 0.06, 0.08, 0.1}, we independently generate two collections ${K_{i}^{(0)}}_{i = 1}^{n} \overset{iid}{\sim} P^{(0)}$ and ${K_{i}^{(ε)}}_{i = 1}^{n} \overset{iid}{\sim} P^{(ε)}$ through (6.1) with the number of shape pairs set to n = 100, and we compute the SECT of each generated shape in directions ${ν_{p}}_{p = 1}^{4}$ and at levels ${t_{q}}_{q = 1}^{50}$ . We then implement the fdANOVA methods and Algorithm 3 to these computed SECT statistics and get the corresponding Accept/Reject outputs. We repeat this procedure 1000 times and report the rejection rates across all 1000 replicates for each ε in . The rejection rates are also visually presented in . We choose Γ = 4 as the number of directions in our simulations based on the following observation: in Appendix K, we experiment with all combinations of $Γ \in {2, 4, 8}, Δ \in {25, 50, 100}$ , and $n \in {25, 50, 100}$ . When Δ = 50 and n = 100, the number Γ = 4 is sufficiently large for our Algorithms 1 and 2 to distinguish $P^{(0)}$ from $P^{(ε)}$ with $ε > 0$ using the significance level $α = 0.05$ . Moreover, this choice allows us to demonstrate that even a relatively small number of directions (e.g., Γ = 4) is sufficient for implementing our Algorithms 1 and 2.

Fig. 3 (Left panel) The relationship between ε and the rejection rates computed via Algorithms 1, 2, 3 (see ), and 12 existing fdANOVA methods (see in Appendix J for details on the existing fdANOVA methods). The (red) dashed line presents the significance level $α = 0.05$ . (Right panel) The shapes in the first row are from $P^{(0)}$ , and the shapes in the second row are from $P^{(0.08)}$ .

Fig. 3 (Left panel) The relationship between ε and the rejection rates computed via Algorithms 1, 2, 3 (see Table 1), and 12 existing fdANOVA methods (see Table J.1 in Appendix J for details on the existing fdANOVA methods). The (red) dashed line presents the significance level α=0.05. (Right panel) The shapes in the first row are from P(0), and the shapes in the second row are from P(0.08).

The results in and demonstrate that our proposed algorithms are effective at detecting the difference between $P^{(ε)}$ and $P^{(0)}$ in terms of distinguishing their mean functions. Notably, our algorithms (especially Algorithm 2) tend to avoid falsely detecting differences between shape-generating distributions under the null hypothesis (i.e., $ε = 0$ ). As ε increases, $P^{(ε)}$ deviates from $P^{(0)}$ , and the power of our algorithms in detecting the deviation increases. When $ε \geq 0.08$ , the power of Algorithms 1 and 2 exceeds 0.99. For all the ε, it is difficult to see the deviation of $P^{(ε)}$ from $P^{(0)}$ visually. For instance, by merely observing the shapes in , one might find it hard to differentiate between the shape collections generated by $P^{(0)}$ (blue) and $P^{(0.08)}$ (pink). However, in more than 99% of the simulations, our algorithms detect the difference between the two distributions. We also randomly shuffle the index i within each collection ${K_{i}^{(ε)}}_{i = 1}^{n}$ and apply Algorithms 1 and 2 to the shuffled collections. The results obtained from the unshuffled and shuffled shape collections, respectively, are nearly identical. Algorithm 3 performs well in detecting the discrepancy between $P^{(0)}$ and $P^{(ε)}$ . However, its power under the alternative hypotheses (i.e., $ε > 0$ ) is weaker than that of our Algorithms 1 and 2. Moreover, Algorithms 1 and 2 exhibit performance comparable to twelve existing state-of-the-art fdANOVA methods (see , , and in Appendix J).

7 Applications

We first apply our proposed Algorithms 1 and 2 to the MPEG-7 shape silhouette database (Sikora Citation2001) as a toy example. Details of this are provided in Appendix I. This analysis shows that our proposed algorithms can distinguish between shape classes in the silhouette database and do not falsely identify signals when there are no differences between groups.

In this section, we apply our algorithms to address the motivating question in Section 1.1. Specifically, we use Algorithms 1 and 2 to distinguish between the four categories of mandibular molars in that are from four genera of primates. The shapes in come from two suborders of primates: Haplorhini and Strepsirrhini (see ). In the haplorhine suborder collection, 29 molars came from the genus Tarsius (yellow panels in ), and 9 molars came from the genus Saimiri (grey panels in ). In the strepsirrhine collection, 11 molars came from the genus Microcebus (blue panels in ), and 6 molars came from the genus Mirza (green panels in ).

Before applying Algorithms 1 and 2, we preprocess the raw triangle mesh data of the surfaces of the molars by aligning them through the ECT alignment approach detailed in Appendix E. The aligned molars are presented in . We apply our Algorithms 1 and 2 to the preprocessed molars. For each aligned molar, we compute its SECT for 2918 directions; in each direction, we use 200 sublevel sets. To compare any pair of molar groups, as a proof of concept, we select the smaller size of the two groups as the sample size input n in our algorithms. For example, when comparing the Tarsius and Microcebus groups, we choose n = 11; that is, we compare the first 11 molars of the Tarsius group to all the molars in the Microcebus group. We apply our algorithms to the four groups of molars and present the results in . The p-values in are either $χ^{2}$ -test p-values (Algorithm 1) or permutation-test p-values (Algorithm 2 with 1000 permutations). The small p-values (P < 0.05) in show that our proposed algorithms can distinguish the four different genera of primates. Since the genera Microcebus and Mirza belong to the same suborder Strepsirrhini (see ), the p-value from Algorithm 2 is comparatively large when comparing molars from these two groups. In comparison, although the Tarsius and Saimiri both belong to the suborder Haplorhini, the molars of the two genera look different. Specifically, the paraconids (i.e., the cusp highlighted in red in ) are only retained by the genus Tarsius and, thus, are a key reason for the small p-values ( $P < 10^{- 3}$ ) when comparing with molars from the Saimiri. Other small p-values ( $P < 10^{- 3}$ ) in our analyses are a result of the corresponding genera belonging to different suborders.

In addition to testing the difference between genera, we apply our algorithms within the genus Tarsius. Specifically, we focus on the first 28 molars in the Tarsius group. We randomly split the 28 molars into two halves and apply Algorithms 1 and 2 to test the difference between the two halves. We repeat the random splitting procedure 100 times and present the corresponding p-values in . The results are summarized by their mean and standard deviation (in parentheses). These p-values show that our proposed Algorithm 2 tends to avoid the Type I error for the molars from the genus Tarsius.

Landmark methods are widely used in geometric morphometrics. One state-of-the-art approach is the “Gaussian process landmarking (GPL)” algorithm (Gao et al. Citation2019; Gao, Kovalsky, and Daubechies Citation2019) which can automatically sample landmarks on the surfaces of the molars in . Gao et al. (Citation2019) showed that these sampled landmarks could induce a continuous Procrustes distance to measure the dissimilarity between molars. A permutation test can be derived using the Procrustes distance induced by the GPL algorithm. This test is detailed in Appendix H and is encapsulated by Algorithm 4. We use the GPL-based Algorithm 4 to differentiate the four collections of molars. For this, we use the MATLAB code from the GitHub repository provided by Gao et al. (Citation2019) to compute the Procrustes distance. Performance of Algorithm 4 is in , which shows that the GPL-based method and our Algorithm 2 have comparable performance. However, repeatedly computing the Procrustes distance is time-consuming. Hence, Algorithm 2 is more computationally efficient than Algorithm 4 while achieving similar performance (see the last row of ).

We want to note that, in addition to the GPL algorithm, many other existing methods can be applied to measure dissimilarity between molars, including parameterized surfaces (Kurtek et al. Citation2010, Citation2011) and the approaches from computational anatomy (Grenander and Miller Citation1998). Similarly, the parameterized curves (Kurtek et al. Citation2012) can also be used to analyze the silhouette database in Appendix I. An even more comprehensive comparison of our algorithms with the entire edifice of existing methods is left for future research.

8 Conclusions and Discussions

In this article, we established the mathematical foundations for the randomness of shapes via the SECT. Specifically, (i) $(S_{R, d}^{M}, B (ρ), P)$ was constructed as the underlying probability space; (ii) the SECT was modeled as a $C (S^{d - 1}; H)$ -valued random variable. We further demonstrated several properties of the SECT ensuring its KL expansion, which led to a $χ^{2}$ -statistic for testing hypotheses on random shapes. We bridged the fdANOVA and TDA. Simulation studies corroborated our mathematical derivations and showed the performance of our hypothesis testing algorithms. Our approach was shown to be powerful in detecting the difference between two shape-generating distributions. We applied our proposed algorithms to silhouette and primate molar datasets. Importantly, our simulations when $ε = 0$ , together with the applications to the molars and the silhouette database, indicate that our algorithms tend to avoid falsely detecting differences between shape-generating distributions when there are none. Using the molars in , we compared the performance of our algorithms to a permutation test based on a state-of-the-art landmarking algorithm (Gao et al. Citation2019; Gao, Kovalsky, and Daubechies Citation2019), underscoring the efficiency of our algorithms. We enumerate potential future research areas in Appendix M, for example, the fdANOVA methods can be used for brain connectivity (Chen et al. Citation2024; Meng and Eloyan Citation2024) via topological summaries.

Supplemental material

Supplementary_Materials.pdf

Download PDF (5.6 MB)

acc_form.pdf

Download PDF (81.3 KB)

Acknowledgments

We are grateful to the Editor, Associate Editor, and three Referees for their thorough review of our article and the insightful suggestions that have tremendously improved its quality. We want to thank Dr. Matthew T. Harrison from the Division of Applied Mathematics at Brown University for useful comments and suggestions. KM wants to thank Mattie Ji from the Department of Mathematics at Brown University for her insightful comments. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Supplementary Materials

The supplementary materials provide the proofs of theorems, further data analysis, and future research topics.

Data Availability Statement

The source code for implementing the simulation studies and applications is publicly available online at https://github.com/JinyuWang123/TDA.git.

Disclosure Statement

The authors report there are no competing interests to declare.

Additional information

Funding

LC would like to acknowledge the support of a David & Lucile Packard Fellowship for Science and Engineering. Research reported in this publication was partially supported by the National Institute On Aging of the National Institutes of Health under Award Number R01AG075511.

References

Boyer, D. M., Lipman, Y., Clair, E. S., Puente, J., Patel, B. A., Funkhouser, T., Jernvall, J., and Daubechies, I. (2011), “Algorithms to Automatically Quantify the Geometric Similarity of Anatomical Surfaces,” Proceedings of the National Academy of Sciences, 108, 18221–18226. Available at https://www.pnas.org/doi/abs/10.1073/pnas.1112822108. DOI: 10.1073/pnas.1112822108.
PubMed Web of Science ®Google Scholar
Brezis, H. (2011), Functional Analysis, Sobolev Spaces and Partial Differential Equations (Vol. 2), New York: Springer.
Google Scholar
Chen, Y., Lin, Z., and Müller, H.-G. (2023), “Wasserstein Regression,” Journal of the American Statistical Association, 118, 869–882. DOI: 10.1080/01621459.2021.1956937.
Web of Science ®Google Scholar
Chen, Y., Lin, S.-C., Zhou, Y., Carmichael, O., Müller, H.-G., Wang, J.-L., and Alzheimer’s Disease Neuroimaging Initiative. (2024), “Gradient Synchronization for Multivariate Functional Data, with Application to Brain Connectivity,” Journal of the Royal Statistical Society, Series B. DOI: 10.1093/jrsssb/qkad140.
Google Scholar
Cohen-Steiner, D., Edelsbrunner, H., and Harer, J. (2007), “Stability of Persistence Diagrams,” Discrete & Computational Geometry, 37, 103–120. DOI: 10.1007/s00454-006-1276-5.
Web of Science ®Google Scholar
Crawford, L., Monod, A., Chen, A. X., Mukherjee, S., and Rabadán, R. (2020), “Predicting Clinical Outcomes in Glioblastoma: An Application of Topological and Functional Data Analysis,” Journal of the American Statistical Association, 115, 1139–1150. DOI: 10.1080/01621459.2019.1671198.
Web of Science ®Google Scholar
Cuesta-Albertos, J., and Febrero-Bande, M. (2010), “A Simple Multiway Anova for Functional Data,” Test, 19, 537–557. DOI: 10.1007/s11749-010-0185-3.
Web of Science ®Google Scholar
Curry, J., Mukherjee, S., and Turner, K. (2022), “How Many Directions Determine a Shape and Other Sufficiency Results for Two Topological Transforms,” Transactions of the American Mathematical Society, Series B, 9, 1006–1043. DOI: 10.1090/btran/122.
Google Scholar
Dunson, D. B., and Wu, N. (2021), “Inferring Manifolds from Noisy Data Using Gaussian Processes,” arXiv preprint arXiv:2110.07478.
Google Scholar
Dupuis, P., Grenander, U., and Miller, M. I. (1998), “Variational Problems on Flows of Diffeomorphisms for Image Matching,” Quarterly of Applied Mathematics, LVI, 587–600. DOI: 10.1090/qam/1632326.
Google Scholar
Edelsbrunner, H., and Harer, J. (2010), Computational Topology: An Introduction, Providence, RI: American Mathematical Society.
Google Scholar
Fan, J. (1996), “Test of Significance based on Wavelet Thresholding and Neyman’s Truncation,” Journal of the American Statistical Association, 91, 674–688. DOI: 10.1080/01621459.1996.10476936.
Web of Science ®Google Scholar
Fasy, B. T., Lecci, F., Rinaldo, A., Wasserman, L., Balakrishnan, S., and Singh, A. (2014), “Confidence Sets for Persistence Diagrams,” The Annals of Statistics, 42, 2301–2339. DOI: 10.1214/14-AOS1252.
Web of Science ®Google Scholar
Fasy, B. T., Micka, S., Millman, D. L., Schenfisch, A., and Williams, L. (2018), “Challenges in Reconstructing Shapes from Euler Characteristic Curves,” arXiv preprint arXiv:1811.11337.
Google Scholar
Gao, T., Kovalsky, S. Z., Boyer, D. M., and Daubechies, I. (2019), “Gaussian Process Landmarking for Three-Dimensional Geometric Morphometrics,” SIAM Journal on Mathematics of Data Science, 1, 237–267. DOI: 10.1137/18M1203481.
Google Scholar
Gao, T., Kovalsky, S. Z., and Daubechies, I. (2019), “Gaussian Process Landmarking on Manifolds,” SIAM Journal on Mathematics of Data Science, 1, 208–236. DOI: 10.1137/18M1184035.
Google Scholar
Ghrist, R., Levanger, R., and Mai, H. (2018), “Persistent Homology and Euler Integral Transforms,” Journal of Applied and Computational Topology, 2, 55–60. DOI: 10.1007/s41468-018-0017-1.
Google Scholar
Górecki, T., and Smaga, Ł. (2015), “A Comparison of Tests for the One-Way Anova Problem for Functional Data,” Computational Statistics, 30, 987–1010. DOI: 10.1007/s00180-015-0555-0.
Web of Science ®Google Scholar
——— (2019), “fdanova: An r Software Package for Analysis of Variance for Univariate and Multivariate Functional Data,” Computational Statistics, 34, 571–597.
Web of Science ®Google Scholar
Goswami, A. (2015), “Phenome10k: A Free Online Repository for 3-D Scans of Biological and Palaeontological Specimens,” Google Scholar.
Google Scholar
Grenander, U., and Miller, M. I. (1998), “Computational Anatomy: An Emerging Discipline,” Quarterly of Applied Mathematics, 56, 617–694. DOI: 10.1090/qam/1668732.
Web of Science ®Google Scholar
Hairer, M. (2009), “An Introduction to Stochastic PDEs,” arXiv preprint arXiv:0907.4178.
Google Scholar
Hatcher, A. (2002), Algebraic Topology, New York: Cambridge University Press.
Google Scholar
Hsing, T., and Eubank, R. (2015), Theoretical Foundations of Functional Data Analysis, with an Introduction to Linear Operators (Vol. 997), Chichester: Wiley.
Google Scholar
Jia Guo, B. Z., and Zhang, J.-T. (2019), “New Tests for Equality of Several Covariance Functions for Functional Data,” Journal of the American Statistical Association, 114, 1251–1263. DOI: 10.1080/01621459.2018.1483827.
Web of Science ®Google Scholar
Jiang, Q., Kurtek, S., and Needham, T. (2020), “The Weighted Euler Curve Transform for Shape and Image Analysis,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp. 844–845.
Google Scholar
Kendall, D. G. (1989), “A Survey of the Statistical Theory of Shape,” Statistical Science, 4, 87–99. DOI: 10.1214/ss/1177012582.
Google Scholar
Kirveslahti, H., and Mukherjee, S. (2023), “Representing Fields Without Correspondences: The Lifted Euler Characteristic Transform,” Journal of Applied and Computational Topology, 8, 1–34. DOI: 10.1007/s41468-023-00133-w.
Google Scholar
Kurtek, S., Klassen, E., Ding, Z., Jacobson, S. W., Jacobson, J. L., Avison, M. J., and Srivastava, A. (2010), “Parameterization-Invariant Shape Comparisons of Anatomical Surfaces,” IEEE Transactions on Medical Imaging, 30, 849–858. DOI: 10.1109/TMI.2010.2099130.
PubMed Web of Science ®Google Scholar
Kurtek, S., Klassen, E., Gore, J. C., Ding, Z., and Srivastava, A. (2011), “Elastic Geodesic Paths in Shape Space of Parameterized Surfaces,” IEEE Transactions on Pattern Analysis and Machine Intelligence, 34, 1717–1730. DOI: 10.1109/TPAMI.2011.233.
Web of Science ®Google Scholar
Kurtek, S., Srivastava, A., Klassen, E., and Ding, Z. (2012), “Statistical Modeling of Curves Using Shapes and Related Features,” Journal of the American Statistical Association, 107, 1152–1165. DOI: 10.1080/01621459.2012.699770.
Web of Science ®Google Scholar
Li, D., Mukhopadhyay, M., and Dunson, D. B. (2022), “Efficient Manifold Approximation with Spherelets,” Journal of the Royal Statistical Society, Series B, 84, 1129–1149. DOI: 10.1111/rssb.12508.
Google Scholar
Marsh, L., Zhou, F. Y., Quin, X., Lu, X., Byrne, H. M., and Harrington, H. A. (2022), “Detecting Temporal Shape Changes with the Euler Characteristic Transform,” arXiv preprint arXiv:2212.10883.
Google Scholar
Meng, K., and Eloyan, A. (2021), “Principal Manifold Estimation Via Model Complexity Selection,” Journal of the Royal Statistical Society, Series B, 83, 369–394. DOI: 10.1111/rssb.12416.
Google Scholar
Meng, K., and Eloyan, A. (2024), “Population-Level Task-Evoked Functional Connectivity via Fourier Analysis,” Journal of the Royal Statistical Society, Series C. DOI: 10.1093/jrsssc/qlae015.
Web of Science ®Google Scholar
Mileyko, Y., Mukherjee, S., and Harer, J. (2011), “Probability Measures on the Space of Persistence Diagrams,” Inverse Problems, 27, 124007. DOI: 10.1088/0266-5611/27/12/124007.
Web of Science ®Google Scholar
Miller, E. (2015), “Fruit Flies and Moduli: Interactions between Biology and Mathematics,” Notices of the AMS, 62, 1178–1184.
Google Scholar
Molchanov, I. (2005), Theory of Random Sets (Vol. 19), Dordrecht: Springer.
Google Scholar
Pauly, M., Brunner, E., and Konietschke, F. (2014), “Asymptotic Permutation Tests in General Factorial Designs,” Journal of the Royal Statistical Society, Series B, 77, 461–473. DOI: 10.1111/rssb.12073.
Google Scholar
Petersen, A., and Müller, H.-G. (2019), “Fréchet Regression for Random Objects with Euclidean Predictors,” The Annals of Statistics, 47, 691–719. DOI: 10.1214/17-AOS1624.
Web of Science ®Google Scholar
Reed, M., and Simon, B. (1972), Methods of Modern Mathematical Physics: Functional Analysis, New York: Academic Press.
Google Scholar
Robinson, A., and Turner, K. (2017), “Hypothesis Testing for Topological Data Analysis,” Journal of Applied and Computational Topology, 1, 241–261. DOI: 10.1007/s41468-017-0008-7.
Google Scholar
Roycraft, B., Krebs, J., and Polonik, W. (2023), “Bootstrapping Persistent Betti Numbers and Other Stabilizing Statistics,” The Annals of Statistics, 51, 1484–1509. DOI: 10.1214/23-AOS2277.
Web of Science ®Google Scholar
Schapira, P. (1995), “Tomography of Constructible Functions,” in International Symposium on Applied Algebra, Algebraic Algorithms, and Error-Correcting Codes, pp. 427–435, Springer.
Google Scholar
Shen, Q., and Faraway, J. (2004), “An f Test for Linear Models with Functional Responses,” Statistica Sinica, 14, 1239–1257.
Web of Science ®Google Scholar
Sikora, T. (2001), “The mpeg-7 Visual Standard for Content Description-an Overview,” IEEE Transactions on Circuits and Systems for Video Technology, 11, 696–702. DOI: 10.1109/76.927422.
Web of Science ®Google Scholar
Tang, W. S., da Silva, G. M., Kirveslahti, H., Skeens, E., Feng, B., Sudijono, T., Yang, K. K., Mukherjee, S., Rubenstein, B., and Crawford, L. (2022), “A Topological Data Analytic Approach for Discovering Biophysical Signatures in Protein Dynamics,” PLoS Computational Biology, 18, e1010045. DOI: 10.1371/journal.pcbi.1010045.
PubMed Web of Science ®Google Scholar
Turner, K., Mukherjee, S., and Boyer, D. M. (2014), “Persistent Homology Transform for Modeling Shapes and Surfaces,” Information and Inference: A Journal of the IMA, 3, 310–344. DOI: 10.1093/imaiai/iau011.
Google Scholar
van den Dries, L. (1998), Tame Topology and o-Minimal Structures (Vol. 248), Cambridge: Cambridge University Press.
Google Scholar
Wang, B., Sudijono, T., Kirveslahti, H., Gao, T., Boyer, D. M., Mukherjee, S., and Crawford, L. (2021), “A Statistical Pipeline for Identifying Physical Features that Differentiate Classes of 3D Shapes,” The Annals of Applied Statistics, 15, 638–661. DOI: 10.1214/20-AOAS1430.
Web of Science ®Google Scholar
Zhang, J.-T. (2013), Analysis of Variance for Functional Data, Boca Raton, FL: CRC Press.
Google Scholar
Zhang, J.-T., and Chen, J. (2007), “Statistical Inferences for Functional Data,” The Annals of Statistics, 35, 1052–1079. DOI: 10.1214/009053606000001505.
Web of Science ®Google Scholar

Randomness of Shapes and Statistical Inference on Shapes via the Smooth Euler Characteristic Transform

Abstract