Full article: Multivariate temperature-series analysis of stress-induced ferroelectricity in SrTiO3: a machine learning approach with K-shape clustering and hierarchical Bayesian estimation

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

ABSTRACT

A new machine learning approach that transforms time-series analysis into temperature-series analysis is introduced to analyze stress-induced ferroelectricity in SrTiO₃ at 231 MPa using birefringence images observed at successive temperatures. The spatial distribution of the temperature-series data for each of the 42,280 pixels was clustered using the multivariate $K$ -shape clustering method based on the shape similarity of the temperature dependence. In addition, to obtain the structural and ferroelectric phase transition temperatures, $T_{c}$ and $T_{F}$ , hierarchical Bayesian temperature-series estimation was performed at each pixel (as a lower level) constrained over the entire cluster (as a higher level) considering the measurement error. Consequently, the K-shape clustering method revealed four clusters corresponding to elongated ferroelectric domains, explained by slight differences in retardance and fast-axis direction. Statistical analysis of the Bayesian posterior probability distribution showed a uniform distribution of $T_{c}$ over the sample, but an inhomogeneous distribution of $T_{F}$ . The higher $T_{F}$ regions exhibited a concentration of stress and/or strain. The Pearson correlation coefficient calculations suggested a strong to moderate relationship between the distribution of T_F and the ferroelectric state, while the correlation between T_c and the ferroelectric state was weak or nonexistent. The combination of machine learning and statistics provides a more reliable and less arbitrary approach to analyzing temperature-series data. These multilevel analyses are particularly useful in studying critical phenomena near phase transition temperatures in condensed matter physics.

GRAPHICAL ABSTRACT

IMPACT STATEMENT

Temperature-series analysis, combined with statistics, provides deeper insights into numerous imaging data observed at successive temperatures. This method will bring innovation to study of critical phenomena in condensed matter physics.

KEYWORDS:

1. Introduction

In condensed matter physics, one of the most important research topics is investigation of the complex relationship between temperature ( $T$ ) and physical properties, particularly critical phenomena occurring near phase transition temperatures [Citation1]. These phenomena gain an additional layer of complexity when intertwined with changes in external parameters, such as magnetic field, electric field, and pressure. An increase in the number of external parameters generates complex, multivariate data structures that escalate in volume and complexity [Citation2–4]. Traditional analysis methods, which are typically limited to examining variables in isolation, struggle to unravel these complex interactions. To bridge this gap, we have proposed an innovative machine learning approach for analyzing $T$ -dependent data, which is optimized for the study of critical phenomena in condensed matter physics.

Since the 2000s, remarkable progress has been made in imaging techniques for the detection of spatial information regarding physical properties [Citation5–10]. These developments are essential for the study of functional materials and focus on the formation and dynamics of macroscopic structures, such as domains or domain walls that affect magnetic, electrical, and transport properties. Therefore, advanced imaging techniques have become a fundamental part of materials research; however, such techniques have triggered an era of data explosion [Citation6–8]. For example, spectral imaging collects the wavelength ( $λ$ ) or energy dependence at each pixel to obtain a single image [Citation2,Citation4,Citation5]. Extracting meaningful insights from a large dataset remains challenging despite advances in measurement techniques. Traditional analysis methods, based on averaging data from selective regions of interest, are limited in scope and prone to arbitrary bias. They often fail to analyze the overall trends of complex distributions and may miss local but important details. This generates an opportunity for machine learning, i.e. by applying advanced algorithms, we can extract deeper insights from a large dataset that previously remained inaccessible [Citation11]. Pioneering research in condensed matter physics has involved the use of multivariate analysis methods based on statistics, i.e. principal component analysis [Citation12,Citation13], a multivariate curve resolution technique [Citation14], and nonnegative matrix factorization [Citation15], to analyze spectral imaging data. Although these methods are adept at extracting features by reducing multivariate data to a lower dimension, they prove insufficient at learning from data to make predictions or decisions for new data. Our machine learning approach is designed to analyze $T$ -dependent imaging data, thus providing more accurate and flexible analysis than before.

We developed a novel machine learning approach for analyzing $T$ -dependent birefringence imaging data, focusing on stress-induced ferroelectricity in SrTiO $_{3}$ [Citation16–18]. Although such phase transition phenomena have been known for a long time [Citation19–22], the high electrical conductivity caused by strain-induced ferroelectricity at the interface between SrTiO $_{3}$ and LaAlO $_{3}$ has attracted renewed attention [Citation23–26]. Birefringence is a technique that uses the polarization properties of light to reveal both the internal structure and dielectric properties of materials. In this study, to effectively manage and interpret numerous birefringence images at different temperatures, we employ machine learning techniques that focus on unsupervised learning methods such as clustering algorithms [Citation27]. These techniques are important for identifying underlying patterns and correlations not readily apparent in a large dataset. Pioneering research has been conducted on the application of machine learning to the analysis of reflection high-energy electron diffraction images [Citation28–31]. Our previous study reported the application of a $K$ -means clustering technique to the analysis of a birefringence image [Citation18]. However, the analysis of $T$ dependence over numerous images remains a substantial challenge. In this study, we focus on the similarity between time-series and $T$ -series data for efficient analysis of successive images.

Because $T$ -series analysis methods have not yet been developed, our approach attempts to simply convert the time variables into the $T$ variables. Time-series analysis is a major topic in machine learning with enormous related knowledge available [Citation32,Citation33]; therefore, the application of these methods to $T$ -series analysis can lead to considerable development in condensed matter physics. First, we recognize that series data have the following unique properties: (1) a distinct sequential order, which is essential for accurate interpretation; (2) a relational dynamic, where each data point depends on its neighbors; (3) a contextual essence, where the true value of the series is unlocked when viewed as a cohesive whole. With these insights, the $T$ -dependent birefringence images of the stress-induced ferroelectricity in SrTiO $_{3}$ are analyzed. For spatial distribution analysis, we use the $K$ -shape clustering method to identify the ferroelectric domain structures [Citation34]. In traditional time-series analysis, Euclidean distance and dynamic time warping are commonly used although they are sensitive to variations in the data scale and can overlook similarities in shapes [Citation32,Citation33]. The $K$ -shape clustering method is a specialized technique for clustering based on the shape of time-series data. Shape similarity is one of the most important features that forms the scaling law in condensed matter physics [Citation1]. Furthermore, for the pixel-by-pixel $T$ -series analysis, we use Bayesian $T$ -series estimation to identify the structural and ferroelectric phase transition temperatures ( $T_{c}$ and $T_{F}$ ) [Citation35,Citation36]. The Bayesian approach has an important advantage of directly incorporating prior knowledge and uncertainties such as measurement error directly into the modeling process, allowing us to perform comprehensive and probabilistic parameter inference based on the posterior probability distribution. Our combined machine learning approach toward spatial distributions and $T$ variations provides a deeper understanding of complex data structures in earlier impossible ways. This approach overcomes the limitations of traditional analysis methods in condensed matter physics and opens new avenues for exploring various critical phenomena, potentially revolutionizing the way researchers approach a large dataset.

2. Dataset for analysis

2.1. Birefringence images in SrTiO₃

Machine learning was applied to $T$ -series birefringence imaging data for the stress-induced ferroelectricity of SrTiO $_{3}$ . Under stress-free conditions, SrTiO $_{3}$ undergoes a structural phase transition at $T_{c} = 105$ K from the cubic paraelectric state to the tetragonal quantum paraelectric state, where the ferroelectric state is suppressed by quantum fluctuations [Citation37–39]. The quantum paraelectric state undergoes ferroelectric phase transition upon the application of stress or strain [Citation19–22,Citation24–26,Citation40–42]. For a SrTiO $_{3}$ thin film, strain-induced ferroelectricity appears at room temperature because of the lattice mismatch between the substrate and the thin film [Citation24–26,Citation40–42]. Similarly, for a SrTiO $_{3}$ bulk sample, when the external force ( $σ$ ) is applied along $[001]_{c}$ , whose direction corresponds to the cubic phase, $T_{c}$ increases from 105 K with increasing $σ$ [Citation20–22]. At $T \leq 30$ K, ferroelectric phase transition occurs, resulting in spontaneous polarization along $[010]_{c}$ ; moreover, $T_{F}$ increases with increasing $σ$ [Citation20–22]. In the previous experimental studies of the stress-induced ferroelectricity in SrTiO $_{3}$ , the value of $σ$ has been considered as stress because the external and internal forces ( $=$ stress) are balanced. Unlike hydrostatic pressure experiments wherein $σ$ is applied uniformly in all directions, the stress caused by $σ$ is unevenly distributed throughout the sample. This is because $σ$ is applied from only one axial direction, and stress tends to be distributed perpendicularly to $σ$ . Therefore, the stress-induced ferroelectric state may also be inhomogeneous because of the stress distribution. According to a study conducted by Müller et al. in Ref [Citation19], when $σ ∥ [111]_{c}$ , the relationship between the converted stress ( $σ_{Cv}$ ) and $T_{c}$ is given by

(1)

σ_{Cv} = 30 \times (T_{c} - 105) (MPa) .

(1)

Reportedly, this relationship also holds when $σ ∥ [001]_{c}$ [Citation16]. In this study, we used imaging techniques to evaluate the local stress and ferroelectric domain structure from the distributions of $T_{c}$ and $T_{F}$ on a pixel-by-pixel basis.

We obtained $T$ -dependent birefringence images of the SrTiO $_{3} (110)$ substrate at $σ = 231$ MPa applied along $[001]_{c}$ when $T$ decreases from 300.0 to 14.1 K, as already reported in Ref [Citation16]. To fully understand the stress-induced ferroelectric phase transition, our analysis focuses on capturing the inhomogeneous distribution of critical phenomena in the substrate. Because the SrTiO $_{3} (110)$ substrate at $σ = 231$ MPa belongs to the plastic region, the linear relationship between stress and strain disappears [Citation17]. According to theoretical studies on SrTiO $_{3}$ thin films [Citation43–47], the lattice distortion functions as a driving force for ferroelectricity and $T_{F}$ is expected to increase with increasing strain. However, because our experimental method cannot help determine whether the observed birefringence is more influenced by stress or strain, we do not distinguish between stress and strain.

The measurement principles required for machine learning with respect to analyzing the $T$ -series birefringence imaging data are described as follows. Incident white light with circular polarization expressed as the Stokes parameters ( $S_{1}^{I n}, S_{2}^{I n}, S_{3}^{I n}) ≃ (0, 0, - 1$ ) is passed through the SrTiO $_{3}$ (110) substrate along $[110]_{c}$ , and the polarization state of the transmitted light is obtained as ( $S_{1}^{O u t}, S_{2}^{O u t}, S_{3}^{O u t}$ ) [Citation48,Citation49]. The polarization state at each point of $384 \times 288$ pixels can be obtained through a single measurement using a CCD camera with polarizing plates. When the refractive indices along the orthogonal principal optical axes are defined as $n_{1}$ and $n_{2}$ for the slow and fast axes with $n_{1} \geq n_{2} \geq 1$ , respectively, the birefringence ( $Δ n$ ) and retardance ( $δ$ ) are expressed as follows:

(2)

Δ n \equiv n_{1} - n_{2},

(2)

(3)

δ \equiv Δ n \times t,

(3)

where $t$ represents the sample thickness ( $t = 0.333$ mm), and the fast-axis direction ( $ψ$ ) corresponds to the direction for $n_{2}$ . To elucidate the large value of $δ$ exceeding $λ / 2$ , we observed the transmitted light separated into three different $λ$ s at 523, 543, and 575 nm using bandpass filters [Citation50]. In case of the SrTiO $_{3} (110)$ substrate at $σ = 231$ MPa, the value of $δ$ helps evaluate the amount of spontaneous polarization due to the Pockels effect [Citation51,Citation52] and stress and/or strain due to the photoelastic effect [Citation16,Citation17]. In addition, the $ψ$ direction reflects the crystal orientation; therefore, its distribution image can be used to infer the distribution of the spontaneous polarization direction [Citation53–56]. As reported in Ref [Citation16], show images of $δ$ and $ψ$ at 130.9 K, corresponding to the cubic paraelectric state. Notably, these images are converted at $λ = 543$ nm. Stripe patterns due to the slip planes appear even at 300.0 K, and their directions coincide with the directions of the projection of $⟨ 111 ⟩_{c}$ onto $(110)$ . shows an image of the $⟨ 111 ⟩_{c}$ stripe pattern; the origin of this image can be attributed to the fact that the fast-axis directions indicated by the arrows differ by a few degrees between the elongated domains. These images show considerably similar distributions to those observed at 30.0 (paraelectric tetragonal state) and 14.1 K (ferroelectric state) shown in of Ref [Citation18]. Because the $δ$ and $ψ$ values correspond to a certain cross-section of the Poincaré sphere comprising the Stokes parameters [Citation18,Citation48,Citation49], the analysis results from different approaches may appear different. Furthermore, it is challenging to quantitatively assess the amount and direction of spontaneous polarization by simultaneously examining the $δ$ and $ψ$ distribution images. Therefore, a multivariate analysis using machine learning must be developed to mine the hidden information in $T$ -series birefringence imaging data.

Figure 1. In SrTiO $_{3}$ (110) substrate at an external force of $σ = 231$ MPa applied along $[001]_{c}$ , images at 130.9 K of (a) retardance $δ$ and (b) fast-axis direction $ψ$ . The large open arrows schematically indicate $σ$ . In (a), the cross marks indicate the analysis position, as shown in . The order is defined from the left end to the right end as a1 to a7. (c) Schematic illustration of the origin of the stripe pattern along $⟨ 111 ⟩_{c}$ . The arrows are the fast-axis directions and differ by a few degrees between the elongated domains.

In our previous research, reported in Ref [Citation18], we applied the $K$ -means clustering method to the birefringence imaging data to observe the ferroelectric domains in SrTiO $_{3}$ at 231 MPa. This approach involved subtraction between the 30.0-K and 14.1-K datasets on a pixel-by-pixel basis, and it effectively simplified the $T$ -series data for spatial clustering. Our results identified three distinct clusters (D1–D3, see Supplementary materials), each with unique characteristics. Cluster D1 exhibited large spontaneous polarization and disordered crystal orientation, indicating stress and/or strain concentration. Conversely, cluster D3 showed small spontaneous polarization but a more uniform distribution and less disordered crystal orientation. Furthermore, cluster D2 showed intermediate properties. This difference between clusters highlights the complex state of the stress-induced ferroelectricity in SrTiO $_{3}$ . However, our method could not adequately visualize the domain structures along $⟨ 111 ⟩_{c}$ ; this limitation is attributed to the reduction of $T$ -series information in the clustering process. Therefore, there is an urgent need for advanced analysis techniques that comprehensively integrate spatial information and $T$ -dependent data. The development of such methods could considerably improve the accuracy of visualization of ferroelectric domain structures and provide deeper insights into the complex $T$ dependence of birefringence in case of SrTiO $_{3}$ . This approach will provide a new perspective for the analysis of critical phenomena in condensed matter physics.

2.2. Preprocessing of birefringence imaging data

We preprocessed our data sets to efficiently perform machine learning without sacrificing accuracy. Focusing on the SrTiO $_{3} (110)$ substrate, birefringence imaging data at each $T$ were extracted at 42,280 pixels (302 $\times$ 140 pixels). As previously described in Ref [Citation18], the Stokes parameters ( $S_{1}^{I n}, S_{2}^{I n}, S_{3}^{I n}$ ) and ( $S_{1}^{O u t}, S_{2}^{O u t}, S_{3}^{O u t}$ ) at each $T$ were averaged over a 3 $\times$ 3 pixel area using the moving average method. Because the sample position shifts with decreasing $T$ , the analysis locations must be adjusted by one pixel unit and averaging is most effective in suppressing the effect of deviations of less than one pixel. Using these averaged Stokes parameters, we subsequently defined the important variables $θ$ and $ϕ$ , setting the stage for our subsequent analysis on the Poincaré sphere [Citation18]. The angle $θ$ , corresponding to $δ$ , was defined within the range $[0^{\circ}, 180^{\circ}]$ with a periodicity of $360^{\circ}$ . The axial direction $ϕ$ , reflecting the direction of the fast or slow axis, was restricted to $[0^{\circ}, 180^{\circ})$ , also with a periodicity of $180^{\circ}$ . With these definitions, we generally expressed the retardance as $δ = | l \pm θ \div 360^{\circ} | \times λ$ , where $l = 0, 1, 2, \dots$ , and the fast-axis direction as $ψ = (ϕ \pm 90^{\circ}) / 2$ . Whenever $δ$ exceeds $λ / 2, λ, 3 λ / 2, \dots$ , the angle $θ$ folds back at 0 $^{\circ}$ or 180 $^{\circ}$ ; therefore, the variables for $l$ and $\pm$ change concurrently. The accuracy of machine learning predictions can be improved using the $T$ -series data for the three $λ$ s without corrections for these variables on a pixel-by-pixel basis [Citation18].

Because the variables $θ$ and $ϕ$ are angular measures, we adopted circular statistics for our analysis [Citation57,Citation58]. This approach allows us to consider angles as vectors, which simplifies their computation and interpretation. From Ref [Citation18], the mean (or maximum likelihood estimate) of $θ$ and $ϕ$ is expressed as follows:

(4)

cos \overline{θ} = \sum_{i = 1}^{m} (\frac{cos θ_{i}}{{\overline{R}}_{θ}}) \times \frac{1}{m},

(4)

(5)

sin \overline{θ} = \sum_{i = 1}^{m} (\frac{sin θ_{i}}{{\overline{R}}_{θ}}) \times \frac{1}{m},

(5)

(6)

cos 2 \overline{ϕ} = \sum_{i = 1}^{m} (\frac{cos 2 ϕ_{i}}{{\overline{R}}_{ϕ}}) \times \frac{1}{m},

(6)

(7)

sin 2 \overline{ϕ} = \sum_{i = 1}^{m} (\frac{sin 2 ϕ_{i}}{{\overline{R}}_{ϕ}}) \times \frac{1}{m},

(7)

(8)

{\overline{R}}_{θ} = \frac{\sqrt{{(\sum_{i = 1}^{m} cos θ_{i})}^{2} + {(\sum_{i = 1}^{m} sin θ_{i})}^{2}}}{m},

(8)

(9)

{\overline{R}}_{ϕ} = \frac{\sqrt{{(\sum_{i = 1}^{m} cos 2 ϕ_{i})}^{2} + {(\sum_{i = 1}^{m} sin 2 ϕ_{i})}^{2}}}{m},

(9)

where $m$ denotes the number of data points and $\overline{R}$ represents the mean resultant length. The value of $\overline{R}$ varies between 0 and 1 and indicates the alignment of the distribution, i.e. values closer to 1 signify a well-aligned distribution. The maximum likelihood estimation of $θ$ and $ϕ$ can be reformulated as follows:

(10)

tan \overline{θ} = \frac{\sum_{i = 1}^{m} sin θ_{i}}{\sum_{i = 1}^{m} cos θ_{i}},

(10)

(11)

tan 2 \overline{ϕ} = \frac{\sum_{i = 1}^{m} sin 2 ϕ_{i}}{\sum_{i = 1}^{m} cos 2 ϕ_{i}} .

(11)

This allows angles to be treated as linear statistics [Citation57,Citation58]. Consequently, the four variables ( $cos θ_{i} / {\overline{R}}_{θ}$ , $sin θ_{i} / {\overline{R}}_{θ}$ , $cos 2 ϕ_{i} / {\overline{R}}_{ϕ}$ , and $sin 2 ϕ_{i} / {\overline{R}}_{ϕ}$ ) are derived from $θ_{i}$ and $ϕ_{i}$ at the $i$ -th measurement point. After successfully applying circular statistics, we extended this procedure to our datasets for three different $λ$ s, creating a 42,280 $\times$ 12 matrix for a birefringence image. In this study, it is necessary to further develop a method for analyzing the numerous images observed at successive temperatures. Because all $T$ -series data are standardized for each variable before machine learning, as discussed later, ${\overline{R}}_{θ}$ and ${\overline{R}}_{ϕ}$ are not necessary for analyzing the $T$ dependence only at a pixel. However, the $K$ -shape clustering would be more reliable by preserving information regarding the distribution within the substrate. Therefore, as a preprocessing of the 12-variable $T$ -series data for 42,280 pixels, we calculated ${\overline{R}}_{θ}$ and ${\overline{R}}_{ϕ}$ for each birefringence image. These careful preprocessing steps help accurately analyze our $T$ -series birefringence imaging data and set the stage for insightful machine learning.

The stress apparatus made of Cu thermally shrinks with decreasing $T$ , causing a shift in the sample position [Citation59]. With 300.0 K as reference, the birefringence image at 14.1 K showed shifts of 23 pixels to the left and 4 pixels to the bottom. Therefore, we manually corrected the sample position by one pixel unit in all images. Previously, we selected as homogeneous a region of the sample as possible and averaged the data within a 10–30 $\times$ 10–30-pixel area to ensure that the shift of less than one pixel was negligible [Citation16]. However, since the $T$ -series data is analyzed on a pixel-by-pixel basis, the 3 $\times$ 3 pixel averaging preprocessing proved insufficient in regions where the amount of birefringence changes abruptly. In these regions, because small jumps appear in $δ (T)$ and $ψ (T)$ curves, conventional time-series analysis methods, which mainly focus on the temporal differences of the changes, would cause serious problems [Citation32,Citation33]. In this study, we adopted the multivariate $K$ -shape clustering method [Citation34] because it is robust to changes in data scale and time deviations, effectively classifying data into the same cluster based on shape similarity. It also maximizes the cross-correlation within a cluster, increasing the likelihood of grouping similar data. Furthermore, for the $T$ -series analysis using the $δ (T)$ and $ψ (T)$ curves, such jumps will inevitably have a negative impact on the estimation of $T_{c}$ and $T_{F}$ values. Because it is impossible to check the $T$ -series data for all 42,280 pixels, the Bayesian $T$ -series estimation should be used, considering error factors based on certain criteria [Citation35,Citation36]. Using the Bayesian posterior probability distribution, the confidence and accuracy of the estimates can be quantitatively understood, allowing discussion of the relationship between the stress and/or strain concentration and the ferroelectric state.

3. Data analysis procedure

Our computer was equipped with an Intel(R) Core(TM) i9-13900K CPU running up to 5.80 GHz and implemented 128 GB of memory. To meet the demands of datasets larger than 20 GB, we avoided data input/output bottlenecks on hard disk drives and increased data throughput many times by installing a solid-state drive. For the $K$ -shape clustering analysis, we used the tslearn package (version 0.5.3.2), a common tool for time-series analysis in the Python environment (version 3.11.1). To determine the optimal number of clusters $k$ , a combination of methods was used, including the elbow method, silhouette method, and gap statistic [Citation60], all of which are integrated in the tslearn package. The computation time, inherently dependent on $k$ , was approximately half a day for computing nine different cluster types ranging from $k = 2$ to 10. For Bayesian $T$ -series estimation, we used the computational power of the Hamiltonian Monte Carlo (HMC) algorithm, a sophisticated technique for Markov chain Monte Carlo (MCMC) simulations, run through the rstan package (version 2.26.23) in the R environment (version 4.1.2). To efficiently process the Bayesian $T$ -series estimation for each of the 42,280 pixels, we employed a strategy that used the parallel package (version 4.1.2), foreach package (version 1.5.2), and doParallel package (version 1.0.17) for parallel computations. This methodology helped us take advantage of parallel processing on up to 32 cores. The computational time depended not only on the number of iterations and chains in the MCMC simulations, but also on the settings of the prior distribution. After extensive optimization, we could estimate the fast-axis directions in a few minutes, as well as $T_{F}$ and $T_{c}$ in approximately a day and a week, respectively. By applying $T$ -series analysis using hierarchical Bayesian $T$ -series estimation, we aim to complement our spatial data analysis using $K$ -shape clustering and provide a more holistic understanding of stress-induced ferroelectricity.

Our approach to $T$ -series analysis, which relates changes in sample $T$ to changes in time, presents unique challenges and opportunities for deeper insight. To address these challenges, we focus on statistical significance in time-series analysis. In time-series analysis, direct tests of statistical significance often require complex procedures tailored to the unique characteristics of the data. Nevertheless, in our approach, we demonstrate the utility of the exploratory values obtained from the MCMC simulations within the Bayesian framework. These values provide an indirect but an effective approach to assess differences between groups by providing insights into posterior distributions. Because of the large sample size in the MCMC simulations, the $t$ test and the Wilcoxon rank sum test are prone to type 1 errors, but Cohen’s $d$ remains robust to them [Citation61,Citation62]. Cohen’s $d$ with Hedges’ correction was calculated using the effsize package (version 0.8.1) in the R environment. Cohen classified effect sizes as small ( $| d | = 0.2$ ), medium ( $| d | = 0.5$ ), and large ( $| d | \geq 0.8$ ). According to Cohen’s guidelines, $0.2 \leq | d | < 0.5$ indicates a subtle but noticeable difference, $0.5 \leq | d | < 0.8$ indicates a moderate and observable difference, and $| d | \geq 0.8$ indicates a substantial and practically significant difference between two groups [Citation61,Citation63]. Statistical treatment using Cohen’s $d$ with Hedges’ correction is important for drawing robust conclusions from $T$ -dependent patterns. The normal-based 95% confidence interval calculated using the normal distribution method is based on the standard error and the assumption that our estimates follow a normal distribution. However, because the data may not follow a parametric distribution, we used the bootstrap 95% confidence interval, which is derived by bootstrap resampling and does not assume a specific distribution. This analysis was performed in the R environment using the boot package (version 1.3.28).

4. Results and discussion

shows the time and measuring point evolution of sample $T$ . As described in Ref [Citation16], our measurements continuously recorded 3,362 sheets $\times$ 3 $λ$ s of the birefringence imaging data from 300.0 to 14.1 K while carefully controlling the decrease of $T$ at a fixed rate of $- 0.25$ K/min. Because of the consistent relationship between $T$ and time, which accurately reflects the order and interval characteristics of the series data, it is feasible to apply the $T$ -series birefringence imaging data to machine learning using $K$ -shape clustering and hierarchical Bayesian $T$ -series estimation.

Figure 2. (a) Sample temperature variation with time and measurement points. Regions of insufficient temperature control, (b) affecting the determination of the structural phase transition temperature, and (c) affecting the determination of ferroelectric phase transition temperature.

4.1. K-shape clustering

To comprehensively characterize the spatial information, the $T$ -series birefringence imaging data were analyzed using the $K$ -shape clustering method. shows the $T$ dependence of ${\overline{R}}_{θ}$ and ${\overline{R}}_{ϕ}$ calculated at each $T$ . The ${\overline{R}}_{θ} (T)$ curves indicate that the distribution of $δ$ becomes large below 30 K because ferroelectric phase transition occurs and spontaneous polarization is inhomogeneously distributed over the substrate, as reported in Ref [Citation18]. The ${\overline{R}}_{ϕ} (T)$ curves for different $λ$ s show a remarkable difference near $T_{c}$ and $T_{F}$ . Furthermore, the ${\overline{R}}_{θ} (T)$ curves also seem to slightly differ in the same $T$ regions. This is attributed to the $θ$ folding back at 180 $^{\circ}$ when $δ$ exceeds $λ / 2$ at 523 and 543 nm. shows the $T$ dependence of the 4-variable $\times$ 3- $λ$ for each pixel, specifically, the seven pixels shown in , selected from a total of 42,280 pixels. These curves show numerous small jumps due to the continuous shift of the sample position. Based on a comparison of with , owing to the considerably small variation in ${\overline{R}}_{θ}$ and ${\overline{R}}_{ϕ}$ below 30 K, the shape of the $T$ -dependent curves of 12 variables divided by their coefficients remains almost unchanged. Furthermore, around $T_{c}$ , ${\overline{R}}_{ϕ}$ changes considerably because of the folding back of $θ$ . These coefficients tend to effectively expand the difference between the pixels in the $T$ -series data. The ${\overline{R}}_{θ}$ and ${\overline{R}}_{ϕ}$ estimates are expected to further improve the accuracy of the $K$ -shape clustering process. However, as shown in ), it is difficult to completely control sample $T$ owing to the limitations of experimental techniques, resulting in small fluctuations within certain $T$ ranges. Such fluctuations temporarily impose a nonintrinsic external force on the substrate, resulting in small birefringence anomalies that can be observed across all pixels. This is a considerable difference between the $T$ -series data and the time-series data. In conventional time-series analysis, the order can easily affect the results; therefore, the $K$ -shape clustering method, which focuses on shape similarity, is preferred in condensed matter physics [Citation32–34]. Although there is no intrinsic thermal hysteresis in SrTiO $_{3}$ , we entered machine learning in its measured order in this study.

Figure 3. Temperature dependence of the mean resultant length (a) ${\overline{R}}_{θ}$ and (b) ${\overline{R}}_{ϕ}$ for $λ = 523$ , 543, and 575 nm.

Figure 3. Temperature dependence of the mean resultant length (a) R‾θ and (b) R‾ϕ for λ=523, 543, and 575 nm.

Figure 4. Four variables ( $cos θ_{i} / {\overline{R}}_{θ}$ , $sin θ_{i} / {\overline{R}}_{θ}$ , $cos 2 ϕ_{i} / {\overline{R}}_{ϕ}$ , and $sin 2 ϕ_{i} / {\overline{R}}_{ϕ}$ ) as a function of temperature at (a–d) $λ = 523$ nm, (e–h) 543 nm, and (i–l) 575 nm. Analysis positions a1 to a7 and the corresponding cluster numbers to which they belong.

Figure 4. Four variables (cosθi/R‾θ, sinθi/R‾θ, cos2ϕi/R‾ϕ, and sin2ϕi/R‾ϕ) as a function of temperature at (a–d) λ=523 nm, (e–h) 543 nm, and (i–l) 575 nm. Analysis positions a1 to a7 and the corresponding cluster numbers to which they belong.

To optimize computational efficiency, we preprocessed the 42,280-pixel $\times$ 12-variable $T$ -series data between 130.9 and 14.1 K by averaging every four $T$ points, resulting in each $T$ -series containing 346 points with an approximate step of 0.34 K. For the multivariate $K$ -shape clustering method, determining the number of clusters $k$ in advance is imperative. shows the results of identifying the optimal values of $k$ using the elbow method, silhouette method, and gap statistic [Citation60]. Typically, these methods rarely yield consistent results because each emphasizes different aspects of cluster cohesion and separation. In other words, the elbow method focuses on the variance of the data in the clusters, the silhouette method focuses on the high cohesion in the clusters and the large separation between the clusters, and the gap statistic technique focuses on the difference between the random and real datasets. In this study, because the elbow and silhouette methods indicate $k = 4$ , as shown in , we selected this value. Although these indicators shown in serve as guidelines [Citation60], re-evaluating the validity of $k = 4$ after clustering is essential. shows the division of the 12-variable $T$ -series data into four groups (E1–E4), revealing the elongated clusters along $⟨ 111 ⟩_{c}$ corresponding to the dislocation. Comparing the multivariate $K$ -shape clustering shown in with the multivariate $K$ -means clustering (D1-D3, see Supplementary materials) [Citation18], we found that clusters E1 and E2 coincide with clusters D1 and D2, indicating large spontaneous polarization in these regions due to stress and/or strain concentration. Conversely, clusters E3 and E4, which correspond to cluster D3, could be regions of low spontaneous polarization and less disordered crystal orientation. shows the number of pixels for each cluster obtained from the $K$ -means and $K$ -shape clustering analyses. This result also corroborates that E1 and E2 correspond to D1 and D2, respectively, E3 and E4 correspond to D3.

Figure 5. Evaluation results for the optimal value of $k$ using the (a) elbow method, (b) silhouette method, and (c) gap statistic.

Figure 6. Result of the K-shape clustering obtained from 12 variables of temperature-series data between 130.9 and 14.1 K.

Table 1. Number of pixels in respective clusters determined by K-means and K-shape clustering methods. The K-means clustering data are referenced in Ref. [Citation18].

Download CSV Display Table

4.2. Analysis for each cluster

In , it appears that domains E1 and E3 along $⟨ 111 ⟩_{c}$ form one group, while domains E2 and E4 form another. The $T$ dependence of $δ$ and $ψ$ was summed of each cluster to confirm the characteristics for each cluster. The averaged values of the retardance, $δ^{Av}$ , and the fast-axis direction, $ψ^{A v}$ , for each cluster were obtained by averaging the difference between the incident and transmitted light of the Stokes parameters at each $T$ , defined as $(S_{1}^{A v}, S_{2}^{A v}, S_{3}^{A v})$ , as follows:

S_{1}^{A v} \equiv \frac{\sum_{i = 1}^{m} {(S_{1}^{O u t} - S_{1}^{I n})}_{i}}{m},

(12)

S_{2}^{A v} \equiv \frac{\sum_{i = 1}^{m} {(S_{2}^{O u t} - S_{2}^{I n})}_{i}}{m},

(12)

S_{3}^{A v} \equiv \frac{\sum_{i = 1}^{m} {(S_{3}^{O u t} - S_{3}^{I n})}_{i}}{m},

where $(\dots)_{i}$ represents the difference between the Stokes parameters at the $i$ -th pixel. shows the $T$ dependence of $δ^{Av}$ and $ψ^{Av}$ at 575 nm when $θ$ and $ϕ$ are rewritten as follows:

(13)

δ = (1 - \frac{θ}{360^{\circ}}) \times λ,

(13)

(14)

ψ = \frac{ϕ + 90^{\circ}}{2} .

(14)

Figure 7. Temperature dependence of the (a) averaged retardance, $δ^{Av}$ , and (b) averaged fast-axis direction, $ψ^{Av}$ , for $λ = 575$ nm. In (a), the $δ^{Av} (T)$ curves for E1 and E2 overlap, and the curves for E3 and E4 overlap.

These curves are almost identical to those for 523 and 543 nm, except for the $\sim$ $T_{c}$ and $\sim$ $T_{F}$ regions. In these regions, it is difficult to correctly estimate the value of $δ^{Av}$ from $(S_{1}^{A v}, S_{2}^{A v}, S_{3}^{A v})$ because we cannot separate the two components of $θ$ that fold back at 180 $^{\circ}$ and do not fold back on a pixel-by-pixel basis. In this study, we focused our analysis on $T$ -series data at 575 nm.

In , the $δ^{Av} (T)$ curves exhibit a similar shape regardless of the clusters, with two inflection points appearing at $\sim$ $T_{c}$ and $\sim$ $T_{F}$ . In addition, the values of $δ^{Av}$ are overall higher for E1 and E2 than for E3 and E4 because of stress and/or strain concentration. However, for E1 and E2, the direction of spontaneous polarization may be disturbed because the crystal orientations are disturbed, as discussed in Ref [Citation18]. The slopes of the $δ^{Av} (T)$ curves at $T < T_{F}$ (the ferroelectric state) appear to be steeper for E3 and E4 than for E1 and E2 because the increase in $δ$ due to the Pockels effect may be proportional to the superposition of spontaneous polarizations resulting from disturbed crystal orientations. In , the $ψ^{Av} (T)$ curves show numerous small jumps due to shifts in the sample position. In particular, the jumps of the $ψ^{Av} (T)$ curves for E1 and E2 exhibit opposite trends with almost identical magnitudes in the same $T$ range. As shown in , the effect of less than one pixel misalignment is pronounced because the vertically elongated domains for E1 and E2 alternate along the horizontal direction where the displacement is large. From a crystallographic viewpoint, it is unlikely that the optical principal axis rotates by a few degrees in the cubic or tetragonal phase with decreasing $T$ . However, if the crystal orientations are disturbed by stress and/or strain concentration, the averaged direction may rotate slightly with decreasing $T$ . Consequently, it remains inconclusive whether the $ψ^{Av} (T)$ curves vary with decreasing $T$ . In this study, assuming that the values of $ψ^{Av}$ remain constant with respect to $T$ between 130.9 and 14.1 K, we performed a Bayesian $T$ -series estimation of the $T$ -independent value of $ψ^{Av}$ for each cluster considering the measurement error.

The HMC algorithm for the MCMC simulations was used to derive the $T$ -independent value of $ψ^{Av}$ for each cluster. To ensure a comprehensive exploration of the parameter space, 16 independent MCMC chains were run, each with different random initial values. A burn-in period of 1,000 iterations was used, followed by the collection of 100,000 samples for analysis. The prior probability and model Hamiltonian for $ψ^{Av}$ are expressed as follows:

(15)

tan 2 ψ_{n}^{A v} \sim N ({\hat{t}}_{2 ψ}, {σ_{obs}}^{2}),

(15)

{\hat{t}}_{2 ψ} \sim N (0, 1^{2}),

σ_{obs} \sim |N {(0, 1}^{2})|,

where the normal distribution $N (\overset{ˉ}{x}, σ^{2})$ in linear statistics is a mean $\overset{ˉ}{x}$ and a standard deviation $σ \geq 0$ . The variable $ψ_{n}^{A v}$ denotes the experimental data of $ψ^{Av}$ at the $n$ -th $T$ point, while ${\hat{t}}_{2 ψ}$ serves as a parameter in the MCMC simulations. Using circular statistics, the mean (maximum likelihood estimate) of the angles is expressed as shown in EquationEquations (10)(10) $tan \overline{θ} = \frac{\sum_{i = 1}^{m} sin θ_{i}}{\sum_{i = 1}^{m} cos θ_{i}},$ (10) and (Equation11(11) $tan 2 \overline{ϕ} = \frac{\sum_{i = 1}^{m} sin 2 ϕ_{i}}{\sum_{i = 1}^{m} cos 2 ϕ_{i}} .$ (11) ). Because of the summation of trigonometric functions, it is not easy for the Bayesian approach to successively evaluate the error of each measurement.

To incorporate the error in each measurement into the Bayesian $T$ -series estimation, the derivation process of $ϕ$ and $ψ$ should be considered. As explained in Ref [Citation18], the angle $ϕ$ is represented by a periodicity of 180 $^{\circ}$ in the $S_{1} S_{2}$ -plane. Because all data between 130.9 and 14.1 K, regardless of the clusters, are concentrated around $ϕ ≃ 90^{\circ}$ in the third and fourth quadrants, as shown schematically in , the angles of $ϕ_{+}$ and $ϕ_{-}$ can be calculated as follows: if $S_{1} \geq 0$ and $S_{2} < 0$ ,

(16)

ϕ_{+} = 180^{\circ} + arctan (\frac{S_{2}^{O u t} - S_{2}^{I n}}{S_{1}^{O u t} - S_{1}^{I n}}),

(16)

Figure 8. Schematic drawing of the Stokes parameters $S_{1}^{O u t}$ and $S_{2}^{O u t}$ in the $S_{1} S_{2}$ -plane. All our experimental data for $λ = 575$ nm between 130.9 and 14.1 K show $ϕ ≃ 90^{\circ}$ in the third and fourth quadrants. The definitions of $ϕ_{+}$ and $ϕ_{-}$ differ depending on the sign of $S_{1}^{O u t}$ as in Equations (16) and (17).

while if $S_{1} < 0$ and $S_{2} < 0$ ,

(17)

ϕ_{-} = arctan (\frac{S_{2}^{O u t} - S_{2}^{I n}}{S_{1}^{O u t} - S_{1}^{I n}}) .

(17)

However, since “ $\arctan$ ’’ is a multivalued function, it is necessary to adjust $\pm 180^{\circ}$ to match the angle as shown in . Using EquationEquation (14)(14) $ψ = \frac{ϕ + 90^{\circ}}{2} .$ (14) , EquationEquations (16)(16) $ϕ_{+} = 180^{\circ} + arctan (\frac{S_{2}^{O u t} - S_{2}^{I n}}{S_{1}^{O u t} - S_{1}^{I n}}),$ (16) and (Equation17(17) $ϕ_{-} = arctan (\frac{S_{2}^{O u t} - S_{2}^{I n}}{S_{1}^{O u t} - S_{1}^{I n}}) .$ (17) ) can be rewritten as follows:

(18)

tan ϕ_{\pm} = - \frac{1}{tan 2 ψ} .

(18)

Because the angle $ϕ$ has the $180^{\circ}$ periodicity, to avoid “ $tangent$ ’’ singularities, the relationship between the Stokes parameters and $ψ$ expressed as

(19)

tan 2 ψ_{n}^{A v} = - \frac{{[\sum_{i} {(S_{1}^{O u t} - S_{1}^{I n})}_{i}]}_{n}}{{[\sum_{i} {(S_{2}^{O u t} - S_{2}^{I n})}_{i}]}_{n}} ≃ 0,

(19)

where $[\dots]_{n}$ is the sum of the differences in the Stokes parameters at the $n$ -th $T$ point. In our birefringence imaging measurements, the variables $(S_{1}^{O u t}, S_{2}^{O u t}, S_{3}^{O u t})$ at all pixels are identified simultaneously in a single measurement, assuming the condition $(S_{1}^{O u t})^{2} + (S_{2}^{O u t})^{2} + (S_{3}^{O u t})^{2} = 1$ for the respective pixels. Furthermore, as mentioned in $§$ 2.1, the incident light was set to $(S_{1}^{I n}, S_{2}^{I n}, S_{2}^{I n}) ≃ (0, 0, - 1)$ [Citation16]. Therefore, the measurement error of $(S_{1}^{O u t} / S_{2}^{O u t})$ in a single measurement at a certain $T$ is important, and the Bayesian $T$ -series estimation based on linear statistics was performed in a simplified manner using EquationEquation (19)(19) $tan 2 ψ_{n}^{A v} = - \frac{{[\sum_{i} {(S_{1}^{O u t} - S_{1}^{I n})}_{i}]}_{n}}{{[\sum_{i} {(S_{2}^{O u t} - S_{2}^{I n})}_{i}]}_{n}} ≃ 0,$ (19) . The convergence of the MCMC simulations was assessed using trace plots and Gelman – Rubin statistics [Citation64]. The trace plots showed effective mixing and overlap between the chains. As detailed in Supplementary materials, the effective sample size ( $n_{eff}$ ) derived from the MCMC simulations exceeded $\sim$ 40% of the total sample size, which consisted of 1,584,000 iterations. An $n_{eff}$ lower than 10% of the total iterations typically indicates a high degree of autocorrelation, suggesting that the MCMC chain may not be exploring the parameter space efficiently. Conversely, our results with $n_{eff}$ well above this threshold indicate that the MCMC simulations were sufficiently efficient, with lower autocorrelation, ensuring a robust exploration of the parameter space. The Gelman – Rubin diagnostic value ( $\hat{R}$ ) was 1.00 for all three parameters including the log probability of the unconstrained parameters, indicating satisfactory convergence between the chains (see Supplementary materials).

shows the results of the analysis for $ψ^{Av}$ to visually represent the posterior distributions for the MCMC simulations of ${\hat{t}}_{2 ψ}$ . shows a box and whisker plot of the exploration values in the fast-axis direction, expressed as ${\hat{ψ}}^{Av}$ $(= (arctan {\hat{t}}_{2 ψ}) \div 2 ≃ 90^{\circ}$ ) [Citation65]. shows the probability density histogram for ${\hat{ψ}}^{Av}$ for E3 and E4. The scales of the angular measures in these figures are very narrow, ranging from ${89.4}^{\circ}$ to ${91.7}^{\circ}$ ; thus, they can be treated as linear statistics. Therefore, these figures allow a direct comparison to detect any significant differences between the four clusters. shows the five-number summary, i.e. minimum, first quartile $(\equiv Q_{1})$ , median, third quartile $(\equiv Q_{3})$ , and maximum, and mean for ${\hat{ψ}}^{Av}$ of each cluster needed to construct the plot in . In our analysis, outliers in the box and whisker plots are identified based on a common statistical criterion involving the interquartile range (IQR) defined as $(Q_{3} - Q_{1})$ . Exploration values outside the range from $(Q_{1} - 1.5 \times I QR)$ to $(Q_{3} + 1.5 \times I QR)$ are considered outliers [Citation65]. This approach is widely accepted for outlier detection because it provides a balance between sensitivity to extreme values and robustness to the natural variability of the data. Although the exploration of the parameter space for ${\hat{ψ}}^{Av}$ was set wide, i.e. $2 {\hat{ψ}}^{Av} \sim (180 \pm arctan 1.96) =$ 117–243 $^{\circ}$ , both the minimum and maximum values in fell within a narrower range than the variation in the $ψ^{Av} (T)$ curves shown in . In Bayesian estimation, the prior probability distribution is used to reduce the computational cost by narrowing the range of variables. However, if this range is extremely restrictive, it can limit the exploration of the parameter space, potentially leading to incomplete or biased results. Thus, it is crucial to balance the reduction in computational cost with the need for sufficient exploration. The relationship between exploration values and range limits is a key consideration in the Bayesian approach (see Supplementary materials). The difference in the mean of ${\hat{ψ}}^{Av}$ between E1 and E2 is about 1.9 $^{\circ}$ , and this becomes the origin of the stripe pattern along $⟨ 111 ⟩_{c}$ , as shown in . To further assess the significant difference in ${\hat{t}}_{2 ψ}$ between E3 and E4, Cohen’s $d$ with Hedges’ correction was calculated as $| d | = 5.603$ [Citation61,Citation62]. In addition, after 100,000 resamplings of the $| d |$ value, the bootstrap 95% confidence interval is $[5.549, 5.659]$ . This indicates a significant difference between E3 and E4, and agrees well with the probability density histogram shown in . From ${\hat{ψ}}^{Av}$ , it can be quantitatively seen that E1 and E3 form one pair and E2 and E4 form another pair in the elongated domains along $⟨ 111 ⟩_{c}$ . These results confirm the successful separation of $K$ -shape clustering with $k = 4$ . This Bayesian approach, although indirect, simplifies the assessment of significant differences between clusters. In addition, the stripe pattern along $⟨ 111 ⟩_{c}$ , which remained ambiguous in the $K$ -means clustering, was separated by the $K$ -shape clustering. This may be because the $K$ -means clustering used the absolute value of the difference between the 30.0 and 14.1 K datasets to reduce the computational cost, possibly eliminating some information in $ψ$ [Citation18].

Figure 9. (a) A box and whisker plot of the exploration values of the averaged fast-axis direction, expressed as ${\hat{ψ}}^{Av}$ . (b) Probability density histogram for ${\hat{ψ}}^{Av}$ for E3 and E4. The scales of the angular measures in these figures are highly narrow; thus, they can be treated as linear statistics.

Figure 9. (a) A box and whisker plot of the exploration values of the averaged fast-axis direction, expressed as ψˆAv. (b) Probability density histogram for ψˆAv for E3 and E4. The scales of the angular measures in these figures are highly narrow; thus, they can be treated as linear statistics.

Table 2. Five-number summary (minimum, first quartile, median, third quartile, and maximum) and mean of exploration values of fast-axis direction for respective clusters expressed as ${\hat{ψ}}^{Av}$ .

Display Table

4.3. Hierarchical Bayesian temperature-series estimation

From previous studies of stress-induced ferroelectricity in SrTiO $_{3}$ , it was found that both $T_{c}$ and $T_{F}$ increase with increasing stress [Citation19–22]. As reported in Ref [Citation16], the value of $δ$ also increases due to increased stress and/or strain due to the photoelastic effect. The purpose of this study is to clarify the distribution of stress and/or strain from $T_{c}$ and the distribution of the ferroelectric state from $T_{F}$ on a pixel-by-pixel basis. The previous method typically estimates the phase transition temperature from the intersection of two straight lines on the $δ (T)$ curve using the least squares method [Citation16–18]. However, this method is sensitive to outliers and cannot account for measurement errors due to its point estimation methodology. To ensure consistent quality of analysis across all 42,280 different $T$ -series data, we used hierarchical Bayesian $T$ -series estimation. This fits individual models at lower levels, guided by constraints from the higher-level model. In other words, as shown in , the phase transition temperatures were estimated for each pixel while simultaneously estimating the overall phase transition temperature within the cluster.

Figure 10. (a) Representation of the hierarchical Bayesian $T$ -series estimation for phase transition temperatures at each pixel ( $T_{c}^{i}$ and $T_{F}^{i}$ for $i = 1$ –42,280) and for the entire cluster ( $T_{c}^{E j}$ and $T_{F}^{E j}$ for $j = 1$ –4). Schematic representation of the model Hamiltonian used for the hierarchical Bayesian $T$ -series estimation of (b) $T_{F}$ and (c) $T_{c}$ from the $δ (T)$ curve at position a3.

Because the estimation of $T_{F}$ is a more intuitive treatment than that of $T_{c}$ , the hierarchical Bayesian $T$ -series estimation method for $T_{F}$ is explained first. For the $T_{F}$ estimation, we used the intersection of two straight lines, as shown schematically in . The notations $T_{F}^{i}$ for the pixels ( $i = 1 \sim 42, 280$ ) and $T_{F}^{E j}$ for the clusters ( $j = 1 \sim 4$ ) denote the ferroelectric transition temperature at the $i$ -th pixel and as a whole in the E $j$ cluster, respectively. However, because the number $i$ is a sequential number unrelated to the cluster number $j$ , it is necessary to count only the skipped number $i$ belonging to each cluster based on the $K$ -shape clustering result shown in . As shown in , a positive slope is assumed for $T \geq T_{F}^{i}$ (the paraelectric region), and a negative slope is assumed for $T < T_{F}^{i}$ (the ferroelectric region). For this analysis, the $δ (T)$ data at 575 nm from 40.0 to 14.1 K were entered into the machine learning in their measured order. To reduce the computational cost, the $δ (T)$ curves were smoothed by averaging every two $T$ points for the $δ$ values, resulting in 159 points with an interval of approximately 0.17 K between 40.0 and 14.1 K. In addition, the smoothed $δ (T)$ curve at the $i$ -th pixel was standardized, denoted as $Y^{i} (n)$ , where $n$ is the $n$ -th $T$ point. Standardization makes it easier to define a more appropriate prior probability distribution because all data can be treated on the same scale without affecting the estimation of $T_{F}^{E j}$ and $T_{F}^{i}$ . After extensive optimization, the prior probability distribution was defined as follows:

T_{F}^{E j} \sim f (T_{F}^{E j}; 22.0, 4.0)

(20)

\equiv \frac{1}{π} \frac{4.0}{{(T_{F}^{E j} - 22.0)}^{2} + {4.0}^{2}},

(20)

(21)

T_{F}^{i} \sim N (T_{F}^{E j}, {0.3}^{2}),

(21)

where $f (T; x_{0}, γ)$ is the Cauchy distribution varying $T$ with median $x_{0}$ and scale parameter $γ$ . Compared with the normal distribution, the heavier tails of the Cauchy distribution are more robust to outliers, thereby providing a more flexible and tolerant approach to modeling errors in data with potential anomalies. The range limits for $T_{F}^{E j}$ and $T_{F}^{i}$ are set to

(22)

16.0 K < T_{F}^{E j} < 35.0 K,

(22)

(23)

16.0 K < T_{F}^{i} < 35.0 K .

(23)

The probability distributions for $T_{F}^{E j}$ and $T_{F}^{i}$ are broad, but $T_{F}^{i}$ is closely related to $T_{F}^{E j}$ in EquationEquation (21)(21) $T_{F}^{i} \sim N (T_{F}^{E j}, {0.3}^{2}),$ (21) . Under these conditions, the model Hamiltonian is as follows: if $T \geq T_{F}^{i}$ (the paraelectric state),

(24)

Y^{i} (n) \sim N (\hat{a} + {\hat{b}}_{1} \times (X (n) - T_{F}^{i}), {σ_{obs}}^{2}),

(24)

σ_{obs} \sim |f (σ_{obs}; 0, 2)|,

while if $T < T_{F}^{i}$ (the ferroelectric state),

(25)

Y^{i} (n) \sim N (\hat{a} + {\hat{b}}_{2} \times (X (n) - T_{F}^{i}), {σ_{obs}}^{2}),

(25)

σ_{obs} \sim |f (σ_{obs}; 0, 2)|,

where $X (n)$ is the temperature at the $n$ -th $T$ point, i.e. $X (1) = 40.0$ K, $\dots$ , $X (159) = 14.1$ K. Unlike EquationEquation (13)(13) $δ = (1 - \frac{θ}{360^{\circ}}) \times λ,$ (13) , EquationEquations (24)(24) $Y^{i} (n) \sim N (\hat{a} + {\hat{b}}_{1} \times (X (n) - T_{F}^{i}), {σ_{obs}}^{2}),$ (24) and (Equation25(25) $Y^{i} (n) \sim N (\hat{a} + {\hat{b}}_{2} \times (X (n) - T_{F}^{i}), {σ_{obs}}^{2}),$ (25) ) use linear statistics to address the errors in $Y^{i} (n)$ because we estimate the intersection of two straight lines on the linear graph.

Below 40.0 K, the displacement of the sample position is negligible because the stress apparatus underwent sufficient thermal shrinkage. Therefore, as shown in , the measurement error corresponding to $σ_{obs}$ could be small. The parameters ${\hat{b}}_{1}$ and ${\hat{b}}_{2}$ in EquationEquations (24)(24) $Y^{i} (n) \sim N (\hat{a} + {\hat{b}}_{1} \times (X (n) - T_{F}^{i}), {σ_{obs}}^{2}),$ (24) and (Equation25(25) $Y^{i} (n) \sim N (\hat{a} + {\hat{b}}_{2} \times (X (n) - T_{F}^{i}), {σ_{obs}}^{2}),$ (25) ) were constrained to ${\hat{b}}_{1} > 0$ (a positive slope) and ${\hat{b}}_{2} < 0$ (a negative slope), respectively, but the parameter $\hat{a}$ was not constrained. The MCMC simulations were performed with different random initial values for 20,000 iterations each over eight chains, with the first 1,000 being the warm-up period. As detailed in the Supplementary materials, the $n_{eff}$ derived from the MCMC simulations exceeded $\sim$ 40% of the total sample size, which consisted of 152,000 iterations. For convergence diagnostics, the Gelman – Rubin statistics were computed for seven parameters, including the log probability of the unconstrained parameters associated with the estimation of $T_{F}^{E j}$ and $T_{F}^{i}$ , and $\hat{R}$ of all variables and pixels remained below 1.05, indicating satisfactory convergence (see Supplementary materials).

To estimate $T_{c}$ , the structural phase transition temperature at each pixel, $T_{c}^{i}$ ( $i = 1 \sim 42, 280$ ), and in the E $j$ cluster, $T_{c}^{E j}$ ( $j = 1 \sim 4$ ), were determined similar to $T_{F}$ . shows that the $δ (T)$ peak at $\sim$ 120 K has a large width and significant jump and bend, making it difficult to use two straight lines to identify the intersection. Despite the computational burden, we could detect a trend change from increasing to decreasing by comparing the values of $δ$ at the current $T$ and a previous $T$ . This approach could be called ‘Bayesian temperature-series change point detection’ as it is derived from the time-series analysis [Citation33]. In this method, the order of the $T$ -series data and the uniformity of the $T$ intervals may affect the estimation results; however, the data were entered in their measured order without any correction. In , several types of slopes can be observed around the peak top. In this study, by setting a wide distribution of measurement error, we detected temperatures that entered a strongly decreasing trend (see Supplementary materials). The $δ (T)$ curve between 149.7 and 100.1 K for each pixel was smoothed by averaging every eight $T$ points, resulting in a series of 74 points and an interval of approximately 0.68 K. Although the $T$ interval of the $T_{c}$ estimate is larger than that of the $T_{F}$ estimate, this discrepancy is not a problem because the $T$ intervals are approximately the same when normalized by the respective phase transition temperatures based on the scaling laws [Citation1]. Furthermore, the smoothed $δ (T)$ curve between 149.7 and 100.1 K at the $i$ -th pixel was standardized and denoted as $Y^{i} (n)$ . To explore the parameter space more independently, we defined the structural phase transition temperatures as follows:

(26)

T_{c}^{i} = T_{c}^{E j} + T_{c}^{o f f i},

(26)

where $T_{c}^{o f f i}$ is the deviation between $T_{c}^{i}$ and $T_{c}^{E j}$ . Post optimization, the prior probability for the MCMC simulations was defined as follows:

(27)

107.0 K < T_{c}^{E j} < 130.0 K,

(27)

T_{c}^{E j} = N (117.0, {1.0}^{2}),

(28)

- 0.5 K < T_{c}^{o f f i} < + 0.5 K,

(28)

T_{c}^{o f f i} = N (0, 5^{2}) .

We set a wider range for $T_{c}^{E j}$ but a narrower range for $T_{c}^{o f f i}$ . Preliminary runs indicated that exploring ${\hat{T}}_{c}^{off i}$ does not significantly affect the estimation of $T_{c}^{i}$ , as ${\hat{T}}_{c}^{off i}$ is uniformly distributed over the substrate (see Supplementary materials). To minimize the risk of converging to local minima, we systematically reduced the range limit while closely monitoring the posterior probability distribution. The final exploration range was then determined using EquationEquation (28)(28) $- 0.5 K < T_{c}^{o f f i} < + 0.5 K,$ (28) , considering the balance between the computational time and accuracy. Under these conditions, the model Hamiltonian is as follows: if $T \geq T_{c}^{i}$ (the cubic phase),

(29)

Y^{i} (n) = N (Y^{i} (n - 1) + {\hat{c}}_{1}, {σ_{obs}}^{2}),

(29)

σ_{obs} = |f (σ_{obs}; 0, 5)|,

while if $T < T_{c}^{i}$ (the tetragonal phase),

(30)

Y^{i} (n) = N (Y^{i} (n - 1) + {\hat{c}}_{2}, {σ_{obs}}^{2}),

(30)

σ_{obs} = |f (σ_{obs}; 0, 5)|,

where $Y^{i} (n)$ and $Y^{i} (n - 1)$ are the standardized $δ (T)$ curves at the $i$ -th pixel for the $n$ -th and ( $n - 1$ )-th $T$ points, respectively. The measurement error $σ_{obs}$ was set to a distribution that was sufficiently wide to be regarded as an essentially uniform distribution. The parameters ${\hat{c}}_{1}$ and ${\hat{c}}_{2}$ in EquationEquations (29)(29) $Y^{i} (n) = N (Y^{i} (n - 1) + {\hat{c}}_{1}, {σ_{obs}}^{2}),$ (29) and (Equation30(30) $Y^{i} (n) = N (Y^{i} (n - 1) + {\hat{c}}_{2}, {σ_{obs}}^{2}),$ (30) ) are assumed as ${\hat{c}}_{1} > 0$ (an increasing trend) and ${\hat{c}}_{2} < 0$ (a decreasing trend), respectively. The MCMC simulations were performed with different random initial values for 20,000 iterations each over four chains, with the first 1,000 as the warm-up period. As detailed in the Supplementary materials, the $n_{eff}$ derived from the MCMC simulations exceeded $\sim$ 20% of the total sample size, which consisted of 76,000 iterations. The convergence diagnostics used the Gelman – Rubin statistics for 81 parameters, including the log probability of the unconstrained parameters associated with the estimation of $T_{c}^{E j}$ and $T_{c}^{o f f i}$ , and $\hat{R}$ of all variables and pixels were below 1.05, indicating satisfactory convergence (see Supplementary materials).

4.4. Analysis for each pixel

shows the results of the exploration values for $T_{c}$ , expressed as ${\hat{T}}_{c}^{E j}$ and ${\hat{T}}_{c}^{i}$ , to visually represent the posterior distributions for the MCMC simulations. shows a box and whisker plot of ${\hat{T}}_{c}^{E j}$ ( $j = 1 \sim 4$ ). presents a five-number summary and the mean required for constructing this plot. The exploration of the parameter space for $T_{c}^{E j}$ is set wide in EquationEquation (27)(27) $107.0 K < T_{c}^{E j} < 130.0 K,$ (27) ; therefore, the minimum and maximum values are widely scattered, but their ranges are sufficiently narrower than their limits. No significant differences in ${\hat{T}}_{c}^{E i}$ were observed between clusters. shows a probability density histogram for ${\hat{T}}_{c}^{i}$ , summed over the exploration values at all pixels in each cluster. Although this plot is unsuitable for understanding the characteristics of ${\hat{T}}_{c}^{i}$ on a pixel-by-pixel basis, the distribution of ${\hat{T}}_{c}^{i}$ showed no significant differences between clusters. We calculated the Pearson correlation coefficient ( $r$ ) to examine the relationship between ${\hat{T}}_{c}^{E j}$ (the higher level) and ${\hat{T}}_{c}^{i}$ (the lower level). Because the coefficient $r$ measures the relative relationship between two groups, changes in the scale or position of the variables do not affect the strength or direction of the correlation. In general, a linear relationship is considered strong when $| r | > 0.7$ , moderate when $0.3 < | r | \leq 0.7$ , and weak or indicative of almost no correlation when $| r | \leq 0.3$ [Citation61]. For the correlation between ${\hat{T}}_{c}^{E j}$ and ${\hat{T}}_{c}^{i}$ , the coefficient $r$ exceeds 0.94 for all pixels, because the exploration of ${\hat{T}}_{c}^{off i}$ did not affect the estimation of ${\hat{T}}_{c}^{i}$ . This indicates a uniform distribution of ${\hat{T}}_{c}^{i}$ over the substrate (see Supplementary materials). shows the distribution of the mean of ${\hat{T}}_{c}^{i}$ for all pixels, which appears almost uniform. Although the standard deviation is often used when discussing the degree of distribution, it does not accurately represent the characteristics of the distribution if the dataset has an asymmetric distribution or numerous outliers. Furthermore, to compare the distribution of $T_{c}$ and $T_{F}$ , we must use indicators that can be analyzed at different scales. Therefore, a quantitative assessment of the variation in the mean of ${\hat{T}}_{c}^{i}$ is facilitated by , which presents the calculations of median absolute deviation (MAD) and coefficient of variation (CV) [Citation65]. In addition, the bootstrap 95% confidence interval is also shown after 100,000 resamplings. The MAD metric, which indicates the median deviation from the median, provides a robust assessment of data scatter that is less sensitive to outliers. The CV metric, which is the standard deviation divided by the mean, reflects the relative variation. These metrics are useful for comparing datasets of different scales. From , the MAD shows that the variation is smaller for E1 and E2 than for E3 and E4, but the difference is small. Furthermore, the CV shows that the standard deviation is estimated to be approximately 0.1 $\sim$ 0.2% of the mean; thus, the overall variation in the mean of ${\hat{T}}_{c}^{i}$ is almost quantitatively uniform.

Figure 11. (a) A box and whisker plot of exploration values of structural phase transition temperatures in clusters, expressed as ${\hat{T}}_{c}^{E 1}$ , ${\hat{T}}_{c}^{E 2}$ , ${\hat{T}}_{c}^{E 3}$ , and ${\hat{T}}_{c}^{E 4}$ . (b) Probability density histogram of the sum of the exploration values of ${\hat{T}}_{c}^{i}$ for each pixel when structural phase transition temperatures at a pixel are expressed as $T_{c}^{i}$ . All histograms overlap. (c) Distribution plot of the mean of ${\hat{T}}_{c}^{i}$ .

Figure 11. (a) A box and whisker plot of exploration values of structural phase transition temperatures in clusters, expressed as TˆcE1, TˆcE2, TˆcE3, and TˆcE4. (b) Probability density histogram of the sum of the exploration values of Tˆci for each pixel when structural phase transition temperatures at a pixel are expressed as Tci. All histograms overlap. (c) Distribution plot of the mean of Tˆci.

Table 3. Five-number summary and mean of exploration values of the structural phase transition temperatures for respective clusters expressed as ${\hat{T}}_{c}^{E 1}$ , ${\hat{T}}_{c}^{E 2}$ , ${\hat{T}}_{c}^{E 3}$ , and ${\hat{T}}_{c}^{E 4}$ .

Display Table

Table 4. Median absolute deviation (MAD) and coefficient of variation (CV) in the distribution of the mean of ${\hat{T}}_{c}^{i}$ and ${\hat{T}}_{F}^{i}$ for respective clusters. The upper panel of each column displays the values of MAD and CV, whereas the lower panel provides the bootstrap 95% confidence intervals for them.

Display Table

shows the results of the exploration values for $T_{F}$ , represented as ${\hat{T}}_{F}^{E j}$ and ${\hat{T}}_{F}^{i}$ . shows a box and whisker plot of ${\hat{T}}_{F}^{E j}$ ( $j = 1 \sim 4$ ). provides a five-number summary and the mean required for constructing this plot. For E1 and E2, ${\hat{T}}_{F}^{E 1}$ and ${\hat{T}}_{F}^{E 2}$ are scattered on the high- $T$ side; thus, the mean is more than 0.5 K above the median and the IQR is more than twice as large as for E3 and E4. shows a probability density histogram for ${\hat{T}}_{F}^{i}$ , summed over the exploration values at all pixels in each cluster. In E1 and E2, the histograms show 3–4 peaks, indicating a mixture of several groups. Conversely, for E3 and E4, the histograms are concentrated on the lower- $T$ side. For the correlation between ${\hat{T}}_{F}^{E j}$ and ${\hat{T}}_{F}^{i}$ in each cluster, the coefficients $r$ are widely scattered within $0.1 < r < 0.6$ , although the means of $r$ in the respective clusters are 0.25–0.31 (see Supplementary materials). This indicates that the lower-level estimation is successfully performed for each pixel, while being restricted by the higher-level estimation of the entire cluster. shows the distribution of the mean of ${\hat{T}}_{F}^{i}$ . The MAD and CV, and their bootstrap 95% confidence intervals after 100,000 resamplings were calculated to evaluate the distributions of the mean of ${\hat{T}}_{F}^{i}$ and are shown in . The MAD results show that the distributions of the mean of ${\hat{T}}_{F}^{i}$ from the median are more than twice as wide for E1 and E2 than for E3 and E4. The CV results show that the standard deviation of the mean of ${\hat{T}}_{F}^{i}$ is approximately 8% of the mean for E1 and E2, and approximately 4% of the mean for E3 and E4. For both metrics, E1 and E2 are a pair and E3 and E4 are another pair. shows Cohen’s $d$ with Hedges’ correction of distributions of ${\hat{T}}_{F}^{i}$ and the bootstrap 95% confidence interval after 10,000 resamplings of $| d |$ for each pair of clusters to assess significant differences. Consequently, significant differences exist between the clusters, except for the pair E1 and E2 and pair E3 and E4. This is consistent with the $δ^{A v} (T)$ curves shown in .

Figure 12. (a) A box and whisker plot of exploration values of ferroelectric phase transition temperatures in clusters, expressed as ${\hat{T}}_{F}^{E 1}$ , ${\hat{T}}_{F}^{E 2}$ , ${\hat{T}}_{F}^{E 3}$ , and ${\hat{T}}_{F}^{E 4}$ . (b) Probability density histogram of the sum of the exploration values of ${\hat{T}}_{F}^{i}$ for each pixel when structural phase transition temperatures at a pixel are expressed as $T_{F}^{i}$ . The histograms for E1 and E2 overlap, and those for E3 and E4 overlap. (c) Distribution plot of the mean of ${\hat{T}}_{F}^{i}$ .

Figure 12. (a) A box and whisker plot of exploration values of ferroelectric phase transition temperatures in clusters, expressed as TˆFE1, TˆFE2, TˆFE3, and TˆFE4. (b) Probability density histogram of the sum of the exploration values of TˆFi for each pixel when structural phase transition temperatures at a pixel are expressed as TFi. The histograms for E1 and E2 overlap, and those for E3 and E4 overlap. (c) Distribution plot of the mean of TˆFi.

Table 5. Five-number summary and mean of exploration values of the ferroelectric phase transition temperatures for respective clusters expressed as ${\hat{T}}_{F}^{E 1}$ , ${\hat{T}}_{F}^{E 2}$ , ${\hat{T}}_{F}^{E 3}$ , and ${\hat{T}}_{F}^{E 4}$ .

Display Table

Table 6. Cohen’s $d$ with Hedges’ correction of the distributions of the exploration values ${\hat{T}}_{F}^{i}$ in two combinations within E1, E2, E3, and E4. Because the sign of the $d$ value is insignificant, the upper panel of each column displays the absolute value of $d$ , whereas the lower panel details the bootstrap 95% confidence intervals for $| d |$ .

Display Table

From Ref [Citation18]. the larger $T_{F}$ regions indicate the stress and/or strain concentration. On the basis of linear relationship between stress and $T_{c}$ expressed as EquationEquation (1)(1) $σ_{Cv} = 30 \times (T_{c} - 105) (MPa) .$ (1) , the stress distribution was also considered. By uniformly subtracting 105 K from the means of ${\hat{T}}_{c}^{i}$ in clusters, the standard deviations remain unchanged, while the mean and median are changed. Therefore, the MADs remain constant, but the CVs change, and the relative relationship between the CVs could change. However, if the means of ${\hat{T}}_{c}^{i}$ in clusters are multiplied by the same factor, the relationship between the MADs does not change as the same coefficients are multiplied for them. Comparing the MADs, the relationship of the means of ${\hat{T}}_{c}^{i}$ between clusters is opposite to that of the means of ${\hat{T}}_{F}^{i}$ . In Ref [Citation16], we have already proposed evaluating the stress and/or strain distribution using $δ$ caused by the photoelastic effect. The distributions of $δ (T)$ and $ψ (T)$ at 130.9 K in and those at 14.1 K in of Ref [Citation16]. are similar to the distribution of the mean of ${\hat{T}}_{F}^{i}$ in . To evaluate the similarity quantitatively, shows the heat maps of the coefficient $r$ between $δ (T)$ , $ψ (T)$ , means of ${\hat{T}}_{c}^{i}$ , and means of ${\hat{T}}_{F}^{i}$ . Concerning the correlations between the means of ${\hat{T}}_{c}^{i}$ and ${\hat{T}}_{F}^{i}$ , there are moderate correlations for E1 but weak or no correlations for E2–E4. Furthermore, for the relationship between $δ (T)$ and the means of ${\hat{T}}_{F}^{i}$ , there are moderate correlations for E1 and E2, but almost no correlations for E3 and E4 at 130.9 K, whereas there are strong correlations for E1 and E2, and moderate correlations for E3 and E4 at 14.1 K. Contrary to our expectation, we could not find any pair of variables showing a strong correlation for E3 and E4, despite almost uniform stress and/or strain. Consequently, the stress and/or strain distribution could be better estimated from $δ (14.1 K)$ or the mean of ${\hat{T}}_{F}^{i}$ than from the mean of ${\hat{T}}_{c}^{i}$ .

Figure 13. Heat maps for each cluster of Pearson correlation coefficients ( $r$ ) between $δ$ , $ψ$ , mean of ${\hat{T}}_{c}^{i}$ and mean of ${\hat{T}}_{F}^{i}$ at (a) 130.9 K and (b) 14.1 K.

Figure 13. Heat maps for each cluster of Pearson correlation coefficients (r) between δ, ψ, mean of Tˆci and mean of TˆFi at (a) 130.9 K and (b) 14.1 K.

Finally, we speculate why the correlation is stronger for E1 and E2 than for E3 and E4. If the Pockels effect causes an increase in the $δ (T)$ curve below $T_{F}$ , the coefficient $r$ should be stronger for E3 and E4 than for E1 and E2 because of the less disordered crystal orientation. Meanwhile, considering the photoelastic effect, we can assume a strong linear relationship between the local stress and/or strain and $T_{F}$ . Although it is not easy to distinguish between these effects experimentally, we hypothesize that flexoferroelectricity is induced by local strain, which is an interesting concept [Citation66], but it borders on speculative territory in this study. At zero electric field, the ensemble average of the normal spontaneous polarization within a pixel ( $\geq$ 10 $\times$ 10 $μ$ m $^{2}$ ) is very small, whereas that of the flexoferroelectric polarization may be large because the strain has already biased it in a certain direction. If this scenario is true, flexoferroelectric polarization would be generated locally for E1 and E2 in addition to the normal ferroelectric polarization distributed throughout the substrate. This hypothesis can be tested by applying electric fields at various $σ$ . For example, the flexoferroelectricity is bound to the lattice distortions; therefore, different electric field dependencies may emerge within the substrate. In the future, to confirm the nonlinear relationship between ${\hat{T}}_{c}^{i}$ , ${\hat{T}}_{F}^{i}$ , and stress and/or strain, we will also apply machine learning using nonlinear models such as decision trees and random forests to unravel the complex dynamics governing the flexoferroelectricity and normal ferroelectricity.

5. Conclusion

In this study, the stress-induced ferroelectric domains elongated along $⟨ 111 ⟩_{c}$ in SrTiO $_{3}$ were investigated by combining machine learning and statistics to extract deep insights from large $T$ -series birefringence imaging data. After applying the advanced preprocessing to characterize the spatial information of the $T$ -series data, we first performed the multivariate $K$ -shape clustering method focused on the shape similarity of the $T$ dependencies. This result showed that a large spontaneous polarization appears for E1 and E2 owing to stress and/or strain concentration. Furthermore, hierarchical Bayesian $T$ -series estimation was used to ensure consistent analysis of 42,280-pixel $T$ -series data to estimate $T_{c}^{i}$ and $T_{F}^{i}$ for each pixel as constrained by the overall $T_{c}^{E j}$ and $T_{F}^{E j}$ in the cluster. For the means of ${\hat{T}}_{c}^{i}$ , the uniform distribution over the substrate is strongly supported. Furthermore, the means of ${\hat{T}}_{F}^{i}$ for E1 and E2 are widely scattered to the high- $T$ side. The higher region coincides well with the stress and/or stress concentration regions. Consequently, the hierarchical Bayesian $T$ -series estimation strongly supports the $K$ -shape clustering results. Based on the correlation coefficients for E1 and E2, a strong correlation exists between $δ (14.1 K)$ and the means of ${\hat{T}}_{F}^{i}$ , but the correlations become moderate for E3 and E4. Because of the weak or no correlation between the means of ${\hat{T}}_{c}^{i}$ and ${\hat{T}}_{F}^{i}$ , it is difficult to deduce the distribution of stress and/or strain from these means. Our multistep $T$ -series analysis method has proven considerably useful for studying critical phenomena near phase transition temperatures. Further development of $T$ -series analysis is essential for the widespread use of machine learning in condensed matter physics.

Supplemental material

Supplemental Material

Download PDF (33.8 MB)

Disclosure statement

No potential conflict of interest was reported by the author(s).

Supplementary material

Supplemental data for this article can be accessed online at https://doi.org/10.1080/27660400.2024.2342234

Additional information

Funding

This research was partially supported by a Grant-in-Aid for Scientific Research (KAKENHI) Grant Numbers [JP21K04897] and [JP23K03283] from the Japan Society for the Promotion of Science.

References

Yeomans YM. Statistical mechanics of phase transitions. Oxford Science Publications; 1992.
Google Scholar
Gautam R, Vanga S, Ariese F, et al. Review of multidimensional data processing approaches for Raman and infrared spectroscopy. EPJ Techn Instrum. 2015;2(1):8. doi: 10.1140/epjti/s40485-015-0018-6
Google Scholar
Mueller T, Kusne AG, Ramprasad R. Machine learning in materials science: recent progress and emerging applications. Rev Comput Chem. 2016;29:186–19. doi: 10.1002/9781119148739
Web of Science ®Google Scholar
Rodrigues JF Jr, Florea L, Oliveira MCF. Big data and machine learning for materials science. Discover Materials. 2021;1(1):12. doi: 10.1007/s43939-021-00012-0
PubMedGoogle Scholar
Turrell G, Corset J. Raman microscopy: developments and applications. London: Academic Press; 1996.
Google Scholar
Lieber CM. Nanoscale science and technology: building a big future from small things. Mater Res Bull. 2003;28(7):486–491. doi: 10.1557/mrs2003.144
Google Scholar
Meyer E. Scanning probe microscopy and related methods. Beilstein J Nanotechnol. 2010;1:155–157. doi: 10.3762/bjnano.1.18
PubMed Web of Science ®Google Scholar
Fleming A. Dual-stage vertical feedback for high-speed scanning probe microscopy. IEEE Trans Control Syst Technol. 2011;19(1):156–165. doi: 10.1109/TCST.2010.2040282
Web of Science ®Google Scholar
Casola F, Sar T, Yacoby A. Probing condensed matter physics with magnetometry based on nitrogen-vacancy centres in diamond. Nat Rev Mater. 2018;3(1):17088. doi: 10.1038/natrevmats.2017.88
Web of Science ®Google Scholar
Uemura Y, Matsuoka S, Tsutsumi J, et al. Birefringent field-modulation imaging of transparent ferroelectrics. Phys Rev Appl. 2020;14(2):024060. doi: 10.1103/PhysRevApplied.14.024060
Web of Science ®Google Scholar
Laskar MDAR, Celano U. Scanning probe microscopy in the age of machine learning. APL Mach Learn. 2023;1(4):041501. doi: 10.1063/5.0160568
Google Scholar
Trebbia P, Bonnet N. EELS elemental mapping with unconventional methods I. theoretical basis: image analysis with multivariate statistics and entropy concepts. Ultramicroscopy. 1990;34(3):165–178. doi: 10.1016/0304-3991(90)90070-3
PubMed Web of Science ®Google Scholar
Bonnet N, Brun N, Colliex C. Extracting information from sequences of spatially resolved EELS spectra using multivariate statistical analysis. Ultramicroscopy. 1999;77(3–4):97–112. doi: 10.1016/S0304-3991(99)00042-X
Web of Science ®Google Scholar
Muto S, Yoshida T, Tatsumi K. Diagnostic nano-analysis of materials properties by multivariate curve resolution applied to spectrum images by S/TEM-EELS. Mater Trans. 2009;50(5):964–969. doi: 10.2320/matertrans.MC200805
Web of Science ®Google Scholar
Shiga M, Tatsumi K, Muto S. Sparse modeling of EELS and EDX spectral imaging data by nonnegative matrix factorization. Ultramicroscopy. 2016;170:43–59. doi: 10.1016/j.ultramic.2016.08.006
PubMed Web of Science ®Google Scholar
Manaka H, Uetsubara K, Korogi S, et al. Microscopic observation of ferroelectric and structural phase transitions in SrTiO3 under uniaxial stress using birefringence imaging techniques. J Phys Soc Jpn. 2022;91(8):084704. doi: 10.7566/JPSJ.91.084704
Google Scholar
Manaka H, Uetsubara K, Miura Y. Stress-induced ferroelectricity in quantum paraelectric SrTiO3 observed by birefringence imaging. JPS Conf Proc. 2023;38:011112.
Google Scholar
Toyoda K, Manaka H, Miura Y. Improvements of birefringence imaging techniques to observe stress-induced ferroelectricity in SrTiO3 based on K-means clustering with circular statistics. Sci Technol Adv Mater Meth. 2023;3(1):2278322. doi: 10.1080/27660400.2023.2278322
Google Scholar
Müller KA, Berlinger W, Slonczewski JC. Order parameter and phase transitions of stressed SrTiO3. Phys Rev Lett. 1970;25(11):734. doi: 10.1103/PhysRevLett.25.734
Web of Science ®Google Scholar
Burke WJ, Pressley RJ. Stress induced ferroelectricity in SrTiO3. Solid State Commun. 1971;9(3):191–195. doi: 10.1016/0038-1098(71)90115-3
Web of Science ®Google Scholar
Uwe H, Sakudo T. Stress-induced ferroelectricity and soft phonon modes in SrTiO3. Phys Rev B. 1976;13(1):271–286. doi: 10.1103/PhysRevB.13.271
Web of Science ®Google Scholar
Fujii Y, Uwe H, Sakudo T. Stress-induced quantum ferroelectricity in SrTiO3. J Phys Soc Jpn. 1987;56(6):1940–1942. doi: 10.1143/JPSJ.56.1940
Google Scholar
Ohtomo A, Hwang HY. A high-mobility electron gas at the LaAlO3/SrTiO3 heterointerface. Nature. 2004;427(6973):423–426. doi: 10.1038/nature02308
PubMed Web of Science ®Google Scholar
Honig M, Sulpizio JA, Drori J, et al. Local electrostatic imaging of striped domain order in LaAlO3/SrTiO3. Nat Mater. 2013;12(12):1112. doi: 10.1038/nmat3810
PubMed Web of Science ®Google Scholar
Lee PW, Singh VN, Guo GY, et al. Hidden lattice instabilities as origin of the conductive interface between insulating LaAlO3 and SrTiO3. Nat Commun. 2016;7(1):12773. doi: 10.1038/ncomms12773
PubMedGoogle Scholar
Su CP, Singh AK, Wu TC, et al. Impact of strain-field interference on the coexistence of electron and hole gases in SrTiO3/LaAlO3/SrTiO3. Phys Rev Mat. 2019;3:075003. doi: 10.1103/PhysRevMaterials.3.075003
Web of Science ®Google Scholar
Bishop CM. Pattern recognition and machine learning (information science and statistics). Springer; 2006.
Google Scholar
Vasudevan RK, Tselev A, Baddorf AP, et al. Big-data reflection high energy electron diffraction analysis for understanding epitaxial film growth processes. ACS Nano. 2014;8(10):10899–10908. doi: 10.1021/nn504730n
PubMed Web of Science ®Google Scholar
Provence SR, Thapa S, Paudel R, et al. Machine learning analysis of perovskite oxides grown by molecular beam epitaxy. Phys Rev Mater. 2020;4(8):083807. doi: 10.1103/PhysRevMaterials.4.083807
Web of Science ®Google Scholar
Gliebe K, Sehirlioglu A. Distinct thin film growth characteristics determined through comparative dimension reduction techniques. J Appl Phys. 2021;130(12):125301. doi: 10.1063/5.0059655
Web of Science ®Google Scholar
Yoshinari A, Iwasaki Y, Kotsugi M, et al. Skill-agnostic analysis of reflection high-energy electron diffraction patterns for Si(111) surface superstructures using machine learning. Sci Technol Adv Mater Meth. 2022;2(1):162–174. doi: 10.1080/27660400.2022.2079942
Google Scholar
Senin P. Dynamic time warping algorithm review. Inf Comput Sci Dep Univ Hawaii Manoa Honol. USA. 2008;855(1–23):40.
Google Scholar
Box GEP, Jenkins GM, Reinsel GC, et al. Time series analysis: forecasting and control. 5th ed. Wiley; 2015.
Google Scholar
Paparrizos J, Gravano L. K-shape: efficient and accurate clustering of time series. SIGMOD Rec. 2016;45(1):69–76. doi: 10.1145/2949741.2949758
Web of Science ®Google Scholar
Mohammad-Djafari A, Féro O. Bayesian approach to change points detection in time series. Int J Imaging Syst Tech. 2007;16(5):215–221. doi: 10.1002/ima.20080
Web of Science ®Google Scholar
Iwamitsu K, Aihara S, Okada M, et al. Bayesian analysis of an excitonic absorption spectrum in a Cu2O thin film sandwiched by paired MgO plates. J Phys Soc Jpn. 2016;85(9):094716. doi: 10.7566/JPSJ.85.094716
Google Scholar
Cowley RA. Lattice dynamics and phase transitions of strontium titanate. Phys Rev. 1964;134(4A):A981. doi: 10.1103/PhysRev.134.A981
Web of Science ®Google Scholar
Courtens E. Birefringence of SrTiO3 produced by the 105°K structural phase transition. Phys Rev Lett. 1972;29(20):1380. doi: 10.1103/PhysRevLett.29.1380
Web of Science ®Google Scholar
Cowley RA. The phase transition of strontium titanate. Phil Trans R Soc A. 1996;354(1720):2799–2814. doi: 10.1098/rsta.1996.0130
Google Scholar
Haeni JH, Irvin P, Chang W. Room-temperature ferroelectricity in strained SrTiO3. Nature. 2004;430(7001):758. doi: 10.1038/nature02773
PubMed Web of Science ®Google Scholar
Kim YS, Kim DJ, Kim TH, et al. Localized electronic states induced by defects and possible origin of ferroelectricity in strontium titanate thin films. Appl Phys Lett. 2007;91(4):042908. doi: 10.1063/1.3139767
Google Scholar
Iglesias L, Sarantopoulos A, Magén C, et al. Oxygen vacancies in strained SrTiO3 thin films: formation enthalpy and manipulation. Phys Rev B. 2017;95(16):165138. doi: 10.1103/PhysRevB.95.165138
Web of Science ®Google Scholar
Hashimoto T, Nishimatsu T, Mizuseki H, et al. Ab initio study of strain-induced ferroelectricity in SrTiO3. Jpn J Appl Phys. 2005;44(9S):7134. doi: 10.1143/JJAP.44.7134
Google Scholar
Antons A, Neaton JB, Rabe KM, et al. Tunability of the dielectric response of epitaxially strained SrTiO3 from first principles. Phys Rev B. 2005;71(2):024102. doi: 10.1103/PhysRevB.71.024102
Web of Science ®Google Scholar
Ni L, Liu Y, Song C, et al. First-principle study of strain-driven phase transition in incipient ferroelectric SrTiO3. Physica B. 2011;406(21):4145. doi: 10.1016/j.physb.2011.08.018
Web of Science ®Google Scholar
Ong PV, Lee J. Strain dependent polarization and dielectric properties of epitaxial BaTiO3 from first-principles. J Appl Phys. 2012;112(1):014109. doi: 10.1063/1.4736375
Web of Science ®Google Scholar
Watanabe Y. Ferroelectricity of stress-free and strained pure SrTiO3 revealed by ab initio calculations with hybrid and density functionals. Phys Rev B. 2019;99(6):064107. doi: 10.1103/PhysRevB.99.064107
Web of Science ®Google Scholar
Azzam RM, Bashara NM. Ellipsometry and polarized light. North Holland: Amsterdam; 1977.
Google Scholar
Manaka H, Yagi G, Miura Y. Development of birefringence imaging analysis method for observing cubic crystals in various phase transitions. Rev Sci Instrum. 2016;87(7):073704. doi: 10.1063/1.4958662
PubMed Web of Science ®Google Scholar
Manaka H, Fukuda T, Miura Y. Birefringence imaging measurements on various structural phase transitions in (CnH2n+1NH3)2MnCl4 with n=1,2, and 3 using multiple wavelengths. J Phys Soc Jpn. 2016;85(12):124701. doi: 10.7566/JPSJ.85.124701
Google Scholar
Manaka H, Nozaki H, Miura Y. Microscopic observation of ferroelectric domains in SrTiO3 using birefringence imaging techniques under high electric fields. J Phys Soc Jpn. 2017;86(11):114702. doi: 10.7566/JPSJ.86.114702
Google Scholar
Manaka H, Nozaki H, Miura Y. Development of birefringence imaging techniques under high electric fields. J Phys Conf Ser. 2018;969:012119. doi: 10.1088/1742-6596/969/1/012119
Google Scholar
Manaka H, Sasaki Y, Miura Y. Re-examination of successive structural phase transitions in (C3H7NH3)2CuCl4 using birefringence imaging and electron paramagnetic resonance spectroscopy. J Phys Soc Jpn. 2017;86(11):114710. doi: 10.7566/JPSJ.86.114710
Google Scholar
Miura Y, Okumura K, Fukuda T, et al. Observation of ferroelastic domains in layered magnetic compounds using birefringence imaging. J Phys Conf Ser. 2018;969:012153. doi: 10.1088/1742-6596/969/1/012153
Google Scholar
Manaka H, Okumura K, Tokunaga K, et al. Observations of successive local-structure and ferroelectric phase transitions in (C2H5NH3)2CuCl4 using birefringence imaging and electron paramagnetic resonance spectroscopy. J Phys Soc Jpn. 2022;91(11):114701. doi: 10.7566/JPSJ.91.114701
Google Scholar
Miura Y, Ibushi R, Manaka H. Observation of ferroelectricity in two-dimensional antiferromagnet (C2H5NH3)2CuCl4 using birefringence imaging techniques. JPS Conf Proc. 2023;38:011142. doi: 10.7566/JPSCP.38.011142
Google Scholar
Fisher NI. Statistical analysis of circular data. Cambridge University Press; 1993.
Google Scholar
Pewsey A, Neuhauser M, Ruxton GD. Circular statistics in R. Oxford University Press; 2014.
Google Scholar
Manaka H, Tateishi K, Miura Y. Real-space imaging by magnetic birefringence for KNiF3 under inhomogeneous stress. J Phys Soc Jpn. 2019;88(12):124702. doi: 10.7566/JPSJ.88.124702
Google Scholar
Schubert E. Stop using the elbow criterion for K-means and how to choose the number of clusters instead. ACM SIGKDD Explorations Newsl. 2023;2(1):36–42. doi: 10.1145/3606274.3606278
Google Scholar
Cohen J. Statistical power analysis for the behavioral sciences. 2nd ed. Routledge; 1988.
Google Scholar
Hedges LV, Olkin I. Statistical methods for meta-analysis. Academic Press; 1985.
Google Scholar
Durlak JA. How to select, calculate, and interpret effect sizes. J Pediatr Psychol. 2009;34(9):917–928. doi: 10.1093/jpepsy/jsp004
PubMed Web of Science ®Google Scholar
Gelman A, Carlin JB, Stern HS. Bayesian data analysis. Chapman and Hall/CRC; 2013.
Google Scholar
Tukey JW. Exploratory data analysis. Pearson Education, Ltd.; 1977.
Google Scholar
Das S, Wang B, Paudel T, et al. Enhanced flexoelectricity at reduced dimensions revealed by mechanically tunable quantum tunnelling. Nature Commun. 2019;10(1):537. doi: 10.1038/s41467-019-08462-0
PubMedGoogle Scholar

Multivariate temperature-series analysis of stress-induced ferroelectricity in SrTiO₃: a machine learning approach with K-shape clustering and hierarchical Bayesian estimation

ABSTRACT

GRAPHICAL ABSTRACT

IMPACT STATEMENT

1. Introduction

2. Dataset for analysis

2.1. Birefringence images in SrTiO₃

2.2. Preprocessing of birefringence imaging data

3. Data analysis procedure

4. Results and discussion

4.1. K-shape clustering

Table 1. Number of pixels in respective clusters determined by K-means and K-shape clustering methods. The K-means clustering data are referenced in Ref. [Citation18].

4.2. Analysis for each cluster

Table 2. Five-number summary (minimum, first quartile, median, third quartile, and maximum) and mean of exploration values of fast-axis direction for respective clusters expressed as ${\hat{ψ}}^{Av}$ .

4.3. Hierarchical Bayesian temperature-series estimation

4.4. Analysis for each pixel

Table 3. Five-number summary and mean of exploration values of the structural phase transition temperatures for respective clusters expressed as ${\hat{T}}_{c}^{E 1}$ , ${\hat{T}}_{c}^{E 2}$ , ${\hat{T}}_{c}^{E 3}$ , and ${\hat{T}}_{c}^{E 4}$ .

Table 5. Five-number summary and mean of exploration values of the ferroelectric phase transition temperatures for respective clusters expressed as ${\hat{T}}_{F}^{E 1}$ , ${\hat{T}}_{F}^{E 2}$ , ${\hat{T}}_{F}^{E 3}$ , and ${\hat{T}}_{F}^{E 4}$ .

5. Conclusion

Supplemental Material

Disclosure statement

Supplementary material

References

Information for

Open access

Opportunities

Help and information

Multivariate temperature-series analysis of stress-induced ferroelectricity in SrTiO3: a machine learning approach with K-shape clustering and hierarchical Bayesian estimation

ABSTRACT

GRAPHICAL ABSTRACT

IMPACT STATEMENT

1. Introduction

2. Dataset for analysis

2.1. Birefringence images in SrTiO3

2.2. Preprocessing of birefringence imaging data

3. Data analysis procedure

4. Results and discussion

4.1. K-shape clustering

Table 1. Number of pixels in respective clusters determined by K-means and K-shape clustering methods. The K-means clustering data are referenced in Ref. [Citation18].

4.2. Analysis for each cluster

Table 2. Five-number summary (minimum, first quartile, median, third quartile, and maximum) and mean of exploration values of fast-axis direction for respective clusters expressed as ψˆAv.

4.3. Hierarchical Bayesian temperature-series estimation

4.4. Analysis for each pixel

Table 3. Five-number summary and mean of exploration values of the structural phase transition temperatures for respective clusters expressed as TˆcE1, TˆcE2, TˆcE3, and TˆcE4.

Table 4. Median absolute deviation (MAD) and coefficient of variation (CV) in the distribution of the mean of Tˆci and TˆFi for respective clusters. The upper panel of each column displays the values of MAD and CV, whereas the lower panel provides the bootstrap 95% confidence intervals for them.

Table 5. Five-number summary and mean of exploration values of the ferroelectric phase transition temperatures for respective clusters expressed as TˆFE1, TˆFE2, TˆFE3, and TˆFE4.

5. Conclusion

Supplemental Material

Disclosure statement

Supplementary material

Additional information

Funding

References

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date

Multivariate temperature-series analysis of stress-induced ferroelectricity in SrTiO₃: a machine learning approach with K-shape clustering and hierarchical Bayesian estimation

2.1. Birefringence images in SrTiO₃

Table 2. Five-number summary (minimum, first quartile, median, third quartile, and maximum) and mean of exploration values of fast-axis direction for respective clusters expressed as ${\hat{ψ}}^{Av}$ .

Table 3. Five-number summary and mean of exploration values of the structural phase transition temperatures for respective clusters expressed as ${\hat{T}}_{c}^{E 1}$ , ${\hat{T}}_{c}^{E 2}$ , ${\hat{T}}_{c}^{E 3}$ , and ${\hat{T}}_{c}^{E 4}$ .

Table 5. Five-number summary and mean of exploration values of the ferroelectric phase transition temperatures for respective clusters expressed as ${\hat{T}}_{F}^{E 1}$ , ${\hat{T}}_{F}^{E 2}$ , ${\hat{T}}_{F}^{E 3}$ , and ${\hat{T}}_{F}^{E 4}$ .