1,186
Views
0
CrossRef citations to date
0
Altmetric
Materials Data Analysis and Utilization

Alloys innovation through machine learning: a statistical literature review

ORCID Icon, ORCID Icon & ORCID Icon
Article: 2326305 | Received 26 Jun 2023, Accepted 28 Feb 2024, Published online: 11 Apr 2024

ABSTRACT

This review systematically analyzes over 200 publications to explore the growing role of data-driven methods and their potential benefits in accelerating alloy development. The review presents a comprehensive overview of different aspects of alloy innovation by machine learning and other computational approaches used in recent years. These methods harness the power of advanced simulation techniques and data analytics to expedite materials’ discovery, predict properties, and optimize performance. Through analysis, significant trends and disparities within the data discerned, while highlighting previously overlooked research gaps, thus underscoring areas that require further exploration. Machine Learning techniques are widely applied across various alloys, with a pronounced emphasis on steel and High Entropy Alloys. Notably, researchers primarily investigate the physical, mechanical, and catalytic properties of materials. In terms of methodology, while 68% of the examined papers rely on a single machine learning model, the remainder employ a range of 2 to 12 models, with Neural Network being the most prevalent choice. However, a notable concern arises as 53% of these papers do not share their dataset, and a staggering 81% do not provide access to their code. Paramount importance of adopting a systematic approach when scrutinizing machine learning methodologies is underscored. Analysis shows lack of consistency and diversity in the methods employed by researchers in the field of alloy development, highlighting the potential for improvement through standardization. The critical analysis of the literature not only reveals prevailing trends and patterns but also shines a light on the inherent limitations within the traditional trial-and-error paradigm.

GRAPHICAL ABSTRACT

IMPACT STATEMENT

Through statistical analysis of 200+ papers, this research identifies trends, patterns and gaps to highlight areas for further exploration in using machine learning for alloy development.

1. Introduction

Owing to high-speed computers, networks, and massive storage cloud spaces, nowadays artificial intelligence (AI), and machine learning (ML) have proven that they could considerably speed up fundamental and applied materials science research [Citation1,Citation2]. This paper explores the profound impact of AI and ML algorithms and techniques, which have used in an era of unprecedented materials discovery and development across a wide spectrum of material types, including structural materials, catalysts, high-entropy alloys (HEA), superalloys, shape-memory alloys, magnetic materials, and energy storage materials that their study can be complicated and complex.

For instance, when it comes to the mechanical properties of an alloy, achieving the desired properties necessitates providing a balance between accurate prediction and occasional envisioning. An example of such studies is reported by Olson et al. [Citation3]. Using a classic materials search approach, they looked to design a new alloy that has less nickel and copper to reduce its cost while preserving the look of the alloy, its excellent corrosion properties, and its electrical conductivity [Citation3]. The pursuit of optimal trade-offs within these complex and complicated systems is facilitated by ML. While an increasing number of scholars have harnessed ML techniques to ascertain the optimal trade-off, several key inquiries persist, including the identification of the most suitable ML technologies and models, which remain unaddressed.

This paper represents a comprehensive synthesis of insights drawn from a meticulous statistical analysis of over 200 publications. Specifically, the study focuses on peer-reviewed journals that employ rigorous methodologies, featuring a diverse array of ML models. Each publication presents its optimal model, contributing to the broader understanding of the field. All the papers under study are succinctly summarized within the main text and are visually represented in the accompanying graphs. Consequently, these papers are duly cited in the reference section of this research.

Chapter 2 offers a global perspective on the field’s evolution and tracing its development over the years. Chapter 3 provides a comprehensive overview of the alloys under examination. Chapter 4 delves into the application of ML in alloy development by covering predicted parameters, dataset sizes, ML optimization, and commonly used models. Mechanical property prediction, model accuracy, and alignment with first principles and CALPHAD are explored. We also address data generation challenges and presented the best suggested ML models. Chapter 5 reviews various ML deployment strategies, while Chapter 6 focuses on data and code availability in publications. Lastly, Chapter 7 concludes our journey by summarizing findings and providing an outlook for the field’s future.

The present review offers a specific perspective on ML-driven research within the domain of alloy development. It elucidates critical methodologies, applications, theoretical predictions, and experimental validations. Furthermore, it presents the results of various studies, comparing and discussing discrepancies and contradictions while identifying gaps in the existing literature. The primary objectives of this research endeavor are as follows:

  1. To gain a comprehensive understanding of the prevailing ML models and technologies employed by researchers to address multifaceted challenges in materials science research.

  2. To uncover underlying patterns and correlations among diverse material features.

  3. To analyze the structures and dynamics of ML approaches, as well as the intricate interconnections between models, structures, and properties.

  4. To discern the impacts and implications of various compositional elements, considering experimental procedures, with potential implications for decision-making and policy.

  5. To determine the optimal scenarios for utilizing experimental research data versus computational data.

  6. To identify the ML models recommended by researchers for predicting desired alloy properties.

Throughout this study, data visualization techniques, such as bar charts, pie charts, and Sankey graphs, are employed to enhance the clarity of the narrative. These visual aids provide a holistic perspective on the landscape of ML research within the materials science domain. It is worth noting that this review exclusively encompasses publications in the English language, with publications in other languages excluded from consideration.

In summary, this paper addresses a critical juncture in materials science research, where the convergence of AI and ML is poised to reshape the landscape of materials discovery, development, and innovation. The insights presented herein have profound implications for researchers, policymakers, and practitioners seeking to harness the power of AI and ML in the pursuit of advanced materials with tailored properties and applications.

2. Current ML development in the world

Diverse national contingents of researchers have manifested distinct methodological inclinations in their research to harness ML for alloy development. This study has undertaken a comprehensive analysis, systematically extracting the first author’s country of origin from each publication, thereby elucidating the myriad national strategies and approaches employed in the application of ML for advancing alloy development. shows a bar chart detailing publications analyzed in this review from 1995 to 2022, including the first authors’ countries. The graph shows that before 2012, a few papers used ML for alloy development while the number increased by time. It is worths to mention that ML studies regained scientific interest in late 2014 and the beginning of 2015 when the Google speech recognition program was already launched [Citation4]. Publications within the domain exhibited a substantial upswing, notably in the years 2016 and 2017, with the U.S.A. at the forefront of this surge. Nevertheless, commencing from 2020, China surpassed the U.S.A. in terms of the volume of papers published in this field. It is worth mentioning that in this research, the affiliated country of the first author of every publication is considered the country of that publication.

Figure 1. Chart shows the quantity of research publications studied in this paper, organized by the year of publication and the first author’s country.

Figure 1. Chart shows the quantity of research publications studied in this paper, organized by the year of publication and the first author’s country.

shows the distribution of publications across different journals in the field of using ML in alloy development. Notably, a significant number of journals cover this topic, with 67% of the papers distributed among various journals, demonstrating a wide dissemination of research. It is evident that publication in this field is not confined to a limited number of journals. Computational Materials Science, in particular, has a notable 10% share of publications, highlighting its significant contribution to the literature in this domain.

Figure 2. Pie chart displaying the proportion of publications in different peer-review journals that were studied in this research.

Figure 2. Pie chart displaying the proportion of publications in different peer-review journals that were studied in this research.

3. Alloys

The allocation of scholarly interest among countries exhibits intriguing patterns, emphasizing the diverse approaches of distinct national research bodies. shows a Sankey diagram illustrating the various alloys and materials studied by ML by every country. Most of cited researchers in this work have studied steel, HEA, and Aluminum (Al) alloys. The vast majority of research contribution are held by China and U.S.A., followed by the UK, India and the rest of the world. Steels, HEA and Al-based alloys consist of large number of publications due to their obvious industrial importance. Range of alloys, Catalysts, Ti-based alloys, and Metallic glasses are also of great interest for most of these countries. The term ‘Range of Alloys’ indicates the publications that looked at a range of alloys instead of focusing on one alloy with a single base metal. It is noticeable that China’s publications are focused more on steel, HEA, metallic glass, and magnesium alloys, while the U.S.A. focuses on steel, catalysts, and HEA.

Figure 3. Sankey diagram with countries on the left side and studied alloys by ML on the right side.

Figure 3. Sankey diagram with countries on the left side and studied alloys by ML on the right side.

In their publications, researchers from the U.S.A. have frequently explored a wide range of alloys, a practice less commonly observed among researchers from other nations. This highlights a notable difference in the approach to alloy consideration, with U.S.A.-based researchers demonstrating a penchant for encompassing multiple alloys within a single publication, a tendency not as commonly shared by their counterparts from other countries. The sankey diagram offers a comprehensive visual representation of how different countries engage in ML research pertaining to various alloys and highlights distinctive research among these nations. Further elaboration on the research about each specific alloy is presented in this chapter.

This chapter examines the alloys studied in this research and demonstrates the required properties of these alloys, as well as presenting the approaches researchers have employed to address these challenges.

3.1. Aluminum alloys

Aluminum-based alloys are widely used in many applications, such as the aerospace and automotive industry, because of their low density, high strength, and low production-maintenance costs. Compared to steel, aluminum-based alloys have a strength disadvantage. Al-alloys are heat-treated to attain high strength, with experimental procedures often needing to be optimized or difficult to control [Citation5]. Modeling is valuable for reducing experimental procedures; however, it bears its own inherent limitations. So, researchers have employed ML techniques to address this challenge.

For instance, the modeling of crystal nucleation in aluminum or amorphization, requires the construction of interatomic interactions that can reproduce the properties of both the solid and the liquid states. The simulation necessary to capture such a mechanism may take much larger capacity beyond the capability of first-principles calculations. Marchand et al. [Citation6] have presented the first family of NN potentials to describe the binary Al-Cu system with comparable accuracy to the first-principles method. The newly developed potentials could be used to predict elastic constants, solid solution of dilute systems, interfaces, stacking faults, intermetallic compounds, and defect energies. This demonstrated the clear advantage of using the ML approach. Also, Jakse et al. [Citation7] have studied the solidification process in aluminum by simulations on the atomic scale by coupling classical molecular dynamics simulations and a NN. A set of configurations relevant to solidification phenomena were generated.

Özhavak et al. [Citation8] have used a Convolutional NN algorithm, ANN, and Random Forest (RF) regression to determine the change in mechanical properties of AA2024 aluminum produced with different manufacturing methods after aging at various temperatures and times. The Convolutional NN obtained the best accuracy of 93%.

Kumar et al. [Citation9] studied the performance of decision tree, RF, and XGBoost ML models to determine the best classifier model for a friction stir welded AA 6061-T6 aluminum alloy. The XGBoost model showed a maximum accuracy of about 96% among the three models, which shows promising results that help lower the time and cost of manufacturing these alloys. Yao et al. [Citation10] have proposed a parameter identification method based on semi-coupled damage mechanics and ML to describe the fracture behavior of 6061 aluminum alloy. A set of reliable parameters were generated to circumvent extracting the stress state evolution from the calibrated specimens. Due to the large demand for its wide applications, modern parts based on 6XXX series aluminum are recently produced by additive manufacturing techniques, where developing ML predictions is essential for 3D printing parameter optimization. Based on Gaussian process regression, Liu et al. [Citation11] have developed a ML approach to identify the optimized processing window for laser powder bed fusion. The newly predicted processing parameters helped to print dense new AlSi10Mg alloys with a higher ultimate tensile strength between 297 and 389 MPa, and an elongation to failure from 6% to 10%. Yanase et al. [Citation12] have examined the densification behavior and microstructural formation of the Al–10 mass %Si–0.35 mass %Mg alloy fabricated using the Selective Laser Melting method based on experimental work and NN ML approach. The combined techniques have identified the required processing factor which quantitatively influences the behavior. Using a back-propagation NN algorithm, Chun et al. [Citation13] could accurately predict manufacturing parameters for aluminum alloys, such as flow stress, roll force, and roll torque.

Li et al. [Citation14] studied the relationship between alloy composition, process parameters, and mechanical properties, using a genetic algorithm combined with backpropagation and radial basis function NN to develop the properties of 7XXX aluminum alloys. The accuracy of combining the NN with the genetic algorithm was superior to that of those predicted by the NN and radial function. Predicted fine and dispersed spherical precipitates formed after aging heat treatment were observed experimentally by SEM and HRTEM, confirming that their mixed approach may greatly assist metallurgists in developing 7XXX aluminum alloys. Aydin [Citation15] has studied the volume loss of AA7075 with 5 wt% Al2O3 composites under load, using linear regression, support vector regression (SVR), ANN, and extreme learning machine for the prediction. The extreme learning machine approach showed the highest R2 coefficient compared to other methods, which shows the model’s remarkable ability to predict the wear behavior of aluminum composites. Decke et al. [Citation16] have focused on predicting the hot deformation behavior of thermo-mechanically processed precipitation-hardenable aluminum alloy AA7075 based on a comparative study of modeling techniques based on ML and the Zerilli – Armstrong modeling. They could predict stress–strain curves using the gradient boosting model as their best model with an accuracy of 91%.

Haghdadi et al. [Citation17] have estimated the high-temperature flow behavior of a cast A356 aluminum alloy using a feed-forward back propagation NN with a single hidden layer composed of 20 neurons for a series of isothermal compression tests. The results demonstrate that the NN model is a robust tool for predicting the high-temperature flow behavior of cast A356 aluminum alloy.

Shang et al. [Citation18] reported the characterization of the dynamic hardening behavior of an aluminum alloy, sheet of 5182-O (wrought alloy), for strain rate and temperature coupling effect. Polynomial algorithm and ANN were used to describe the highly non-linearity and coupling of strain hardening. The study found that the network structure, parameter setting, and optimization algorithm have affected the ANN prediction accuracy. Using ML, Soofi et al. [Citation19] have estimated the properties of commercial wrought aluminum alloys. Starting from a small dataset of 236 features, six mechanical and nine technological properties, they could capture composition/processing-property relations.

Takara et al. [Citation20] used multiple regression with supervised learning to predict the linear relationship between voids and their corresponding elemental compound composition.

Lian et al. [Citation21] have predicted the fatigue life of aluminum alloy using a ML framework combined with empirical formulas and data-driven models. They used gradient boost regression to predict the fatigue life of seven different series of aluminum alloys, achieving a great improvement compared to conventional models. Their findings successfully demonstrated the advantages of the ML approach, which provides a unique way to predict fatigue life to reduce the time and cost of experiments. Similarly, Chun et al. [Citation22] have proposed an RF ML method to predict structural damage from vibrational measurements in aluminum alloys.

Several reports about the ML prediction of equilibrium phases, composition, and thermal transport were published. Jiang et al. [Citation23] have developed a new ML model to predict the quasi-phase equilibrium equations for multicomponent alloys to reduce the long time required to solve such equations. A NN model of three inputs, four outputs, and a hidden layer of 150 nodes was tested to study the quasi-phase equilibrium during the isothermal solidification of Al-Cu-Mg alloy. Dai et al. [Citation24] classified five types of aluminum alloys using laser-induced breakdown spectroscopy combined with principal component analysis and least-squares support vector machine (SVM) algorithm. An accuracy of 100% was achieved, suggesting the model is an efficient approach to identifying aluminum alloys. In a similar way, Wang et al. [Citation25] have studied thermal transport in aluminum with precipitate morphology. They developed a Finite Element Method to create a database. A correlation analysis was conducted from the obtained data to evaluate the importance of various precipitate features. Thermal conductivity was predicted by linear regression, Bayesian Ridge, Kernel Ridge, Nearest Neighbors, and RF. In similar way, ML can uncover hidden patterns and correlations that elude traditional approaches. Thus, the heightened understanding enables the precise manipulation of alloy characteristics to achieve desired outcomes.

3.2. Titanium alloys

Researchers published a range of publications on adopted ML and data-driven strategies to further develop titanium alloys. Malinov and Sha [Citation26,Citation27] have proposed an ML model based on ANNs for modeling correlations between processing parameters and properties in titanium alloys and γ-TiAl-based alloys. Combined with computer programs for optimization of the inputs, the ANN models could be powerful tools for practical applications in solving various problems in titanium alloys.

Zhu et al. [Citation28] have proposed a titanium alloy design method based on experiments and ML. The work discusses the effect of Mo and Cr on the microstructure and mechanical properties of newly developed titanium alloys (Ti–3Al–2Nb-1.2 V–1Zr–1Sn-xCr-yMo). The predicted results agree with experimental values. The mechanical test result shows an excellent balance of strength (Yield Strength ~1200 MPa) and plasticity (Elongation ~ 12%) can be achieved after the solution treatment at 750°C and ageing at 550°C for 6 h. Using heat assisted single-point incremental forming, Bautista-Monsalve et al. [Citation29] have reported ML-based procedure for the determination of the surface finish quality of titanium alloy parts.

Currently, additive manufacturing technology is widely used in fabricating titanium alloy parts for aerospace applications. Zhan et al. [Citation30] have reported a Data-driven fatigue life prediction in additive manufactured titanium alloy. The paper presents a damage mechanics-based ML framework for additive manufacturing titanium alloy’s data-driven fatigue life prediction. The work presents a theoretical framework for the damage mechanics-based fatigue models and a strategy to compute the damage and the construction process. An RF model was used, and the predicted results were compared with the experimental data to verify the proposed method. Li et al. [Citation31] have presented ML-based very-high-cycle fatigue life prediction of Ti-6Al-4 V alloy fabricated by selective laser melting. Sun et al. [Citation32] have reported developing a constitutive relationship model of Ti600 alloy using an ANN.

To capture the effect of alloying elements and the influence of the heat-treatment on the stability of titanium alloys, an ML approach based on an ANN modeling was proposed by Reddy et al. [Citation33].

Kwak et al. [Citation34] have predicted the mechanical properties of γ-TiAl alloys produced using an RF regression model. The accuracy of the prediction results was evaluated using the R2 value. The R2 values were 0.9336, 0.9902, 0.8104, and 0.9810, for the tensile strength, elongation, nanoindentation hardness, and interlamellar space, respectively.

Ti-6Al-4 V titanium is a difficult-to-cut material used in critical applications in the aerospace industry requiring high-reliability levels. Selecting the cutting conditions can improve the machinability of the alloy, including the generation of compressive residual stresses. The work by Outeiro et al. [Citation35] proposed orthogonal cutting tests of Ti-6Al-4 V titanium were performed, and a designed experiment was used to investigate the influence of the cutting conditions on residual stresses, the forces, and the chip compression ratio. The residual stress was only measured for selected cutting conditions due to the time consumed and the high cost. An ML regression model was applied to predict the residual stresses for other cutting conditions. Wu et al. [Citation36] have proposed an ensemble NNs model that is developed for targeting the effects of alloying elements on phase stability and low Young’s modulus of Ti alloys. The work also provided guidelines for future material designs considering the stability of β phase and discussed the effects of some alloying elements, such as Nb, Zr, Mo, Sn, and Ta.

3.3. Magnesium alloys

Improving the strength of magnesium alloys is still a challenge that is limiting its potential applications as a lightweight metal. ML can help in the development of high-strength Mg alloys. Liu et al. [Citation37] have used a surrogate model to optimize the composition and heat treatment conditions of Mg cast alloys. A new alloy with the composition of Mg-10.0Al-2.0Sn-2.0Zn-0.1Ca-0.1Mn (at. %) was identified, aged at 200°C for 96 h. The alloy shows a Vickers hardness value superior to the conventional alloy.

Because of many variances from different suppliers, the mechanical properties of AZ31 magnesium alloys are hard to control. Xu et al. [Citation38] proposed a ML approach using the ANN and the SVM algorithms to understand better the relationship between alloy compositions, processing parameters, and mechanical properties. The yield strength, ultimate tensile strength, and tensile elongation were predicted by both models, and a new AZ31 extruded alloy was fabricated.

Ibarra-Hernández et al. [Citation39] have used a combination of a NN algorithm with metaheuristic structural global search algorithms to screen the Mg – Ca binary system for new (meta)stable alloys. This allows for efficient exploration of the potential energy surface beyond the possibility of the traditional searches based on ab initio calculations. Some low-enthalpy structures were identified at different stoichiometries.

Gui et al. [Citation40] have used a novel Mg-4Y-3Nd-2Sm-0.5Zr alloy prepared to explore the twin nucleation behavior of Mg alloys by combining ML and electron backscattered diffraction techniques. Three ML algorithms, including gradient boosting, ANN, and the proposed relevance-based ensemble scheme, are used to model to predict the twin nucleation. The AUC score was 0.880, 0.879, and 0.756 for the proposed relevance-based ensemble scheme, ANN, and gradient boosting, respectively.

Wei et al. [Citation41] have collected 150 pieces of the work function of Mg-based solid solutions and Mg-containing second phases to build an initial dataset estimated by density functional theory. Fourteen features were used as input variables. Four ML models, including multiple linear regression, support vector regression (SVR), gradient boosting regression tree, and extreme gradient boosting tree, were used to predict the work function of the second phase. The result shows that the SVR model has the highest accuracy and best generalization ability.

Fang et al. [Citation42] have used a linear regression model to investigate the relationship between the widths of supercooled liquid regions of Mg-based bulk metallic glasses and bond parameters. They deduced an equation for the first time to predict whether Mg-based multicomponent alloys can form metallic glasses.

To analyze the solution behavior of a second element in the primary phase of α-Mg, Chen et al. [Citation43] have collected the interaction features of solutes and Mg obtained from first-principles calculation, the intrinsic physical properties of the pure elements and structural features. They have used RF and decision tree algorithm methods and found that the solution behavior of an element in α-Mg can be affected mainly by four features: formation energy, electronegativity, non-bonded atomic radius, and work function.

3.4. Copper, cobalt, and additive manufactured alloys

Qiao et al. [Citation44], Wang et al. [Citation45], Rovinelli et al. [Citation46], Wang et al. [Citation47], Zhang et al. [Citation48], Seko et al. [Citation49] used various ML methods, such as ANNs, GAs, Bayesian networks, RFs, etc. to predict and optimize the properties of copper alloys with different alloying elements, such as Co, Ni, Si, Cr, Zr, etc. These studies used different features to represent the alloys, such as elemental composition, thermodynamic data, microstructural parameters, etc., and different ML models to capture the non-linear relationships between the features and the properties.

Another application of ML methods for copper alloys is to design alloys with specific colors for art and decoration purposes. For example, Reitz and Barojas [Citation50], Oh et al. [Citation51], Chintakindi et al. [Citation52], Yang et al. [Citation53] used various ML methods, such as Monte Carlo simulation, feature-assisted ML algorithms, principal component analysis (PCA), hyper-parameter optimization, etc., to predict and optimize the color values of copper alloys with different alloying elements, such as Al, Zn, Sn, etc. These studies used different features to represent the alloys, such as elemental composition, annealing temperature, color coordinates, etc., and different ML models to capture the complex color phenomena.

A third application of ML methods for copper alloys is to model the mechanical behavior and microstructural evolution during deformation and heat treatment processes. For example, Mayeur et al. [Citation54], van Nieuwenburg et al. [Citation55], Sun et al. [Citation56], Fu et al. [Citation57], Kautz et al. [Citation58], Grabowski et al. [Citation59] used various ML methods, such as crystal plasticity model with dislocation transport (CPDT), confusion matrix method (CMM), DNN, sensitivity analysis and ML surrogate model (SMLSM), image-driven ML approach (IDMLA), ab initio vibrational free energies including anharmonicity (AFEA), etc., to predict and optimize the mechanical properties and microstructural features of copper alloys with different alloying elements, such as Ni, Co, Si, Mo, etc.

3.5. High-entropy alloys

HEA are the vast majority of alloys treated by ML before aluminum alloys and steels. This top place is gained because of the complex nature of the alloys and a considerable number of possible configurations that could never be all calculated or fabricated.

Chang et al. [Citation60] have utilized an ANN to predict the composition of HEAs applied to a non-equimolar AlCoCrFeMnNi to achieve high hardness. The work successfully predicted new compositions of AlCoCrFeMnNi-based HEAs, exhibiting hardness values higher than those available in the literature.

George et al. [Citation61] have reported the implementation of gradient boosting trees ML algorithm trained on nearly 7000 ordered inorganic HEAs, with a combined experimental and computational approach based on in-situ neutron-diffraction characterizations and ab initio calculations, to explore and predict the elastic moduli of single-phase face-centered-cubic Al0.3CoCrFeNi HEAs. ML models trained on a large dataset could accurately predict the elastic properties of the alloys. The ML models demonstrated the dependence of bulk and shear moduli on several material features, which can help tune elastic properties in HEAs.

Qingfenget al. [Citation62] have studied the eutectic formation of the Al-Co-Cr-Fe-Ni system as an example representative of HEA via an ANN algorithm. Essential elements and strongly associated elements were identified. Cr is strongly associated with Al, Ni, Co, and Fe. These findings can accelerate the employment of the algorithm in designing more complex alloys.

Menou et al. [Citation63] have integrated a computational HEA design. Their strategy combines a multi-objective genetic algorithm with statistical criteria to guide the formation of a single phase, supplemented by computational thermodynamics (Thermo-Calc) to study solid solution-strengthening models. Many of non-equimolar face-centered-cubic single-phase alloys are designed. Best designed alloys are synthesized experimentally.

Also, the Menou et al. [Citation64] used a multi-objective optimization genetic algorithm combining solid solution hardening and thermodynamic modeling (CALPHAD) with data mining to design HEA. The approach found the best stable alloys and designed several optimal HEA. Al35Cr35Mn8Mo5Ti17 (at.%) is chosen for experimental validation.

Zhuang et al. [Citation65] have developed a method integrating a multicomponent diffusion-multiple and ML to study the effects of alloying elements, such as Ni, Al, W, Ti, Ta, Cr, Mo, and Nb, on the microstructures.

A class of ML potential called Moment Tensor Potential (MTP) has been developed and used by Jafary-Zadeh et al. [Citation66] to predict the structure and elastic properties of the multi-principal element alloy.

Rickman et al. [Citation67] have proposed an ML approach using supervised data fed to two complementary algorithms: multiple regression analysis and a genetic algorithm. The methodology is validated after identifying promising multi-principal elements with high hardness.

A study by Ozerdem and Kolukisa [Citation68] has employed an ANN approach to predict the mechanical properties of Cu-Sn-Pb-Zn-Ni cast alloys. The ANN used multi-layer perceptron architecture with a back-propagation algorithm. The compositions were used as input, and yield strength, tensile strength, and elongation were employed as outputs to be predicted.

Zhang et al. [Citation69] used molecular dynamics simulation combined with ML methods to study the mechanical properties of non-equiatomic Fe-Cr-Ni-Co-Mn HEAs. Molecular Dynamic simulation provided a database describing the relationship between material composition and mechanical properties. These properties are trained and investigated using three ML models, including SVM, kernel-based extreme learning machine, and DNN, for predicting the yield stress. The DNN model outperformed other models for the classification of yield stress.

Kostiuchenko et al. [Citation70] have proposed a computational method for studying HEAs based on applying ML potentials derived from ab initio data in combination with Monte Carlo simulations. The approach is employed to study phase stability, phase transitions, and short-range chemical order.

Rao et al. [Citation71] have integrated ML with density-functional theory, thermodynamic calculations, and experiments to predict high-entropy Invar alloys. The method led to the characterization of 17 new alloys, where two were identified as high-entropy Invar alloys with low thermal expansion.

Machaka et al. [Citation72] have constructed a dataset based on 430 peer-reviewed experimental publications, including 40 metallurgy-specific predictor features. This study comprised eight ML classifiers: regression tree, linear discriminant analysis, naїve Bayes, generalized linear regression, RF, ANN, k-nearest neighbors, and SVMs. These classifiers were trained and evaluated to classify HEA solid solution phases across feature ensemble sizes.

Zhang et al. [Citation73] proposed a systematic methodology based on a genetic algorithm to select the ML model and materials descriptors from many alternative HEAs. Several HEAs were selected and experimentally synthesized.

Batchelor et al. [Citation74] have presented a theoretical method to find new active alloy electrocatalysts. Using the multicomponent Ir-Pd-Pt-Rh-Ru alloy, the alloy composition with the best predicted catalytic activity was found.

Using the concepts of Louvain and modified particle swarm optimization (PSO) algorithms, Nia et al. [Citation75] tried to find correlations between HEA descriptors to predict HEA candidates with similar functionality. The proposed method predicted the HEA phase composition with an accuracy of 93%.

The gradient-boosting regression model was used by Wan et al. [Citation76] to map the oxygen reduction reaction catalytic activity of millions of reactive sites on HEA surfaces. The model showed high accuracy and generalization, accurately predicting the catalytic activities of numerous reactive sites on HEA surfaces.

Clausen et al. [Citation77] developed a methodology to predict the oxygen reduction reaction’s net catalytic activity in Ag-Ir-Pd-Pt-Ru alloy. Based on ab initio calculations, a graph convolution NN is used to predict adsorption energies of *OH and *O.

Mishra et al. [Citation78] used ML models on design parameters using classification for HEA phase prediction. The ML models are RF, Stacked Ensemble, and SVM. The training dataset comprising 601 cast alloys is used with cross-validation for test data. The stacked ensemble was the best model, overperforming the SVM and RF.

Wen et al. [Citation79] described a relationship derived from the electronegative difference of elements to characterize solid solution strengthening in the four HEAs: Al-Co-Cr-Fe-Ni, Co-Cr-Fe-Ni-Mn, Hf-Nb-Ta-Ti-Zr, and Mo-Nb-Ta-W-V. Their findings are based on ML methods involving feature construction and selection, which are essential to determining the most important descriptors.

Aided by ML-based interatomic potentials, Zhou et al. [Citation80] developed an approach based on density-functional theory and computational thermodynamic integration to perform extensive screening in vast compositional space on an equiatomic HEA, Ta-V-Cr-W, and discovered the most stable ones, after comparing their free energies.

Numerous other studies covering many applications in HEAs, have included predictions of new alloys [Citation81], phase formations [Citation82,Citation83], heterogeneous Grain Structures [Citation84], large and enhanced hardness [Citation85,Citation86], strengthening mechanisms [Citation87], formation energies [Citation88], sublattice formation in Co-Cr-Fe-Ni [Citation89], phase formations through considering the formation enthalpy and atom-size difference [Citation90], configurational energy [Citation91], quality prediction of additively manufactured alloys [Citation45], stability and structure prediction of cubic phases [Citation92], efficient exploration of the composition-phase space [Citation93], the discovery of single-phase refractory [Citation94], multi-objective optimization [Citation95], selective CO2 reduction to methanol [Citation96] and discovery of high-entropy ceramics [Citation97].

Different methods to design HEA were employed, such as thermodynamics and Gaussian process statistical analysis [Citation98], mined from binary phase diagrams [Citation99], statistical approaches [Citation100] and feature engineering to characterize limited material datasets.

3.6. Steel

Similar to the developments and advances in aluminum and HEAs, a breakthrough in multi-component steel design also requires a comprehensive understanding of complex multi-phases. This necessitates the systematic exploration and screening of numerous prospective configurations across extensive composition ranges, where ML methodologies have the potential to provide valuable assistance.

Deng et al. [Citation101], Wang et al. [Citation47], Shin et al. [Citation102], Geng et al. [Citation103], Peng et al. [Citation104], and Yang et al. [Citation105] used ML methods to develop steel alloys and optimize their properties. These studies performed similar activities, such as predicting steel alloys’ elastic properties, creep life, and atmospheric corrosion resistance. However, they differ in the specific alloys and properties studied and the ML methods used.

Rakhshkhorshid and Sendesi [Citation106], Zhan and Li [Citation107], Thanush, et al. [Citation108], Wang et al. [Citation109], and Wang et al. [Citation110] also used ML methods to develop steel alloys and predict their properties. These studies focused on predicting austenite formation temperatures, fatigue life, atmospheric corrosion rate, creep life, and molten salt corrosion resistance of steel alloys.

ML methods have been used to predict various properties of steel alloys such as austenite formation temperatures, rolling force in cold rolling, and optimal design of alloy steels [Citation26,Citation111–115]. ML also used to predict deformation behavior, high-temperature deformation behavior, ductile damage in sheet metal forming, stacking faulting energy in austenitic steels, materials defects [Citation116–126], porosity in metal-based additive manufacturing, flow stress in hot deformation, tensile strength of alloy steels and high cycle fatigue performance of polycrystalline microstructures [Citation127–134].

Fu et al. [Citation57] and Rahnama et al. [Citation135] used ML methods to predict various properties of steel alloys, such as grain structure after thermomechanical processing, kinetic modeling of a discontinuous precipitation reaction, vibrational free energies including anharmonicity for multicomponent alloys and the occurrence of interphase precipitation in High Strength Low Alloy (HSLA) steels.

One of the main applications of ML methods for maraging steels is to predict the tensile strength and hardness based on the composition and aging temperature. For example, Guo and Sha [Citation131] used an ANN to model the correlation between processing parameters and properties of maraging steels. They found that the aging temperature significantly influenced tensile strength and hardness. Rao and Prasad [Citation132] used a NN approach to evaluate the flow stress of maraging steels during hot deformation. They found that the flow stress increased with increasing strain rate and decreasing temperature. Gocheva-llieva and Dobrev [Citation133,Citation134] used polynomial regression to study the tensile strength of alloy steels and found that the tensile strength increased with increasing carbon and manganese content.

Another application of ML methods for maraging steels is to predict fatigue life and crack growth based on the microstructural features and loading conditions. For example, Paulson et al. [Citation134] used data-driven reduced-order models to rank-order polycrystalline microstructures’ high cycle fatigue performance. They found that the grain size, grain orientation spread, texture coefficient, etc., influenced the fatigue life. Fu et al. [Citation57] used sensitivity analysis and a ML surrogate model to predict the grain structure of U-10Mo alloy after thermomechanical processing. They found that the grain size distribution was affected by the rolling temperature, reduction ratio, annealing temperature and etc.

A third application of ML methods for maraging steels is to predict the precipitation kinetics and phase transformation behavior based on the thermodynamic data and kinetic models. For example, Kautz et al. [Citation136] used an image-driven ML approach to model the kinetic behavior of a discontinuous precipitation reaction in a Fe-Ni-Cr alloy. They found that the precipitation rate was controlled by diffusion and interface migration. Grabowski et al. [Citation59] used ab initio vibrational free energies, including anharmonicity, for multicomponent alloys and found that the vibrational entropy significantly affected the phase stability of Fe-Ni-Cr alloys. Rahnama et al. [Citation135] used ML to predict the occurrence of interphase precipitation in HSLA steels. They found that the composition, cooling rate, austenite grain size, etc, influenced the interphase precipitation.

Deng et al. [Citation101] have designed oxide dispersion-strengthened steel with a composition of Fe-12%Cr-4.5%Al-2.0%W-0.3%Y2O3 (wt%) by a radial basis function ANN and prepared by hot isostatic pressing. They found a small amount of Y-Al-O composite oxide, and its average size is significantly larger than Y2O3. The tensile strength and elongation measured at room temperature and 700°C are consistent with the predicted values of ML.

Wang et al. [Citation47] have reported a ML approach using randomized trees and deep NNs to cluster expansion-generated data to predict the bulk and shear moduli of Fe-Cr-Al alloys. They found that the compositions and temperature are critical features in bulk modulus, while compositions, temperature, and the ordering effect are essential in shear modulus.

By combining measured creep data of a high-temperature alloy and alloy features derived from a computational thermodynamic approach, Shin et al. [Citation102] have proposed a methodology to predict new creep-resistant alloys. The method has highlighted important features essential to better understanding new alloy designs.

A feedforward ANN model has been developed by Geng et al. [Citation103] using relevant material descriptors, including chemical composition, to predict the hardenability of boron steel. The model has successfully predicted new steel with a good hardenability of 0.04% V-added boron steel.

3.7. Amorphous, shape memory, metallic glass, and catalytic materials

In the past, scientists and researchers have relied on domain knowledge, trial and error, and luck to obtain materials with optimized composition and process. Ling et al. [Citation137] have proposed a methodology, called RFs with Uncertainty Estimates for Learning Sequentially (FUELS), built on an RF model, that can accelerate getting desirable materials by fitting data-driven models to experimental data.

Li et al. [Citation138] have successfully generated and optimized ML models to predict the elastic properties of amorphous alloys by implementing six different algorithms, including linear algorithms (Lasso and Ridge) and nonlinear algorithms (RF, Extreme Gradient Boosting, SVR, ANN). The predictions agreed with glass forming ability and plasticity prediction ternary amorphous alloy systems.

Bulk-metallic glasses are a unique class of alloy materials utilized in various applications due to their attractive physical properties. Xiong et al. [Citation139] have tested over 20 ML models to predict the properties of metallic glasses and amorphous metallic alloys. RF gave the best prediction for their study. The model predictions for a newly introduced dataset were in agree with the experimental results.

Tang et al. [Citation140] have used ML to accelerate the design of Fe – based amorphous alloys with six magnetic desired properties. Linear Regression, Support Vector Regression, Decision Tree Regression, ANN, and RF Regression are employed to build prediction models. The ANN algorithm showed the best predictability with an accuracy of over 90% while the prediction yield into the design of Fe83-B9-P3-C4-Nb1, which was verified experimentally.

Artrith and Kolpak [Citation141] have proposed an approach that combines site-based Monte-Carlo simulations to sample the composition space and molecular dynamics simulations for electrocatalysts suitable for sustainable energy applications. To describe the atomic interactions, they used ANN potentials to interpolate density-functional theory calculations. The equilibrium compositions at finite temperatures: 300 K and 500 K, are compared to the ground state configurations for Cu–Au nanoalloys.

Mueller [Citation142] presented a theoretical study of Au-Pd nanoparticles. Using a Bayesian approach to cluster expansions, he has calculated the equilibrium atomic structures, energies, and the electronic surface d-band centers across the entire range of compositions at different temperatures. The prediction error of the estimated formation energies was lower than the error relative to density functional theory (1 meV/atom).

Cao et al. [Citation143] have developed a method based on cluster expansions and density functional theory that may facilitate the rational design of new catalysts. The method is allowing scientists to quickly predict the near-surface atomic arrangement and catalytic properties, giving them the possibility to identify the synthesizing conditions of the best alloys.

Tran and Ulissi [Citation144] have presented an automated computational screening approach, combining ML and optimization to guide density functional theory to predict electrocatalyst performance. They demonstrated the feasibility of this method by screening various alloys with different elements.

Gao et al. [Citation145] have built a model based on an overlap of ML and atomic positions to accelerate the estimation of formation energies of amorphous alloys and considerably reduce the cost of the required DFT calculations. The results with high accuracy could help to develop the Pd40-Ni10-Cu30-P20 catalyst.

Yang et al. [Citation53] have simulated a NN model based on DFT calculations to predict the potential energies across the Cu–Pd–Au ternary space. With reasonable accuracy, the simulation results qualitatively agree with the experimental data for PdAu and CuAu. Other important studies reporting advanced ML models used in predicting properties of Ni-P based amorphous alloys [Citation146], and shape memory alloys [Citation147–152].

Cai et al. [Citation153] have reported the progress of component design methods for bulk metallic glass. The group reported extensive studies on the development of glass forming alloys by ANN, to reduce transition temperatures [Citation154], modeling for undercooled liquid region [Citation155], and predicting of critical cooling rate [Citation156].

Among other methodologies of using ML in predicting forming metallic glasses, a two-step fused ML approach was used by Zhang et al. [Citation157]. They used multivariate analysis and classification using principal component analysis. Tripathi et al. [Citation158], a support vector classification method for predicting the glass-forming ability of binary metallic alloys from compositions, randomly selected. Sun et al. [Citation159] were employed the same method to predict elastic properties and glass-forming ability of bulk metallic glasses. Xiong et al. [Citation160] predicted the ability to exist in an amorphous state, critical casting diameter and supercooled liquid range by using decision trees and RF. Ward et al. [Citation161] and Ren et al. [Citation162] developed accelerated discovery of metallic glasses using ML iteration and high-throughput experiments.

In case of catalytic materials, ML used for a range of applications including the development of platinum-modified amorphous alloy surface catalysts [Citation163], optimizing ternary catalytic alloy electrode for efficiency improvement of semiconductor performance [Citation164], electrocatalytic for oxygen reduction [Citation165], composition and activity of electrocatalytic nanoalloys in aqueous solvents [Citation166], catalytic activity of nanoparticles [Citation167], generated feature engineering of ML chemisorption modeling [Citation168], bimetallic catalysts [Citation169], scaling relations for the description of catalytic material and beyond [Citation170], predicting binding energies by building convolutional NN for atomic surface structures [Citation171], CO2 electrocatalysts [Citation172], and nanocatalysts for oxygen reduction reaction [Citation173].

Other studies have covered vast scientific research areas naming: Inconel 718 alloys [Citation174], thermoelectrics [Citation175], CO2 electroreduction catalyst screening [Citation176], electronic density states [Citation177], nanoclusters [Citation178,Citation179], casting parts [Citation180], electrochemical curve simulation of corrosion and its application [Citation181] and copper alloys [Citation182]. The search for stable Cu-Pd-Ag nanoparticles [Citation183] predicting stress hotspots in face centered cubic materials [Citation184] search for rare-earth-free permanent magnets [Citation185] were other research in the field. Predicted the Curie temperature of ferromagnets Nelson and Sanvito [Citation186], the discovery of new half-heusler [Citation187], low-thermal-conductivity of half-heusler semiconductors [Citation188,Citation189], elastic moduli of k-nary inorganic polycrystalline compounds [Citation190], properties of metastable polymorph structures [Citation191] amorphous LixSi [Citation192] low-melting alloys [Citation193] predicting properties of inorganic materials [Citation194] and processing-structure relationships in soft magnetic alloys [Citation195] were the next groups of research in the field.

3.8. Ni-based superalloy

Distinct groups could be identified upon careful examination of the abstracts of the selected papers on ML methods for developing nickel alloys. The first group comprises Tancret [Citation196], Menou, et al. [Citation197], and Conduit et al. [Citation198], who shared a common focus on the design and optimization of nickel alloys. These papers employed various computational tools, optimization algorithms, and techniques such as computational thermodynamics, genetic algorithms, and data mining to optimize alloy compositions and achieve desired mechanical and microstructural properties.

The second group includes Jiang et al. [Citation199], Chandran et al. [Citation200], and Schooling et al. [Citation201], who rather focused on the application of ML for a target property prediction. These papers explore lattice misfit prediction, estimation of interface energy, and modeling of fatigue thresholds in nickel alloys. They leverage ML algorithms, such as SVR, NNs, and neuro-fuzzy networks, to accurately predict and analyze several material properties. Taylor and Conduit [Citation202] have used Gaussian process regression ML to predict good properties for laboratory and commercial Ni-based superalloys. Wu et al. [Citation203] proposed a novel methodology for classifying multi-principal element alloys with excellent strength – ductility. They predicted a yield strength of more than 1 GPa and a ductility of over 20%. Islam et al. [Citation204] have employed a NN in the ML framework to gain insights into designing multi-principal element alloys (MPEA) to recognize the underlying data pattern using an experimental dataset to classify the corresponding phase selection in MPEAs.

4. ML and alloy

This chapter presents the approaches of researchers using ML algorithms to address the challenges in alloy research. Their strategies for selecting predictive parameters, dataset size, and ML model optimization are examined. The most commonly employed ML models, including ensemble techniques and ANN, are also investigated. Furthermore, the chapter explores methodologies for designing and predicting mechanical properties and assesses the accuracy of ML models. Additionally, it delves into the utilization of first-principles, CALPHAD, and thermodynamics in related research. The challenges of data generation, high-throughput approaches, and image processing are discussed in detail. Finally, the most effective ML models are presented.

4.1. Predicted parameter

4.1.1. Predicted parameter, country and alloy

Choosing the right predicted parameter is crucial because ML is utilized to optimize specific alloy properties, a consistent focus in reviewed papers. The Sankey diagram of is structured with distinct columns for countries, predicted parameters, and the studied alloys. Notably, China emerges as a preeminent contributor, accounting for the highest volume of publications, dedicating 40% to the prediction of physical properties, 45% to mechanical properties and 15% on other parameters. In contrast, research publications from the U.S.A. allocate 35% toward physical properties, 26% toward mechanical properties, and 39% toward various other parameters. Significantly, a substantial proportion of research pertaining to catalytic properties (46%), crystal properties (77%), and process parameters (75%) originates from U.S.A. researchers. In the case of the United Kingdom, 50% of their research endeavors are centered on the prediction of physical properties, while 33% are focused on mechanical properties.

Figure 4. The Sankey diagram with the countries on the left side, predicted parameters in the middle and studied alloys on the right side.

Figure 4. The Sankey diagram with the countries on the left side, predicted parameters in the middle and studied alloys on the right side.

In addition to the conventional single-parameter prediction approach, an alternative strategy has surfaced, wherein a combined and integrated approach is employed to simultaneously predict both physical and mechanical properties. This innovative methodology is exemplified in the references [Citation12,Citation37,Citation44].

This collective analysis suggests that while researchers from diverse countries engage in predicting a spectrum of alloy properties, the U.S.A. exhibits a proclivity for a more extensive range of parameter prediction when employing ML techniques compared to their international counterparts. This implies a certain level of diversification and specialization in the global research landscape, with implications for the unique research priorities and strengths of each contributing nation.

reveals a comprehensive analysis of predicted parameters, encompassing a range of properties including mechanical and physical properties across a diverse array of alloys. Notably, the research landscape displays distinct tendencies, with 57% of studies focusing on steel, 58% on aluminum alloys, and 57% on titanium alloys to predict the mechanical properties. In contrast, the prediction of physical properties is notably prevalent in specific domains, with 25% of steel research, 65% of HEA investigations, 17% of aluminum alloys’ research, and 67% of metallic glass studies concentrating on this aspect. A salient observation pertains to the strong inclination among studies that encompass multiple alloys, where 60% of such research endeavors prioritize the prediction of physical properties, exemplifying the multifaceted nature of the research paradigm.

4.1.2. Predicted parameter and ML model

Various ML models are employed for predicting specific alloy properties and researchers are divided in choosing ML models to predict alloy properties. The Sankey diagram in , shows the associations between predicted parameters and the corresponding ML models employed for prediction. 38% of researchers utilize NN for predicting mechanical properties, and 22% for predicting physical properties. An observed preference is noted for Linear Regression, where 60% of researchers employing this model for mechanical property prediction. Moreover, 40% of researchers resort to NN for predicting catalytic properties. Interestingly, 66% of researchers harness Gaussian Process to predict physical properties.

Figure 5. The Sankey diagram represents the associations between predicted properties (on the left side) and the corresponding ML models employed in the analysis, as evident from the collected publications (on the right side). The figure provides insights into the distribution and utilization of ML models for predicting specific properties in the context of the collected publications.

Figure 5. The Sankey diagram represents the associations between predicted properties (on the left side) and the corresponding ML models employed in the analysis, as evident from the collected publications (on the right side). The figure provides insights into the distribution and utilization of ML models for predicting specific properties in the context of the collected publications.

Further analysis reveals that predominantly, NN, RF, and SVM constitute the preferred ML models that collectively representing 20% of all models employed. Notably, NN is frequently utilized, with 40% of its applications dedicated to predicting both mechanical and physical properties. In the case of RF, 40% of its applications focus on predicting physical properties, while 40% are oriented toward mechanical properties. SVM allocations reveal 38% for mechanical property prediction and 34% for physical property prediction.

The co-utilization of ML models for predicting both mechanical and physical properties is evident that is highlighting the adaptability and utility of these models in alloy research. Researchers exhibit diverse preferences for ML models, and while certain models are specialized, others, like Linear Regression, have specific applications in the eyes of researchers.

4.2. Large/Small datasets, and raw data collection

The size of the dataset has a significant impact on the accuracy of ML models. A larger dataset can improve model accuracy by reducing overfitting and increasing generalization. However, most of dataset provided from metallurgy for ML applications are small and limited to few samples. Overfitting occurs when a model is trained too well on a small dataset and performs poorly on new data. A larger dataset can also enhance the robustness of a model by making it more resistant to noise and outliers. However, collecting large datasets of alloys can be expensive and time-consuming. One of most robust methods in materials search adapted for small dataset is based on a trade-off between an exploitation (ML model) and exploration (uncertainties) approaches. In this regard, Xue et al. [Citation205] have developed an adaptive design strategy, coupled with experiments, that can accelerate the discovery process of NiTi-based shape memory alloys.

Several initiatives have emerged to build materials databases due to the Materials Genome Initiative, data-management policies enforced by funding agencies, and the recent awareness within the materials community of the positive impact of data sharing and dissemination. A wide range of databases, both empirical and computational, have become available that cover a diverse set of materials properties such as structures, formation energetics, thermodynamic phase diagrams, and electrical and mechanical properties across different material classes, including metals, ceramics, alloys, glasses, 2D materials and nanocomposites [Citation62]. shows some notable data repositories and their application in the studied publications. Although a few of these data repositories are frequently used by researchers, a significant number of them remain largely unnoticed. It is intriguing to note that researchers often struggle to create more data to make their ML models better, even though there is a lot of unused data in existing databases. We’ll delve deeper into this in Chapter 7, which titled ‘Data and Code Availability’.

Table 1. Known data repositories of metals and used in the collected papers.

Researchers are facing a significant challenge arises in forming the libraries used for ML predictions. The most likely scenario is to consider multiple databases specialized around a specific type of desired data, e.g. first-principles calculations, crystallography, mechanical properties, Scanning Transmission Electron Microscopy (STEM) imaging, etc. Having open, well-documented, and standardized dataset models is essential in all cases to enable better integration [Citation206].

Despite presenting intriguing research findings, some attempts in this review did not yield successful outcomes. One such endeavor was to compare the sizes of datasets used in constructing ML models. However, making such comparisons among published papers often proved challenging, as several studies did not provide information about the size of their datasets. In cases where data sizes were reported, there was considerable variability. For example, Tang et al. [Citation140] utilized a dataset comprising 2083 inputs representing 36 different elements, while Yan et al. [Citation94] mentioned 1807 entries. This is while some studies indicated just the number of rows and columns of the datasets [Citation207,Citation208]. These disparities made it difficult to establish clear dataset size comparisons. As a result, the dataset sizes often remained unreported. It would be beneficial for future research to standardize and report dataset sizes consistently for better comparisons.

4.3. ML optimization: feature selection and hyperparameter tuning

Feature selection is a process where the most relevant features in a data are automatically selected. Hyperparameter tuning is the process of selecting the optimal set of hyperparameters for a learning algorithm [Citation209]. Evaluating a model means analyze how it will perform on the test dataset. For instance, ML data scientists train their models and compare how well it generalizes utilizing the test set. It is supposed that they selected the best model configuration, but now they must apply some regularization to avoid model overfitting. The main question is, how do they choose the value of the regularization hyperparameter? One option is to train models using different values for this hyperparameter, by running hyperparameter tuning. Most publications do not apply feature selection or processing and hyperparameter tuning to their models. Some only highlighted their feature selection [Citation24,Citation29,Citation31,Citation34,Citation140], and others only tuned their model hyperparameters [Citation38,Citation40] or used default settings [Citation25,Citation102]. Studies considering both feature selection and hyperparameter tuning are rare [Citation2,Citation11,Citation16,Citation19]. The most used hyperparameter search approaches are the grid search method with cross-validation [Citation21] or the random search method with cross-validation [Citation36]. These all mean the research community could benefit from more comprehensive and systematic approaches to model refinement.

4.4. Ensemble techniques

Suppose a complex question presented to thousands of random persons, then their answers are collected. Generally, their collective answer is better than an answer from an expert. This is called the wisdom of the crowd. Similarly, if a material property is predicted by using a group of predictors (such as classifiers or regressors), often better predictions are carried out than the best individual predictor. A group of predictors is called an ensemble, and the technique is called ensemble learning. An ensemble learning algorithm is also called an ensemble technique. For example, Yan et al. [Citation128] used 3 ensemble models to predict hardness of steel. A typical example of ensemble learning is RFs which perform better than individual decision trees [Citation209]. RF is employed in a variety of alloy development research that is discussed later sections in this chapter.

4.5. Artificial neural network

ANNs are ML models inspired by biological neuron networks in our brains [Citation209]. ANNs have gradually become quite different from their biological cousins. They are versatile, robust, and scalable, ideal for tackling large and highly complex ML tasks such as classifying billions of images, powering speech recognition services, or recommending the best videos to watch to hundreds of millions of daily users.

A massive quantity of data is nowadays available to train ANNs in different fields, and ANNs frequently outperform other ML techniques on vast and complex problems. The significant increase in computing power since the end of the 1990s and the beginning of the 2000s, also thanks to the gaming industry, which has stimulated producing powerful GPU cards, made it possible to train large ANNs in a reasonable amount of time [Citation209].

A required large dataset, which most cases not available for particular materials or alloys, is one of the main theoretical limitations of ANNs. Also, many researchers thought that ANN training algorithms would likely get stuck in local optima. Still, this is not a big problem in practice, especially for more extensive ANNs: the local optima often perform almost as well as the global optimum [Citation209]. ANNs had slow progress because of their complexity and lack of interpretability of their results. Nevertheless, this did not stop more attention toward them, resulting in more and more improvement and even more unique predictions.

It’s essential to note that, the terms ‘neural network’ (NN), ‘artificial neural network’ (ANN), and ‘deep neural network’ (DNN) are not used interchangeably in this research because it is important to use the terminology that accurately describes the type of network you are discussing.

4.6. Design and prediction of mechanical properties

Researchers used ML to enhance design of experiments including developing high-throughput rapid experimental alloy development [Citation210], proposing a general experimental design [Citation211], maximin and centroid design strategies [Citation212], improving alloy design efficiency [Citation213,Citation214] and catalyst design [Citation215].

Estimating the mechanical properties (e.g. Yield strength) of an indented material is often challenging. Conventional optimization techniques are generally unable to yield accurate values. Fernandez-Zelaia et al. [Citation216] have proposed combining a Gaussian Process surrogate model using finite element models of spherical indentation, a Bayesian optimization and Markov Chain Monte Carlo sampling. The efficiency of the approach was tested against the experiment with good agreement. In a separate instance, ML models were applied to anticipate the tensile strength of metallic materials subjected to hydrogen exposure [Citation217]. Additionally, ML was employed to classify stress hotspots in another case [Citation218].

4.7. ML models and accuracy

In , a sankey diagram illustrates the relationship between ML models and the alloys under study. Notably, NN, RF, and SVM, which are the most frequently employed ML models, are used to predict parameters across a diverse array of alloys. This indicates that researchers do not confine themselves to a particular ML model when investigating specific alloys. Notably, steel, HEA, aluminum, nickel, and range of alloys (researches that encompassing a range of alloys within a single paper), collectively account for 50% of the papers employing ML in alloy development. Furthermore, it is observed that 18% of steel research and 33% of aluminum research employ NN as their preferred ML model. The graphical representation underscores that researchers do not impose limitations on the choice of ML models for specific alloys, resulting in a relatively equitable distribution of models across the various alloys studied.

Figure 6. Sankey diagram with ML models on the left side and the studied alloys on the right side. The diagram is an illustration of the relationship between the ML model and predicted alloys.

Figure 6. Sankey diagram with ML models on the left side and the studied alloys on the right side. The diagram is an illustration of the relationship between the ML model and predicted alloys.

Various ML models such as ANNs, SVMs, and decision trees have been applied to the alloy development process. These models can be used to predict the properties of an alloy based on an optimal composition for a given set of desired properties.

4.8. ML, first-principles, CALPHAD and thermodynamic approaches

Inputs derived from first-principles calculations [Citation59,Citation150] and thermodynamic modeling methods such as CALPHAD [Citation210,Citation213] have been heavily used in ML applications in alloy research. First, theoretical modeling can generate big data sets for ML models. Theoretical modeling calculations are heavy and slow for computing machines, while similar calculations of ML can be performed faster in similar computing machines. Secondly, traditional modeling methods can be used to validate the outcome of the ML models. Both advantages show how traditional modeling and ML can balance each other.

One approach to using ML in alloy development is first to gather a large dataset of alloy compositions and their related properties. This dataset can then be used to train an ML model, which can then be used to make predictions about the properties of new alloy compositions. The accuracy of the model can be improved by increasing the size and diversity of the training dataset and by hyperparameter tuning of the ML model.

Another approach is to use an evolutionary model, such as genetic models [Citation196,Citation197], to search for the optimal alloy composition. This approach generates relevant libraries. Therefore and evaluates an initial population of alloy compositions based on their predicted properties. The compositions that perform best are then used to generate a new population, which is again evaluated, and the process is repeated until the optimal composition is found.

4.9. The challenge to generate data

Research shows that the accuracy of ML models can differ even with the same data set [Citation219–221]. Each ML model has its unique structure with underlying mathematical concepts and relevant libraries. Therefore, it is essential to identify the best-fit model by critically analyzing performance parameters before applying it to real-world scenarios. Such an approach in ML model selection is a promising methodology with minimal effort to discover novel alloy compositions compared to conventional alloy development procedures.

Results of the published papers show that the accuracy of ML models can differ even with the same data set. So, different accuracies are reported in a single article. shows the pros and cons of using some of the ML models. Researchers are encouraged to make informed ML model selections by weighing these factors. Highlighting models that outperform their counterparts, this paper recommends top ML models for alloy development, with further discussion in the ‘ML Models and Accuracy’ section.

4.10. High throughput approach

In total, 28 of the studied papers used high throughput methods, e.g [Citation102]. used High Throughput CALPHAD approach [Citation187], and [Citation189] used high throughput ab initio method, and [Citation69] used high throughput molecular dynamics, the rest of the papers studied experimental High Throughput methods. While U.S.A. and China are the frontiers in using High Throughput, Carrete, et al. in France are using the method to predict the physical properties of Half-Heusler [Citation187–189]. It is important to highlight that in High Throughput studies, RF, SVM, and XGBoost have consistently emerged as the ML models with the highest reported accuracy. Interestingly, even though the high throughput approach is employed to generate extensive data in materials science, NN have not been widely utilized for modeling this type of data. This stands in contrast to the broader field of alloy development, where on average, NNs have been the researchers’ preferred choice for ML applications.

4.11. Computer vision in image processing

Computer vision and Convolutional neural networks (CNNs) emerged to solve problems related to visual tasks. Power image search services, self-driving cars, and automatic video classification systems are among their applications and more [Citation209].

By using image processing techniques, researchers can analyze the microstructure of alloys at various scales, from the atomic to the macroscopic levels. A small portion of the studied publications used image processing in alloy exploration and development [Citation29,Citation58,Citation125,Citation184,Citation222]. Image processing is mainly used for microstructure evaluation, such as identifying stress hot spots [Citation184].

4.12. The challenge to find the best ML approach

The Pie chart of reveals that the majority of publications (68%) rely on a single ML model for their research while others used more than one ML model to predict parameters. It is notable that one publication authored by C. Wen et al. [Citation79] stands out as an exception, employing an extensive array of 12 ML models in a single research paper. The remaining publications utilize a varying number of ML models, ranging from 2 to 11, demonstrating diversity in the approaches taken by researchers in selecting and applying ML models to their work. This is showing at least 32% of the researchers may have encountered uncertainty or difficulty in determining one ML model as the most appropriate model for their specific application.

Figure 7. The pie chart illustrates the frequency of ML model usage within a single published paper. For example, one paper used 12 ML models in one published paper while the majority of papers used only one ML model.

Figure 7. The pie chart illustrates the frequency of ML model usage within a single published paper. For example, one paper used 12 ML models in one published paper while the majority of papers used only one ML model.

Following uncertainty for the best ML model for the application, researchers compare the results from every ML model to find the best. shows the frequency of using other ML models in every publication. This statistical result reflects the popularity of the ML models and not necessarily the accuracy of the predictions or the advantage of using two or more ML models. Those researchers who used more than 1 ML model compared the accuracy of the models and presented their best ML model. A review of the best suggested ML models presented earlier in 4.7, ML models and accuracy.

As previously discussed, some researchers adopted a multi-model approach to identify the most accurate solution for their specific applications within the realm of ML. They conducted assessments of accuracy and reported their findings, while others chose not to share accuracy information [Citation110]. shows predicted parameters and their correspondent best-reported ML models. The connections between the predicted properties and ML models reveal which ML model was reported as the best performer for predicting a particular alloy property. For instance, if mechanical property is connected to RF, it suggests that RF was found to be the best ML model for predicting mechanical properties. The suggested model was selected the best in comparison with the other models that used in the same paper. The graph is extracted only from the papers that used more than one ML model, compared those models and reported the best. RF, SVM, XGBoost, and SVR are introduced as the best ML models for all the predicted parameters. It is interesting to learn that while NN is the most used ML model (section 4.1 Predicted Parameters), however among more than 200 studied papers, just two researchers reported NN as the best ML model [Citation18,Citation140]. Furthermore, it is noteworthy that 12 distinct ML models were reported as the best for predicting mechanical properties, and 7 ML models excelled in predicting physical properties. With one exception, the ML models ideal for predicting mechanical properties were not the same as those for predicting physical properties.

Figure 8. Sankey diagram with predicted parameters on the left side and the best-suggested ML models on the right side. The diagram is an illustration of the relationship between the predicted property and the alloy of the best-suggested ML model for predicted properties. This graph is not showing all of the studied papers. It is only showing the results of those papers that compared different ML models and reported the best ML model.

Figure 8. Sankey diagram with predicted parameters on the left side and the best-suggested ML models on the right side. The diagram is an illustration of the relationship between the predicted property and the alloy of the best-suggested ML model for predicted properties. This graph is not showing all of the studied papers. It is only showing the results of those papers that compared different ML models and reported the best ML model.

5. ML deployment

Suppose a model is launched into production, but unfortunately, it does not perform as well as expected and produces large errors. The problem lies in the fact that the generalization error was measured multiple times on the test set, and the model and hyperparameters were adapted to yield the best model for that dataset. This means the model is unlikely to perform as well on new data. Therefore, ML deployment task is often missed or ignored in the journey of building the model in the vast majority of the ML publications. illustrates how data flow is used to improve alloy development. The data flow begins with data collection and concludes with data re-examination and re-assessment. This data flow aids researchers in enhancing their processes by reviewing the steps.

Figure 9. Building a data flywheel for ML modelling: (1) data collection, (2) Exploratory Data Analysis (EDA), (3) data cleaning/preprocessing, (4) feature engineering, (5) training an ML model, (6) optimization through hyperparameter tuning, (7) testing and validation of the best ML model, (8) Model deployment for production, (9) evaluation of the accuracy and the quality of final predictions, (10) data re-checking and re-evaluation. Steps 8–10 are usually missed or neglected in most of the published works.

Figure 9. Building a data flywheel for ML modelling: (1) data collection, (2) Exploratory Data Analysis (EDA), (3) data cleaning/preprocessing, (4) feature engineering, (5) training an ML model, (6) optimization through hyperparameter tuning, (7) testing and validation of the best ML model, (8) Model deployment for production, (9) evaluation of the accuracy and the quality of final predictions, (10) data re-checking and re-evaluation. Steps 8–10 are usually missed or neglected in most of the published works.

To deploy ML algorithms, it is possible to use a cloud-based environment (e.g. AWS, Google Cloud and Microsoft Azure) to train, deploy, automate, manage, and track ML models. It is often used with cloud storage where potentially large amounts of data are also considered in the ML process. Using cloud-base environment is not expanded in this research because it is not used by majority of the researchers in field of alloy development.

6. Data and code availability

It is crucial to present the primary data used to construct ML models as it enables other researchers to reproduce the results and validate the findings. This practice also ensures that the data utilized in the model is precise and impartial. Moreover, it allows other researchers to reproduce the results, build upon the work, and develop novel models based on the same data. shows the availability of the primary dataset in the studied publications. In some cases, the dataset was presented in the text [Citation180], some in a supplementary file [Citation7], in some cases, an external link [Citation166] was given; some said it would be available on request [Citation67], some said they could not share the data [Citation57] and some referred to their github.com repository [Citation21]. This is while most of the researchers did not present the primary dataset at all. This research shows that recent publications have more tendency to share their dataset. It should be mentioned that sharing the dataset and code in some journals is mandatory.

Figure 10. Pie charts showing (a) data availability and (b) code availability, in the studied publications. The figures provide insights into how often data and code are shared within the research community have implications for the transparency and reproducibility of the research.

Figure 10. Pie charts showing (a) data availability and (b) code availability, in the studied publications. The figures provide insights into how often data and code are shared within the research community have implications for the transparency and reproducibility of the research.

Sharing the code of ML models is crucial as it enables other researchers to reproduce the results and validate the findings. This practice also ensures that the code utilized in the model is precise and impartial. Moreover, it allows other researchers to build upon the work and develop novel models based on the same code. shows abundance of publications that shared the code of their ML model. Surprisingly, 81% of the researchers did not share their code while 14% said the code will be available on request.

7. Conclusion and outlook

In recent years, ML approaches have gained significant attention in alloy development. This statistical literature review highlighted advanced ML methods used in alloy design worldwide, where the U.S.A. and China are the leading countries in the selected publications.

It is found that steel, HEA, and aluminum were the most extensively studied alloys in the field. Investigators from the U.S.A. have extensively examined a range of alloys, while researchers of other nations mostly focused on a single alloy system in one publication.

Computational Materials Science is the journal with the highest publication rate in the field, with 10% of the published papers.

The use of ML approaches in predicting physical, mechanical, and catalytic properties in several alloys ranging from light and HEA to amorphous, Ni-based alloys and steels were thoroughly. The fact that researchers explored and utilized a wide spectrum of ML tools and approaches to improve and advance the development of alloys is indicating the versatility and comprehensiveness of their approach. This research indicates that the U.S.A. leads in diverse alloy property prediction using ML compared to other countries.

This comprehensive analysis underscores the diverse ML models adopted in property prediction and reveals a desire of researchers for NN, RF, and SVM that collectively representing 20% of all models employed. These models are used to predict parameters across a diverse array of alloys. This suggests that researchers do not limit their research to a specific ML model when studying a particular alloy. So, the models distributed equally for the alloys.

Researchers display a wide range of preferences when it comes to ML models, and while some models are tailored for specific tasks, others, such as Linear Regression, are seen as having distinct applications by researchers.

This research shows most publications neglect feature selection, hyperparameter tuning, or both. Only a few studies address both aspects, primarily employing grid or random search methods with cross-validation. The materials’ research community could benefit from more comprehensive and systematic approaches to model refinement.

The dataset size and structure essential to the model accuracy were discussed. Most used ML approaches, such as ensemble techniques, NNs, and computer image processing, were also revisited. Those which underwent careful feature selection and hyperparameter tuning and reported the best optimal models were highlighted. Often available data useful for alloy property prediction is too small, with few features. This creates a challenge of generating the extra necessary data from other resources, such as first-principles calculations and thermodynamic approaches, which can benefit some studies.

It is worth noting that 68% of the publications used only one ML model, 9% used two ML models, and the maximum number of ML models used in a single paper was 12. Most used ML models were in the following order: NN, RF, SVM, SVR, Gaussian process, and Linear Regression.

While NN is the most used ML model, only two papers presented NN as the best model, in comparison with the other models. Notably, the ML models that had been the best for predicting mechanical properties were not the same as those for predicting physical properties.

Linear regression and XGBoost were the most frequently used models for predicting mechanical properties, while the Gaussian process and Bayesian optimizations were the preferred models for predicting physical properties. Steel, aluminum, and titanium were the most studied alloys in terms of their mechanical properties, whereas in the case of HEA and metallic glasses, the focus of the studies was on their physical properties.

Researchers reported that ensemble techniques with SVM provided the highest accuracy in predicting the properties of alloys, while NN was the most used model. High Throughput methods were used in 12% of the publications. It is alarming that 53% of the publications did not share their datasets, and 81% did not share the code used to build their ML models. So, sharing datasets and codes used to build and deploy ML models should be encouraged to promote reproducibility and facilitate future research in this field. Also, only 2% of the publications used image-processing techniques for alloy development.

Overcoming obstacles requires standardization, enhanced collaboration, and innovative algorithms that can handle the complexities of alloy systems. Continuous interdisciplinary collaboration will be vital in pushing the boundaries of what is possible, ensuring that the insights gained from computational methods are effectively translated into real-world applications.

This research demonstrated how standardization is the lost part of using ML in alloy development research. Standardized formats and protocols for data collection, storage, and sharing make it easier to combine datasets from various sources including experiment and CALPHAD results. Standardization can help ensure data quality and consistency. It provides guidelines for data validation, calibration, and error correction, reducing the chances of erroneous results. This statistical literature review has aimed to compare models, but it becomes evident that the best benchmarking can only be achieved through the use of standardized datasets and evaluation metrics, enabling equitable model comparisons and performance assessments among peers.

Authorship contribution statement

Alireza Valizadeh: Investigation, Data curation, Conceptualization, Methodology, Writing – original draft, Visualization, Writing – review & editing. Ryoji Sahara: Conceptualization, Writing – review & editing. Maaouia Souissi: Investigation, Conceptualization, Methodology, Writing – original draft, Supervision, Project administration, Writing – review & editing.

Acknowledgements

We acknowledge Dr Taylor D.Sparks from The University of Utah for many useful discussions.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Notes on contributors

Alireza Valizadeh

Alireza Valizadeh, is a distinguished scientist in materials informatics, and specializes in energy materials. With a Ph.D. from Brunel University London, he currently researches at NEX Power Ltd, utilizing AI and machine learning for materials development. Focused on the energy sector, his goal is to innovate sustainable energy solutions. In 2018, he was awarded the Frank Fitzgerald medal by the Institute of Materials, Mines and Minerals for his extensive work in the iron and steel industry. Through his pursuit of scientific excellence, Alireza shapes a sustainable energy landscape for societal and environmental benefits.

Ryoji Sahara

Ryoji Sahara, graduated in Materials Science from Tohoku University, Japan, in 2000. He worked as an assistant professor at the Institute for Materials Research (IMR) and later joined the National Institute for Materials Science (NIMS) as a principal researcher in 2003. At NIMS, he supervised postdoctoral researchers, leading projects on carbide formation in heat-resistant creep steels, oxidation of titanium alloys, and materials design using machine learning. He collaborates with researchers from Saveetha University, TU-Deft, and Ohio State University. As chairman and member of ACCMS, he contributes to computational materials science. With 60+ publications, he currently leads the Computational Structural Materials Group at NIMS’s Research Center for Structural Materials.

Maaouia Souissi

Maaouia Souissi, graduated from the National School of Engineering of Sfax in 2005 and worked as a research assistant in Tunisia until 2007. He obtained his Ph.D. in Materials Science from Tohoku University, Japan, in 2011. Following that, he conducted postdoctoral research at Osaka Prefecture University for five years, focusing on defect modeling in steels using density functional theory. In 2016, he joined NIMS, Japan, investigating structural materials modeling for high-strength heat-resistant applications. Later, he joined Brunel University London as a research fellow, collaborating with Constellium partnership on aluminum alloy development for aerospace and automotive purposes. He has over 15 years of research experience, contributed to prestigious journals, and actively participated in international conferences.

References

  • Tanaka I, Rajan K, Wolverton C. Data-centric science for materials innovation. MRS Bull. 2018;43(9):659–31. doi: 10.1557/mrs.2018.205
  • Schmidt J, Marques MRG, Botti S, et al. Recent advances and applications of machine learning in solid-state materials science. Npj Comput Mater. 2019;5. doi: 10.1038/s41524-019-0221-0
  • Olson GB. Computational design of hierarchically structured materials. Science. 1997;277(5330):1237–1242. doi: 10.1126/science.277.5330.1237
  • Kingsmore KM, Puglisi CE, Grammer AC, et al. An introduction to machine learning and analysis of its use in rheumatic diseases. Nat Rev Rheumatol. 2021;17(12):710–730. doi: 10.1038/s41584-021-00708-w
  • Pfeiler, W. Alloy physics: a comprehensive reference. Wiley; 2007.
  • Marchand D, Jain A, Glensk A, et al. Machine learning for metallurgy I. A neural-network potential for al-cu. Phys Rev Mater. 2020;4(10):1–21. doi: 10.1103/PhysRevMaterials.4.103601
  • Jakse N, Sandberg J, Granz LF, et al. Machine learning interatomic potentials for aluminium: application to solidification phenomena. J Phys Condens Matter. 2023;35(3):035402. doi: 10.1088/1361-648X/ac9d7d
  • Varol ÖH, İnce M, Bıçaklı EE. Prediction of mechanical properties of the 2024 aluminum alloy by using machine learning methods. Arab J Sci Eng. 2023;48:2841–2850. doi: 10.1007/s13369-022-07009-8
  • Kumar AK, Surya MS, Venkataramaiah P. Performance evaluation of machine learning based-classifiers in friction stir welding of Aa6061-T6 alloy. Int J Interact Des Manuf. 2023;17(1):469–472. doi: 10.1007/s12008-022-00904-2
  • Yao D, Pu S, Li M, et al. Parameter identification method of the semi-coupled fracture model for 6061 aluminium alloy sheet based on machine learning assistance. Int J Solids Struct. 2022;254–255:111823. doi: 10.1016/j.ijsolstr.2022.111823
  • Liu Q, Wu H, Paul MJ, et al. Machine-learning assisted laser powder bed fusion process optimization for AlSi10Mg: new microstructure description indices and fracture mechanisms. Acta Mater. 2020;201:316–328. doi: 10.1016/j.actamat.2020.10.010
  • Yanase Y, Miyauchi H, Matsumoto H, et al. Densification behavior and microstructures of the Al-10%Si-0.35Mg alloy fabricated by selective laser melting: from experimental observation to machine learning. Mater Trans. 2022;63:176–184. doi: 10.2320/matertrans.MT-M2021215
  • Chun MS, Biglou J, Lenard JG, et al. Using neural networks to predict parameters in the hot working of aluminum alloys. J Mater Process Technol. 1999;86(1–3):245–251. doi: 10.1016/S0924-0136(98)00318-5
  • Li B, Du Y, Zheng ZS, et al. Manipulation of mechanical properties of 7xxx aluminum alloy via a hybrid approach of machine learning and key experiments. J Mater Res Technol. 2022;19:2483–2496. doi: 10.1016/j.jmrt.2022.06.015
  • Aydin F. The investigation of the effect of particle size on wear performance of AA7075/Al2O3 composites using statistical analysis and different machine learning methods. Adv Powder Technol. 2021;32:445–463. doi: 10.1016/j.apt.2020.12.024
  • Decke J, Engelhardt A, Rauch L, et al. Predicting flow stress behavior of an AA7075 alloy using machine learning methods. Crystals. 2022;12(9):1–19. doi: 10.3390/cryst12091281
  • Haghdadi N, Zarei-Hanzaki A, Khalesian AR, et al. Artificial neural network modeling to predict the hot deformation behavior of an A356 aluminum alloy. Mater Des. 2013;49:386–391. doi: 10.1016/j.matdes.2012.12.082
  • Shang H, Wu P, Lou Y, et al. Machine learning-based modeling of the coupling effect of strain rate and temperature on strain hardening for 5182-O aluminum alloy. J Mater Process Technol. 2022;302:117501. doi: 10.1016/j.jmatprotec.2022.117501
  • Soofi Y J, Rahman MA, Gu Y, et al. A feasibility study of machine learning-assisted alloy design using wrought aluminum alloys as an example. Comput Mater Sci. 2022;215:111783. doi: 10.1016/j.commatsci.2022.111783
  • Takara Y, Ozawa T, Yamaguchi M. Analysis of the elemental effects on the surface potential of aluminum alloy using machine learning. Jpn J Appl Phys. 2022;61:SL1008. doi: 10.35848/1347-4065/ac5a2a
  • Lian Z, Li M, Lu W. Fatigue life prediction of aluminum alloy via knowledge-based machine learning. Int J Fatigue. 2022;157:106716. doi: 10.1016/j.ijfatigue.2021.106716
  • Chun PJ, Yamane T, Izumi S, et al. Development of a machine learning-based damage identification method using multi-point simultaneous acceleration measurement results. Sensors (Switzerland). 2020;20(10):20. doi: 10.3390/s20102780
  • Jiang X, Zhang R, Zhang C, et al. Fast prediction of the quasi phase equilibrium in phase field model for multicomponent alloys based on machine learning method. Calphad. 2019;66:101644. https://linkinghub.elsevier.com/retrieve/pii/S0364591619300860
  • Dai Y, Zhao S, Song C, et al. Identification of aluminum alloy by laser‐induced breakdown spectroscopy combined with machine algorithm. Microw Opt Technol Lett. 2021;63(6):1629–1634. doi: 10.1002/mop.32810
  • Wang J, Yousefzadi Nobakht A, Blanks JD, et al. Machine learning for thermal transport analysis of aluminum alloys with precipitate morphology. Adv Theory Simul. 2019;2(4). doi: 10.1002/adts.201800196
  • Malinov S, Sha W, McKeown J. Modelling the correlation between processing parameters and properties in titanium alloys using artificial neural network. Comput Mater Sci. 2001;21(3):375–394. doi: 10.1016/S0927-0256(01)00160-4
  • Malinov S, Sha W. Application of artificial neural networks for modelling correlations in titanium alloys. Mater Sci Eng A. 2004;365(1–2):202–211. doi: 10.1016/j.msea.2003.09.029
  • Zhu C, Li C, Wu D, et al. A titanium alloys design method based on high-throughput experiments and machine learning. J Mater Res Technol. 2021;11:2336–2353. doi: 10.1016/j.jmrt.2021.02.055
  • Bautista-Monsalve F, García-Sevilla F, Miguel V, et al. A novel machine-learning-based procedure to determine the surface finish quality of titanium alloy parts obtained by heat assisted single point incremental forming. Metals (Basel). 2021;11(8):11. doi: 10.3390/met11081287
  • Zhan Z, Hu W, Meng Q. Data-driven fatigue life prediction in additive manufactured titanium alloy: a damage mechanics based machine learning framework. Eng Fract Mech. 2021;252:107850. doi: 10.1016/j.engfracmech.2021.107850
  • Li J, Yang Z, Qian G, et al. Machine learning based very-high-cycle fatigue life prediction of Ti-6Al-4V alloy fabricated by selective laser melting. Int J Fatigue. 2022;158:106764. doi: 10.1016/j.ijfatigue.2022.106764
  • Sun Y, Zeng WD, Zhao YQ, et al. Development of constitutive relationship model of Ti600 alloy using artificial neural network. Comput Mater Sci. 2010;48:686–691. doi: 10.1016/j.commatsci.2010.03.007
  • Reddy NS, Panigrahi BB, Ho CM, et al. Artificial neural network modeling on the relative importance of alloying elements and heat treatment temperature to the stability of α and β phase in titanium alloys. Comput Mater Sci. 2015;107:175–183. doi: 10.1016/j.commatsci.2015.05.026
  • Kwak S, Kim J, Ding H, et al. Machine learning prediction of the mechanical properties of γ-TiAl alloys produced using random forest regression model. J Mater Res Technol. 2022;18:520–530. doi: 10.1016/j.jmrt.2022.02.108
  • Outeiro J, Cheng W, Chinesta F, et al. Modelling and optimization of machining of ti-6Al-4V titanium alloy using machine learning and design of experiments methods. J Manuf Mater Process. 2022;6(3):6. doi: 10.3390/jmmp6030058
  • Wu CT, Lin PH, Huang SY, et al. Revisiting alloy design of low-modulus biomedical β-Ti alloys using an artificial neural network. Materialia. 2022;21:101313. doi: 10.1016/j.mtla.2021.101313
  • Liu Y, Wang L, Zhang H, et al. Accelerated development of high-strength magnesium alloys by machine learning. Metall Mater Trans A Phys Metall Mater Sci. 2021;52:943–954. doi: 10.1007/s11661-020-06132-1
  • Xu X, Wang L, Zhu G, et al. Predicting tensile properties of AZ31 magnesium alloys by machine learning. JOM. 2020;72:3935–3942. doi: 10.1007/s11837-020-04343-w
  • Ibarra-Hernández W, Hajinazar S, Avendaño-Franco G, et al. Structural search for stable Mg-ca alloys accelerated with a neural network interatomic model. Phys Chem Chem Phys. 2018;20:27545–27557. doi: 10.1039/C8CP05314F
  • Gui Y, Li Q, Zhu K, et al. A combined machine learning and EBSD approach for the prediction of {10-12} twin nucleation in an Mg-RE alloy. Mater Today Commun. 2021;27:102282. doi: 10.1016/j.mtcomm.2021.102282
  • Wei X, Wang J, Wang C, et al. Prediction of electronic work function of the second phase in binary magnesium alloy based on machine learning method. J Mater Res. 2022;37(21):3792–3802. doi: 10.1557/s43578-022-00752-6
  • Fang S, Xiao X, Xia L, et al. Relationship between the widths of supercooled liquid regions and bond parameters of Mg-based bulk metallic glasses. J Non Cryst Solids. 2003;321(1–2):120–125. doi: 10.1016/S0022-3093(03)00155-8
  • Chen T, Gao Q, Yuan Y, et al. Coupling physics in machine learning to investigate the solution behavior of binary Mg alloys. J Magnes Alloy. 2022;10:2817–2832. https://linkinghub.elsevier.com/retrieve/pii/S2213956721001584
  • Qiao L, Ramanujan RV, Zhu J. Machine learning discovery of a new cobalt free multi-principal-element alloy with excellent mechanical properties. Mater Sci Eng A. 2022;845:143198. doi: 10.1016/j.msea.2022.143198
  • Wang H, Li B, Xuan FZ. A dimensionally augmented and physics-informed machine learning for quality prediction of additively manufactured high-entropy alloy. J Mater Process Technol. 2022;307:117637. doi: 10.1016/j.jmatprotec.2022.117637
  • Rovinelli A, Sangid MD, Proudhon H, et al. Using machine learning and a data-driven approach to identify the small fatigue crack driving force in polycrystalline materials. Npj Comput Mater. 2018;4:1–10. doi: 10.1038/s41524-018-0094-7
  • Wang R, Zeng S, Wang X, et al. Machine learning for hierarchical prediction of elastic properties in Fe-cr-al system. Comput Mater Sci. 2019;166:119–123. doi: 10.1016/j.commatsci.2019.04.051
  • Zhang YM, Yang S, Evans JRG. Revisiting Hume-Rothery’s rules with artificial neural networks. Acta Mater. 2008;56(5):1094–1105. doi: 10.1016/j.actamat.2007.10.059
  • Seko A, Maekawa T, Tsuda K, et al. Machine learning with systematic density-functional theory calculations: application to melting temperatures of single- and binary-component solids. Phys Rev B. 2014;89(5):054303. doi: 10.1103/PhysRevB.89.054303
  • Reitz DM, Blaisten-Barojas E. Simulating the NaK eutectic alloy with Monte Carlo and machine learning. Sci Rep. 2019;9:1–11. doi: 10.1038/s41598-018-36574-y
  • Oh SV, Hwang W, Kim K, et al. Using feature‐assisted machine learning algorithms to boost polarity in lead‐free multicomponent niobate alloys for high‐performance ferroelectrics. Adv Sci. 2022;9. doi: 10.1002/advs.202104569
  • Chintakindi S, Alsamhan A, Abidi MH, et al. Annealing of monel 400 alloy using principal component analysis, hyper-parameter optimization, machine learning techniques, and multi-objective particle swarm optimization. Int J Comput Intell Syst. 2022;15(1):18. doi: 10.1007/s44196-022-00070-z
  • Yang Y, Guo Z, Gellman AJ, et al. Simulating segregation in a ternary Cu-pd-au alloy with density functional theory, machine learning, and monte carlo simulations. J Phys Chem C. 2022;126:1800–1808. doi: 10.1021/acs.jpcc.1c09647
  • Mayeur JR, Mourad HM, Luscher DJ, et al. Numerical implementation of a crystal plasticity model with dislocation transport for high strain rate applications. Model Simul Mater Sci Eng. 2016;24(4):045013. doi: 10.1088/0965-0393/24/4/045013
  • van Nieuwenburg EPL, Liu Y-H, Huber SD. Learning phase transitions by confusion. Nat Phys. 2017;13:435–439. https://www.nature.com/articles/nphys4037
  • Sun H, Zhang H, Ren G, et al. A knowledge transfer framework for general alloy materials properties prediction. Materials. 2022;15(21):7442. doi: 10.3390/ma15217442
  • Fu Y, Frazier WE, Choi KS, et al. Prediction of grain structure after thermomechanical processing of U-10Mo alloy using sensitivity analysis and machine learning surrogate model. Sci Rep. 2022;12:1–11. doi: 10.1038/s41598-022-14731-8
  • Kautz E, Ma W, Jana S, et al. An image-driven machine learning approach to kinetic modeling of a discontinuous precipitation reaction. Mater Charact. 2020;166:110379. doi: 10.1016/j.matchar.2020.110379
  • Grabowski B, Ikeda Y, Srinivasan P, et al. Ab initio vibrational free energies including anharmonicity for multicomponent alloys. Npj Comput Mater. 2019;5. 10.1038/s41524-019-0218-8
  • Chang YJ, Jui CY, Lee WJ, et al. Prediction of the composition and hardness of high-entropy alloys by machine learning. JOM. 2019;71:3433–3442. doi: 10.1007/s11837-019-03704-4
  • Kim G, Diao H, Lee C, et al. First-principles and machine learning predictions of elasticity in severely lattice-distorted high-entropy alloys with experimental validation. Acta Mater. 2019;181:124–138. doi: 10.1016/j.actamat.2019.09.026
  • Wu Q, Wang Z, Hu X, et al. Uncovering the eutectics design by machine learning in the Al–Co–Cr–Fe–Ni high entropy system. Acta Mater. 2020;182:278–286. doi: 10.1016/j.actamat.2019.10.043
  • Menou E, Toda-Caraballo I, Rivera-Díaz-Del-Castillo PEJ, et al. Evolutionary design of strong and stable high entropy alloys using multi-objective optimisation based on physical models, statistics and thermodynamics. Mater Des. 2018;143:185–195. doi: 10.1016/j.matdes.2018.01.045
  • Menou E, Tancret F, Toda-Caraballo I, et al. Computational design of light and strong high entropy alloys (HEA): obtainment of an extremely high specific solid solution hardening. Scr Mater. 2018;156:120–123. doi: 10.1016/j.scriptamat.2018.07.024
  • Zhuang X, Antonov S, Li W, et al. Alloying effects and effective alloy design of high-Cr CoNi-based superalloys via a high-throughput experiments and machine learning framework. Acta Mater. 2022;243:118525. doi: 10.1016/j.actamat.2022.118525
  • Jafary-Zadeh M, Khoo KH, Laskowski R, et al. Applying a machine learning interatomic potential to unravel the effects of local lattice distortion on the elastic properties of multi-principal element alloys. J Alloys Compd. 2019;803:1054–1062. doi: 10.1016/j.jallcom.2019.06.318
  • Rickman JM, Chan HM, Harmer MP, et al. Materials informatics for the screening of multi-principal elements and high-entropy alloys. Nat Commun. 2019;10:1–10. doi: 10.1038/s41467-019-10533-1
  • Ozerdem MS, Kolukisa S. Artificial neural network approach to predict the mechanical properties of Cu-Sn-Pb-Zn-Ni cast alloys. Mater Des. 2009;30:764–769. doi: 10.1016/j.matdes.2008.05.019
  • Zhang L, Qian K, Huang J, et al. Molecular dynamics simulation and machine learning of mechanical response in non-equiatomic FeCrNiCoMn high-entropy alloy. J Mater Res Technol. 2021;13:2043–2054. doi: 10.1016/j.jmrt.2021.06.021
  • Kostiuchenko T, Körmann F, Neugebauer J, et al. Impact of lattice relaxations on phase transitions in a high-entropy alloy studied by machine-learning potentials. Npj Comput Mater. 2019;5:1–7. doi: 10.1038/s41524-019-0195-y
  • Rao Z, Tung P-Y, Xie R, et al. Machine learning–enabled high-entropy alloy discovery. Science. 2022;378:78–85. doi: 10.1126/science.abo4940
  • Machaka R. Machine learning-based prediction of phases in high-entropy alloys. Comput Mater Sci. 2021;188:110244. doi: 10.1016/j.commatsci.2020.110244
  • Zhang Y, Wen C, Wang C, et al. Phase prediction in high entropy alloys with a rational selection of materials descriptors and machine learning models. Acta Mater. 2020;185:528–539. doi: 10.1016/j.actamat.2019.11.067
  • Batchelor TAA, Pedersen JK, Winther SH, et al. High-entropy alloys as a discovery platform for electrocatalysis. Joule. 2019;3(3):834–845. doi: 10.1016/j.joule.2018.12.015
  • Ghouchan Nezhad Noor Nia R, Jalali M, Mail M, et al. Machine Learning Approach to Community Detection in a high-entropy alloy interaction network. ACS Omega. 2022;7:12978–12992. doi: 10.1021/acsomega.2c00317
  • Wan X, Zhang Z, Yu W, et al. Machine-learning-assisted discovery of highly efficient high-entropy alloy catalysts for the oxygen reduction reaction. Patterns. 2022;3:100553. doi: 10.1016/j.patter.2022.100553
  • Clausen CM, Nielsen MLS, Pedersen JK, et al. Ab Initio to activity: machine learning-assisted optimization of high-entropy alloy catalytic activity. High Entropy Alloy Mater. 2022;1:0–3.
  • Mishra A, Kompella L, Sanagavarapu LM, et al. Ensemble-based machine learning models for phase prediction in high entropy alloys. Comput Mater Sci. 2022;210:111025. doi: 10.1016/j.commatsci.2021.111025
  • Wen C, Wang C, Zhang Y, et al. Modeling solid solution strengthening in high entropy alloys using machine learning. Acta Mater. 2021;212:116917. doi: 10.1016/j.actamat.2021.116917
  • Zhou Y, Srinivasan P, Körmann F, et al. Thermodynamics up to the melting point in a TaVCrW high entropy alloy: systematic ab initio study aided by machine learni. Phys Rev B. 2022;105:214302. doi: 10.1103/PhysRevB.105.214302
  • Krishna YV, Jaiswal UK, RM R. Machine learning approach to predict new multiphase high entropy alloys. Scr Mater. 2021;197:113804. doi: 10.1016/j.scriptamat.2021.113804
  • Li Y, Guo W. Machine-learning model for predicting phase formations of high-entropy alloys. Phys Rev Mater. 2019;3:95005. doi: 10.1103/PhysRevMaterials.3.095005
  • Huang W, Martin P, Zhuang HL. Machine-learning phase prediction of high-entropy alloys. Acta Mater. 2019;169:225–236. doi: 10.1016/j.actamat.2019.03.012
  • Li L, Xie B, Fang Q, et al. Machine learning approach to design high entropy alloys with heterogeneous grain structures. Metall Mater Trans A Phys Metall Mater Sci. 2021;52(2):439–448. doi: 10.1007/s11661-020-06099-z
  • Wen C, Zhang Y, Wang C, et al. Machine learning assisted design of high entropy alloys with desired property. Acta Mater. 2019;170:109–117. doi: 10.1016/j.actamat.2019.03.010
  • Yang C, Ren C, Jia Y, et al. A machine learning-based alloy design system to facilitate the rational design of high entropy alloys with enhanced hardness. Acta Mater. 2022;222:117431. doi: 10.1016/j.actamat.2021.117431
  • Li XG, Chen C, Zheng H, et al. Complex strengthening mechanisms in the NbMoTaW multi-principal element alloy. Npj Comput Mater. 2020;6:1–32. doi: 10.1038/s41524-020-0339-0
  • Shapeev A. Accurate representation of formation energies of crystalline alloys with many components. Comput Mater Sci. 2017;139:26–30. doi: 10.1016/j.commatsci.2017.07.010
  • Meshkov EA, Novoselov II, Shapeev AV, et al. Sublattice formation in CoCrFeNi high-entropy alloy. Intermetallics. 2019;112:106542. doi: 10.1016/j.intermet.2019.106542
  • Zhang L, Chen H, Tao X, et al. Machine learning reveals the importance of the formation enthalpy and atom-size difference in forming phases of high entropy alloys. Mater Des. 2020;193:108835. doi: 10.1016/j.matdes.2020.108835
  • Zhang J, Liu X, Bi S, et al. Robust data-driven approach for predicting the configurational energy of high entropy alloys. Mater Des. 2020;185:108247. doi: 10.1016/j.matdes.2019.108247
  • Nong Z-S, Zhu J-C, Cao Y, et al. Stability and structure prediction of cubic phase in as cast high entropy alloys. Mater Sci Technol. 2014;30(3):363–369. doi: 10.1179/1743284713Y.0000000368
  • Abu-Odeh A, Galvan E, Kirk T, et al. Efficient exploration of the high entropy alloy composition-phase space. Acta Mater. 2018;152:41–57. doi: 10.1016/j.actamat.2018.04.012
  • Yan Y, Lu D, Wang K. Accelerated discovery of single-phase refractory high entropy alloys assisted by machine learning. Comput Mater Sci. 2021;199:110723. doi: 10.1016/j.commatsci.2021.110723
  • Guo T, Wu L, Li T. Machine learning accelerated, high throughput, multi‐objective optimization of multiprincipal element alloys. Small. 2021;17(42). doi: 10.1002/smll.202102972
  • Roy D, Mandal SC, Pathak B. Machine learning assisted exploration of high entropy alloy-based catalysts for selective CO2 reduction to methanol. J Phys Chem Lett. 2022;13(25):5991–6002. doi: 10.1021/acs.jpclett.2c00929
  • Kaufmann K, Maryanovsky D, Mellor WM, et al. Discovery of high-entropy ceramics via machine learning. Npj Comput Mater. 2020;6:1–9. doi: 10.1038/s41524-020-0317-6
  • Tancret F, Toda-Caraballo I, Menou E, et al. Designing high entropy alloys employing thermodynamics and Gaussian process statistical analysis. Mater Des. 2017;115:486–497. doi: 10.1016/j.matdes.2016.11.049
  • Qi J, Cheung AM, Poon SJ. High entropy alloys mined from binary phase diagrams. Sci Rep. 2019;9(1):15501. doi: 10.1038/s41598-019-50015-4
  • Domínguez LA, Goodall R, Todd I, et al. Prediction and validation of four component high entropy alloys using principal component analysis. Adv Eng Mater. 2015;210:259–268. doi: 10.1016/j.matchemphys.2017.09.001
  • Deng L, Wang C, Luo J, et al. Preparation and property optimization of FeCrAl-based ODS alloy by machine learning combined with wedge-shaped hot-rolling. Mater Charact. 2022;188:111894. doi: 10.1016/j.matchar.2022.111894
  • Shin D, Yamamoto Y, Brady MP, et al. Modern data analytics approach to predict creep of high-temperature alloys. Acta Mater. 2019;168:321–330. doi: 10.1016/j.actamat.2019.02.017
  • Geng X, Cheng Z, Wang S, et al. A data-driven machine learning approach to predict the hardenability curve of boron steels and assist alloy design. J Mater Sci. 2022;57:10755–10768. doi: 10.1007/s10853-022-07132-9
  • Peng G, Sun Y, Zhang Q, et al. A collaborative design platform for new alloy material development. Adv Eng Informatics. 2022;51:101488. doi: 10.1016/j.aei.2021.101488
  • Yang X, Yang J, Yang Y, et al. Data-mining and atmospheric corrosion resistance evaluation of Sn- and Sb-additional low alloy steel based on big data technology. Int J Miner Metall Mater. 2022;29(4):825–835. doi: 10.1007/s12613-022-2457-9
  • Rakhshkhorshid M, Teimouri Sendesi SA. Bayesian regularization neural networks for prediction of austenite formation temperatures (Ac1 and Ac3). J Iron Steel Res Int. 2014;21(2):246–251. doi: 10.1016/S1006-706X(14)60038-8
  • Zhan Z, Li H. Machine learning based fatigue life prediction with effects of additive manufacturing process parameters for printed SS 316L. Int J Fatigue. 2021;142:105941. doi: 10.1016/j.ijfatigue.2020.105941
  • Thanush AA, Chitra P, Kasinath J, et al. Atmospheric corrosion rate prediction of low-alloy steel using machine learning models. IOP Conf Ser Mater Sci Eng. 2022;1248(1):012050. doi: 10.1088/1757-899X/1248/1/012050
  • Wang C, Wei X, Ren D, et al. High-throughput map design of creep life in low-alloy steels by integrating machine learning with a genetic algorithm. Mater Des. 2022;213:110326. doi: 10.1016/j.matdes.2021.110326
  • Wang Y, Goh B, Nelaturu P, et al. Integrated high-throughput and machine learning methods to accelerate discovery of molten salt corrosion-resistant alloys. Adv Sci. 2022;9(20):1–13. doi: 10.1002/advs.202200370
  • Gavard L, Bhadeshia HKDH, MacKay DJC, et al. Bayesian neural network model for austenite formation in steels. Mater Sci Technol. 1996;12(6):453–463. doi: 10.1179/mst.1996.12.6.453
  • Mahfouf M, Jamei M, Linkens DA. Optimal design of alloy steels using multiobjective genetic algorithms. Mater Manuf Process. 2005;20(3):553–567. doi: 10.1081/AMP-200053580
  • Bailer-Jones CAL, Bhadeshia HKDH, MacKay DJC. Gaussian process modelling of austenite formation in steel. Mater Sci Technol. 1999;15(3):287–294. doi: 10.1179/026708399101505851
  • Capdevila C, Caballero FG, de AC. Determination of ms temperature in steels: a bayesian neural network model. ISIJ Int. 2002;42:894–902. http://www.jstage.jst.go.jp/article/isijinternational1989/42/8/42_8_894/_article
  • Capdevila C, Caballero FG, de Andrés C G. Analysis of effect of alloying elements on martensite start temperature of steels. Mater Sci Technol. 2003;19:581–586. doi: 10.1179/026708303225001902
  • Behler J, Parrinello M. Generalized neural-network representation of high-dimensional potential-energy surfaces. Phys Rev Lett. 2007;98(14):146401. doi: 10.1103/PhysRevLett.98.146401
  • Chatterjee S, Murugananth M, HKDH B. δ TRIP steel. Mater Sci Technol. 2007;23:819–827. doi: 10.1179/174328407X179746
  • Brun F, Yoshida T, Robson JD, et al. Theoretical design of ferritic creep resistant steels using neural network, kinetic, and thermodynamic models. Mater Sci Technol. 1999;15(5):547–554. doi: 10.1179/026708399101506085
  • Mandal S, Sivaprasad PV, Venugopal S, et al. Artificial neural network modeling to evaluate and predict the deformation behavior of stainless steel type AISI 304L during hot torsion. Appl Soft Comput. 2009;9(1):237–244. doi: 10.1016/j.asoc.2008.03.016
  • Ji G, Li F, Li Q, et al. A comparative study on Arrhenius-type constitutive model and artificial neural network model to predict high-temperature deformation behaviour in Aermet100 steel. Mater Sci Eng A. 2011;528:4774–4782. doi: 10.1016/j.msea.2011.03.017
  • Abbassi F, Belhadj T, Mistou S, et al. Parameter identification of a mechanical ductile damage using artificial neural networks in sheet metal forming. Mater Des. 2013;45:605–615. doi: 10.1016/j.matdes.2012.09.032
  • Wilk-Kolodziejczyk D, Regulski K, Gumienny G. Comparative analysis of the properties of the nodular cast iron with carbides and the austempered ductile iron with use of the machine learning and the support vector machine. Int J Adv Manuf Technol. 2016;87(1–4):1077–1093. doi: 10.1007/s00170-016-8510-y
  • Chaudhary N, Abu-Odeh A, Karaman I, et al. A data-driven machine learning approach to predicting stacking faulting energy in austenitic steels. J Mater Sci. 2017;52(18):11048–11076. doi: 10.1007/s10853-017-1252-x
  • Tapia G, Khairallah S, Matthews M, et al. Gaussian process-based surrogate modeling framework for process planning in laser powder-bed fusion additive manufacturing of 316L stainless steel. Int J Adv Manuf Technol. 2018;94(9–12):3591–3603. doi: 10.1007/s00170-017-1045-z
  • DeCost BL, Lei B, Francis T, et al. High throughput quantitative metallography for complex microstructures using deep learning: a case study in ultrahigh carbon steel. Microsc Microanal. 2019;25(1):21–29. doi: 10.1017/S1431927618015635
  • Feng S, Zhou H, Dong H. Using deep neural network with small dataset to predict material defects. Mater Des. 2019;162:300–310. doi: 10.1016/j.matdes.2018.11.060
  • Meng L, Zhang J. Process design of laser powder bed fusion of stainless steel using a Gaussian process-based machine learning model. JOM. 2020;72:420–428. doi: 10.1007/s11837-019-03792-2
  • Yan F, Song K, Gao L, et al. DCLF: a divide-and-conquer learning framework for the predictions of steel hardness using multiple alloy datasets. Mater Today Commun. 2022;30:103195. doi: 10.1016/j.mtcomm.2022.103195
  • Jeon J, Seo N, Jung J-G, et al. Prediction and mechanism explain of austenite-grain growth during reheating of alloy steel using XAI. J Mater Res Technol. 2022;21:1408–1418. doi: 10.1016/j.jmrt.2022.09.119
  • Tapia G, Elwany AH, Sang H. Prediction of porosity in metal-based additive manufacturing using spatial Gaussian process models. Addit Manuf. 2016;12:282–290. doi: 10.1016/j.addma.2016.05.009
  • Guo Z, Sha W. Modelling the correlation between processing parameters and properties of maraging steels using artificial neural network. Comput Mater Sci. 2004;29(1):12–28. doi: 10.1016/S0927-0256(03)00092-2
  • Rao KP, YKDV P. Neural network approach to flow stress evaluation in hot deformation. J Mater Process Technol. 1995;53:552–566. https://linkinghub.elsevier.com/retrieve/pii/092401369401744L
  • Gocheva-Ilieva S, Dobrev G study of the tensile strength of alloy steels using polynomial regression. 2022. p. 050005. https://pubs.aip.org/aip/acp/article/2828445.
  • Paulson NH, Priddy MW, McDowell DL, et al. Data-driven reduced-order models for rank-ordering the high cycle fatigue performance of polycrystalline microstructures. Mater Des. 2018;154:170–183. doi: 10.1016/j.matdes.2018.05.009
  • Rahnama A, Clark S, Sridhar S. Machine learning for predicting occurrence of interphase precipitation in HSLA steels. Comput Mater Sci. 2018;154:169–177. doi: 10.1016/j.commatsci.2018.07.055
  • Ma W, Kautz EJ, Baskaran A, et al. Image-driven discriminative and generative machine learning algorithms for establishing microstructure–processing relationships. J Appl Phys. 2020;128:128. doi: 10.1063/5.0013720
  • Ling J, Hutchinson M, Antono E, et al. High-dimensional materials and process optimization using data-driven experimental design with well-calibrated uncertainty estimates. Integr Mater Manuf Innov. 2017;6(3):207–217. doi: 10.1007/s40192-017-0098-z
  • Li Z, Long Z, Lei S, et al. Machine learning driven rationally design of amorphous alloy with improved elastic models. Mater Des. 2022;220:110881. doi: 10.1016/j.matdes.2022.110881
  • Xiong J, Shi SQ, Zhang TY. A machine-learning approach to predicting and understanding the properties of amorphous metallic alloys. Mater Des. 2020;187:108378. doi: 10.1016/j.matdes.2019.108378
  • Tang Y, Wan Y, Wang Z, et al. Machine learning and Python assisted design and verification of Fe–based amorphous/nanocrystalline alloy. Mater Des. 2022;219:110726. doi: 10.1016/j.matdes.2022.110726
  • Artrith N, Kolpak AM. Grand canonical molecular dynamics simulations of Cu-Au nanoalloys in thermal equilibrium using reactive ANN potentials. Comput Mater Sci. 2015;110:20–28. doi: 10.1016/j.commatsci.2015.07.046
  • Mueller T. Ab initio determination of structure-property relationships in alloy nanoparticles. Phys Rev B. 2012;86(14):144201. doi: 10.1103/PhysRevB.86.144201
  • Cao L, Niu L, Mueller T. Computationally generated maps of surface structures and catalytic activities for alloy phase diagrams. Proc Natl Acad Sci. 2019;116(44):22044–22051. doi: 10.1073/pnas.1910724116
  • Tran K, Ulissi ZW. Active learning across intermetallics to guide discovery of electrocatalysts for CO2 reduction and H2 evolution. Nat Catal. 2018;1(9):696–703. doi: 10.1038/s41929-018-0142-1
  • Gao S, Zhen H, Wen B, et al. Exploring the physical origin of the electrocatalytic performance of an amorphous alloy catalyst via machine learning accelerated DFT study. Nanoscale. 2022;14(7):2660–2667. doi: 10.1039/D1NR07661B
  • Keong KG, Sha W, Malinov S. Artificial neural network modelling of crystallization temperatures of the Ni–P based amorphous alloys. Mater Sci Eng A. 2004;365(1–2):212–218. doi: 10.1016/j.msea.2003.09.030
  • Kankanamge UMHU, Reiner J, Ma X, et al. Machine learning guided alloy design of high-temperature NiTiHf shape memory alloys. J Mater Sci. 2022;57:19447–19465. doi: 10.1007/s10853-022-07793-6
  • Solomou A, Zhao G, Boluki S, et al. Multi-objective Bayesian materials discovery: application on the discovery of precipitation strengthened NiTi shape memory alloys through micromechanical modeling. Mater Des. 2018;160:810–827. doi: 10.1016/j.matdes.2018.10.014
  • Liu S, Kappes BB, Amin-Ahmadi B, et al. Physics-informed machine learning for composition – process – property design: shape memory alloy demonstration. Appl Mater Today. 2021;22:100898. doi: 10.1016/j.apmt.2020.100898
  • Xue D, Xue D, Yuan R, et al. An informatics approach to transformation temperatures of NiTi-based shape memory alloys. Acta Mater. 2017;125:532–541. doi: 10.1016/j.actamat.2016.12.009
  • Lenzen N, Altay O. Machine learning enhanced dynamic response modelling of superelastic shape memory alloy wires. Materials. 2022;15(1):15. doi: 10.3390/ma15010304
  • Trehern W, Ortiz-Ayala R, Atli KC, et al. Data-driven shape memory alloy discovery using artificial intelligence materials selection (AIMS) framework. Acta Mater. 2022;228:117751. doi: 10.1016/j.actamat.2022.117751
  • Cai A, Wang H, Li X, et al. Progress of component design methods for bulk metallic glass. Mater Des. 2007;28(10):2694–2697. doi: 10.1016/j.matdes.2006.10.002
  • Cai AH, Xiong X, Liu Y, et al. Artificial neural network modeling of reduced glass transition temperature of glass forming alloys. Appl Phys Lett. 2008;92(11):12–15. doi: 10.1063/1.2899633
  • Hui CA, Xiong X, Liu Y, et al. Artificial neural network modeling for undercooled liquid region of glass forming alloys. Comput Mater Sci. 2010;48:109–114. doi: 10.1016/j.commatsci.2009.12.012
  • Cai AH, Liu Y, An WK, et al. Prediction of critical cooling rate for glass forming alloys by artificial neural network. Mater Des. 2013;52:671–676. doi: 10.1016/j.matdes.2013.06.012
  • Zhang YX, Xing GC, Sha ZD, et al. A two-step fused machine learning approach for the prediction of glass-forming ability of metallic glasses. J Alloys Compd. 2021;875:160040. doi: 10.1016/j.jallcom.2021.160040
  • Tripathi MK, Chattopadhyay PP, Ganguly S. Multivariate analysis and classification of bulk metallic glasses using principal component analysis. Comput Mater Sci. 2015;107:79–87. doi: 10.1016/j.commatsci.2015.05.010
  • Sun YT, Bai HY, Li MZ, et al. Machine learning approach for prediction and understanding of glass-forming ability. J Phys Chem Lett. 2017;8(14):3434–3439. doi: 10.1021/acs.jpclett.7b01046
  • Xiong J, Zhang TY, Shi SQ. Machine learning prediction of elastic properties and glass-forming ability of bulk metallic glasses. MRS Commun. 2019;9(2):576–585. doi: 10.1557/mrc.2019.44
  • Ward L, O’Keeffe SC, Stevick J, et al. A machine learning approach for engineering bulk metallic glass alloys. Acta Mater. 2018;159:102–111. doi: 10.1016/j.actamat.2018.08.002
  • Ren F, Ward L, Williams T, et al. Accelerated discovery of metallic glasses through iteration of machine learning and high-throughput experiments. Sci Adv. 2018;4(4):4. doi: 10.1126/sciadv.aaq1566
  • Zhang X, Li K, Wen B, et al. Machine learning accelerated DFT research on platinum-modified amorphous alloy surface catalysts. Chinese Chem Lett. 2023;34(5):107833. doi: 10.1016/j.cclet.2022.107833
  • Ghosal S, Dey S, Chattopadhyay PP, et al. Designing optimized ternary catalytic alloy electrode for efficiency improvement of semiconductor gas sensors using a machine learning approach. Decis Mak Appl Manag Eng. 2021;4:126–139. doi: 10.31181/dmame210402126g
  • Zhang J-Y, Xia C, Wang H-F, et al. Recent advances in electrocatalytic oxygen reduction for on-site hydrogen peroxide synthesis in acidic media. J Energy Chem. 2022;67:432–450. doi: 10.1016/j.jechem.2021.10.013
  • Artrith N, Kolpak AM. Understanding the composition and activity of electrocatalytic nanoalloys in aqueous solvents: a combination of DFT and accurate neural network potentials. Nano Lett. 2014;14(5):2670–2676. doi: 10.1021/nl5005674
  • Jinnouchi R, Asahi R. Predicting catalytic activity of nanoparticles by a DFT-Aided machine-learning algorithm. J Phys Chem Lett. 2017;8(17):4279–4283. doi: 10.1021/acs.jpclett.7b02010
  • Li Z, Ma X, Xin H. Feature engineering of machine-learning chemisorption models for catalyst design. Catal Today. 2017;280:232–238. doi: 10.1016/j.cattod.2016.04.013
  • Li Z, Wang S, Chin WS, et al. High-throughput screening of bimetallic catalysts enabled by machine learning. J Mater Chem A. 2017;5(46):24131–24138. doi: 10.1039/C7TA01812F
  • Andersen M, Levchenko SV, Scheffler M, et al. Beyond scaling relations for the description of catalytic materials. ACS Catal. 2019;9:2752–2759. doi: 10.1021/acscatal.8b04478
  • Back S, Yoon J, Tian N, et al. Convolutional neural network of atomic surface structures to predict binding energies for high-throughput screening of catalysts. J Phys Chem Lett. 2019;10(15):4401–4408. doi: 10.1021/acs.jpclett.9b01428
  • Zhong M, Tran K, Min Y, et al. Accelerated discovery of CO2 electrocatalysts using active machine learning. Nature. 2020;581(7807):178–183. doi: 10.1038/s41586-020-2242-8
  • Kang J, Noh SH, Hwang J, et al. First-principles database driven computational neural network approach to the discovery of active ternary nanocatalysts for oxygen reduction reaction. Phys Chem Chem Phys. 2018;20(38):24539–24544. doi: 10.1039/C8CP03801E
  • Sah AK, Agilan M, Dineshraj S, et al. Machine learning-enabled prediction of density and defects in additively manufactured inconel 718 alloy. Mater Today Commun. 2022;30:103193. doi: 10.1016/j.mtcomm.2022.103193
  • Sparks TD, Gaultois MW, Oliynyk A, et al. Data mining our way to the next generation of thermoelectrics. Scr Mater. 2016;111:10–15. doi: 10.1016/j.scriptamat.2015.04.026
  • Ma X, Li Z, Achenie LEK, et al. Machine-learning-augmented chemisorption model for CO 2 electroreduction catalyst screening. J Phys Chem Lett. 2015;6(18):3528–3533. doi: 10.1021/acs.jpclett.5b01660
  • Fung V, Ganesh P, Sumpter BG. Physically informed machine learning prediction of electronic density of states. Chem Mater. 2022;34(11):4848–4855. doi: 10.1021/acs.chemmater.1c04252
  • Jäger MOJ, Morooka EV, Federici Canova F, et al. Machine learning hydrogen adsorption on nanoclusters through structural descriptors. Npj Comput Mater. 2018;4:37. https://www.nature.com/articles/s41524-018-0096-5
  • Mao X, Wang L, Xu Y, et al. Computational high-throughput screening of alloy nanoclusters for electrocatalytic hydrogen evolution. Npj Comput Mater. 2021;7:1–9. doi: 10.1038/s41524-021-00514-8
  • Slavkovic R, Jugovic Z, Dragicevic S, et al. An application of learning machine methods in prediction of wear rate of wear resistant casting parts. Comput Ind Eng. 2013;64(3):850–857. doi: 10.1016/j.cie.2012.12.021
  • Gong X, Dong C, Xu J, et al. Machine learning assistance for electrochemical curve simulation of corrosion and its application. Mater Corros. 2020;71(3):474–484. doi: 10.1002/maco.201911224
  • Zhang H, Fu H, Zhu S, et al. Machine learning assisted composition effective design for precipitation strengthened copper alloys. Acta Mater. 2021;215:117118. doi: 10.1016/j.actamat.2021.117118
  • Hajinazar S, Sandoval ED, Cullo AJ, et al. Multitribe evolutionary search for stable Cu-Pd-Ag nanoparticles using neural network models. Phys Chem Chem Phys. 2019;21:8729–8742. doi: 10.1039/C9CP00837C
  • Mangal A, Holm EA. Applied machine learning to predict stress hotspots I: face centered cubic materials. Int J Plast. 2018;111:122–134. doi: 10.1016/j.ijplas.2018.07.013
  • Kusne AG, Gao T, Mehta A, et al. On-the-fly machine-learning for high-throughput experiments: search for rare-earth-free permanent magnets. Sci Rep. 2014;4(1):6367. doi: 10.1038/srep06367
  • Nelson J, Sanvito S. Predicting the curie temperature of ferromagnets using machine learning. Phys Rev Mater. 2019;3(10):104405. doi: 10.1103/PhysRevMaterials.3.104405
  • Legrain F, Carrete J, Van Roekeghem A, et al. Materials screening for the discovery of new half-heuslers: machine learning versus ab initio methods. J Phys Chem B. 2018;122(2):625–632. doi: 10.1021/acs.jpcb.7b05296
  • Carrete J, Li W, Mingo N, et al. Finding unprecedentedly low-thermal-conductivity half-heusler semiconductors via high-throughput materials modeling. Phys Rev X. 2014;4(1):011019. doi: 10.1103/PhysRevX.4.011019
  • Carrete J, Mingo N, Wang S, et al. Nanograined half-heusler semiconductors as advanced thermoelectrics: an Ab Initio high-throughput statistical study. Adv Funct Mater. 2014;24(47):7427–7432. doi: 10.1002/adfm.201401201
  • de Jong M, Chen W, Notestine R, et al. A statistical learning framework for materials science: application to elastic moduli of k-nary inorganic polycrystalline compounds. Sci Rep. 2016;6(1):34256. doi: 10.1038/srep34256
  • Legrain F, van Roekeghem A, Curtarolo S, et al. Vibrational properties of metastable polymorph structures by machine learning. J Chem Inf Model. 2018;58(12):2460–2466. doi: 10.1021/acs.jcim.8b00279
  • Artrith N, Urban A, Ceder G, Constructing first-principles phase diagrams of amorphous LixSi using machine-learning-assisted sampling with an evolutionary algorithm. J Chem Phys. 2018;148. doi: 10.1063/1.5017661
  • Chen H, Shang Z, Lu W, et al. A Property‐Driven stepwise design strategy for multiple low‐melting alloys via machine learning. Adv Eng Mater. 2021;23(12). doi: 10.1002/adem.202100612
  • Ward L, Agrawal A, Choudhary A, et al. A general-purpose machine learning framework for predicting properties of inorganic materials. Npj Comput Mater. 2016;2(1):16028. doi: 10.1038/npjcompumats.2016.28
  • Jha R, Chakraborti N, Diercks DR, et al. Combined machine learning and CALPHAD approach for discovering processing-structure relationships in soft magnetic alloys. Comput Mater Sci. 2018;150:202–211. doi: 10.1016/j.commatsci.2018.04.008
  • Tancret F. Computational thermodynamics, Gaussian processes and genetic algorithms: combined tools to design new alloys. Model Simul Mater Sci Eng. 2013;21(4):21. doi: 10.1088/0965-0393/21/4/045013
  • Menou E, Ramstein G, Bertrand E, et al. Multi-objective constrained design of nickel-base superalloys using data mining- and thermodynamics-driven genetic algorithms. Model Simul Mater Sci Eng. 2016;24(5):055001. doi: 10.1088/0965-0393/24/5/055001
  • Conduit BD, Jones NG, Stone HJ, et al. Design of a nickel-base superalloy using a neural network. Mater Des. 2017;131:358–365. doi: 10.1016/j.matdes.2017.06.007
  • Jiang X, Yin HQ, Zhang C, et al. An materials informatics approach to Ni-based single crystal superalloys lattice misfit prediction. Comput Mater Sci. 2018;143:295–300. doi: 10.1016/j.commatsci.2017.09.061
  • Chandran M, Lee SC, Shim J-H. Machine learning assisted first-principles calculation of multicomponent solid solutions: estimation of interface energy in Ni-based superalloys. Model Simul Mater Sci Eng. 2018;26(2):025010. doi: 10.1088/1361-651X/aa9f37
  • Schooling J, Brown M, Reed PA. An example of the use of neural computing techniques in materials science—the modelling of fatigue thresholds in Ni-base superalloys. Mater Sci Eng A. 1999;260(1–2):222–239. doi: 10.1016/S0921-5093(98)00957-5
  • Taylor PL, Conduit G. Machine learning predictions of superalloy microstructure. Comput Mater Sci. 2022;201:201. doi: 10.1016/j.commatsci.2021.110916
  • Wu L, Wei G, Wang G, et al. Creating win-wins from strength–ductility trade-off in multi-principal element alloys by machine learning. Mater Today Commun. 2022;32:104010. doi: 10.1016/j.mtcomm.2022.104010
  • Islam N, Huang W, Zhuang HL. Machine learning for phase selection in multi-principal element alloys. Comput Mater Sci. 2018;150:230–235. doi: 10.1016/j.commatsci.2018.04.003
  • Xue D, Balachandran PV, Hogden J, et al. Accelerated search for materials with targeted properties by adaptive design. Nat Commun. 2016;7:1–9. doi: 10.1038/ncomms11241
  • Vasudevan RK, Choudhary K, Mehta A, et al. Materials science in the artificial intelligence age: high-throughput library generation, machine learning, and a pathway from correlations to the underpinning physics. MRS Commun. 2019;9(3):821–838. doi: 10.1557/mrc.2019.95
  • Hu X, Wang J, Wang Y, et al. Two-way design of alloys for advanced ultra supercritical plants based on machine learning. Comput Mater Sci. 2018;155:331–339. doi: 10.1016/j.commatsci.2018.09.003
  • Viswanadhapalli B, BR VK, Nagaraju KC. Experimental study and machine learning model to predict formability of magnesium alloy sheet. F1000Res. 2022;11:1118. doi: 10.12688/f1000research.124085.1
  • Géron A. Hands-on machine learning with Scikit-Learn, Keras, and Tensorflow. 2nd ed. O’Reilly Media. Advanced theory and simulations; 2022.
  • Vecchio KS, Dippo OF, Kaufmann KR, et al. High-throughput rapid experimental alloy development (HT-READ). Acta Mater. 2021;221:117352. doi: 10.1016/j.actamat.2021.117352
  • Dehghannasiri R, Xue D, Balachandran PV, et al. Optimal experimental design for materials discovery. Comput Mater Sci. 2017;129:311–322. doi: 10.1016/j.commatsci.2016.11.041
  • Gopakumar AM, Balachandran PV, Xue D, et al. Multi-objective optimization for materials discovery via adaptive design. Sci Rep. 2018;8:1–12. doi: 10.1038/s41598-018-21936-3
  • Gao J, Zhong J, Liu G, et al. A machine learning accelerated distributed task management system (malac-distmas) and its application in high-throughput CALPHAD computation aiming at efficient alloy design. Adv Powder Mater. 2022;1:100005. doi: 10.1016/j.apmate.2021.09.005
  • Takahashi K, Tanaka Y. Material synthesis and design from first principle calculations and machine learning. Comput Mater Sci. 2016;112:364–367. doi: 10.1016/j.commatsci.2015.11.013
  • Toyao T, Suzuki K, Kikuchi S, et al. Toward effective utilization of methane: machine learning prediction of adsorption energies on metal alloys. J Phys Chem C. 2018;122(15):8315–8326. doi: 10.1021/acs.jpcc.7b12670
  • Fernandez-Zelaia P, Roshan Joseph V, Kalidindi SR, et al. Estimating mechanical properties from spherical indentation using Bayesian approaches. Mater Des. 2018;147:92–105. doi: 10.1016/j.matdes.2018.03.037
  • Thankachan T, Prakash KS, David Pleass C, et al. Artificial neural network to predict the degraded mechanical properties of metallic materials due to the presence of hydrogen. Int J Hydrogen Energy. 2017;42(47):28612–28621. doi: 10.1016/j.ijhydene.2017.09.149
  • Mangal A, Holm EA. A comparative study of feature selection methods for stress hotspot classification in materials. Integr Mater Manuf Innov. 2018;7(3):87–95. doi: 10.1007/s40192-018-0109-8
  • Bustillo A, Reis R, Machado AR, et al. Improving the accuracy of machine-learning models with data from machine test repetitions. J Intell Manuf. 2022;33(1):203–221. doi: 10.1007/s10845-020-01661-3
  • Huang X, Jin C, Zhang C, et al. Machine learning assisted modelling and design of solid solution hardened high entropy alloys. Mater Des. 2021;211:110177. doi: 10.1016/j.matdes.2021.110177
  • Huang DJ, Li H. A machine learning guided investigation of quality repeatability in metal laser powder bed fusion additive manufacturing. Mater Des. 2021;203:109606. doi: 10.1016/j.matdes.2021.109606
  • Godaliyadda GMD, Ye DH, Uchic MD, et al. A supervised learning approach for dynamic sampling. Electron Imaging. 2016;28:1–8. https://library.imaging.org/ei/articles/28/19/art00020