214
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Simultaneous stochastic optimisation of mining complexes: Integrating progressive reclamation and waste management with contextual bandits

ORCID Icon & ORCID Icon
Received 15 Jun 2023, Accepted 08 Apr 2024, Published online: 10 May 2024

ABSTRACT

Managing environmental performance of waste dump facilities in mining complexes is an integral part of long-term production planning. Sustainable long-term production scheduling solutions are desired to mitigate risk and return the environment to a productive post-mining state. A simultaneous stochastic optimisation framework for long-term production scheduling in mining complexes is developed that integrates waste management and progressive reclamation. The waste dump placement schedule is jointly optimised with the extraction sequence, destination policy, and stockpiling decisions in a single stochastic mathematical programming framework. This includes the timing of progressive reclamation activities in parallel with production to enhance waste dump rehabilitation. Uncertainty related to the production of acid rock drainage is quantified by simulating geochemical properties of waste and managing the blending of uncertain waste properties within the optimisation framework. With respect to the framework for simultaneous stochastic optimisation, contextual bandits are explored to improve the metaheuristic solution approach and solve the corresponding large-scale optimisation model. The framework is tested in a multi-mine copper-gold mining complex leading to improved environmental performance. Risk of acid rock drainage is decreased by 52.5% in the waste dump facilities. Reclamation planning activities for meeting environmental requirements are scheduled prior to closure. The solution approach more effectively improves the objective function with contextual bandits leading to a 24% improvement in the study presented.

1. Introduction

Waste management and reclamation are essential operational aspects to be considered when planning and optimising the long-term production schedule of a mining complex [Citation1]. An integrative planning approach is desired to satisfy the diverse set of environmental constraints associated with operating a mining complex, while providing an economically viable long-term production schedule. For successful reclamation and rehabilitation, a progressive approach is taken to accelerate the reclamation of mining waste dump facilities, eliminate liabilities and improve environmental performance [Citation2,Citation3]. Furthermore, strategic waste management practices that mitigate acid rock drainage (ARD) and prevent metal leaching are considered to meet water-quality standards and reduce remediation costs [Citation4]. Environmental management is vital for mitigating long-term complications that develop during and after production ceases [Citation5]. Directly considering these components when optimising long-term production schedules reduces environmental risk and is expected to enhance the financial and environmental performance of a mining complex over its lifetime.

Depending on the processes required to recover valuable products that are sold to customers and the market, mining complexes may contain several mines, stockpiles, preconcentration, processing, and waste dump facilities [Citation6]. Research related to the simultaneous stochastic optimisation of mining complexes aims to globally optimise all critical operational aspects in a single stochastic mathematical programming formulation [Citation7–11]. This includes optimising the extraction sequence, destination policy, capital investments, stockpiling, ore blending, processing, transportation and operating alternative decisions, while managing uncertainty related to quantity and quantity of the material in the ground [Citation7–10,Citation12–18]. Decisions are optimised to maximise net present value, by enhancing the value of the products sold, and minimising technical risk related to the uncertain material supply. However, the management, treatment and reclamation of mining waste dump facilities has received limited attention to date within these frameworks.

Waste management and rehabilitation scheduling decisions are typically considered following the optimisation of the long-term production schedule [Citation19–22], thus, advantageous synergies between the extraction sequence, destination policy and waste production schedule may not be found. Fu et al. [Citation23] formulate a mixed integer programming model that jointly optimises an open pit mine production schedule and waste dump placement schedule. Additionally, the encapsulation of potentially acid generating (PAG) waste is included to mitigate ARD, an important waste management consideration. There are several limitations to this integrative approach that are addressed in this work. First, the method ignores material uncertainty and variability of the material properties and uses a deterministic (average type) model of the material attributes. Second, PAG and non-acid generating (NAG) waste materials are classified based on the geochemical properties within a selective mining unit (a mining block), which ignores the potential to blend waste materials of different qualities to mitigate ARD. Lastly, the progressive reclamation of waste dump facilities is not considered within the optimisation formulation. Lin et al. [Citation24] develop a model for optimising waste dump schedules simultaneously with production. Levinson and Dimitrakopoulos [Citation25] optimise the long-term production schedule with waste management considerations in an open-pit gold mining complex under uncertainty to manage the production of PAG waste material. However, this approach does not provide an executable waste dump production schedule for material placement. Ben-Awuah and Askari-Nasab [Citation26] consider scheduling mining cuts and waste dumps jointly; however, this method ignores material uncertainty, aggregates mining blocks to reduce the number of decisions variables and does not produce a waste dump production schedule.

ARD and associated metal leaching have large impacts on water resources and are a serious concern for mining complexes producing wastes with sulphide bearing minerals [Citation27]. Following exposure to oxidising conditions, sulphide waste materials can produce sulphuric acid. Sulphuric acid can be neutralised by contacting certain minerals in the waste material such as carbonates. A recent field evaluation that monitors the geochemical blending of waste materials demonstrates an opportunity for permanent prevention of ARD under certain conditions [Citation28]. More common practices for mitigating ARD aim to eliminate or reduce the rate of oxidation with water covers, treatment methods that add chemicals to acidic water, and various other techniques [Citation29,Citation30]. These conventional approaches can cause long-term risk. Treatment-based approaches extend long-term recurring costs that effect the economics of a mining complex. Water covers and constructed impoundments are high risk due to the potential of failures and the extended duration which the materials must remain covered [Citation4,Citation31–33]. Instead, geochemical blending techniques mix waste with different geochemical characteristics to overcome these limitations by preventing the formation of ARD at the source – the waste dump facilities – without the need for water covers. Blending effectively manages the net producing ratio (NPR) or the ratio of neutralisation potential (NP) to acid potential (AP) of the mixed waste product at the final destination [Citation34,Citation35].

Integrating geochemical blending of waste materials in the optimisation of mining complexes increases the complexity of placing waste material. Geochemical properties of waste are required to blend materials and manage the risk of ARD. Therefore, geostatistical simulation techniques are applied to quantify the quality and quantity of the relevant attributes within the mineral deposits in a mining complex [Citation36,Citation37]. After extraction, the geochemical attributes are tracked across the mining complex to blend waste materials of different qualities and generate a waste product that performs as if it is non-acid generating. A waste dump production schedule or sequence of material placement is determined by optimising the mining complex to generate safe products for placement in waste dump facilities, which makes these decisions manageable in large-scale industry applications. Simultaneously optimising the long-term production schedule with the waste dump placement schedule is critical for these reasons. This work develops a long-term stochastic optimisation formulation that manages waste material blending to prevent ARD. Waste materials are mixed within waste dump cells to create a safe waste product by managing the NPR ratio. Considering the blending of waste materials to mitigate ARD can directly change the response of other production scheduling decisions as the schedule adapts to find waste with the correct geochemical properties, while still aiming to maximise net present value.

Other critical environmental management considerations related to planning and scheduling of an operating mining complex are reclamation and closure requirements outlined by the local government and stakeholders of a region [Citation38]. Waste dump facilities cover large areas of a mining complex and must be returned to a natural and productive state to minimise negative environmental effects. This includes re-establishing tree cover, native species and ensuring landforms are returned to suitable environmental conditions for the region [Citation39–41]. Progressive reclamation provides opportunities to monitor the results of reclamation during production and reduces environmental risk as the requirements for closure are meet [Citation38]. Additionally, progressive reclamation allows for the direct placement of topsoil/overburden material in reclamation areas. Direct placement improves soil development at waste dump facilities leading to more productive reclamation of native species [Citation42–44]. Integrating these critical planning components into the optimisation of the long-term production schedule provides opportunities to eliminate environmental risk by reclaiming segments of the waste dump facilities during production. This is crucial to consider in the simultaneous stochastic optimisation of mining complexes.

This manuscript presents an innovative stochastic mathematical programming formulation and optimisation framework that directly incorporates waste management and progressive reclamation considerations. Unlike previous stochastic programming formulations, the model integrates waste management through the blending and scheduling of waste materials to mitigate ARD. In addition, the production schedule for placing topsoil/overburden material is included to accelerate progressive reclamation at the waste dump facilities to minimise long-term liabilities and satisfy regulatory requirements. Directly considering these aspects within the simultaneous stochastic optimisation framework improves the overarching long-term plan but also increases the size of the stochastic mathematical program.

Intelligent solvers are required to optimise large-scale optimisation models. Contextual bandits are explored in this work for selecting heuristics within a metaheuristic solution approach to optimise the proposed stochastic programming formulation. Models that consider the flow of materials after extraction lead to difficult to solve non-linear optimisation models. This often arises due to the non-linearities in the objectives and constraints caused by blending and stockpiling materials. Bley et al. [Citation45] explain several major challenges caused by non-linear formulations in relation to mine production scheduling with stockpiles. Zhang and Dimitrakopoulos [Citation46] address non-linearities in recovery processes and forward contracts. Metaheuristics are advantageous for optimising large-scale stochastic programming models in reasonable timeframes while handling non-linear constraints and have been widely applied in long-term production scheduling [Citation7–10,Citation12,Citation18,Citation47–54]. Goodfellow and Dimitrakopoulos [Citation9] adapt the acceptance criterion with a cumulative distributive function for each decision neighbourhood to improve the simulation annealing optimisation framework. Instead of modifying the acceptance criterion, a different optimisation strategy is considered in this work that utilises learnings from a contextual bandit algorithm [Citation55]. The algorithm selects an action that it expects will lead to larger improvements in the objective function of a stochastic mathematical program, while exploring less visited actions with an ε-greedy policy. During optimisation, the learner or agent adapts the actions selected given the context of the mining complex environment. Therefore, as the agent learns, the actions selected during the optimisation are targeting changes to the production schedule that are expected to provide the largest improvements to the long-term production schedule.

In the next sections, the stochastic programming formulation is introduced followed by the optimisation framework that utilises contextual bandits. Subsequently, the proposed formulation and solution approach are tested with a case study in a large-scale mining complex. Lastly, conclusions and future work follow.

2. Simultaneous stochastic optimisation of mining complexes with progressive reclamation and waste management

2.1. Stochastic mathematical programming optimization formulation

A two-stage non-linear stochastic programming model is outlined in this section [Citation56]. The proposed formulation provides a simultaneous stochastic optimisation model for jointly optimising the extraction sequence, destination policy, stockpiling, waste dump production schedule, waste blending and progressive reclamation. The waste dump production schedule considers the placement of rock and overburden/topsoil material, while blending waste of different qualities to mitigate the risk of ARD. The objective function of the stochastic mathematical program aims to maximise net present value and minimise deviations from production targets. provides examples of the waste dump considerations in the proposed model. A set of equiprobable stochastic realisations of the key material attributes and the geochemical characteristics of interest are used to account for uncertainty and local variability of ore and waste materials [Citation37]. The stochastic mathematical programming formulation is described next.

Figure 1. Blending strategy to manage NPR at waste dump facilities and precedence requirements in a simplified 2-D example of a waste dump model.

Figure 1. Blending strategy to manage NPR at waste dump facilities and precedence requirements in a simplified 2-D example of a waste dump model.

2.1.1. Notations and definitions

This section defines the notations and their definitions for the stochastic mathematical programming model. and include the sets and parameters. includes the decision variables.

Table 1. Sets.

Table 2. Parameters.

Table 3. Decision variables.

2.1.2. Objective function

The objective function is presented in Equation (1). Part I and II of the objective function maximise the revenues obtained from processing valuable attributes to sell to the market and minimise the costs to produce them. Part III of the objective function minimises the reclamation costs per a unit area of reclaimed land at the waste facilities. This can include the placement, re-sloping, water management and planting costs for reclamation. Part IV minimises deviations from production targets related to the capacities of each location in the mining complex. Part V and VI minimise deviations from the blend of materials within the waste dump cell to prevent the production of ARD by managing the NPR ratio and metal leaching requirements. Part VII encourages progressive reclamation to place materials in locations that are ready for reclamation. Maximising the NPV by including the discounted revenues and costs associated with mining, processing and stockpiling material within the objective function has been considered in previous stochastic programming formulations while managing risk related to capacities [Citation7,Citation9]. However, the contribution of this optimisation model is that reclamation costs are directly considered in the objective function and the balancing of NPR and metal leaching material attributes are included to prevent ARD. Lastly, progressive reclamation is encouraged to mitigate liabilities and improve environmental performance of the mining complex.

(1) max1SsStTiPaApa,i,tmassa,i,t,sPartIsStTiMDci,tmassi,t,sPartIIsStTwWcCwOBgGOB(rcw,t/ψc)sfg,t,ssvg,t,smˉg,w,c,t,sPartIIIsStTiMDpci,tδi,t,s+pci,t+δi,t,s+PartIVsStTwWcCwpcw,tNPRδw,c,t,sNPRPartVsStTwWcCwRaApca,w,tδa,w,c,t,s+pca,w,t+δa,w,c,t,s+PartVIsStTpctRecδt,sRecPartVII(1)

The objective function is maximised subject to the following constraints.

2.1.3. Constraints

Block precedence and reserve constraints ensure that geotechnical slope constraints are enforced and that a single mining block can only be mined once:

(2) xb,tk=1txb ,k∀bBm,b Ob,tT.(2)
(3) tTxb,t1∀bBm,mM(3)

Mass and attribute tracking constraints calculate material quantities and qualities throughout the mining complex.

Constraints (4)–(5) compute the mass and attribute quantity extracted from the mines in each period.

(4) λg,t,s=mMbBmθb,g,swb,sxb,t∀gG,tT,sS.(4)
(5) γa,g,t,s=mMbBmθb,g,swb,sga,b,sxb,t∀aA,gG,tT,sS.(5)

Constraints (6) track the total amount of material mined at each mine in the mining complex:

(6) massm,t,s=bBmwb,sxb,t∀mM,tT,sS.(6)

Constraints (7)–(8) track the mass and the recovered quantity of each attribute at the various destinations in mining complex. Constraint (9) ensures that only a fraction of materials within a stockpile can be transported to a subsequent downstream destination in a single period and constraint (10) ensures that for all non-stockpiling destinations iDS that all materials are transferred to the subsequent location j. The constraints now follow:

(7) massj,t+1,s=gGλg,t,szg,j,t+1+iJjSmassi,t,syi,j,t+1+massi,t,s1kOjyi,k,t∀jD,tT,sS.(7)
(8) massa,j,t+1,s=gGra,j,t+1γa,g,t,szg,j,t+1+iIjSra,j,t+1massa,i,t,syi,j,t+1+massa,j,t,s1kOjyi,k,t∀aA,jD,tT,sS.(8)
(9) iOiyi,j,t1∀iS,tT.(9)
(10) iOiyi,j,t=1∀iDS,tT.(10)

Constraints (11)–(13) calculate the swell factor, specific volume and volume of material sent from each group to the waste dump facility. The swell factor and specific volume of a group are calculated based on the input simulated block parameters and the extraction sequence selected such that:

(11) sfg,t,sλg,t,s=mMbBmsfb,sθb,g,swb,sxb,t∀gG,tT,sS.(11)
(12) svg,t,sλg,t,s=mMbBmsvb,sθb,g,swb,sxb,t∀gG,tT,sS.(12)
(13) vˉg,w,t,s=sfg,t,ssvg,t,szg,w,tmMbBmθb,g,swb,sxb,t∀gG,wW,tT,sS.(13)

Based on the material sent to the waste dump facilities the materials are distributed to different cells to blend materials of different qualities, as shown in . Constraints (14)–(15) ensure that the volume of material extracted and sent to the waste dump facility are distributed to the waste dump cells that accept materials of specific types (rock and overburden):

(14) vˉg,w,t,ssfg,t,ssvg,t,scCwRmg,w,c,t,s=0∀gGR,wW,tT,sS.(14)
(15) vˉg,w,t,ssfg,t,ssvg,t,scCwOBmg,w,c,t,s=0∀gGOB,wW,tT,sS.(15)

Constraints (16)–(17) ensure that the volume of material placed in each cell does not violate the available volume of material within each waste dump facility cell:

(16) gGOBsfg,t,ssvg,t,smg,w,c,t,sVw,cαw,c,t,s∀wW,cCwOB,tT,sS.(16)
(17) gGRsfg,t,ssvg,t,smg,w,c,t,sVw,cαw,c,t,s∀wW,cCwR,tT,sS.(17)

Constraints (18)–(19) update the fill level of a waste dump cell:

(18) gGOBsfg,t,ssvg,t,smg,w,c,t,s=Vw,cκw,c,t,s∀wW,cCwOB,tT,sS.(18)
(19) gGRsfg,t,ssvg,t,smˉg,w,c,t,s=Vw,cκw,c,t,s∀wW,cCwR,tT,sS.(19)

Constraints (20) enforce precedence of the waste dump facility cells directly. The precedence constraints consider vertical and horizontal precedence relationships directly (a simplified example is shown in ):

(20) αw,c,t,sκw,c ,t,s∀wW,cCw,c PDw,c,tT,sS.(20)

Constraints (21)–(23) compute the grade, quality or concentration of an attribute within a group which is used to determine the blended concentration of an attribute in the waste dump facility cells for overburden and rock material placement, respectively:

(21) ga,g,t,s=γa,g,t,s/λg,t,s.(21)
(22) massa,w,c,t,s=gGOBga,g,t,smg,w,c,t,s∀aA,wW,cCwOB,tT,sS.(22)
(23) massa,w,c,t,s=gGRga,g,t,smg,w,c,t,s∀aA,wW,cCwR,tT,sS.(23)

Constraints (24) manage grade blending to meet the NPR threshold target within each waste dump cell with a linear function f of the indicating geochemical attributes NP and AP. By enforcing, the following constraints it is possible to mitigate ARD in perpetuity by ensuring the ongoing prevention with a blended waste product that is non-acid generating:

(24) massNP,w,c,t,sδw,c,t,sNPRLTargetNPRfmassAP,w,c,t,s∀wW,cCw,tT,sS(24)

Constraints (25)–(26) manage the risk of the concentration of geochemical attributes related to metal leaching exceeding the upper and lower concentration limits at the cells in a waste dump facility:

(25) massa,w,c,t,sδa,w,c,t,s+UTargetagGRτ=1tmˉg,w,c,t,s∀aA,wW,cCwR,tT,sS.(25)
(26) massa,w,c,t,s+δa,w,c,t,sLTargetagGRτ=1tmg,w,c,t,s∀aA,wW,cCwR,tT,sS.(26)

Constraints (27) aim manage the risk of meeting progressive reclamation targets for completed areas in each waste dump facilities:

(27) ccwOBκw,c,t,s+δt,sRecRt∀wW,tT,sS.(27)

Constraints (28)–(29) manage the risk of deviating from the capacity constraints at each location in the mining complex:

(28) massi,t,sδi,t,s+Ui,t∀iM\scriptD\nolimitsD,tT,sS.(28)
(29) massi,t,s+δi,t,sLi,t∀iM\scriptD\nolimitsD,tT,sS.(29)

Constraints (30) only allow a group of materials to be sent to a single destination in each production period:

(30) jDκg,j,t=1∀gG,tTSTTLT.(30)

Lastly, non-negativity and binary constraints (31)–(32) are enforced, indices omitted for clarity.

(31) m¯,y,γ,δ,κ,λ0 (31)
(32) x,z,α0,1(32)

The following section will outline the optimisation approach for the proposed stochastic programming formulation. The optimisation approach manages non-linearities in the objective function and constraints by decoupling the optimisation of the extraction sequence and downstream decision variables, which is explained subsequently.

2.1.4. Simulated annealing, contextual bandits and stochastic programming

The two-stage stochastic programming formulation is solved with a hybrid framework that utilises a metaheuristic with embedded contextual bandits. The metaheuristic applied is a simulated annealing-based approach that allows the model to handle non-linear objective function and constraints since the flow of materials are dependent on the product of several key decision variables. Simulated annealing explores the neighbouring solutions by modifying the extraction sequence decision variables xb,t [Citation9,Citation57]. Similar to past work, the extraction sequence decisions are modified within the simulated annealing framework with a set of perturbations that ensure feasibility of the resulting extraction sequence [Citation9,Citation12,Citation58]. Then, the remaining downstream decision variables are optimised with the branch-and-cut algorithm implemented in CPLEX v20.1 given the temporarily fixed extraction sequence decisions to obtain the objective function value for a given iteration of the simulated annealing algorithm. This ensures that each extraction sequence evaluated within the simulated annealing algorithm has optimal downstream decisions. The algorithm occasionally accepts solutions that decrease the objective function to escape local optima based on an acceptance criterion. The probability of accepting a deteriorating solution is adapted throughout the optimisation by using a cooling schedule and annealing temperature parameter. The implementation of simulated annealing in the optimisation of mining complexes has been applied in several previous works [Citation7,Citation9].

In this work, contextual bandits are integrated into the simulated annealing algorithm as a learning framework that aims to determine which perturbations (production schedule modifications) provide the largest improvements in the resulting objective function in given some context of the modification. The actions and contexts that are used in this work are denoted by the sets K and C, respectively. Actions or perturbations that lead to larger improvements in the resulting production schedule of a mining complex in a given context are visited more frequently. This provides an effective way to improve the objective function by utilising past learning to select more impactful perturbations during the optimisation. Given the improvement and context of the optimisation the resulting estimates of each action are updated for subsequent action selection. These estimates are referred to as estimates of the action-value function Qc,k which provides the expected value of taking an action kK in a given context cC. Within the simulated annealing framework, each action is selected with an ε-greedy policy which allows for exploration of different actions (ε0,1 denotes the degree of exploration). The contextual bandit approach for selecting modifications to the production schedule is outlined in Algorithm 1 [Citation55].

Given that the optimisation of mining complexes is a difficult learning task, the agent or learner must learn to track the best actions to take over time. This is because certain actions that perform better early in the optimisation process (modifications to a larger number of decision variables) may not be the top performing modifications once the solution has been improved. For instance, smaller fine-tuning actions may become more beneficial at this time. Therefore, the goal is to learn a policy that maps a context to an action which is expected to improve the objective function by the greatest amount. The learned policy is constantly adapting to new information as the optimisation algorithm proceeds. The algorithm proposed works as follows. First, the state-value function Qc,k estimates are initialised to an optimistic value O. This encourages early exploration of all actions. Similarly, the visit count of each context-action pair Nc,k is initialised to zero. During the optimisation process, each time a mining block is selected to modify the extraction sequence decision variables, the context is retrieved from the mining complex. Then, an action is taken with ε-greedy policy where the maximum action for the given context is taken with probability 1ε. Otherwise, an action is selected uniformly at random. The action is applied to modify the extraction sequence decisions variables and the objective function value is updated after modifying these decisions. The difference between the current objective and the previous objective is calculated and used to update the action-value function. Lastly, the visit count for that action in this context is updated, which is used as a tie-breaking rule. The proposed formulation and solution approach are tested at a large copper-gold mining complex in the following section.

3. Application in a large copper-gold mining complex

3.1. Overview of the copper-gold mining complex

The stochastic optimisation formulation and solution approach are applied at a large-scale copper-gold mining complex to optimise the long-term production schedule. The mining complex under study has been operating for 10 years and has approximately 5 years of operational life remaining prior to closure where reclamation must be completed. The mining complex is presented in . There are three open-pit mines, a run-of-mine stockpile, a processing facility, and three waste dump facilities that require mitigation of ARD and progressive reclamation. The open-pit mines supply materials to the processor within the mining complex to produce copper and gold products for delivery to customers and the market. Waste materials are separated into two material classes rock and overburden/topsoil. The geochemical characteristics of the materials in the deposit include simulations of the sulphur and total inorganic carbon content (assay samples are retrieved with an element scan with ICP-MS and aqua regia digestion and coulometry, respectively), which are used to calculate the NPR ratio in waste dump cells. In addition, the overburden/topsoil quantities are simulated to represent the uncertainty and variability in the quantity of material available for reclamation purposes [Citation59].

Figure 2. Mining complex with three open-pit mines, three waste dump facilities, stockpile, process plant that generates two valuable products.

Figure 2. Mining complex with three open-pit mines, three waste dump facilities, stockpile, process plant that generates two valuable products.

The three open-pit mines that supply material within the mining complex have a joint mining capacity of 42 M tonnes per year and the process plant has a maximum capacity of 13 M tonnes with a recovery of 90% and 80.4% for copper and gold, respectively. Mining blocks are not aggregated and a set of 1.84 million blocks are considered for extraction with dimensions 10 m × 10 m × 16 m. Fifteen stochastically simulated orebody models with the pertinent attributes and geochemical properties are used as input to the optimisation formulation to directly manage risk [Citation12,Citation60]. The stochastic simulations of the orebody are simulated directly at block support and consider the correlations between attributes using an efficient method for quantifying the uncertainty and local variability of the material attributes in a large deposit [Citation37]. The slope angle considered for mining block precedence is 47 degrees based on geotechnical studies completed by the operators of the mining complex.

The mining complex production schedule and waste dump placement schedule are optimised for the remaining five production years in this mining complex to minimise financial losses incurred to satisfy closure requirements. A waste dump model of the areas available for material placement in the waste dump facilities is constructed, similar to a block model, where the cell locations represent the volumes available for dumping materials in different areas. The ultimate goal of the long-term production schedule is to maximise net present value, while mitigating ARD in the waste dump facilities to prevent additional long-term monitoring costs. Additionally, a progressive reclamation strategy is implemented to start rehabilitating completed waste dump areas and return them to their original environmental state during production. This helps reduce liabilities and satisfy regulatory requirements. In addition, the progressive reclamation approach provides opportunities to improve reclamation practices if certain sections are unsuccessful. To begin reclamation in the waste dump facilities, underlying waste dump cells that are below reclamation cells must be completed to ensure the full utilisation of areas available for waste disposal. A progressive reclamation goal of 50 hectares (ha) is targeted during each production year to reduce liabilities and satisfy regulatory requirements prior to closure.

Waste dump cells are 100 × 100 × 15 m3. An assumption made here is that the waste dump cell size allows for the model to adequately blend waste materials of different quality to prevent ARD. Considering the development of waste dumps forms in thin slices of rock, as trucks dump off the waste dump facility lifts, it is expected that adequate blending would occur with cell sizes of these dimensions. A NPR ratio, which measures the neutralisation potential of mining wastes, is measured in each waste dump cell by accounting for the blended geochemical properties [Citation34]. NPR ratios greater than 2 are targeted in the waste dump cells to prevent the production of ARD. Geotechnical and operational constraints are considered when building the precedence relationships between waste-dump cells, which includes satisfying a 37.5-degree angle of repose and ensuring access is present from available roads. This is completed in a preprocessing step to define the precedence relationship in EquationEquation (20).

3.2. Simultaneous stochastic optimization with waste management and progressive reclamation

In the remainder of this section, the results from applying the proposed simultaneous stochastic optimisation framework for long-term production are discussed and compared to a base case production schedule that does not consider integrated waste management and reclamation. The p-10, p-50, and p-90 represent a 10%, 50% and 90% chance of obtaining quantities below the values in the risk profiles that are presented throughout this section. The lower (LB) and upper (UB) bounds are drawn in the following figures when a constraint is applied. To begin with, the impacts of integrating waste management and progressive reclamation will be discussed to highlight the additional production scheduling considerations addressed in this case study. Then, the remaining production schedule decisions are discussed.

In and , the risk profiles for the NPR ratio at 51 randomly selected waste dump cells in the base case and integrated production schedule are shown. In the base case production schedule, there is significant risk of producing ARD as many waste dump cells have a blended NPR ratio that is less than the lower limit of 2. For example, the p-50 in waste dump cells 5–7 show that there is a large risk of producing acid in those areas of the waste dump due to the geochemical properties of waste placed. The integrated production schedule that considers waste management directly in the formulation is able to resolve these issues by blending materials to mitigate ARD. Although the method significantly reduces the risk of ARD and the frequency of occurrence, there are still deviations from the target NPR ratio, see cell 13 and 22 in . Alternative methods such as sourcing additional materials or placing limestone for neutralising materials may also need to be considered to completely prevent ARD in the waste dump facilities. The other cells in the waste dump facilities show similar improvements when considering the integrated approach and reduce the number of cells with unsafe NPR ratios by 52.5%. This significantly reduces risk of ARD in the integrated production schedule with the proposed simultaneous stochastic optimisation framework. The waste placement schedule is impacted by blending materials with different geochemical properties, which leads to distinct changes in the waste dump placement schedules in the base case versus the integrated production schedule shown in .

Figure 3. Risk profiles for cell NPR ratios in the waste dump cells in the base-case production schedule.

Figure 3. Risk profiles for cell NPR ratios in the waste dump cells in the base-case production schedule.

Figure 4. Risk profiles for cell NPR ratios in the waste dump cells in the integrated production schedule.

Figure 4. Risk profiles for cell NPR ratios in the waste dump cells in the integrated production schedule.

Figure 5. Base case (left) and integrated (right) waste dump placement schedule for waste dump facility (WD) 1 and 2.

Figure 5. Base case (left) and integrated (right) waste dump placement schedule for waste dump facility (WD) 1 and 2.

Progressive reclamation and rehabilitation of waste dump facilities is desired to accelerate the closure of the mining complex and return these areas to a productive environmental state. Overburden/topsoil material covers were to be directly placed at the waste dump facilities to enhance plant growth and improve rehabilitation. However, the waste dump facility design must be considered, and full utilisation of the available waste dump cells is required before starting the reclamation process to efficiently use the existing waste dump footprints. shows that the progressive reclamation process in the mining complex is unable to meet targets in years 1–2 in both production schedules. Progressive reclamation was ignored in previous long-term optimisation studies leading to many areas of the waste dump being used and a lack of focus on completing areas to begin reclamation. The proposed optimisation model targets areas of the waste dump to place waste material that provide opportunities to begin reclamation and satisfy the target 50 ha per a year. As the waste dump cells are filled to capacity, reclamation can begin in the areas that reach the final design and targets can be more easily meet in both production schedules in years 3–5. The advantage of integrating progressive reclamation into the optimisation of mining complexes demonstrated in this application is that it provides engineering support for advancing the waste dump facilities to fulfil reclamation activities. These costs are accounted for directly in the integrated model and are not considered in the base-case causing changes to the resulting production schedule. In addition, demonstrates that greater then 60% of reclamation activities on waste dumps 1 and 2 can be completed in parallel with production, if scheduled appropriately.

Figure 6. Risk profiles for the integrated (blue) and base-case (green) progressive reclamation progress at the waste dump facilities.

Figure 6. Risk profiles for the integrated (blue) and base-case (green) progressive reclamation progress at the waste dump facilities.

Figure 7. Progressive reclamation progress in the integrated production schedule (left) waste dump 1 and (right) waste dump 2 from a top view.

Figure 7. Progressive reclamation progress in the integrated production schedule (left) waste dump 1 and (right) waste dump 2 from a top view.

As a result of introducing these additional production scheduling constraints into the stochastic programming formulation, there are impacts to the optimised production scheduling decisions. In , the risk profiles for the processing facility throughput, mining production rate, copper head grade and copper production at the processing facility are shown. There are larger changes to optimised mining rate that occur in years 4 and 5. In addition, the processing facility throughput varies in each of the production schedules presented. A higher throughput with lower head grade is found at the processing plant with the integrated production schedule that considers waste management and reclamation. However, with the change in production scheduling decisions, the forecasted cashflows and metal production remain similar between the two production schedules. The cumulative discounted cashflows shown in demonstrate that the mining complex can break-even over the 5 production years while, preventing ARD and preparing for closure with additional waste management and reclamation costs incurred. Similar production cashflows are obtained, however, the integrated schedule improves waste management () and accounts directly for progressive reclamation completion and associated costs ().

Figure 8. Risk profiles for the integrated and base-case for the processing plant annual throughput, mining production rate, copper head grade, and copper production.

Figure 8. Risk profiles for the integrated and base-case for the processing plant annual throughput, mining production rate, copper head grade, and copper production.

Figure 9. Risk profiles for the integrated and base-case discounted and cumulative discounted cashflows.

Figure 9. Risk profiles for the integrated and base-case discounted and cumulative discounted cashflows.

4. Discussion

The proposed optimisation framework simultaneously optimises several major components in a mining complex considering uncertainty. This leads to critical differences in the resulting production schedules and also the mining extraction sequences in each mine. Cross sections for Mines 1–3 are shown in and , which highlight major differences between the extraction sequences produced when comparing the base case to the integrated schedule. The small arrows identify some of the visual differences in the schedule for Mine 1 and Mine 2. These changes in the extraction sequence impact the subsequent components in the mining complex and allow for improved reclamation and waste management by maximising the value of the products sold and accounting for the impact of critical attributes across the entire mining complex.

Figure 10. Comparison of the differences between the base case and integrated extraction sequence for Mine 1, an east-west cross-section.

Figure 10. Comparison of the differences between the base case and integrated extraction sequence for Mine 1, an east-west cross-section.

Figure 11. Comparison of the differences between the base case and integrated extraction sequence for Mine 2 and 3, an east-west cross section.

Figure 11. Comparison of the differences between the base case and integrated extraction sequence for Mine 2 and 3, an east-west cross section.

Optimising long-term production schedules in industry scale mining complexes is a challenging optimisation problem especially when considering several additional components for integrating waste management and progressive reclamation. The mining complex optimised in this work required 2.4 M continuous decision variables and 9.2 M binary decision variables. The optimised solution was reached in 80 hours. Contextual bandits were applied to optimise large-scale mining complexes more effectively, as demonstrated in this work. The results of the optimisation using contextual bandits with an ε-greedy policy was compared to those obtained by employing a random approach for selecting perturbations within the simulated annealing framework. shows that a 24% higher objective function value is reached after 10,000 iterations and that the contextual bandit learnings are shown to improve the heuristic selection approach after only 2,000 iterations of the simulated annealing algorithm. Context related to the areas to be mined allows for improved efficiency in the optimisation approach by understanding which areas are more likely to improve the production schedule by delaying them or advancing them in the extraction sequence.

Figure 12. Contextual bandit performance compared to random perturbation selection.

Figure 12. Contextual bandit performance compared to random perturbation selection.

5. Conclusions and future work

A simultaneous stochastic optimisation framework for optimising large-scale mining complexes is proposed that integrates waste management and progressive reclamation considerations into the stochastic programming model. The framework directly manages supply uncertainty. A simulated annealing metaheuristic framework is applied to solve the optimisation model with contextual bandits, which is tested at an open-pit copper-gold mining complex. The case study compares the simultaneous stochastic optimisation with integrated waste management and progressive reclamation to a base case production schedule that does not consider these additional components in the optimisation. The comparisons show that similar net present value can be achieved while, accounting for waste management and mitigating ARD by blending of materials with different geochemical properties. The integrated approach reduces the number of cells at risk of ARD by 52.5%. Progressive reclamation considerations allow for 60% of reclamation activities to be completed on waste dumps 1 and 2 during production. In addition, a reclamation and waste dump placement schedule are obtained that improve environmental performance. Lastly, contextual bandits are shown to increase the efficiency of the solution approach by selecting heuristics that are expected to improve the long-term production schedule based on context taken directly from the optimisation of the case-study presented. Future work should investigate the integration of production cycle times particularly with regards to the load and haul fleet in open-pit mining complexes to accurately understand the impact of waste management and reclamation on fleet performance. In addition, further developments for improving metaheuristics through learning-based methods could help improve the solution approach.

Acknowledgments

This work is funded by the National Science and Engineering Research Council of Canada (NSERC) CRD Grant 500414-16, NSERC Discovery Grant 239019, the COSMO Stochastic Mine Planning Laboratory and mining industry consortium (AngloGold Ashanti, Anglo American, BHP, De Beers, IAMGOLD, Kinross, Newmont Mining, and Vale), and the Canada Research Chairs Program.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

The work was supported by the Natural Sciences and Engineering Research Council of Canada [CRD Grant 500414-16; Discovery Grant 239019].

References

  • D.M. McHaina, Environmental planning considerations for the decommissioning, closure and reclamation of a mine site, Int. J. Min. Reclam. Environ. 15 (3) (2001), pp. 163–176. doi:10.1076/ijsm.15.3.163.3412.
  • L. Sawatsky, G. McKenna, M.-J. Keys, and D. Long, Towards minimising the long-term liability of reclaimed mine sites, in Reclaimed Land: Erosion Control, Soils and Ecology, M. Haigh and A.A. Balkema, eds., Brookfield, Rotterdam, 2000.
  • N. Bolan, M.B. Kirkham, and Ok Y-S, Spoil to Soil: Mine Site Rehabilitation and Revegetation, Taylor & Francis, Boca Raton, FL, 2017, pp. 1–391.
  • Price, W.A. Challenges posed by metal leaching and acid rock drainage at closed mines, British Columbia Mine Reclamation Symposium, 2003.
  • Z. Asif and Z. Chen, Environmental management in North American mining sector, Environ. Sci. Pollut. Res. 23 (1) (2016), pp. 167–179. doi:10.1007/s11356-015-5651-8.
  • B.S. Pimentel, G.R. Mateus, and F.A. Almeida, Mathematical models for optimizing the global mining supply Chain, in Intelligent Systems in Operations: Methods, Models and Applications in the Supply Chain, B. Nag, ed., IGI Global, 2010, pp. 133–163. doi:10.4018/978-1-61520-605-6.ch008.
  • L. Montiel and R. Dimitrakopoulos, Optimizing mining complexes with multiple processing and transportation alternatives: An uncertainty-based approach, Eur. J. Oper. Res. 247 (1) (2015), pp. 166–178. doi:10.1016/j.ejor.2015.05.002.
  • L. Montiel and R. Dimitrakopoulos, Simultaneous stochastic optimization of production scheduling at twin creeks mining complex, Nevada, Min. Eng. 70 (12) (2018), pp. 48–56. doi:10.19150/me.8645.
  • R. Goodfellow and R. Dimitrakopoulos, Global optimization of open pit mining complexes with uncertainty, Appl. Soft Comput. 40 (3) (2016), pp. 292–304. doi:10.1016/j.asoc.2015.11.038.
  • R. Goodfellow and R. Dimitrakopoulos, Simultaneous stochastic optimization of mining complexes and mineral value chains, Math. Geosci. 49 (3) (2017), pp. 341–360. doi:10.1007/s11004-017-9680-3.
  • R. Dimitrakopoulos and A. Lamghari, Simultaneous stochastic optimization of mining complexes-mineral value chains: An overview of concepts, examples and comparisons, Int. J. Min. Reclam. Environ. 36 (6) (2022), pp. 1–18. doi:10.1080/17480930.2022.2065730.
  • L. Montiel and R. Dimitrakopoulos, A heuristic approach for the stochastic optimization of mine production schedules, J. Heuristics 23 (5) (2017), pp. 397–415. doi:10.1007/s10732-017-9349-6.
  • M.F. Del Castillo and R. Dimitrakopoulos, Dynamically optimizing the strategic plan of mining complexes under supply uncertainty, Resour. Policy 60 (3) (2019), pp. 83–93. doi:10.1016/j.resourpol.2018.11.019.
  • A. Paithankar, S. Chatterjee, R. Goodfellow, and M.W.A. Asad, Simultaneous stochastic optimization of production sequence and dynamic cut-off grades in an open pit mining operation, Resour. Policy 66 (2020), p. 101634. doi:10.1016/j.resourpol.2020.101634.
  • Z. Levinson and R. Dimitrakopoulos, Adaptive simultaneous stochastic optimization of a gold mining complex: A case study, J. South Afr. Inst. Min. Metall. 120 (3) (2020), pp. 221–232. doi:10.17159/2411-9717/829/2020.
  • Z. Saliba and R. Dimitrakopoulos, Simultaneous stochastic optimization of an open pit gold mining complex with supply and market uncertainty, Min. Tech. 128 (4) (2019), pp. 216–229. doi:10.1080/25726668.2019.1626169.
  • Z. Saliba and R. Dimitrakopoulos, An application of simultaneous stochastic optimisation of an open-pit mining complex with tailings management, Int. J. Min. Reclam. Environ. 34 (8) (2020), pp. 592–607. doi:10.1080/17480930.2019.1688954.
  • Z. Brika, M. Gamache, and R. Dimitrakopoulos, Optimising the mine production scheduling accounting for stockpiling and investment decisions under geological uncertainty, Int. J. Min. Reclam. Environ. 37 (5) (2023), pp. 1–20. doi:10.1080/17480930.2023.2199378.
  • Y. Li, E. Topal, and D. Williams, Waste rock dumping optimisation using mixed integer programming (MIP), Int. J. Min. Reclam. Environ. 27 (6) (2013), pp. 425–436. doi:10.1080/17480930.2013.794513.
  • Y. Li, E. Topal, and D.J. Williams, Optimisation of waste rock placement using mixed integer programming, Min. Tech. 123 (4) (2014), pp. 220–229. doi:10.1179/1743286314Y.0000000070.
  • V. Vaziri, A.R. Sayadi, A. Mousavi, A. Parbhakar-Fox, and M. Monjezi, Mathematical modeling for optimized mine waste rock disposal: Establishing more effective acid rock drainage management, J. Cleaner Prod. 288 (2021), p. 125124. doi:10.1016/j.jclepro.2020.125124.
  • V. Vaziri, A.R. Sayadi, A. Parbhakar-Fox, A. Mousavi, and M. Monjezi, Improved mine waste dump planning through integration of geochemical and mineralogical data and mixed integer programming: Reducing acid rock generation from mine waste, J. Environ. Manage. 309 (2022), p. 114712. doi:10.1016/j.jenvman.2022.114712.
  • Z. Fu, M.W.A. Asad, and E. Topal, A new model for open-pit production and waste-dump scheduling, Eng. Optim. 51 (4) (2019), pp. 718–732. doi:10.1080/0305215X.2018.1476501.
  • J. Lin, M.W.A. Asad, E. Topal, P. Chang, J. Huang, and W. Lin, A novel model for sustainable production scheduling of an open-pit mining complex considering waste encapsulation, Resour. Policy 91 (2024), p. 104949. doi:10.1016/j.resourpol.2024.104949.
  • Z. Levinson and R. Dimitrakopoulos, Simultaneous stochastic optimisation of an open-pit gold mining complex with waste management, Int. J. Min. Reclam. Environ. 34 (6) (2020), pp. 415–429. doi:10.1080/17480930.2019.1621441.
  • E. Ben-Awuah and H. Askari-Nasab, Incorporating waste management into oil sands long term production planning, Min. Tech. 122 (1) (2013), pp. 33–45. doi:10.1179/1743286312Y.0000000031.
  • D.K. Nordstrom, Mine waters: Acidic to circumneutral, Elements 7 (6) (2011), pp. 393–398. doi:10.2113/gselements.7.6.393.
  • S.J. Day, Small-scale field evaluation of geochemical blending of waste rock to mitigate acid rock drainage potential, Geochem. Explor. Environ. Anal. 22 (1) (2022), doi:10.1144/geochem2021-066.
  • J. Skousen, A. Rose, G. Geidel, J. Foreman, R. Evans, and W. Hellier, Handbook of Technologies for Avoidance and Remediation of Acid Mine Drainage, National Mine Land Reclamation Center, Morgantown, 1998, p. 131.
  • J. Skousen, C.E. Zipper, A. Rose, P.F. Ziemkiewicz, R. Nairn, L.M. McDonald, R.L. Kleinmann, et al., Review of passive systems for acid mine drainage treatment, Mine Water Environ. 36 (1) (2017), pp. 133–153. doi:10.1007/s10230-016-0417-1.
  • J.O. Grimalt, M. Ferrer, and E. Macpherson, The mine tailing accident in Aznalcollar, Sci. Total Environ. 242 (1–3) (1999), pp. 3–11. doi:10.1016/S0048-9697(99)00372-1.
  • D. Kossoff, W. Dubbin, M. Alfredsson, S. Edwards, M. Macklin, and K.A. Hudson-Edwards, Mine tailings dams: Characteristics, failure, environmental impacts, and remediation, Appl. Geochem. 51 (2014), pp. 229–245. doi:10.1016/j.apgeochem.2014.09.010.
  • F. Thompson, B.C. de Oliveira, M.C. Cordeiro, B.P. Masi, T.P. Rangel, P. Paz, T. Freitas, G. Lopes, B.S. Silva, A.S. Cabral, M. Soares, D. Lacerda, C. dos Santos Vergilio, M. Lopes-Ferreira, C. Lima, C. Thompson, C.E. de Rezende, et al., Severe impacts of the Brumadinho dam failure (Minas Gerais, Brazil) on the water quality of the Paraopeba River, Sci. Total Environ. 705 (2020), p. 135914. doi:10.1016/j.scitotenv.2019.135914.
  • R.W. Lawrence and M. Scheske, A method to calculate the neutralization potential of mining wastes, Environ. Geol. 32 (2) (1997), pp. 100–106. doi:10.1007/s002540050198.
  • P. Mehling, S. Day, and K. Sexsmith, (eds.), Blending and layering waste rock to delay, mitigate or prevent acid generation, Fourth International Conference on Acid Rock Drainage; 1997 May 31–May 6. Vancouver, BC, Canada.
  • P. Goovaerts, Geostatistics for Natural Resources Evaluation, Oxford University Press, New York, 1997.
  • A. Boucher and R. Dimitrakopoulos, Block simulation of multiple correlated variables, Math Geosci. 41 (2) (2009), pp. 215–237. doi:10.1007/s11004-008-9178-0.
  • J. Straker, J. McConnachie, T. Baker, G. McKenna, and T. Antill (eds.), Guiding Principles for Successful Mine Reclamation in British Columbia, British Columbia Mine Reclamation Symposium, 2020, Online.
  • C. Grant and J. Koch, Decommissioning western Australia’s first bauxite mine: Co-evolving vegetation restoration techniques and targets, Ecol. Manage. Restor. 8 (2) (2007), pp. 92–105. doi:10.1111/j.1442-8903.2007.00346.x.
  • J.M. Koch, Restoring a jarrah forest understorey vegetation after bauxite mining in Western Australia, Restor. Ecol. 15 (s4) (2007), pp. S26–S39. doi:10.1111/j.1526-100X.2007.00290.x.
  • C.E. Zipper, J.A. Burger, J.G. Skousen, P.N. Angel, C.D. Barton, V. Davis, J.A. Franklin, et al., Restoring forests and associated ecosystem services on appalachian coal surface mines, Environ. Manage 47 (5) (2011), pp. 751–765. doi:10.1007/s00267-011-9670-z.
  • P.M. Holmes, Shrubland restoration following woody alien invasion and mining: Effects of topsoil depth, seed source, and fertilizer addition, Restor. Ecol. 9 (1) (2001), pp. 71–84. doi:10.1046/j.1526-100x.2001.009001071.x.
  • D.D. Mackenzie and M.A. Naeth, The role of the forest soil propagule bank in assisted natural recovery after oil sands mining, Restor. Ecol. 18 (4) (2010), pp. 418–427. doi:10.1111/j.1526-100X.2008.00500.x.
  • S.E. Macdonald, S.M. Landhäusser, J. Skousen, J. Franklin, J. Frouz, S. Hall, et al., Forest restoration following surface mining disturbance: Challenges and solutions, New. For. 46 (5) (2015), pp. 703–732. doi:10.1007/s11056-015-9506-4.
  • A. Bley, N. Boland, G. Froyland, and M. Zuckerberg, Solving mixed integer nonlinear programming problems for mine production planning with stockpiling, Optim. Online (2012).
  • J. Zhang and R.G. Dimitrakopoulos, Stochastic optimization for a mineral value chain with nonlinear recovery and forward contracts, J. Oper. Res. Soc. 69 (6) (2018), pp. 864–875. doi:10.1057/s41274-017-0269-5.
  • M. Kumral and P.A. Dowd, A simulated annealing approach to mine production scheduling, J. Oper. Res. Soc. 56 (8) (2005), pp. 922–930. doi:10.1057/palgrave.jors.2601902.
  • M. Kumral, Optimizing ore–waste discrimination and block sequencing through simulated annealing, Appl. Soft Comput. 13 (8) (2013), pp. 3737–3744. doi:10.1016/j.asoc.2013.03.005.
  • J.A. Ferland, J. Amaya, and M.S. Djuimo, Application of a particle swarm algorithm to the capacitated open pit mining problem, in Autonomous Robots and Agents, S. Mukhopadhyay and G. Gupta, (eds.), Springer, Berlin, Heidelberg, 2007, pp. 127–133.
  • A. Lamghari and R. Dimitrakopoulos, A diversified Tabu search approach for the open-pit mine production scheduling problem with metal uncertainty, Eur. J. Oper. Res. 222 (3) (2012), pp. 642–652. doi:10.1016/j.ejor.2012.05.029.
  • A. Lamghari, R. Dimitrakopoulos, and J.A. Ferland, A variable neighbourhood descent algorithm for the open-pit mine production scheduling problem with metal uncertainty, J. Oper. Res. Soc. 65 (9) (2014), pp. 1305–1314. doi:10.1057/jors.2013.81.
  • M.S. Shishvan and J. Sattarvand, Long term production planning of open pit mines by ant colony optimization, Eur. J. Oper. Res. 240 (3) (2015), pp. 825–836. doi:10.1016/j.ejor.2014.07.040.
  • R. Senécal and R. Dimitrakopoulos, Long-term mine production scheduling with multiple processing destinations under mineral supply uncertainty, based on multi-neighbourhood Tabu search, Int. J. Min. Reclam. Environ. 34 (7) (2020), pp. 459–475. doi:10.1080/17480930.2019.1595902.
  • M. Godoy, The Effective Management of Geological Risk in Long-Term Scheduling of Open Pit Mines, University of Queensland, Brisbane, Australia, 2003.
  • R.S. Sutton and A.G. Barto, Reinforcement Learning: An Introduction, 2 nd ed., MIT Press, Cambridge, MA, 2018.
  • J.R. Birge and F. Louveaux, Introduction to Stochastic Programming, 2 nd ed., Springer, New York, NY, 2011.
  • S. Kirkpatrick, C.D. Gelatt, Jr, and M.P. Vecchi, Optimization by simulated annealing, Science 220 (4598) (1983), pp. 671–680. doi:10.1126/science.220.4598.671.
  • A. Lamghari and R. Dimitrakopoulos, Hyper-heuristic approaches for strategic mine planning under uncertainty, Comput. Oper. Res. 115 (2020), p. 104590. doi:10.1016/j.cor.2018.11.010.
  • S. Strebelle and C. Cavelius, Solving speed and memory issues in multiple-point statistics simulation program SNESIM, Math. Geosci. 46 (2) (2014), pp. 171–186. doi:10.1007/s11004-013-9489-7.
  • F. Albor Consuegra and R. Dimitrakopoulos, Stochastic mine design optimisation based on simulated annealing: Pit limits, production schedules, multiple orebody scenarios and sensitivity analysis, Min. Tech. 118 (2) (2009), pp. 79–90. doi:10.1179/037178409X12541250836860.