Full article: Time-stage driven pathfinding framework for optimized medical treatments

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

Coming up with the correct patient-specific treatment procedure is a non-trivial task. One of the main challenges is that the problem is combinatorial, and each treatment can be followed by numerous treatments. In real-life scenarios, one cannot experiment with all probable treatment combinations that may provide the necessary positive outcome. In addition, the task of correct drug prescription is challenging because, at different time-stages, the prescription may provide different results, a concept not widely explored in the literature on computational modeling. To address this task, we model the problem as a search problem and propose two algorithms that construct a directed acyclic graph (DAG), a directed cyclic graph, and a tree for each patient to explore various treatment combinations in an iterative and recursive manner. Each patient has three corresponding datasets, one for each stage, representing the features the patient has demonstrated during the recovery process. As a result, we provide a framework for identifying treatment options that may not have been explored previously while incorporating the concept of time-stage-based observations in the search procedure as novel contributions to the existing literature. We provide evaluation and scaling methods and identify current limitations and future research directions.

Keywords:

1. Introduction

The problem that we try to address in this work is closely related to the task of iterative target class allocation proposed by (Aslanyan et al., Citation2022), where the goal is to move the object to the marked class with successive classification steps. This task can be addressed using machine learning-based techniques, such as through cascade classifiers; however, we model it using a graph-based approach. For solving this problem for the medical treatment domain from the perspective of graph search, where the target class is the Home discharge outcome, we obtain and use historical treatment and charted patient observations as a result of MIMIC-III clinical database processing.

Although not at a larger scale, searching algorithms have previously been applied to drug discovery tasks. Based on our conducted literature review, such applications imply a one-stage search process and do not extend to multiple stages, which brings additional complexities to the task of finding the optimal drug treatment prescription procedure. In this work, each node in the graph represents a patient who has a corresponding feature vector, and the graph itself describes a patient–feature interaction. There are similar works in the literature that are based on protein–protein interaction networks which search for a target node in the graph. We construct a graph and perform a search not only to identify desirable nodes (features) but also to derive a treatment strategy based on the medications of the patient using feature similarity.

As a result of our experiments, each patient may have zero, one, or many treatment possibilities depending on the algorithmic approach. After constructing the graphs, we search for the optimal solutions using existing pathfinding algorithms, which are needed in case multiple shortest paths exist, propose cost and heuristic functions, and show the optimality for one of the functions in the scope of the current graph search domain. In addition, we also provide an API-driven software pipeline for the user to obtain a graph and a treatment path for the requested features and diagnosis. The paper’s contributions can be summarized in the following points.Footnote¹

• We provide two algorithms that, based on stage-based logic, construct a cyclic graph, a DAG, and a tree for each patient based on the admission features by dynamically searching for the optimal treatment paths. One of the algorithms can have both deterministic and non-deterministic behavior. The algorithms are applied to datasets that group patients’ recovery into three stages, namely initial, intermediate, and final, which include charted patient observations (features) and drugs given to the patient for a given stage.

• We provide heuristic and cost functions devised for performing the search on the constructed graphs for the medical treatment-search task.

• We provide a clustering-based evaluation of the DAG construction experiment to examine its effectiveness. Similar to the approach described in (Chen et al., Citation2016), we also provide betweenness and cosine similarity-based evaluations of both experiments to identify the most important nodes and observe their similarities to the target features.

• We provide a serverless pipeline based on the modern backend and cloud technologies as a scaling platform for algorithmic results.

2. Related work

Network-based approaches are not uncommon in developing drug combination therapies and candidate-drug prediction tasks. Rivas-Barragan et al. (Citation2020) present Drug2ways, a novel methodology that leverages multi-modal causal networks to predict drug candidates. They implement an algorithm that reasons over causal paths in large-scale biological networks to propose drug candidates for a given disease. The algorithm traverses the ensemble of paths between a drug and disease to propose the drugs that are likely to cure the disease based on the information contained in the network. Directionality inferred through the paths can be used to identify drug candidates, and the algorithm can also find ways of using existing drugs and propose combination therapies.

Many knowledge-graph methods have been developed to identify new drug therapies for diseases. Most such methods are based on counting the number of biomedical concepts, such as drugs, diseases, and genes in the indirect paths defined as sequences of two triples between two drugs, two diseases, or between a drug and a disease. The assumption is that a high number of intermediate concepts indicate similarity between drugs (Malas et al., Citation2019). Successful network-based frameworks have also been developed for accelerating drug discovery and quantifying disease–disease and drug–disease relationships (Cheng et al., Citation2018; Guney et al., Citation2016; Menche et al., Citation2015). These approaches have allowed moving from “one-drug, one-target” paradigm towards “multiple-drugs, multiple-targets” possibilities (Hopkins, Citation2008; Vidal et al., Citation2011; Yıldırım et al., Citation2007).

(Chen et al., Citation2016) apply a shortest path algorithm to identify candidate genes in the protein–protein network, and the analysis shows that some of them are deemed breast cancer-related genes according to the most recently published literature. In this work, the model is based on the information on protein–protein interactions, and such methods have been previously applied to identify genes of other diseases, such as age-related macular degeneration (Zhang et al., Citation2013) and hepatocellular carcinoma (Jiang et al., Citation2013). The weighted network is constructed by using proteins as nodes, and two nodes are adjacent if the corresponding proteins can interact with each other. The score of the interaction is defined as $Q (p_{1}, p_{2})$ , which ranges between 150 and 950, where $p_{1}$ and $p_{2}$ are corresponding proteins of nodes $v_{1}$ and $v_{2}$ . The final weight is given by 1000- $Q (p_{1}, p_{2})$ . The shortest paths are constructed for any two known breast cancer-related genes using Dijkstra’s algorithm, and for each node gene in the network, the number of times the node/gene appears as an intermediary node is counted, and this value is referred to as betweenness in the study. To avoid false candidate-gene discoveries, random gene-set selection and permutation FDR calculations are performed, and the genes with small FDRs are picked as significant breast-cancer genes. Li et al. (Citation2013) propose a computational method and apply Dijkstra’s algorithm for identifying cell lung cancer (SCLC) and non-small cell lung cancer (NSCLC) related genes with a shortest path approach in the protein–protein interaction (PPI) network. Based on data from the STRING database resource, the authors show that some of the genes identified from shortest paths have been reported to be related to lung cancer, and the candidate genes identified possessed more functional similarity with known cancer genes than those identified from gene expression profiles. The shortest path algorithm has also been applied to identify Gastric Cancer-related genes in PPI networks by Jiang et al. (Citation2014) and thyroid cancer-related genes by Jiang et al. (Citation2015).

The task of finding the optimal drug treatment procedures is also related to the research area of constructing drug recommendation systems. Health recommender systems (HRS) can make early diagnoses, predict disease progression, and make appropriate recommendations according to the health status of the patients. Some HRSs help support medical treatment and prognosis, while others offer medicine based on patient reviews using sentiment analysis and feature engineering (Bhimavarapu et al., Citation2022). There are also examples of graph-based approaches for providing personalized medical recommendations. Bhoi et al. (Citation2021) propose a two-stage recommender system comprising attention-based RNNs and graph networks to model drug co-occurrences in the EHR. They also adapt GAT to incorporate the varying importance of drug interactions for learning effective drug embeddings and justify a particular medication by providing contribution percentages among diagnoses and procedures. Similar to this work, MIMIC-III is one of the datasets used for the experiment. An existing recommender system, such as LEAP (Zhang et al., Citation2017), uses patients’ current visits and drug-to-drug interactions to predict a list of medications; however, patients’ previous visits are not considered. Another example is the recommender system DMNC (Le et al., Citation2018) which uses temporal differences in patients’ past visits when predicting medications. Mao et al. (Citation2022) present a medical intelligent system called MEDGCN that can automatically recommend patients’ medications based on their incomplete lab tests and estimate lab values that have not yet been taken using graph convolutional networks (GCN). The experiments are validated on NMEDW and MIMIC-III datasets. Some other examples of drug recommendations using graph-based methods are GAMENet by (Shang et al., Citation2019), which integrates drug–drug interaction knowledge for providing safe and personalized medical recommendations using graph neural networks, and G-BERT by (Shang et al., Citation2019), which integrates GNNS with transformer-based encoders for solving a medical recommendation task, achieving state-of-the-art performance.

Finding an optimal path with various quality measures is often regarded as an NP-hard problem, even for low-dimensional spaces (Canny & Reif, Citation1987). Most common deterministic and sampling-based algorithms aim to capture topological space connectivity using a discrete graph structure, and the final solution is extracted by searching the graph using Dijkstra’s algorithm or one of its variants. Overall, there exists a challenge in designing high-quality paths using a representative graph structure, as stated in Raveh et al. (Citation2011), who introduce an algorithm for merging an arbitrary number of input motion paths into a hybrid output path, which is based on the observation that the quality of certain sub-paths within each solution may be higher than the quality of the entire path. To optimize the combinations of therapeutic interventions and better manage the clinical outcomes of complex diseases, Calzolari et al. (Citation2008), propose a novel application of searching algorithms. Their findings demonstrate the effectiveness of the algorithms in identifying optimal combinations compared to random search and suggest that modified searching algorithms have the potential to enhance the discovery of novel therapeutic drug combinations. Calzolari et al. (Citation2008) also state that with searching strategies for combined interventions, there is no need for an experimental design where possible combinations of drugs for all selected doses are tested.

There are also approaches aimed at finding new drugs or a combination of drugs that can be used in combination therapies. Combination therapies are thus widely used in the treatment of complex diseases, from hypertension to cancer and infectious diseases. The systematic identification of drug combinations that offer high efficacy and low toxicity is mostly driven by intuition rather than by established principles. In their work, to tackle such challenges (Cheng et al., Citation2019), come up with a network-based drug combination strategy to quantify the relationship between drug targets and disease proteins.

The availability and scalability of medical software systems that can computationally serve the methodological findings in the medical/healthcare domain are as much important as the findings themselves. Gribova et al. (Citation2019) propose a concept of a cloud environment that can be used for the development and use of medical intelligent systems. Software systems such as BioXcel Therapeutics, BERG, XtalPi’s ID4, Deep Genomics, IBM’s Watson, and Google’s DeepMind Health can reduce the mortality rate in complicated medical diagnoses based on clinical data (Rajan & Paranthaman, Citation2022).

3. Materials and methods

3.1. Data

There are datasets for three stages for each diagnosis. The first stage is the initial stage, the second is the intermediate stage, and the third stage is the final stage. Each stage has two datasets. The first dataset has data about the historical patient measurements for 10 features. We label them as the Features dataset. The second dataset has data about the historical list of drugs for that stage, and we label them as the Drugs dataset. Such datasets are constructed for four diagnoses, which are PNEUMONIA, CONGESTIVE HEART FAILURE, SEPSIS, and DIABETIC KETOACIDOSIS. The drugs are the procedures or the graph edges, which tell which procedure should be given to the patient who is at a certain state with the given features. The datasets for stages are constructed based on the concept of time. Features datasets contain averaged information about 10 distinct features for $t$ days for each stage. An example can be seen from Table . The features are O2 saturation pulse oximetry, Heart Rate, Respiratory Rate, Noninvasive Blood Pressure mean, Noninvasive Blood Pressure systolic, Noninvasive Blood Pressure diastolic, Temperature Fahrenheit, Arterial Blood Pressure systolic, Arterial Blood Pressure diastolic, Arterial Blood Pressure mean. $t$ is 2 for stages 1 and 2 for all patients, and 2 $\leq t \leq$ 4 for stage 3. This variation in $t$ allows to increase the number of observations for the experiment. Similar to the Features datasets, the Drugs datasets contain drug prescriptions based on the same logical division of $t$ described above. Possible discharge locations are ICF, HOME, DEAD/EXPIRED, SHORT TERM HOSPITAL, HOME HEALTH CARE, etc. If the location is HOME, we encode it as 1; otherwise, the encoding is 0. This indicates that there is only one positive discharge, and the optimal path should lead to a location of this type. All the datasets are derived from pre-processing four database tables from the MIMIC-III clinical database, a freely accessible database comprising de-identified health-related data from over 40,000 patients (Johnson et al., Citation2016). The tables that have been used are ADMISSIONS, D_Items, and PRESCRIPTIONS datasets and a subset of the CHARTEVETS dataset.

Table 1. Features dataset example for a single patient in a stage before averaging

Display Table

3.2. DAG construction experiment

We first model the target class allocation problem as a directed acyclic graph (DAG) construction and search problem using existing pathfinding algorithms adapted to our multi-stage treatment logic. Figure shows that the number of time-stages in our datasets corresponds to the depth of the tree and is 3. Each node or state represents the health state of the patient represented by a 1-dimensional feature vector having 10 numerical observations, one for each feature. Actions are represented by drug sequences that move the patient from state $a$ at time-stage $i$ to state $a^{'}$ at time-stage $i + 1$ . Each state can have at most five child nodes; in other words, the set of actions for each state is limited to five by default but can vary as an input parameter. How the algorithm responsible for constructing the state space works is described in detail in the following steps. Each time-stage has its historical list of patients, features, and treatments that have been prescribed in each stage. The data for these three stages is denoted as the “training” set. For the first stage, in addition, we also possess information about patients and their features who have just been admitted, and regard these observations are part of the “test” set. We construct one directed acyclic graph (DAG) for each patient. More specifically, we iterate over each “test” instance, which possesses the features at the first stage, and also iterate over the available “training” instances for the first stage. Our step cost function $C$ is based on an RMSE measure (1), which takes two nodes and calculates the square root of the sum of squared distance between the feature vectors $F$ of these nodes. The ideal RMSE score will have a score of 0. We add 1 to make the concept of the path more intuitive, making the minimum path cost to be 1. The first five “training” instances with the lowest cost values become the child nodes or the next possible states for the 2nd stage. The paths (edges) that connect the “testing” instance with its successor states are the drug-treatment actions. This procedure is repeated for each of the obtained five child nodes, finding the most similar child nodes in terms of features from 2nd stage “training” features, then is repeated for the resulting child nodes, connecting them to the states at the final 3rd stage. This process is conducted for each “testing” instance. What makes this process unique is that all the states available at the final time-stage are the instances that have been positively discharged (discharged to Home). The $target$ is a state selected from such states, and this approach makes sure the final selected path will lead to a state having the features of a patient with a successful historical discharge. If the final node has a successful discharge, at least conceptually, each child node from each time-stage will at least be as good as its parent node, given they all are part of a successful drug treatment path. Each drug will have a target node, meaning there is at least one possible solution path for each graph. The intuition is to construct a stage-progression-based drug treatment graph based on similar but not identical historical treatment information for that stage. By default, the search happens using Dijkstra’s algorithm, which for a given vertex in the graph finds the path with the lowest cost between the vertex and any other vertex. It is also often used for finding the shortest path from a single vertex to a single destination vertex (Konakalla, Citation2014), in this case, the target node. The algorithm, however, is uninformed as the target node is not taken into consideration until the node obtained from path expansion is found to be the target. Search can become informed by taking into account the value of some heuristic function, which can say some nodes are more promising and will lead to a solution faster. More formally, a heuristic function $h (n)$ , where $n$ is a node, is said to be admissible if $h (n)$ is always less than or equal to the actual cost of a lowest-cost path from node $n$ to a target (David Poole & Mackworth, Citation2017). To identify the best path using heuristic information, we offer two heuristic functions and do the search using the A Star algorithm, a variant of Dijkstra’s algorithm, which takes into account heuristic information. The first heuristic function (2) is analogous to a “straight-line” heuristic, which for a node $n$ directly measures the $RMSE$ score between node $n^{'} s$ features $F$ and the target node’s features. Note that for all nodes located $n_{d - 1}$ step away from the target, the cost function $C (n)$ and heuristic function $h (n)$ will be equal, which still satisfies the admissibility condition. The heuristic is admissible in the scope of this experiment but may not be admissible for other datasets. We also provide another heuristic function (3) and a proof of admissibility in (3.2). $kstest$ is the default function name from the SciPy library, performing a Kolmogorov–Smirnov test for goodness of fit (Virtanen et al., Citation2020) and providing a $P$ value, where if the $P$ value is bigger than 0.05, we accept the null hypothesis that the distributions are identical. The intuition is that the estimated cost from node $n$ to the target node is the node’s depth weighted by the similarity to the target node, which for the given cost function, is always admissible. The algorithm is provided in the Appendix A as Algorithm (1). In the cost and heuristic function formulas, we use the terms RMSE and kstest to avoid extra formula complications and demonstrate that the names correspond to function names from external libraries.

Figure 1. DAG visualizations for the diagnoses.

(1)

C (n, m) = 1 + RMSE (F_{n}, F_{m})

(1)

(2)

h (n, target) = 1 + RMSE (F_{n}, F_{target})

(2)

(3)

h (n, target) = 1 - kstest (F_{n}, F_{target})

(3)

(4)

C (n_{d - k}, target) = k * (1 + RMSE (F_{n}, F_{target}))

(4)

(5)

h (n_{d - k}, target) = k * (1 - kstest (F_{n}, F_{target}))

(5)

Definition 3.1. Suppose $G$ is a DAG representing the multi-stage drug treatment state space. $start$ is the initial state, $T (s_{i}, a)$ is the transition model that returns state $s {^{'}}_{i + 1}$ belonging to stage $i + 1$ after taking action $a$ at state $s$ from stage $i$ . $d$ denotes the maximum depth $d$ which is equal to the number of treatment stages, and $target$ denotes the end state located at depth $d$ .

Definition 3.2. Let $C (n_{d - k}, target)$ denote the cost of the shortest path from node $n$ at depth $d - k$ to the target, and $h (n_{d - k}, target)$ denote the heuristic cost of $n$ to the target. $C (n_{d - k}, target)$ is given by Equationequation (4)(4) $C (n_{d - k}, target) = k * (1 + RMSE (F_{n}, F_{target}))$ (4) , and $h (n_{d - k}, target)$ is given by Equationequation (5)(5) $h (n_{d - k}, target) = k * (1 - kstest (F_{n}, F_{target}))$ (5) , where $F_{n}$ and $F_{target}$ are the feature vectors of node $n$ and the node $target$ . The range of $C (n_{d - k}, target)$ is $R_{C (n_{d - k}, target)} : k * [1, 1024]$ , and the range of $R_{h (n_{d - k}, target)} : k * [0, 1]$ .

Proposition 3.3. Given a DAG $G$ , $h (n_{d - k}, target) \leq C (n_{d - k}, target)$ for any node $n$ $k$ steps away from the maximum depth $d$ .

In other words, the theorem states that the heuristic is admissible for a given node at a given depth of the graph. The admissibility is visible from the definition, but we also provide a formal proof.

Proof. Base Case: When $k = 1$ , $h (n_{d - 1}, target) = 1 - kstest (F_{n}, F_{target})$ and $C (n_{d - 1}, target) = 1 + RMSE (F_{n}, F_{target})$ . $h (n_{d - 1}, target) \leq C (n_{d - 1}, target)$ as the range of $h (n_{d - 1}, target)$ is always smaller than the range of $C (n_{d - 1}, target)$ .

Inductive Case: Suppose $h (n_{d - k}, target) \leq C (n_{d - k}, target)$ , then $h (n_{d - (k + 1)}, target)$ = $(k + 1) * (1 - kstest (F_{n}, F_{target}))$ and $C (n_{d - (k + 1)}, target) = (k + 1) * (1 + RMSE (F_{n}, F_{target})) .$ The expressions reduce to the base case if we divide both sides by $k + 1$ , which proves the induction.

3.3. Graph and tree construction experiment

Although in this experiment we construct a cyclic graph and a tree, which again are graphs, it differs from the DAG experiment (3.2) in multiple ways. First, the stages are not fixed, a target state may exist at depth 0 or depths deeper than 3, 2 new cost functions, in addition to cost function (1), are provided. Here, every cost function output is compared to a threshold value to check whether the algorithm should terminate or not. The threshold function for cost function (1) is given by (6), which calculates the average RMSE across the rows of the target feature matrix $T$ , which is the third-time features of those patients who have been discharged to Home, $n$ denoting the number of rows, $T [i, :], T [j, :]$ , denoting rows of the matrix.

(6)

threshold = 1 + \frac{1}{n^{2}} \sum_{i}^{n} \sum_{j}^{n} RMSE (T [i, :], T [j, :])

(6)

1 is added to prevent the threshold from being 0. The algorithm can have non-deterministic behavior as part of the imposed complexity, and finally, the algorithm is recursive. The path can have no solution, acceptable solution, or a goal solution, the details of which can be found in the Appendix A as part of Algorithm (2). The algorithm of the experiment can search for the optimal drug path by constructing a graph, where directed edges but not self-loops are allowed, or a tree, where undirected edges are not allowed. Similar to the breadth-first search traversal, the algorithm uses a queue for node expansion where the most similar nodes are added to the queue first. The graph can have multiple shortest paths, while the tree will always have a single shortest path given there exists a target solution node. By default, each node can have three child nodes, but the number can vary as an input parameter to the algorithm. Furthermore, both the graph and the tree can be searched in a deterministic and non-deterministic manner. Deterministic search means that the child nodes will have features that belong to a dataset from one of the three stages. The idea is that the existing features as a result of existing procedures are a good approximation, representing the mean of all the potential features that could be obtained as a result of procedure application. Meanwhile, non-deterministic means we do allow features to vary which implies deviating the features with a certain probability from the observed to include some randomness and bring it closer to the real-life scenario. We experiment with two additional cost functions and also provide threshold functions for our recursive algorithm to terminate. The cost function (7) shows the average combined deviation for the features of node $n$ , which in the algorithm is compared with the threshold denoting the sum of standard deviations obtained from target features from the third time-stage (8). We add 1 to the cost and threshold functions for the same reason as described in the case of cost function (1).

(7)

C (n) = 1 + \sum_{i = 1}^{10} \sqrt{{(F_{n} [i] - A [i])}^{2}}

(7)

where $F$ and $A$ are 1D vectors with 10 observations, representing the feature vector and the target feature vector averaged across rows, respectively. Note that here $C (n)$ depends only on a single input, as $A$ is the same for each $n$ .

(8)

threshold = 1 + \sum_{i = 1}^{10} σ (T [:, i])

(8)

where $T$ is the target feature matrix, $T [:, i]$ is a feature column in the matrix. The intuition is that if the node’s total feature deviations are within the acceptable deviation range, then the algorithm can stop the execution.

The second cost function given by (9) is a vector-based subtraction operation, counting the number of features that are in the acceptable deviation range, and the threshold is a constant of 2, which is a varying input parameter to the algorithm and is not explicitly given. The intuition is that if the patient does not demonstrate acceptable values for the majority of the features, then the search should continue.

(9)

C (n) = 1 + (10 - count (\sqrt{F_{n} - A^{2}} < σ (T)))

(9)

4. Results

By looking at Figure , you can see the DAG visualizations for three of the diagnoses. The concept of time-stage is incorporated in subfigure (3.1) but applies to all Figure (1’s) subfigures. The nodes at each stage show the features that the patient can have after being given a drug procedure. Each such DAG has a solution located at depth 3. Meanwhile, by looking at Figure and Table , the latter showing sample paths per patient using a given experimental approach, you see a tree constructed with a deterministic approach and what a drug treatment path looks like for a given diagnosis. If a path exists, it either has found an acceptable solution or a goal solution. The tree, described in the table, has no solution, as it reaches a depth of 5, and a target node still has not been found, so a base case is met, and the algorithm terminates without giving a solution. Overall the paths are obtained for different cost functions and graph types, but only some of the combinations are provided in Table . From the table, “start:0” means the path is for the patient with id “0”, “start:4” is for the patient with id 4, etc. Figures are examples of a non-deterministic tree and a deterministic graph, respectively, and the search paths can be found in Table . The paths can be explained as follows: if you look at the first observation from Table , you see that the path is “start:0” $\to$ “t1:5” $\to$ “t2:35”, $\to$ “t3:9”. This means after the initial patient features are observed, the patient should be given the medication of the patient with id 5 from time-stage 1, after that, the medication of patient with id 35 from time-stage 2, then the medication of patient 9 in time-stage 3 for a successful discharge. Note that a non-deterministic path has some extra alpha-numeric letters attached to the path, such as “t2:14:0.88:jKp”. This is done to assign the node a unique id and make sure no two nodes share the same label as a result of a non-deterministic action. You can also see that for all DAGs, the paths include one node from each stage, meaning one procedure is assigned relevant to the stage the patient currently is in. For the tree and the graph, the path search is more complex, and if the patient shares similar features with patients from later time-stages, such medications can be given disregarding the stage progression. An interesting example is the result of the non-deterministic tree obtained for SEPSIS diagnosis for the patient with id “0”, where all medications are chosen from the second time-stage before the algorithm suggests that a solution has been found. The corresponding medications are not listed, but each patient for each time-stage has a drug medication sequence from the Drugs dataset. Although the real effectiveness of the procedure can only be verified with actual hospital-level medication, this method of searching for the best procedure computationally is expected to yield the desired results.

Figure 2. Deterministic tree.

Figure 3. Non-deterministic tree.

Figure 4. Deterministic graph.

Table 2. Shortest path examples

Display Table

4.1. Evaluation

4.1.1. K-Means

We provide a clustering-based approach for evaluating the performance of the DAG construction algorithm. K-Means algorithm with three clusters is fit on the features datasets of each stage, assigning each unique instance to a given cluster. The algorithm is fit on whole datasets, meaning we obtain clusters for the “testing” instances of each diagnosis. We then obtain the treatment suggestions from the graph search for each of the three stages. If the suggested path (treatment) of the patient belongs to the same cluster as the patient’s actual treatment cluster, then the algorithm has been successful in performing the search. By looking at Table , we see that in all cases, the algorithm with above 60% accuracy assigns the correct medication cluster. We take these results as a baseline for improvement. We also note that the results are subject to small variations for different runs as the clustering algorithm is fit during execution. Changing the number of clusters may also affect the results.

Table 3. Evaluation results

Download CSV Display Table

4.1.2. Betweenness and cosine similarity

We calculate how many times each node appears as part of a solution path for each experiment, and this count is referred to as a betweenness value similar to Chen et al.’s (Citation2016) work. For the nodes having the highest betweenness values, we calculate cosine similarity for each node using node’s and target node’s vectors, where values closer to 1 indicate high similarity.Footnote² The paths are constructed using the default cost function (1) and heuristic function (3). We can observe from Figures that the nodes that appear the most as part of a treatment path have above 75% similarity with the target node/features for all diagnoses, which is an indication of potentially effective candidate treatment procedures.

Figure 5. DAG evaluation.

Figure 6. Tree evaluation.

Figure 7. Graph evaluation.

5. Parallel processing and scaling

To be able to perform graph construction and path generation efficiently, we also construct a serverless API-driven pipeline based on Django (Django Software Foundation, Citation2022), AWS Lambda Functions (Amazon Web Services, Citation2023b), and S3 (Amazon Web Services, Citation2023a). Figure demonstrates the pipeline logic, where the client sends a request with patient features and the diagnosis. Serverless Lambda Functions start and perform the graph construction and optimal treatment recommendation, then return the generated path and image URL; meanwhile, the images get uploaded to the S3 storage bucket. By calling the URL of the image, the backend makes a request to S3 storage and returns the constructed graph. The framework for calling the Lambda Function can itself be deployed on Lambda, making the overall pipeline available on demand, providing efficiency. This approach provides high scalability, meaning for newly admitted patients, the results can be obtained efficiently as the pipeline is serverless and fast, as the process is parallel, with each process running in its container environment. Further instructions on how to test the pipeline on the local Django server are provided in the project repository.

Figure 8. API-based serverless path recommendation pipeline.

5.1. Setting

The experiments are conducted in Python, using the Networkx library (Hagberg et al., Citation2008). The software pipeline is constructed using the Django backend framework, and the serverless cloud functionality is achieved with AWS Lambda and S3 services.

6. Discussion

In this work, graph construction algorithms, cost, and heuristic functions, along with a software pipeline were proposed with the aim of finding the correct medical treatment for patients across multiple recovery stages using graphs where each node denoted patient features. As a limitation, we acknowledge that the approaches need further medical testing and verification. In addition, due to the stage-based dataset pre-processing and novel graph construction logic for performing a graph-search-based recommendation, which also provides dataset size limitations, the effectiveness of the approach with existing works can be validated only in terms of methodological and experimental similarities. K-Means and Cosine Similarity-based validation are provided for the evaluation of the methods. As part of future work, providing a graph construction pipeline for an increased number of time-stages, which can also include dataset benchmarking, scalability in searching performance for bigger graphs or trees, and further optimization of the algorithms, can be considered.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This work was partially supported by the RA MESCS Science Committee and BRFFR in the frames of the joint research project SC No21SC-BRFFR-1B029 and by the research project [21T-1B314].

Notes on contributors

Karen Gishyan

Karen Gishyan. Bachelor of Arts in Business, American University of Armenia, Yerevan, Armenia 2015–2019. Master of Science in Computer Science, University of Bath, Bath, UK 2019–2020. PhD student at the Institute for Informatics and Automation Problems of the National Academy of Sciences of the Republic of Armenia. Research interests: Intelligent Control and Cognitive Systems, Software Systems and Design, Artificial Intelligence and Time Series Analysis.

Hasmik Sahakyan

Hasmik Artem Sahakyan. Graduated from Yerevan State University. She received her PhD in 2002 and her doctorate in 2018. She is currently the scientific secretary of the Institute for Informatics and Automation Problems of the National Academy of Sciences of the Republic of Armenia and leading researcher at the Discrete Mathematics Department. Scientific interests: Combinatorics, Discrete Optimization, Discrete Tomography, Artificial Intelligence, Data Science, and Data Mining.

Levon Aslanyan

Levon Hakob Aslanyan. Graduated from Novosibirsk State University in 1968. He received his PhD in 1976, and his doctorate in 1997. Professor since 1997. Corresponding member of National Academy of Sciences of the Republic of Armenia, 2014. Currently heads the Department of Discrete Modeling, Analysis, and Recognition Technologies at the Institute for Informatics and Automation Problems of the NAS RA. Research interests: mathematical logic, discrete mathematics, mathematical theory of pattern recognition and artificial intelligence.

Notes

1. Code source: https://github.com/karen-gishyan/project

2. For Algorithm 1, similarity is calculated with the target node’s feature vector, for Algorithm 2, it is calculated with the averaged vector of 3rd stage’s target feature vectors.

References

Amazon Web Services. Amazon s3; 2023a. Available from: https://aws.amazon.com/s3/.
Google Scholar
Amazon Web Services. Aws lambda; 2023b. Available from: https://aws.amazon.com/lambda/.
Google Scholar
Aslanyan, L., Gishyan, K., Sahakyan, H. Target class classification recursion preliminaries. In: 13th Conference for Data Analysis Methods for Software Systems; December; 2022. p. 6–17. Available from: https://www.mii.lt/damss/files/damss_2022.pdf.
Google Scholar
Bhimavarapu, U., Chintalapudi, N., & Battineni, G. (2022). A fair and safe usage drug recommendation system in medical emergencies by a stacked ann. Algorithms, 15(6), 186. https://doi.org/10.3390/a15060186
Web of Science ®Google Scholar
Bhoi, S., Lee, M. L., Hsu, W., Fang, H. S. A., & Tan, N. C. (2021). Personalizing medication recommendation with a graph-based approach. ACM Transactions on Information Systems (TOIS), 40(3), 1–23. https://doi.org/10.1145/3488668
Web of Science ®Google Scholar
Calzolari, D., Bruschi, S., Coquin, L., Schofield, J., Feala, J. D., Reed, J. C., McCulloch, A. D., & Paternostro, G. (2008). Search algorithms as a framework for the optimization of drug combinations. PLoS Computational Biology, 4(12), e1000249. https://doi.org/10.1371/journal.pcbi.1000249
PubMed Web of Science ®Google Scholar
Canny, J., & Reif, J. (1987). New lower bound techniques for robot motion planning problems. Proceedings of the 28th Annual Symposium on Foundations of Computer Science (sfcs 1987), Los Angeles, CA (pp. 49–60). http://ieee-focs.org/
Google Scholar
Cheng, F., Desai, R. J., Handy, D. E., Wang, R., Schneeweiss, S., Barabási, A.-L., & Loscalzo, J. (2018). Network-based approach to prediction and population-based validation of in silico drug repurposing. Nature Communications, 9(1), 2691. https://doi.org/10.1038/s41467-018-05116-5
PubMed Web of Science ®Google Scholar
Cheng, F., Kovács, I. A., & Barabási, A. L. (2019). Publisher Correction: Network-based prediction of drug combinations. Nature Communications, 10(1), 1–11. https://doi.org/10.1038/s41467-019-09692-y
PubMedGoogle Scholar
Chen, L., Hao Xing, Z., Huang, T., Shu, Y., Huang, G., & Li, H.-P. (2016). Application of the shortest path algorithm for the discovery of breast cancer-related genes. Current Bioinformatics, 11(1), 51–58. https://doi.org/10.2174/1574893611666151119220024
Web of Science ®Google Scholar
David Poole, A. M., Mackworth, A. K. (2017). Artificial intelligence: Foundations of computational agents, 2e. Cambridge University Press. https://doi.org/10.1017/9781108164085
Google Scholar
Django Software Foundation. Django; (2022). Available from: https://djangoproject.com.
Google Scholar
Gribova, V., Moskalenko, P., Petryaeva, M., & Okun, D. (2019). Cloud environment for development and use of software systems for clinical medicine and education. Proceedings of the 7th Scientific Conference on Information Technologies for Intelligent Decision Making Support (ITIDS 2019), Ufa, Russia (pp. 225–229). Atlantis Press. https://www.atlantis-press.com/proceedings/itids-19#:~:text=Welcome%20to%20the%207th%20Scientific,%2C%202019%20in%20Ufa%2C%20Russia
Google Scholar
Guney, E., Menche, J., Vidal, M., & Barábasi, A.-L. (2016). Network-based in silico drug efficacy screening. Nature Communications, 7(1), 10331. https://doi.org/10.1038/ncomms10331
PubMedGoogle Scholar
Hagberg, A., Swart, P. S., & Chult, D. (2008). Exploring network structure, dynamics, and function using networkx. Los Alamos National lab.(lanl). NM (United States).
Google Scholar
Hopkins, A. L. (2008). Network pharmacology: The next paradigm in drug discovery. Nature Chemical Biology, 4(11), 682–690. https://doi.org/10.1038/nchembio.118
PubMed Web of Science ®Google Scholar
Jiang, M., Chen, Y., Zhang, Y., Chen, L., Zhang, N., Huang, T., Cai, Y.-D., & Kong, X. (2013). Identification of hepatocellular carcinoma related genes with k-th shortest paths in a protein–protein interaction network. Molecular BioSystems, 9(11), 2720–2728. https://doi.org/10.1039/c3mb70089e
PubMedGoogle Scholar
Jiang, Y., Shu, Y., Shi, Y., Li, L.-P., Yuan, F., & Ren, H. (2014). Identifying gastric cancer related genes using the shortest path algorithm and protein-protein interaction network. BioMed Research International 2014, 2014, 1–9. https://doi.org/10.1155/2014/371397
Web of Science ®Google Scholar
Jiang, Y., Zhang, P., Li, L. P., He, Y.-C., Gao, R.-J., & Gao, Y.-F. (2015). Identification of novel thyroid cancer-related genes and chemicals using shortest path algorithm. BioMed Research International 2015, 2015, 1–8. https://doi.org/10.1155/2015/964795
Web of Science ®Google Scholar
Johnson, A. E., Pollard, T. J., Shen, L., Lehman, L. W. H., Feng, M., Ghassemi, M., Moody, B., Szolovits, P., Anthony Celi, L., & Mark, R. G. (2016). Mimic-iii, a freely accessible critical care database. Scientific Data, 3(1), 1–9. https://doi.org/10.1038/sdata.2016.35
Google Scholar
Konakalla, S. V. (2014). PATH FINDING – Dijkstra’s algorithm. https://cs.indstate.edu/rjaliparthive/dijkstras.pdf
Google Scholar
Le, H., Tran, T., & Venkatesh, S. (2018). Dual memory neural computer for asynchronous two-view sequential learning. Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining, London, United Kingdom (pp. 1637–1645). https://dl.acm.org/doi/proceedings/10.1145/3219819
Google Scholar
Li, B. Q., You, J., Chen, L., Zhang, J., Zhang, N., Li, H.-P., Huang, T., Kong, X.-Y., & Cai, Y.-D. (2013). Identification of lung-cancer-related genes with the shortest path approach in a protein-protein interaction network. BioMed Research International 2013, 2013, 1–8. https://doi.org/10.1155/2013/267375
Web of Science ®Google Scholar
Malas, T. B., Vlietstra, W. J., Kudrin, R., Starikov, S., Charrout, M., Roos, M., Peters, D. J. M., Kors, J. A., Vos, R., ‘t Hoen, P. A. C., van Mulligen, E. M., & Hettne, K. M. (2019). Drug prioritization using the semantic properties of a knowledge graph. Scientific Reports, 9(1), 1–10. https://doi.org/10.1038/s41598-019-42806-6
PubMedGoogle Scholar
Mao, C., Yao, L., & Luo, Y. (2022). Medgcn: Medication recommendation and lab test imputation via graph convolutional networks. Journal of Biomedical Informatics, 127, 104000. https://doi.org/10.1016/j.jbi.2022.104000
PubMed Web of Science ®Google Scholar
Menche, J., Sharma, A., Kitsak, M., Ghiassian, S. D., Vidal, M., Loscalzo, J., & Barabasi, A.-L. (2015). Uncovering disease-disease relationships through the incomplete interactome. Science, 347(6224), 1257601. https://doi.org/10.1126/science.1257601
PubMed Web of Science ®Google Scholar
Rajan, S. P., & Paranthaman, M. (2022). Artificial Intelligence in Healthcare: Algorithms and Decision Support Systems. In C. Venkatesh, N. Rengarajan, P. Ponmurugan, & S. Balamurugan (Eds.), Smart Systems for Industrial Applications (pp. 173–197). https://doi.org/10.1002/9781119762010.ch7
Google Scholar
Raveh, B., Enosh, A., & Halperin, D. (2011). A little more, a lot better: Improving path quality by a path-merging algorithm. IEEE Transactions on Robotics, 27(2), 365–371. https://doi.org/10.1109/TRO.2010.2098622
Web of Science ®Google Scholar
Rivas-Barragan, D., Mubeen, S., Guim Bernat, F., Hofmann-Apitius, M., & Domingo-Fernández, D. (2020). Drug2ways: Reasoning over causal paths in biological networks for drug discovery. PLoS Computational Biology, 16(12), e1008464. https://doi.org/10.1371/journal.pcbi.1008464
PubMed Web of Science ®Google Scholar
Shang, J., Ma, T., & Xiao, C. (2019). Pre-training of graph augmented transformers for medication recommendation. arXiv Preprint arXiv: 190600346.
Google Scholar
Shang, J., Xiao, C., Ma, T., Li, H., & Sun, J. (2019). Gamenet: Graph augmented memory networks for recommending medication combination. Proceedings of the AAAI Conference on Artificial Intelligence, Honolulu, Hawaii, USA (Vol. 33, No. (01), pp. 1126–1133). https://ojs.aaai.org/index.php/AAAI/issue/view/246
Google Scholar
Vidal, M., Cusick, M. E., & Barabási, A. L. (2011). Interactome networks and human disease. Cell, 144(6), 986–998. https://doi.org/10.1016/j.cell.2011.02.016
PubMed Web of Science ®Google Scholar
Virtanen, P., Gommers, R., Oliphant, T. E., Haberland, M., Reddy, T., Cournapeau, D., Burovski, E., Peterson, P., Weckesser, W., Bright, J., van der Walt, S. J., Brett, M., Wilson, J., Millman, K. J., Mayorov, N., Nelson, A. R. J., Jones, E., Kern, R. … Halchenko, Y. O. (2020). SciPy 1.0: Fundamental algorithms for Scientific Computing in Python. Nature Methods, 17(3), 261–272. https://doi.org/10.1038/s41592-019-0686-2
PubMed Web of Science ®Google Scholar
Yıldırım, M. A., Goh, K. I., Cusick, M. E., Barabási, A.-L., & Vidal, M. (2007). Drug—target network. Nature Biotechnology, 25(10), 1119–1126. https://doi.org/10.1038/nbt1338
PubMed Web of Science ®Google Scholar
Zhang, Y., Chen, R., Tang, J., Stewart, W. F., & Sun, J. (2017). LEAP: learning to prescribe effective and safe treatment combinations for multimorbidity. Proceedings of the 23rd ACM SIGKDD international conference on knowledge Discovery and data Mining, Halifax NS, Canada (pp. 1315–1324). https://dl.acm.org/doi/proceedings/10.1145/3097983
Google Scholar
Zhang, J., Jiang, M., Yuan, F., Feng, K.-Y., Cai, Y.-D., Xu, X., & Chen, L. (2013). Identification of age-related macular degeneration related genes by applying shortest path algorithm in protein-protein interaction network. BioMed Research International 2013, 2013, 1–8. https://doi.org/10.1155/2013/523415
Web of Science ®Google Scholar

Appendix A.

Algorithms

Algorithm 1 DAG construction pseudocode

Obtain “train” patient features for the first and second stages.
Obtain those “train” patient features for the third stage, which have been discharged to Home (target node will be selected from one of these features).
Obtain first stage “test” features as test nodes.
Initialize a DAG with the start node for each “test” node.
For each “testing” node, iterate over stage 1 “training” features, calculate closeness based on cost function (1), select top five closest “training” features from stage 1, make DAG nodes, construct edges, and assign closeness values to the edge (If there are no actual data for the given “training” feature for that stage, assign a fixed penalty of 2 * *10).
For each closest node from step 5, iterate over stage 2 “training” features, calculate closeness based on cost function (1), select top five closest “training” features from stage 2, make DAG nodes, construct edges, and assign closeness values to the edge (If there are no actual data for the given “training” feature for that stage, assign a fixed penalty of 2 * *10).
For each closest node from step 6, iterate over stage 3 “training” features, calculate closeness based on cost function (1), select top five closest “training” features from stage 3, make DAG nodes, construct edges, and assign closeness values to the edge (If there are no actual data for the given “training” feature for that stage, assign a fixed penalty of 2 * *10).
Search the graph from step 3 with A Star algorithm, using the start node and the target node.

Algorithm 2 Graph/Tree construction pseudocode (recursive)

Obtain “train” patient features for the first and second stages.
Obtain those “train” patient features for the third stage, which have been discharged to Home (target node will be selected from one of these features).
Obtain first stage “test” features as test nodes.
Select target features as the average (over the rows) for the feature matrix from step 2, and choose a cost function and a respective threshold.
Initialize a Graph with the start node for each “test” node. Initialize a “frontier” queue, dictionary storing visited nodes, dictionary storing cycle-resulting nodes (for the graph version).
Base case 1: Terminate the algorithm if there are no more nodes in the “frontier” queue to explore.
Base case 2: Terminate the algorithm if the node is at a depth of 3 or deeper, has features that are better than the “start node” features, mark the solution as “acceptable”.
Base case 3: Terminate the algorithm if the node is at a depth of 5, marking the solution as “not found”.
Base case 4: Terminate the algorithm if the node’s features are within the threshold, mark the solution as ‘goal’.
Inductive case. Select a node from the queue, calculate similarity between target features using cost functions (1), (7), or (9). If one of the base cases is not met, iteratively find the three most similar features from the features from steps 1 and 2, consider them as child nodes.
If tree version: For a given child node, if it results in a cycle, remove the edge connecting the parent to the child node. Add the node to the frontier queue with default features if the algorithm is deterministic; otherwise deviate the features with some probability before adding them to the queue. Relabel the node if it has non-deterministic features. After considering all child nodes, do a recursive call.
If graph version: Add a given child node to the frontier queue with default features if the algorithm is deterministic and has not already been visited; otherwise deviate the features with some probability before adding to the queue. Relabel the node if it has non-deterministic features. After considering all child nodes, do a recursive call.
Search the Graph from step 5 with Dijkstra’s algorithm, using the start and target nodes (the tree version will always have a single solution path if one exists).

Time-stage driven pathfinding framework for optimized medical treatments

Abstract

1. Introduction

2. Related work