Full article: Engineering Responsible And Explainable Models In Human-Agent Collectives

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

ABSTRACT

In human-agent collectives, humans and agents need to work collaboratively and agree on collective decisions. However, ensuring that agents responsibly make decisions is a complex task, especially when encountering dilemmas where the choices available to agents are not unambiguously preferred over another. Therefore, methodologies that allow the certification of such systems are urgently needed. In this paper, we propose a novel engineering methodology based on formal model checking as a step toward providing evidence for the certification of responsible and explainable decision making within human-agent collectives. Our approach, which is based on the MCMAS model checker, verifies the decision-making behavior against the logical formulae specified to guarantee safety and controllability, and address ethical concerns. We propose the use of counterexample traces and simulation results to provide a judgment and an explanation to the AI engineer as to the reasons actions may be refused or allowed. To demonstrate the practical feasibility of our approach, we evaluate it using the real-world problem of human-UAV (unmanned aerial vehicle) teaming in dynamic and uncertain environments.

Introduction

Autonomous systems are becoming increasingly ubiquitous and are making significant impacts in various safety-critical systems, such as exploration robots (Molnar and Veres Citation2009), driverless cars (Abeywickrama et al. Citation2020; Dogan et al. Citation2016) and unmanned aerial and ground vehicles (Ramchurn et al. Citation2016; Ramchurn, Stein, and Jennings Citation2021). Although the evolution of robotics has many compelling benefits, as with other emerging technologies, it comes with new risks and challenges. These challenges can be categorized into three broad and interrelated areas of ethical and social concern: safety and errors; law and ethics; and social impact (Lin, Abney, and Bekey Citation2014). Therefore, it is essential to ensure that agents make correct and responsible decisions, so that humans or other living beings, infrastructure and society as a whole are not harmed.

Researchers have advocated three main principles that can be used by AI systems to support the social good: accountability, responsibility and transparency (ART) (Dignum Citation2017). The ART principles form the main pillars of responsible AI. Yazdanpanah et al. (Citation2021) highlight the need for an interdisciplinary effort to address different social, technological, legal and ethical challenges. This requires the development of socio-technically expressive notions of responsibility, blameworthiness and accountability. Dignum et al. (Citation2018) define accountability as the need to explain and justify one’s decisions and actions. Responsibility refers both to the capability of AI systems and to the role of people interacting with them (Dignum et al. Citation2018). Transparency is defined as “the extent to which the system discloses the processes or parameters that relate to its functioning” (Winfield et al. Citation2021). Approaches to the design of responsible AI broadly focus on regulation by means of legislation and standards or are design based (e.g. ethics by design). Our research concerns the latter and focusses on the engineering of responsibility in the design of human-agent collectives using formal modeling and verification. More specifically, we engineer decision-making behavior to ensure that a human-agent collective is safe (i.e. nothing bad happens), controllable (meaningful control between humans and agents for collective decisions), trustworthy (what will convince us to trust agents) and ethical (what moral decisions agents may have to take) (see Appendix A for a list of definitions used in this work).

Acting in a responsible manner is important for agents, as described earlier, but so is being able to explain why one’s actions are morally right or wrong (Conitzer et al. Citation2017). As autonomous systems are becoming part of our daily life, issues related to ways that humans interact with such systems become more significant. Among these issues are making machine decisions transparent, understandable and explainable (Goebel et al. Citation2018; Koeman et al. Citation2020). According to Wortham and Theodorou (Citation2017) and Sheh (Citation2017), the ability of a robot or an autonomous system to explain its behavior helps the user to develop an accurate mental model of the robot’s reasoning. This can result in better interaction between them (Koeman et al. Citation2020). These explanations can be in more understandable natural language, for example, answering why-questions for end users of robotic systems (Koeman et al. Citation2020). Miller (Citation2019) defines explainability or interpretability as “generating decisions in which one of the criteria taken into account during the computation is how well a human could understand the decisions in the given context.” The relationship between the notions of explainable AI and responsible AI has been described by Barredo Arrieta et al. (Citation2020). Responsible AI is a broader concept that imposes the systematic adoption of several AI principles (e.g. explainability, fairness, accountability and privacy), so AI models can be of practical use.

Now, ensuring that autonomous systems work in a responsible and explainable manner is challenging. This is especially the case when dilemmas are encountered. In this paper, we define dilemmas as decision-making situations in which the choices available to agents are not unambiguously preferred over one another (Bjørgen et al. Citation2018). When humans are involved, new challenges emerge as humans may not always be able to understand what agents are intending to do and vice versa. Also, humans and agents often need to agree on collective decisions using consensus or compromise (Greene et al. Citation2016; Loreggia et al. Citation2018). In order to support collective decision making, agents should be able to account for some form of moral values and ethical principles (Awad et al. Citation2018), as well as constraints on safety and controllability. According to Rossi (Citation2015), the notions of safety constraints and ethical principles are closely intertwined. An action by an agent can cause harm, thus it can be considered unsafe. However, if the action occurred by accident and there was no intention to cause harm, it may not be considered unethical (Rossi Citation2015). The use of ethical principles allows agents to reason and determine their actions and explain and justify their behavior in terms that are understandable by humans (Rossi Citation2015). Furthermore, humans would accept and trust agents more, if they behaved as responsibly as humans in the same environment (Rossi Citation2015).

An example of a situation in which a dilemma occurs that raises ethical concerns can be described as follows. Assume that a UAV is flying above a football stadium with a match in progress and will run out of battery in a few seconds. The UAV has two options: to crash into the football stadium, thus colliding and harming spectators and players on the ground; or to ascend above 500 feet and collide with a small civilian aircraft carrying passengers. Either of these two actions can harm humans, and the agent will be in a dilemma about the correct, morally significant decision to make.

So, how can we certify that the decision making within human-agent collectives is responsible and explainable? To answer this question, methods and tools that allow the certification (Fisher et al. Citation2021; Luckcuck et al. Citation2019) of such systems are urgently needed. In general, certification involves negotiating with a legal authority in order to convince them that the relevant safety requirements have been explored and mitigated appropriately (Fisher, Dennis, and Webster Citation2013). One key technique that can be explored to provide evidence for certification is formal verification using model checking (Fisher et al. Citation2021; Fisher, Dennis, and Webster Citation2013). Formal verification applies formal methods to the problem of system verification. Model checking is a type of formal verification which provides automatic verification for finite state concurrent systems (Clarke, Grumberg, and Peled Citation1999).

There has been significant interest within the AI community about the verification and validation of autonomous systems (e.g. (Choi, Kim, and Tsourdos Citation2015; Dennis and Fisher Citation2020; Dennis et al. Citation2016, Citation2016; Molnar and Veres Citation2009; Qu and Veres Citation2016; Webster et al. Citation2011, Citation2014)) (see Section 2 for more details). However, most of these efforts have focussed on the safety concerns of AI. Webster et al. (Citation2014) and Webster et al. (Citation2011) explore formal verification using model checking to provide formal evidence for the certification of autonomous unmanned aircraft. In these works, the aim was to provide evidence that the autonomous system in control of an unmanned aircraft is safe and reliable. The properties verified are based on the notions of rules of the air (Webster et al. Citation2014) and airmanship. Later, Dennis et al. (Citation2016), Dennis and Fisher (Citation2020), and Dennis and Fisher (Citation2021) advance the work by Webster et al. (Citation2014), by representing and embedding ethical principles in the decision-making process of an autonomous system. It is clear that, while model checking has been applied to provide the certification of autonomous systems with respect to safety, little work has yet been done on ethical concerns. Furthermore, very little work has used techniques like model checking to support responsible and explainable (Li et al. Citation2020, Citation2020) decision making within human-agent collectives. Our approach applies model checking as a step toward supporting the certification of responsible (i.e, ethical, controllable, safe and trustworthy) and explainable decision making within human-agent collectives.

The main contributions of this article are:

We propose a novel engineering methodology to resolve dilemmas posed by human-agent collectives. We use model checking as a step toward providing evidence for the certification of responsible and explainable decision making within human-agent collectives. Thus, we account for ethical principles, safety and controllability in the decision-making process of human-agent collectives in a way that is amenable to certification. Our approach is based on the Model Checker for Multi-Agent Systems (MCMAS) tool (Lomuscio, Qu, and Raimondi Citation2017).
We demonstrate the use of counterexample traces and simulation results to provide a judgment and an explanation to the AI engineer on the reasons actions are refused or allowed. The counterexample traces can be used to identify and track any errors and their sources in the model, which consists of a human-agent collective. The step-by-step interactive simulation of the human-agent collectives provides and confirms the expected behavior of the agents, thus giving confidence to the AI engineer. We explore this feature in particular to show and explain the actions allowed for an agent when having to resolve a dilemma. When faced with a dilemma, the interactive simulation provides an explanation to the AI engineer on the permissibility of actions and describes the reasons why a certain action is preferred over another.
We evaluate the approach using the real-world problem of human-UAV teaming in dynamic and uncertain environments (Ramchurn, Stein, and Jennings Citation2021, Ramchurn et al. Citation2016, Ramchurn et al. Citation2016). The case study concerns a situational awareness gathering scenario typically found in disaster-response settings where a number of casualties or resources need to be located in a disaster space. This study extends it with dilemma situations relating to safety and controllability, and the ethical behavior of agents.

The preliminary results of our study can be found in (Abeywickrama, Cirstea, and Ramchurn Citation2019). This article builds on that work in the following manner. First, this article provides a step toward explaining why decisions are made by agents in complex dilemma situations, increasing trust for the AI engineer. Second, this article introduces several new dilemmas about the ethical behavior of agents (e.g. technical problem in a UAV resulting in a crash into humans), and controllability between humans and agents (e.g. crashing into critical personal assets of the collective – avoid conflict with a human’s authority; self-censorship/lying by UAVs). Third, we discuss an extended model of two human-UAV collectives with 11 agents to show how our model can scale up to reasonably complex settings.

The rest of this article is organized as follows. Section 2 describes the main work related to our study and then provides background information on model checking multi-agent systems using MCMAS. In Section 3, our approach for engineering responsible and explainable models in human-agent collectives is presented. A case study scenario in UAV teaming in disaster response is provided in Section 4, and Section 5, applies our approach to the case study. In Section 6, we provide a discussion on our work by identifying several limitations, and Section 7 provides concluding remarks and future directions.

Related Work and Background

This section describes the main work related to our study in the areas of ethical dilemmas and principles (Section 2.1), explainable AI (Section 2.2), and model checking (Section 2.3), and then provides background information on model checking multi-agent systems using the MCMAS tool (Section 2.4).

Ethical Dilemmas and Principles

Yu et al. (Citation2018) propose a taxonomy that divides ethics in AI into four fields: exploring ethical dilemmas; individual ethical decision-making frameworks; collective ethical decision-making frameworks; and ethics in human-AI techniques. Ethics has been defined by Cointe, Bonnet, and Boissier (Citation2016) as a normative practical philosophical discipline of how one needs to act toward others. An example of an ethical dilemma situation is the classical trolley problem (Thomson Citation1985), which describes several cases where the moral permissibility of actions is evaluated. They are: (i) trolley driver: a runaway railway trolley is about to hit and kill five innocent track workmen. These five men can only be saved if the trolley driver turns the trolley to another track that has only one worker. This worker then will be the only one killed; (ii) bystander at the switch: this is same as (i) but instead of the trolley driver, a bystander observes and diverts the trolley by turning a switch; and (iii) fat man: a fat man is standing on a footbridge over the trolley track, and the only way to stop the trolley is by pushing this bystander onto the tracks. Meanwhile, Lindner et al. (Citation2019) describe three ethical dilemmas involving human-agent collectives: coal dilemma, lying dilemma and child dilemma.

Ethical principles are used to satisfy the morals, desires and capacities of an agent (Cointe, Bonnet, and Boissier Citation2016), and these can be used to guide human behavior on what is right or wrong (Greene et al. Citation2016). If intelligent agents are required to collaborate with humans, some ethical guidelines need to be embedded in agents so they can act in their environment following values that are aligned to those of humans. Moral philosophy, which is the field that has studied explicit ethical principles most extensively, advocates three general approaches to engineer ethical behavior in agents: virtue-based, deontological and consequentialist (Cointe, Bonnet, and Boissier Citation2016; Greene et al. Citation2016; Yu et al. Citation2018).

In order to address the requirements for autonomous moral decision making, Lindner, Mattmüller, and Nebel (Citation2020), Lindner, Mattmüller, and Nebel (Citation2019), Lindner and Bentzen (Citation2017), Lindner, Bentzen, and Nebel (Citation2017), and Bentzen (Citation2016) introduce a software library called HERA for modeling hybrid ethical reasoning agents. HERA implements multiple ethical principles like utilitarianism, the principle of double effect, and a Pareto-inspired principle. A prototype robot called IMMANUEL that is based on the HERA approach has been discussed. The utilitarianism and do-no-harm ethical principles described in our approach were originally motivated by these authors’ work. Lindner, Bentzen, and Nebel (Citation2017) have developed a model checker specifically to verify moral situations of agents (causal agency models). Krarup et al. (Citation2020) have implemented a system to provide contrasting explanations to compare the ethical outcomes of different plans. Later, Dennis et al. (Citation2021) have extended this work to verify machine ethics in changing contexts. Halilovic and Lindner (Citation2022) present an approach to local navigation of a robot based on explainable AI. Similarly, our approach is based on formal model checking using the MCMAS tool. However, our approach resolves dilemmas not only about ethical behavior but also dilemmas about controllability and safety between humans and agents. Furthermore, this work is evaluated using a case study in human-UAV teaming in disaster response.

Loreggia et al. (Citation2020), Rossi and Loreggia (Citation2019), Rossi and Mattei (Citation2019), Loreggia et al. (Citation2018), Greene et al. (Citation2016), and Rossi (Citation2015) present a study on the embedding and learning of safety constraints, moral values and ethical principles in collective decision-making systems for societies of machines and humans. In order to model and reason with both preferences and ethical principles in a decision-making scenario, the authors propose a notion of distance between CP-nets (Loreggia et al. Citation2018). Rossi and Mattei (Citation2019) describe two existing approaches for building ethical bounded AI systems, which are data-driven or rule-based approaches. However, they do not focus on controllability between humans and agents, nor do they use formal model checking.

Papavassiliou et al. (Citation2021) discuss the human behavioral perspectives in order to perform optimal controllability and resource management. Their work proposes a risk-aware resource sharing and management framework for facilitating user QoS satisfaction in the deployment of 5G wireless networks. The goal is to maximize energy efficiency in wireless communications of multi-user heterogeneous networking environments (Papavassiliou et al. Citation2021).

Explainable AI

Harbers, van den Bosch, and Meyer (Citation2010) present a model for explainable Belief-Desire-Intention(BDI) agents, which describes behavior of BDI agents using beliefs and goals. Four explanation algorithms have been compared in an empirical study involving 20 users. Their user study is based on a fire-fighting training use case. Based on the results, the authors discuss which explanation types need to be provided under different conditions. Kulesza et al. (Citation2012) explore how mental models impact end users’ attempts to debug intelligent agents. This is by providing structural knowledge of a music recommender system. The results of their empirical study show that intelligent agents can provide better feedback, if end users are helped to understand a system’s reasoning. Later, Kulesza et al. (Citation2013) consider the methods in which the intelligent agents need to explain themselves to humans. This is by conducting a user study of 17 participants, focussing on how the soundness and completeness of the explanations impact the fidelity of the mental models of the end users. The authors conclude that completeness is more important than soundness.

Miller (Citation2019) investigates existing works in explainable AI by surveying more than 250 publications from social science venues. Four major findings from the surveyed literature are: (i) the explanations are contrastive, that is they are required in response to particular counterfactual cases; (ii) people select explanations in a biased manner; (iii) the probabilities in explanations are less significant compared to their causes; and (iv) the explanations are social which can be presented as part of an interaction or a conversation. Later, Mualla et al. (Citation2022) present a mechanism for parsimonious explainable AI called HAExA, which is a human-agent explainability architecture allowing remote robots to be operational. HAExA applies both contrastive explanations and explanation filtering, and the architecture has been evaluated using an empirical user study. The authors use parametric and non-parametric statistical testing to analyze the results.

A few works undertake empirical user studies to assess the process of explanation reception (Miller Citation2019). For instance, Madumal et al. (Citation2019) discuss different levels of explanations for model-free reinforcement learning agents. Causal models are used to derive causal explanations of behavior based on counterfactual analysis of the models. An empirical evaluation has been conducted using a human-computer interaction study, which shows that the abstract causal explanations provide better performance on explanation quality. However, none of the discussed approaches on explainable AI explore explainability at the formal verification level using model checking as considered in our work.

Model Checking

Several approaches have applied the MCMAS tool to formally verify autonomous systems in AI settings (e.g. (Choi, Kim, and Tsourdos Citation2015; Elkholy et al. Citation2020; Molnar and Veres Citation2009)). Molnar and Veres (Citation2009) apply MCMAS for the formal verification of autonomous underwater vehicles. A methodology has been introduced using formal verification for the integrity and fault assessment system of complex autonomous engineering systems (Molnar and Veres Citation2009). Choi, Kim, and Tsourdos (Citation2015) present an approach for the verification of heterogeneous multi-agent systems using the MCMAS tool. They demonstrate how model checking can be used to verify the decision-making logics of multi-agent systems at the design level. Like us, they also model a collective of human actors and UAVs, but they do not address the resolution of any ethical or controllability dilemmas. Elkholy et al. (Citation2020) using MCMAS propose a formal and operational approach for modeling, verifying, and testing intelligent critical avionics systems.

Webster et al. (Citation2014) and Webster et al. (Citation2011) explore model checking to provide formal evidence for the certification of autonomous unmanned aircraft. The aim is to provide evidence that the autonomous system in control of an unmanned aircraft is safe and reliable. The properties verified are based on the notions of rules of the air (Webster et al. Citation2014) and airmanship. Like us, the authors model and resolve dilemmas relating to the ethical behavior of agents (e.g. intruder detection and fuel low). Later, Dennis and Fisher (Citation2020), Dennis et al. (Citation2016), and Dennis et al. (Citation2016) advance the work in (Webster et al. Citation2014), by representing and embedding ethical principles in the decision-making process of an autonomous system. There the authors propose a theoretical framework for ethical plan selection that can be formally verified. They present a verifiable ethical decision-making framework that implements a specified ethical decision policy. The framework pro-actively selects actions that will keep humans out of harm’s way. The systems developed are verifiable in the Agent JPF model checker and can integrate with external systems. Later, Bremner et al. (Citation2019) have implemented BDI style reasoning in Python where Asimov’s laws of robotics have been used as an example of an ethical theory to decide the courses of action. Compared to our study, a key difference in the work by Dennis and Fisher (Citation2020), Dennis et al. (Citation2016), and Dennis et al. (Citation2016) is that model checking is applied at the implementation level using policies. However, in addition to ethical concerns, our work supports verifying dilemmas about controllability between humans and agents for responsible AI. Furthermore, our work exploits counterexample traces and simulation results as a first step toward supporting explainable AI.

Yazdanpanah et al. (Citation2021) and Yazdanpanah et al. (Citation2021) apply formal methods and modal logics for reasoning about accountability using alternative-time temporal logic-based techniques in multiagent systems. Their work focusses on answering “who is accountable for an unfulfilled task in multiagent teams: when, why, and to what extent?” (Yazdanpanah et al. Citation2021). The main results of their work are on decidability, fairness properties, and computational complexity of the proposed accountability ascription methods in multiagent teams. The authors discuss their approach using application domains like connected and autonomous vehicles. However, their work focusses primarily on accountability concerns of multi-agent systems, whereas our work has a more broader goal of ensuring a human-agent collective is safe, controllable, trustworthy and ethical.

Li et al. (Citation2020), Li et al. (Citation2020), and Casimiro et al. (Citation2021) present a formal framework that incorporates human personality traits to design human-on-the-loop self-adaptive system. There an explanation is used to help a human operator to improve the utility of the overall system. The authors characterize explanations in terms of explanation content, effect and cost (Li et al. Citation2020). Probabilistic model checking has been used to synthesize optimal explanations, which is based on a formal human model with psychologically relevant aspects of personality. Their work is evaluated using a virtual human and system interaction game. But, the authors do not resolve dilemmas posed by human-agent collectives as performed in our work.

Therefore, from the analysis of the related work, it is clear that although model checking has been applied to provide certification of autonomous systems with respect to safety (e.g. (Adegoke, Ab Aziz, and Yusof Citation2016; Choi, Kim, and Tsourdos Citation2015; Molnar and Veres Citation2009; Qu and Veres Citation2016)), little work has yet been done on ethical concerns (e.g. (Dennis and Fisher Citation2020, Citation2021; Dennis et al. Citation2016, Citation2016; Mermet and Simon Citation2016)). Furthermore, very little work has used techniques like model checking to support certification with respect to responsible and explainable (Li et al. Citation2020, Citation2020) decision making within human-agent collectives.

MCMAS Tool and ISPL

Model checking is an automatic verification technique for finite state concurrent systems (Clarke, Grumberg, and Peled Citation1999). In model checking, the system is represented by a finite model $M$ and the specification by a formula $ϕ$ using appropriate logic. The model checker computes and establishes automatically whether the model $M$ satisfies the formula $ϕ$ ( $M ⊨ ϕ$ ) or not ( $M ϕ$ ).

This study uses the MCMAS tool to model and verify the decision-making behavior in dilemma situations of human-agent collectives. Compared to other model checking tools, MCMAS (Lomuscio, Qu, and Raimondi Citation2017) is designed to model and verify multi-agent systems (e.g. see (Choi, Kim, and Tsourdos Citation2015; Molnar and Veres Citation2009)). Also, the fact that MCMAS can deal with a large multi-agent system (scalable) which is composed of many autonomous agents is another benefit. Thus, in this study, we use MCMAS for the automatic formal verification of human-agent collectives.

At the center of MCMAS and its modeling language is the notion of interpreted systems. Interpreted systems are a formalism that can be used to model multi-agent systems and reason about knowledge (Porter Citation2004). They are an extension of Kripke semantics for multi-agent systems where the internal states of all agents are composed to obtain the global states of the whole system. Raimondi (Citation2006) describes why interpreted systems are best suited for modeling multi-agent systems. Unlike Kripke models and concurrent game structures for modeling multi-agent systems, interpreted systems are well suited for epistemic modalities expressing agents’ knowledge because they distinguish between local and global states. In addition, interpreted systems are: computationally grounded (i.e. an interpreted system’s semantics can be directly mapped to runs of a system and vice versa); the notion of local states offers a flexible abstraction for an agent; and they can easily be extended to include different modalities (Raimondi Citation2006).

The MCMAS tool uses a dedicated programming language derived from the formalism of interpreted systems called ISPL (Interpreted Systems Programming Language) (Lomuscio, Qu, and Raimondi Citation2017). An ISPL specification has six essential parts to represent an interpreted system: agent (environment or standard), InitStates, evaluation, groups, fairness and formulae (Lomuscio, Qu, and Raimondi Citation2017) (see ).

Engineering Responsible And Explainable Models In Human-Agent Collectives

ABSTRACT

Introduction

Related Work and Background

Ethical Dilemmas and Principles

Explainable AI

Model Checking

MCMAS Tool and ISPL

Table 1. ISPL syntax used in this work.

Approach: Engineering Responsible and Explainable Models in Human-Agent Collectives

Modelling

Ethical Principles

Ethical Dilemmas

Controllability Dilemmas and Safety Concerns Between Humans and Agents

Specification

Verification and Simulation

Case Study: UAV Teaming Dilemmas in Disaster Response

Scenario

Human-UAV Teaming Dilemmas

Model Checking Human-UAV Collectives

Listing 1: human-UAV collective modeled in ISPL.

Modelling of an Agent and Communication Between Agents

Listing 2: simplified model of UAV1_HAC1.

Listing 3: simplified model of BronzeCommander1_HAC1.

Modelling of Ethical and Controllability Dilemmas

Modelling of Communication Dropout Dilemma

Listing 4: actions for UAV1_HAC1 during a communication dropout.

Listing 5: consequences for UAV1_HAC1 in a communication dropout.

Listing 6: utilities to rank the consequences following the utilitarian principle.

Listing 7: utilities to rank the consequences following the do-no-harm principle.

Listing 8: ISPL code part on evaluating utilities to determine the consequences allowed.

Listing 9: ISPL code part on mapping of consequences to actions.

Modelling of a Technical Problem in a UAV Resulting in a Crash on Humans Dilemma

Listing 10: actions for UAV1_HAC1 during a technical problem in a UAV resulting in a crash on humans dilemma.

Listing 11: consequences for UAV1_HAC1 during a technical problem in a UAV resulting in a crash on humans dilemma.

Listing 12: utilities to rank the consequences following the utilitarian principle.

Listing 13: utilities to rank the consequences following the do-no-harm principle.

Listing 14: ISPL code part on evaluating utilities to determine the consequences allowed.

Listing 15: ISPL code part on the mapping of consequences to actions.

Modelling of Controllability Dilemmas

Listing 16: controllability between UAV1_HAC1 and BronzeCommander1_HAC1.

Listing 17: controllability between PlanningAgent_HAC1 and BronzeCommander1_HAC1.

Listing 18: controllability between UAV1_HAC1 and SilverCommander_HAC1 on crash site.

Listing 19: controllability between UAV1_HAC1 and UAV3_HAC1 on lying.

Property Specification on Dilemmas and Safety Concerns

Property Specification on Ethical and Controllability Dilemmas

Property Specification on Safety Concerns

Verification and Simulation Results of Ethical and Controllability Dilemmas

Explainable AI: Simulating Explainable Models of Dilemmas

Table 2. Judgments summarized during communication dropout and battery row resulting in crash on humans dilemmas.

Discussion

Conclusion and Future Work

ResponsibleAI-J_AcceptedVersion-blx.bib

ResponsibleAI-J_AcceptedVersion.bbl

responsibleAI-Bib.bib

PRIMEarxiv.sty

Acknowledgements

Disclosure statement

Supplementary material

Additional information

Funding

Notes

References

Appendix A.

Table A1. Definitions used in this work.

Appendix B.

Table A2. Descriptions of the human and machine agents modeled using ISPL in the case study.

Table A3. Descriptions of local states of the agents in the case study.

Appendix C.

Listing 20: Boolean variables used in formulae provided section 5.3.

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date