<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="brief-report">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Artif. Intell.</journal-id>
<journal-title>Frontiers in Artificial Intelligence</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Artif. Intell.</abbrev-journal-title>
<issn pub-type="epub">2624-8212</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/frai.2023.1212336</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Artificial Intelligence</subject>
<subj-group>
<subject>Perspective</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Counterfactual learning in enhancing resilience in autonomous agent systems</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Samarasinghe</surname> <given-names>Dilini</given-names></name>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1931059/overview"/>
</contrib>
</contrib-group>
<aff><institution>School of Systems and Computing, University of New South Wales</institution>, <addr-line>Canberra, ACT</addr-line>, <country>Australia</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Hao Liu, Chiba University, Japan</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Alexander Nikolaevich Raikov, National Supercomputer Center, China</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Dilini Samarasinghe <email>d.samarasinghe&#x00040;adfa.edu.au</email></corresp>
</author-notes>
<pub-date pub-type="epub">
<day>28</day>
<month>07</month>
<year>2023</year>
</pub-date>
<pub-date pub-type="collection">
<year>2023</year>
</pub-date>
<volume>6</volume>
<elocation-id>1212336</elocation-id>
<history>
<date date-type="received">
<day>26</day>
<month>04</month>
<year>2023</year>
</date>
<date date-type="accepted">
<day>11</day>
<month>07</month>
<year>2023</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2023 Samarasinghe.</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Samarasinghe</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license> </permissions>
<abstract>
<p>Resilience in autonomous agent systems is about having the capacity to anticipate, respond to, adapt to, and recover from adverse and dynamic conditions in complex environments. It is associated with the intelligence possessed by the agents to preserve the functionality or to minimize the impact on functionality through a transformation, reconfiguration, or expansion performed across the system. Enhancing the resilience of systems could pave way toward higher autonomy allowing them to tackle intricate dynamic problems. The state-of-the-art systems have mostly focussed on improving the redundancy of the system, adopting decentralized control architectures, and utilizing distributed sensing capabilities. While machine learning approaches for efficient distribution and allocation of skills and tasks have enhanced the potential of these systems, they are still limited when presented with dynamic environments. To move beyond the current limitations, this paper advocates incorporating counterfactual learning models for agents to enable them with the ability to predict possible future conditions and adjust their behavior. Counterfactual learning is a topic that has recently been gaining attention as a model-agnostic and post-hoc technique to improve explainability in machine learning models. Using counterfactual causality can also help gain insights into unforeseen circumstances and make inferences about the probability of desired outcomes. We propose that this can be used in agent systems as a means to guide and prepare them to cope with unanticipated environmental conditions. This supplementary support for adaptation can enable the design of more intelligent and complex autonomous agent systems to address the multifaceted characteristics of real-world problem domains.</p></abstract>
<kwd-group>
<kwd>autonomous agent systems</kwd>
<kwd>multi-agent system (MAS)</kwd>
<kwd>resilience</kwd>
<kwd>counterfactual learning</kwd>
<kwd>machine learning</kwd>
<kwd>explainability</kwd>
<kwd>explainable agents</kwd>
<kwd>robustness</kwd>
</kwd-group>
<counts>
<fig-count count="1"/>
<table-count count="0"/>
<equation-count count="0"/>
<ref-count count="73"/>
<page-count count="8"/>
<word-count count="7555"/>
</counts>
<custom-meta-wrap>
<custom-meta>
<meta-name>section-at-acceptance</meta-name>
<meta-value>Machine Learning and Artificial Intelligence</meta-value>
</custom-meta>
</custom-meta-wrap>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>1. Introduction</title>
<p>In the fast-paced society of today, the application demands for collective intelligence are increasing. Applications of autonomous systems are often found in fields that require distributed sensing and action. These include both engineering-related disciplines that are interested in autonomous self-organizing artifacts and science-related disciplines that are interested in understanding the behaviors and evolution of the natural world. The benefits in safety, productivity, and in saving money and time has induced intelligent systems use to be more widespread across a range of domains (Singh et al., <xref ref-type="bibr" rid="B64">2013</xref>). Research related to autonomous aerial (Kulkarni et al., <xref ref-type="bibr" rid="B37">2022</xref>), aquatic (Bu et al., <xref ref-type="bibr" rid="B16">2022</xref>), and ground (Chakraborty et al., <xref ref-type="bibr" rid="B18">2022</xref>) agent systems has demonstrated interest in domains such as search and rescue (Huamanchahua et al., <xref ref-type="bibr" rid="B31">2022</xref>), exploration (Azp&#x000FA;rua et al., <xref ref-type="bibr" rid="B6">2023</xref>), planning (Raja and Pugazhenthi, <xref ref-type="bibr" rid="B55">2012</xref>), and surveillance (Filipovi&#x00107; et al., <xref ref-type="bibr" rid="B22">2023</xref>). In the sense of industrial applications, autonomous intelligent systems are sought after in fields such as healthcare (Kyrarini et al., <xref ref-type="bibr" rid="B38">2021</xref>), manufacturing (Shneier and Bostelman, <xref ref-type="bibr" rid="B62">2015</xref>), retail (Bogue, <xref ref-type="bibr" rid="B12">2019</xref>), military (Voth, <xref ref-type="bibr" rid="B70">2004</xref>), and mining (Nanadrekar et al., <xref ref-type="bibr" rid="B46">2022</xref>).</p>
<p>Advances in these domains cannot be achieved without automation that can facilitate intelligent capabilities primarily related to learning, adaptation, and evolution. Learning in an agent system can be viewed from two perspectives as: passive learning and active learning (Taylor et al., <xref ref-type="bibr" rid="B68">2021</xref>). When prior-knowledge is abundant and the environment is not dynamic, passive learning is useful to construct the parametric representations of relationships within the model. However, intelligent systems that can make decisions to best realize goals in dynamic environments require individual agents that follow an active learning policy where they can understand and exploit the interactions they encounter. Active learning sees a wide range of applications particularly in terms of prioritized decision-making; object recognition, detection, and classification; and inspection related tasks. It is also not sufficient for agent systems to be <italic>functionally adequate</italic> to address the complex problems in the current application domains. They need to be adaptive, thus, have the ability to self organize (Bernon et al., <xref ref-type="bibr" rid="B10">2005</xref>). The components or the individual agents in the system should be endorsed with the capability to rearrange their behaviors and relations with each other locally without the requirement of manual intervention of a programmer. Adaptive characteristics in an agent system are specifically appreciated for requirements such as maintaining intrinsic safety and learning and accomplishing new tasks. The capacity to evolve its characteristics and functionality in an agent system is closely related to the requirements of learning and adaptation as well. While autonomous agent systems have progressed a long way in terms of accuracy and speed of completing monotonous tasks compared to their human counterparts, they have not yet reached the level of cognition possessed by humans that could ideally lead to modifications and advancements in the system&#x00027;s design based on an evolutionary process. Autonomous evolution in such systems would facilitate long term existence of agent ecosystems that can survive in diverse environmental niches and specialize in different tasks without direct oversight from human designers (Lan et al., <xref ref-type="bibr" rid="B39">2021</xref>).</p>
<p>These expectations of agent systems reveal that there exists a requirement for them to have the capacity to anticipate, respond to, adapt to, and recover from adverse and dynamic conditions in complex environments. This can be attributed to the resilience of these systems which facilitates incorporating agile policies that allow adaptation to newly perceived conditions and overcome disturbances that were not modeled. In a resilient system, the agents should possess the intelligence to apply a transformation, reconfiguration, or an expansion across the system to preserve their functionality, change their functionality, or to minimize the impact on functionality (Prorok et al., <xref ref-type="bibr" rid="B53">2021</xref>).</p>
<p>The early work related to adapting and overcoming unexpected encounters within agent systems have primarily focused on parametric uncertainty (Luders et al., <xref ref-type="bibr" rid="B43">2010</xref>) and fault tolerance (Parker, <xref ref-type="bibr" rid="B51">1998</xref>). Although these adaptive control methods can alter process dynamics to suit the conditions faced, they are restricted by design assumptions and conditions (Barber et al., <xref ref-type="bibr" rid="B7">2000</xref>; &#x000C5;str&#x000F6;m and Wittenmark, <xref ref-type="bibr" rid="B1">2013</xref>). While they accommodate uncertainty and risk, these models are mostly concerned with the sensitivity of the system output to parametric changes and bounded disturbances that are within expected design conditions. In contrast, resilient agent systems are more relaxed on such assumptions and investigate the capability of adapting to and withstanding disturbances and adverse conditions which have not been modeled (Prorok et al., <xref ref-type="bibr" rid="B53">2021</xref>).</p>
<p>In this perspective, the dynamic and adverse conditions and environmental states should be addressed through agile approaches that can facilitate system-wide transformations. This paper emphasizes the need for resilient agent systems to anticipate unmodeled disturbances and leverage emerging behaviors to retain desired performance levels. Resilience can be discussed in two contexts under morphological and behavioral aspects. The focus here is on achieving resilience through improvements to the behavioral aspects of the agents rather than improving the morphological aspects such as self-reconfigurability of hardware components.</p>
<p>Toward this end, Section 2 identifies the state-of-the-art in resilient autonomous agent systems and approaches to improve resilience which are limited in their capacity to anticipate adverse conditions. Means of addressing this limitation through counterfactual learning is presented in Section 3. This concept is gaining attention in the research community as a mode of explaining and making inferences on past, present, and future relations within AI systems. In this paper, the application of counterfactual learning is extended toward maintaining resilience of agent systems. Ways of using counterfactual causality to help gain insights into unforeseen circumstances and make inferences about the probability of an outcome are presented in Section 3. This section identifies counterfactual learning as an approach to guide and prepare them to cope with unanticipated environmental conditions. Finally, Section 5 concludes the paper with a potential way forward in implementing the proposed approach and a discussion of the current perspectives and limitations of the approach.</p></sec>
<sec id="s2">
<title>2. Resilience in autonomous agent systems</title>
<p>As discussed in Section 1, collective intelligence is a highly demanding attribute for agent systems, but, is equally challenging to automate. Specifically, automating the strategic planning process for identifying conditions for purposeful decision making requires the agents to follow an active and adaptive learning architecture (Samarasinghe et al., <xref ref-type="bibr" rid="B59">2022a</xref>) It is associated with constantly enhancing distributed intelligence of agents that is focussed on achieving better results as a group than any individual agent through individual enrichment and mutual recognition (Ha and Tang, <xref ref-type="bibr" rid="B27">2022</xref>). Decentralization, diversity, and independence are all attributes that impact the potential of an agent system that possesses collective intelligence. As such, resilience is an invariable feature that should be enhanced with regard to maintaining autonomy and collective intelligence of systems. Resilience is not simply about having adaptive controls; rather, it refers to having the capacity to anticipate and respond to dynamic conditions. Therefore, the focus on resilience as discussed in this paper is a timely requirement to enhance the potential of autonomous agent systems.</p>
<p>Four primary attributes of resilience are identified here that characterize the ability to retain and/or adjust the functioning of the system in agent systems, namely: robustness; flexibility; scalability; and modularity. Robustness is the ability of a system either to preserve a certain property (such as the stability or performance) or to continue operations amid the presence of partial or individual failures, uncertainties, and disturbances to the environment (Gazi and Fidan, <xref ref-type="bibr" rid="B25">2007</xref>). It does not cover the ability to anticipate and respond to adverse impacts. For agent systems with time delay, robustness has generally been implemented using approaches such as continuous time consensus protocols (Olfati-Saber and Murray, <xref ref-type="bibr" rid="B50">2004</xref>) and other scenarios concerned with disturbances and model uncertainties have also looked at introducing foraging controllers to agents and sliding-mode control technique which reduces the motion of a system to a lower dimensional space (Gazi, <xref ref-type="bibr" rid="B24">2005</xref>). Flexibility refers to the capacity to adapt to changing environments and requirements (Bayindir and &#x0015E;ahin, <xref ref-type="bibr" rid="B8">2007</xref>). This should not be confused with robustness, as flexibility relates to identifying and shifting to suitable new behaviors to solve a different problem that was a consequence of the changed circumstances. Proactive agent control is gaining attention as an approach used to facilitate flexibility in an agent system where multiple agents, and potentially humans, collaborate to operate the tasks based on each others&#x00027; capabilities, resources, and goals (Li et al., <xref ref-type="bibr" rid="B42">2023</xref>). This enables flexible management of tasks in dynamic and uncertain circumstances. Agile scheduling using negotiation mechanisms (Rabelo et al., <xref ref-type="bibr" rid="B54">1999</xref>) is another common approach used to improve the integration of information, coordination, and communication in agent systems targeting flexibility. The potential of the agent system to operate under a range of agent group sizes is known as scalability (&#x0015E;ahin, <xref ref-type="bibr" rid="B57">2005</xref>). Decentralized and self-organized systems using approaches such as component distribution and replication (Deters, <xref ref-type="bibr" rid="B20">2001</xref>), are capable of supporting the functioning of a group of agents irrespective of unexpected adversities such as malfunctioning agents, with a relatively low impact on performance. Lastly, modularity refers to the ability to use individual components, to operate on multiple and diverse functions and configurations to better pursue dynamic and complex environments (Beni, <xref ref-type="bibr" rid="B9">2005</xref>). A modularized system can use individual agents interchangeably within the system for different tasks more easily than a single centralized system. Improving capability relations (Nunes, <xref ref-type="bibr" rid="B49">2014</xref>) including association, composition, and generalization within agents is a common approach that is utilized in facilitating modularity of agent systems. Ideally, a resilient autonomous agent system should be able to adopt each of these respective attributes in response to the diverse environments and interactions it encounters.</p>
<p>The state-of-the-art in achieving resilience in agent systems have mostly focussed on decentralized control architectures (Ghedini et al., <xref ref-type="bibr" rid="B26">2018</xref>; Sartoretti et al., <xref ref-type="bibr" rid="B61">2019</xref>; Zhang et al., <xref ref-type="bibr" rid="B72">2020</xref>; Blumenkamp et al., <xref ref-type="bibr" rid="B11">2022</xref>). As discussed by Iocchi et al. (<xref ref-type="bibr" rid="B32">2001</xref>), a decentralized architecture could be distinguished from a centralized architecture based on how the decision-making process is designed. In a centralized system, there exists a central governing unit, an entity, or an agent that is in-charge of organizing and allocating tasks to the rest of the agents in the system. There are also weak centralized architectures where the central unit or the leader is dynamically assigned (Swaminathan et al., <xref ref-type="bibr" rid="B67">2015</xref>) or the trajectory of the leader is dynamically decided (Rezaee et al., <xref ref-type="bibr" rid="B56">2021</xref>) based on the environmental changes or interruptions and failures. In contrast, a decentralized control architecture (also referred to as distributed control architectures) has no central authority or a leader and the decision-making process is distributed among all agents in the system. These architectures have proven to be more scalable compared to centralized architectures as the computational burden of having a central controlling unit is mitigated increasing adaptivity in dynamic environments (Ismail and Sariff, <xref ref-type="bibr" rid="B33">2018</xref>). However, they are not without limitations. Efficient task distribution is still challenging in adverse circumstances as the impacts cannot be anticipated beforehand (Steele and Thomas, <xref ref-type="bibr" rid="B66">2007</xref>). Decentralized systems can reorganize the task allocation strategy but real-time adaptation can introduce unforeseen issues due to conflicts in capacity and skills associated with individual agents in the system.</p>
<p>Improving redundancy is another approach that has been used to achieve resilience in agent systems (Hazon and Kaminka, <xref ref-type="bibr" rid="B29">2005</xref>). An agent system can be redundant in terms of the number of agents that can address a functionality or in terms of the number of approaches that a single task can be executed by the system. A failure or malfunction of a single agent or a functionality can be addressed by another agent or a different strategy in a redundant system. Therefore, it has been proven that redundancy can be leveraged to achieve parallelization in task execution while withstanding individual failures (Hsieh and Mather, <xref ref-type="bibr" rid="B30">2013</xref>). However, coordinating and resolving the redundancies in a system is still a challenge and requires the exploitation of different managing criteria (Krizmancic et al., <xref ref-type="bibr" rid="B36">2020</xref>). Further, these systems can often only vary within a defined spectrum of functions/parameters/sensory attributes and are not equipped with the decision-making capacity to identify new strategies or solutions to problems in real-time under dynamic circumstances.</p>
<p>Facilitating agent systems with distributed sensing capabilities such that multiple agents could sense the environment, can increase the signal-to-noise ratio of the system. This can help detect changes in the environment and identify and track anomalies and adversarial impacts on the system while collaboratively mapping dynamic environments (Bossens et al., <xref ref-type="bibr" rid="B14">2022</xref>). Similar measures also include approaches such as distributed diffusion where information collected from different sensors or input sources of multiple agents is incorporated and the aggregation is used in decision-making of each agent in the system (Li et al., <xref ref-type="bibr" rid="B41">2020</xref>). While distributed sensing facilitates robustness and increased relevancy of sensory measurements, the decision-making process after collecting this information has to be separately addressed.</p>
<p>The investigation of the said state-of-the-art approaches for achieving resilience demonstrates that these models suffer from limitations associated with anticipating dynamic and adverse environmental conditions. While they facilitate adaptive controls across a bounded range of potential dynamics, they lack the capacity to analyse an existing situation and make insights into unforeseen circumstances as a means of making predictions or preparations to adapt to these environments.</p></sec>
<sec id="s3">
<title>3. Counterfactual learning as a means to interpret machine learning models</title>
<p>In this section, the concept of counterfactuals is introduced as discussed within the literature on causal inference and how it has evolved as counterfactual learning which is an emerging technique used in explainable AI to improve the interpretability of machine learning models.</p>
<p>Counterfactuals have long been used in inferring causal relations (Lewis, <xref ref-type="bibr" rid="B40">1974</xref>). They enable explaining past events and outcomes, and also predict future ones through links between causes and effects. They are essentially <italic>what-if</italic> questions; or conditional assertions where the antecedent is false, and the consequent describes what would have happened if it was true. Counterfactuals take a similar form to the following: &#x0201C;You were denied a loan because your annual income was $30,000. If your income had been $45,000, you would have been offered a loan&#x0201D; (Wachter et al., <xref ref-type="bibr" rid="B71">2017</xref>). In this example, the statement of decision states the current representation of the world. The counterfactual that follows, describes what needs to change for the representation of the world to be favorable. As such, counterfactuals are, ideally, inferences that are made based on alternative possibilities.</p>
<p>Research in AI and machine learning has started recognizing the knowledge of counterfactuals as a useful attribute in multiple fields such as computer vision, natural language processing (NLP), and recommender systems. It is used as a means to supplement incomplete data and partial feedback received from learning models to improve performance through estimations and inferences. In computer vision models, counterfactual analysis has been used to clarify the existences of biases. For example, Joo and K&#x000E4;rkk&#x000E4;inen (<xref ref-type="bibr" rid="B35">2020</xref>) uses a model that generates counterfactural images across the dimensions of race and gender to evaluate the fairness of image classification techniques. Counterfactual image generation methods are also used in open set recognition (Neal et al., <xref ref-type="bibr" rid="B47">2018</xref>) to train classifiers to detect unknown classes which are close to the training examples yet do not belong to any training category. Similar to computer vision models, NLP models have also utilized counterfactuals to mitigate annotation biases in datasets through causal inference. Tian et al. (<xref ref-type="bibr" rid="B69">2022</xref>) propose a counterfactual reasoning framework to mitigate the biases formulated in tasks of natural language inference, and Zhu et al. (<xref ref-type="bibr" rid="B73">2020</xref>) use counterfactual reasoning for open-domain dialogue generation where outcomes of alternatives for trained policies are used in the learning framework to expand the search space for higher performance levels. In recommender systems, counterfactual estimators enable investigating how a particular recommendation policy would have performed if it had been used instead of the policy that was used (Saito and Joachims, <xref ref-type="bibr" rid="B58">2021</xref>).</p>
<p>Among these, the most prominent applications of counterfactuals in AI are perhaps seen in explainable AI (XAI) where it is commonly used as a model-agnostic and post-hoc technique to improve explainability in machine learning models. XAI is conceptualized in three phases of explanation which explore the aspects of: explanation generation, explanation communication, and explanation reception (Neerincx et al., <xref ref-type="bibr" rid="B48">2018</xref>). In this regard, XAI is discussed across perceptual and cognitive categories. Perceptual XAI is concerned with providing perceptual foundations of agent behavior and the AI architecture used; whereas cognitive XAI is associated with describing why a certain action was chosen by relating them to the agents&#x00027; beliefs and goals (Neerincx et al., <xref ref-type="bibr" rid="B48">2018</xref>). Existing XAI approaches include the use of: saliency maps that describe the important aspects of inputs to a system (Adebayo et al., <xref ref-type="bibr" rid="B2">2018</xref>); explainable BDI (belief-desire-intention) agent architecture that provides reasoning for changes in behavior (Harbers et al., <xref ref-type="bibr" rid="B28">2012</xref>); and fuzzy-cognitive maps that can simulate the behavior of systems under different conditions to explain potential outcomes and decision making scenarios (Apostolopoulos and Groumpos, <xref ref-type="bibr" rid="B3">2023</xref>), among others. Causal inferences generated through counterfactual learning could be used in two aspects: local and non-local in generating XAI models for agent systems in contrast to the other existing approaches. Local counterfactual learning focuses on what features, attributes, or characteristics would need to be changed to achieve a certain outcome (Chou et al., <xref ref-type="bibr" rid="B19">2022</xref>). Further, non-local counterfactuals can be used to determine absence causations which identify the lacking, failure, or non-occurrence of attributes or characteristics that lead to certain outcomes of the system (Dowe, <xref ref-type="bibr" rid="B21">2009</xref>). The dynamic and ad-hoc explanations of these models and the understanding of their decision-making process can be utilized to improve the accuracy of the models as well as in increasing the trust of human users in these systems (Byrne, <xref ref-type="bibr" rid="B17">2019</xref>).</p></sec>
<sec id="s4">
<title>4. Proposed counterfactual learning-based approach for resilience in autonomous agent systems</title>
<p>The existing AI applications of counterfactual learning discussed in Section 3 have identified the means of using the causal relations generated by counterfactuals to further analyse and gain insights into these applications. Based on this understanding, this paper proposes that the exploitation of the causal inferences that can be extracted through counterfactuals is useful in enhancing the resilience of autonomous agent systems in terms of analysing the present experiences to foresee and prepare for dynamic environmental conditions that are beyond the boundaries of experienced outcomes.</p>
<p><xref ref-type="fig" rid="F1">Figure 1</xref> elaborates how the knowledge of counterfactuals can be integrated in an agent system. In a typical agent-based environment, the agent(s) are endowed with pre-knowledge on the domain, a set of actions, and goals. As each agent senses and interacts with the other agents and/or the environment, they build perceptions which result in actions that affect the neighboring environment (Samarasinghe et al., <xref ref-type="bibr" rid="B60">2022b</xref>). The said perceptions can be used in generating the counterfactual knowledge in an active learning environment to influence and improve the actions and thus the performance of the system. Three primary avenues on how counterfactuals can facilitate resilience in agent systems are focussed in this aspect. In this regard, this paper identifies that counterfactuals can be used to:</p>
<list list-type="bullet">
<list-item><p>Generate new knowledge for agents based on their current awareness of the system.</p></list-item>
<list-item><p>Describe and assess the decision-making process of agents in the system.</p></list-item>
<list-item><p>Construct explanations of cause-effect or reason-action relationships between events and agents in the system to improve the implementation architecture.</p></list-item>
</list>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p>Incorporating counterfactual knowledge in an agent system within an active learning environment. Each agent is endowed with prior domain knowledge, actions, and goals which are used in building the perception of the current world. The counterfactual knowledge derived from the current perceptions can influence the actions and thus the performance of the system.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="frai-06-1212336-g0001.tif"/>
</fig>
<p>As the first approach, counterfactuals can be used to generate new knowledge based on the current awareness on a system, its operations, and the environment it is operating in. They have been used in multiple applications for agent systems to supplement incomplete and/or limited data. Counterfactuals can be used to analyse the current situation and envisage the different outcomes by changing the antecedents to add to the knowledge. Counterfactual predictions are used by Jin et al. (<xref ref-type="bibr" rid="B34">2022</xref>) to guide the exploration of a reinforcement learner for robotic manipulation tasks. The active counterfactual predictions generated add to the existing body of knowledge about the task for the learning model to improve its performance. Mannion et al. (<xref ref-type="bibr" rid="B44">2016</xref>) propose a novel reward shaping technique to improve multi-agent reinforcement learning (MARL) for autonomous control of agents in complex environments. The method they discuss: Estimated Counterfactual as Potential, is capable of generating new potential functions to shape the behaviors of the agents, using approximations of knowledge on different evaluations using counterfactuals. The estimated potentials of the agents can help these agents outperform the agents learning only with system evaluation functions. Further, counterfactual multi-agent policy gradients are also used with similar intentions to estimate new knowledge to optimize agent policies learned with machine learning models (Foerster et al., <xref ref-type="bibr" rid="B23">2018</xref>). However, the expected new knowledge could be unique and belong to a different dimension from the current knowledge due to changed environmental conditions and interactions. Therefore, it should be noted that approaches such as logical autoregression and extrapolation which impose strong assumptions on the agents&#x00027; rationality and the dynamics of the environment may not always be accurate. Methods such as Robust Multi-Agent Counterfactual Prediction (RMAC; Peysakhovich et al., <xref ref-type="bibr" rid="B52">2019</xref>) can be utilized to analyse the sensitivity of counterfactual predictions and derived knowledge to violation of such assumptions.</p>
<p>Secondly, counterfactuals could be used as a means to describe or assess how the outcome of a certain decision-making process could have been better. Counterfactual regret minimization (CFR) is a widely adopted algorithm in solving large imperfect-information games (Brown et al., <xref ref-type="bibr" rid="B15">2019</xref>). These games typically simulate real world domains including cybersecurity interactions, auctions, and negotiations which consist of partial information-based strategic interactions between multiple agents. CFR algorithms can train agents to solve problems under such conditions through self-play. The current strategy of each agent is improved by comparing it to others through calculating a <italic>counterfactual regret</italic> for not taking an action at a particular state. This regret represents a quantitative approximation of how good the outcome would have been had it picked a certain action based on the information it gains by observing the system. Counterfactual reasoning is also used in interactive environments for motion planning through inferences about other interacting agents that could have led to improved results/performance. Inference about potential motion of other agents derived through counterfactuals is used by Bordallo et al. (<xref ref-type="bibr" rid="B13">2015</xref>) to understand how the current motion plan could be improved in a real-time learning setting. Efficient iterative planning is achieved for a multi-robot navigation scenario through the predictions made on how good the outcome could be, by observing the actions of other agents.</p>
<p>Finally, the construction of explanations through the identification of cause-effect or reason-action relationships between events can further enhance the resilience of agent systems by incorporating the insights thus gained into the implementation architecture. They can enable users and designers of such systems to make inferences such as Modus Tollens (of the form: <italic>If P, then Q. Not Q. Therefore, not P</italic>) through simulation of multiple possibilities of interactions and events which is otherwise strenuous within the limits of human cognitive capacity. Mueller et al. (<xref ref-type="bibr" rid="B45">2021</xref>) propose a visual interactive model for robot skill learning from visual demonstrations. An interactive augmented reality-based approach is used to iteratively program robot skills through the demonstrations. They utilize Robot Behavior Counterfactuals (RBCs) to provide <italic>what-if</italic> visualizations to explain the effects that alternatives and constraints have on adapting skills to altered environments. The behavior verification indicators provided by the counterfactuals support the users in gaining a better understanding of the model and when and where it can be applied and succeed or fail to further improve the robot training models. Similarly, counterfactual explanations have also been used in envisioning the potential movement paths in autonomous robotic control while coping with the disturbances and uncertainties in the environment. In Smith and Ramamoorthy (<xref ref-type="bibr" rid="B65">2020</xref>), a generative model to produce counterfactual disturbances is proposed as a means to characterize the potential of the robot controls. They are used to explain the perception-action relations which in turn, can be used to modify the configurations to achieve an expected outcome.</p></sec>
<sec sec-type="conclusions" id="s5">
<title>5. Conclusion</title>
<p>Intelligent agent systems demand the need for automation that can facilitate active learning, adaptation, and evolution in order to be able to sustain in complex and dynamic environmental niches without direct human supervision. Therefore, such systems should be resilient, i.e., should have the capacity to anticipate, respond to, adapt to, and recover from adverse and dynamic conditions in complex environments. In particular, resilient agent systems should be able to generate agile policies that can predict and overcome adverse conditions that are beyond the modeled boundaries (Prorok et al., <xref ref-type="bibr" rid="B53">2021</xref>). As such, a resilient system should possess the four attributes: robustness, flexibility, scalability, and modularity. As discussed in Section 2, each of these attributes uniquely characterize different aspects of resilience. Robustness is the ability to preserve the functionality of the system in partial failures, whereas flexibility is associated with the ability to adapt to suitable new behaviors when the environment and circumstances change. Scalability relates to the potential of the agents to operate with different agent group sizes facilitating failed or newly added agents in the system. Modularity covers the ability to use individual components/agents of the system to operate different functionalities. A resilient agent system should encompass all these aspects to be successful in interacting with dynamic environments. In this perspective, this paper proposes the use of counterfactual knowledge as a means to facilitate resilience in agent systems with its ability to make inferences about existing causal relations and predictions on future impacts.</p>
<p>The avenues discussed in Section 3 have provided evidence that they can enhance the boundaries of the current agent learning models that are limited in their capacity to anticipate and react to unmodeled unforeseen scenarios. In order to cater to all four attributes of resilience, an ensemble of all three avenues of counterfactuals: generating new knowledge to understand the potential of the agent system; assessing how the outcomes of decision-making processes could improve, and explaining the cause-effect relationships between the events; is suggested as the way forward. The implementation of the model could take an iterative evolutionary approach where the counterfactual knowledge generated from the three avenues influence the actions of the agent system which in turn feed the evolutionary model as the current perceptions to generate new counterfactual knowledge in the subsequent iteration. The generation of new knowledge based on the current awareness can be made possible with the use of an Estimate Potential approach for counterfactuals as discussed in Section 4. The explanations of the cause-effect relationships of the current system can be derived from these counterfactuals that can enhance the understanding of the decision-making process of the agents. With the explanations, the new potential functions generated can be used to shape the agent behaviors in conjunction with the existing knowledge. An RMAC method can be utilized to analyse the sensitivity of the counterfactual knowledge to the dynamics of the environment, which can inform the decision to of selecting the most robust actions. CFR approach discussed in Section 4 can be used as a mean to assess whether and how the current decision process of the system could be improved based on comparisons against the derived counterfactuals. The combined knowledge derived from the union of these counterfactual-based approaches can then be used to identify the most suitable actions for the agent system. The variations in the environment and consequences of the actions can then be reiterated as perceptions to feed the counterfactual knowledge module to generate the next set of actions for the system.</p>
<p>The said model can enhance the four aspects of resilience (robustness, flexibility, scalability, and modularity) facilitating intelligent autonomous agents that can better perform in dynamic and complex environments. As expected of a robust system, the ability to preserve functionalities and continue operation amid partial failures can be ensured through the understanding of causal inferences of the current domain provided by counterfactual analysis. As discussed by Peysakhovich et al. (<xref ref-type="bibr" rid="B52">2019</xref>), counterfactuals can be used to achieve robust performance across different environments with changing rules. The flexibility to switch to new behaviors as a consequence of changed circumstances and requirements is ensured through the avenues that explore counterfactuals in generating new knowledge based on the current perceptions. Both scalable and modular characteristics of an agent system that can operate under a range of agent group sizes and optimally utilize different components (agents or skills of individual agents) of the system to operate multiple functions, can be facilitated through the knowledge of current cause-effect relationships and the understanding of how the outcomes of the decision-making process can be improved. This can enable appropriate distribution of tasks among the available number of agents based on their individual skills and dynamic requirements that arise (Singh et al., <xref ref-type="bibr" rid="B63">2021</xref>).</p>
<p>As identified, counterfactual learning has the potential to advance the future of intelligent resilient agent systems. However, it is not without limitations. Specifically in agent system, there exists a requirement for the counterfactual explanations and new knowledge generated to represent the system&#x00027;s operation along causal, justificatory, and purposive lines (Arnold et al., <xref ref-type="bibr" rid="B4">2021</xref>). As counterfactuals can make inferences and predictions resulting in actions that are outside the boundaries of the modeled circumstances, another layer of validation could be required based on determinants and constraints on the course of actions the agents can take. Further, there exist a tendency for counterfactuals to be unstable with respect to the changes in the environment (Artelt et al., <xref ref-type="bibr" rid="B5">2021</xref>). A small change in the input can result in a drastic change in the output. As a result, different actions recommended may have arbitrary changes in complexity which might cause issues in practicability of application. Therefore, incorporation of counterfactuals require careful consideration and further research into how the limitations can be overcome. However, the discussion presented in this paper lends valuable insights into the potential of counterfactuals as a successful alternative to model resilient autonomous agent systems to address the complex and dynamic requirements of the real-world applications.</p></sec>
<sec sec-type="data-availability" id="s6">
<title>Data availability statement</title>
<p>The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.</p></sec>
<sec sec-type="author-contributions" id="s7">
<title>Author contributions</title>
<p>DS conceived the idea of the paper, prepared, and reviewed the article.</p></sec>
</body>
<back>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of interest</title>
<p>The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s8">
<title>Publisher&#x00027;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>&#x000C5;str&#x000F6;m</surname> <given-names>K. J.</given-names></name> <name><surname>Wittenmark</surname> <given-names>B.</given-names></name></person-group> (<year>2013</year>). <source>Adaptive Control</source>. <publisher-loc>Chelmsford, MA</publisher-loc>: <publisher-name>Courier Corporation</publisher-name>.</citation>
</ref>
<ref id="B2">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Adebayo</surname> <given-names>J.</given-names></name> <name><surname>Gilmer</surname> <given-names>J.</given-names></name> <name><surname>Muelly</surname> <given-names>M.</given-names></name> <name><surname>Goodfellow</surname> <given-names>I.</given-names></name> <name><surname>Hardt</surname> <given-names>M.</given-names></name> <name><surname>Kim</surname> <given-names>B.</given-names></name></person-group> (<year>2018</year>). <article-title>&#x0201C;Sanity checks for saliency maps,&#x0201D;</article-title> in <source>Advances in Neural Information Processing Systems, Vol. 31</source>, eds <person-group person-group-type="editor"><name><surname>Bengio</surname> <given-names>S.</given-names></name> <name><surname>Wallach</surname> <given-names>H.</given-names></name> <name><surname>Larochelle</surname> <given-names>H.</given-names></name> <name><surname>Grauman</surname> <given-names>K.</given-names></name> <name><surname>Cesa-Bianchi</surname> <given-names>N.</given-names></name> <name><surname>Garnett</surname> <given-names>R.</given-names></name></person-group> (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>Curran Associates, Inc.</publisher-name>).</citation>
</ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Apostolopoulos</surname> <given-names>I. D.</given-names></name> <name><surname>Groumpos</surname> <given-names>P. P.</given-names></name></person-group> (<year>2023</year>). <article-title>Fuzzy cognitive maps: their role in explainable artificial intelligence</article-title>. <source>Appl. Sci</source>. <volume>13</volume>:<fpage>3412</fpage>. <pub-id pub-id-type="doi">10.3390/app13063412</pub-id></citation>
</ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Arnold</surname> <given-names>T.</given-names></name> <name><surname>Kasenberg</surname> <given-names>D.</given-names></name> <name><surname>Scheutz</surname> <given-names>M.</given-names></name></person-group> (<year>2021</year>). <article-title>Explaining in time: Meeting interactive standards of explanation for robotic systems</article-title>. <source>J. Hum. Robot Interact</source>. <volume>10</volume>:<fpage>23</fpage>. <pub-id pub-id-type="doi">10.1145/3457183</pub-id></citation>
</ref>
<ref id="B5">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Artelt</surname> <given-names>A.</given-names></name> <name><surname>Vaquet</surname> <given-names>V.</given-names></name> <name><surname>Velioglu</surname> <given-names>R.</given-names></name> <name><surname>Hinder</surname> <given-names>F.</given-names></name> <name><surname>Brinkrolf</surname> <given-names>J.</given-names></name> <name><surname>Schilling</surname> <given-names>M.</given-names></name> <etal/></person-group>. (<year>2021</year>). <article-title>&#x0201C;Evaluating robustness of counterfactual explanations,&#x0201D;</article-title> in <source>2021 IEEE Symposium Series on Computational Intelligence (SSCI)</source> (<publisher-loc>Orlando, FL</publisher-loc>), <fpage>1</fpage>&#x02013;<lpage>9</lpage>. <pub-id pub-id-type="doi">10.1109/SSCI50451.2021.9660058</pub-id></citation>
</ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Azp&#x000FA;rua</surname> <given-names>H.</given-names></name> <name><surname>Saboia</surname> <given-names>M.</given-names></name> <name><surname>Freitas</surname> <given-names>G. M.</given-names></name> <name><surname>Clark</surname> <given-names>L.</given-names></name> <name><surname>Akbar Agha-Mohammadi</surname> <given-names>A.</given-names></name> <name><surname>Pessin</surname> <given-names>G.</given-names></name> <etal/></person-group>. (<year>2023</year>). <article-title>A survey on the autonomous exploration of confined subterranean spaces: perspectives from real-word and industrial robotic deployments</article-title>. <source>Robot. Auton. Syst</source>. <volume>160</volume>:<fpage>104304</fpage>. <pub-id pub-id-type="doi">10.1016/j.robot.2022.104304</pub-id></citation>
</ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Barber</surname> <given-names>K. S.</given-names></name> <name><surname>Goel</surname> <given-names>A.</given-names></name> <name><surname>Martin</surname> <given-names>C. E.</given-names></name></person-group> (<year>2000</year>). <article-title>Dynamic adaptive autonomy in multi-agent systems</article-title>. <source>J. Exp. Theor. Artif. Intell</source>. <volume>12</volume>, <fpage>129</fpage>&#x02013;<lpage>147</lpage>. <pub-id pub-id-type="doi">10.1080/095281300409793</pub-id></citation>
</ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bayindir</surname> <given-names>L.</given-names></name> <name><surname>&#x0015E;ahin</surname> <given-names>E.</given-names></name></person-group> (<year>2007</year>). <article-title>A review of studies in swarm robotics</article-title>. <source>Turk. J. Electr. Eng. Comput. Sci</source>. <volume>15</volume>, <fpage>115</fpage>&#x02013;<lpage>147</lpage>.</citation>
</ref>
<ref id="B9">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Beni</surname> <given-names>G.</given-names></name></person-group> (<year>2005</year>). <article-title>&#x0201C;From swarm intelligence to swarm robotics,&#x0201D;</article-title> in <source>Swarm Robotics</source>, eds <person-group person-group-type="editor"><name><surname>&#x0015E;ahin</surname> <given-names>E.</given-names></name>  <name><surname>Spears</surname> <given-names>W. M.</given-names></name></person-group> (<publisher-loc>Berlin; Heidelberg</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>1</fpage>&#x02013;<lpage>9</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-540-30552-1_1</pub-id></citation>
</ref>
<ref id="B10">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Bernon</surname> <given-names>C.</given-names></name> <name><surname>Camps</surname> <given-names>V.</given-names></name> <name><surname>Gleizes</surname> <given-names>M.-P.</given-names></name> <name><surname>Picard</surname> <given-names>G.</given-names></name></person-group> (<year>2005</year>). <article-title>&#x0201C;Engineering adaptive multi-agent systems: the Adelfe methodology,&#x0201D;</article-title> in <source>Agent-Oriented Methodologies</source> (<publisher-loc>IGI Global</publisher-loc>), eds <person-group person-group-type="editor"><name><surname>Henderson-Sellers</surname> <given-names>B.</given-names></name> <name><surname>Giorgini</surname> <given-names>P.</given-names></name></person-group> (<publisher-loc>Pennsylvania, PA</publisher-loc>: <publisher-name>Hershey</publisher-name>), <fpage>172</fpage>&#x02013;<lpage>202</lpage>. <pub-id pub-id-type="doi">10.4018/978-1-59140-581-8.ch007</pub-id></citation>
</ref>
<ref id="B11">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Blumenkamp</surname> <given-names>J.</given-names></name> <name><surname>Morad</surname> <given-names>S.</given-names></name> <name><surname>Gielis</surname> <given-names>J.</given-names></name> <name><surname>Li</surname> <given-names>Q.</given-names></name> <name><surname>Prorok</surname> <given-names>A.</given-names></name></person-group> (<year>2022</year>). <article-title>&#x0201C;A framework for real-world multi-robot systems running decentralized GNN-based policies,&#x0201D;</article-title> in <source>2022 International Conference on Robotics and Automation (ICRA)</source> (<publisher-loc>Philadelphia, PA</publisher-loc>), <fpage>8772</fpage>&#x02013;<lpage>8778</lpage>. <pub-id pub-id-type="doi">10.1109/ICRA46639.2022.9811744</pub-id></citation>
</ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bogue</surname> <given-names>R.</given-names></name></person-group> (<year>2019</year>). <article-title>Strong prospects for robots in retail</article-title>. <source>Indus. Robot</source> <volume>46</volume>, <fpage>326</fpage>&#x02013;<lpage>331</lpage>. <pub-id pub-id-type="doi">10.1108/IR-01-2019-0023</pub-id></citation>
</ref>
<ref id="B13">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Bordallo</surname> <given-names>A.</given-names></name> <name><surname>Previtali</surname> <given-names>F.</given-names></name> <name><surname>Nardelli</surname> <given-names>N.</given-names></name> <name><surname>Ramamoorthy</surname> <given-names>S.</given-names></name></person-group> (<year>2015</year>). <article-title>&#x0201C;Counterfactual reasoning about intent for interactive navigation in dynamic environments,&#x0201D;</article-title> in <source>2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)</source> (<publisher-loc>Hamburg</publisher-loc>), <fpage>2943</fpage>&#x02013;<lpage>2950</lpage>. <pub-id pub-id-type="doi">10.1109/IROS.2015.7353783</pub-id></citation>
</ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bossens</surname> <given-names>D. M.</given-names></name> <name><surname>Ramchurn</surname> <given-names>S.</given-names></name> <name><surname>Tarapore</surname> <given-names>D.</given-names></name></person-group> (<year>2022</year>). <article-title>Resilient robot teams: a review integrating decentralised control, change-detection, and learning</article-title>. <source>Curr. Robot. Rep</source>. <volume>3</volume>, <fpage>85</fpage>&#x02013;<lpage>95</lpage>. <pub-id pub-id-type="doi">10.1007/s43154-022-00079-4</pub-id></citation>
</ref>
<ref id="B15">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Brown</surname> <given-names>N.</given-names></name> <name><surname>Lerer</surname> <given-names>A.</given-names></name> <name><surname>Gross</surname> <given-names>S.</given-names></name> <name><surname>Sandholm</surname> <given-names>T.</given-names></name></person-group> (<year>2019</year>). <article-title>&#x0201C;Deep counterfactual regret minimization,&#x0201D;</article-title> in <source>Proceedings of the 36th International Conference on Machine Learning, Vol. 97 of Proceedings of Machine Learning Research</source>, eds <person-group person-group-type="editor"><name><surname>Chaudhuri</surname> <given-names>K.</given-names></name> <name><surname>Salakhutdinov</surname> <given-names>R.</given-names></name></person-group> (<publisher-loc>Cambridge, MA</publisher-loc>), <fpage>793</fpage>&#x02013;<lpage>802</lpage>.</citation>
</ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bu</surname> <given-names>K.</given-names></name> <name><surname>Gong</surname> <given-names>X.</given-names></name> <name><surname>Yu</surname> <given-names>C.</given-names></name> <name><surname>Xie</surname> <given-names>F.</given-names></name></person-group> (<year>2022</year>). <article-title>Biomimetic aquatic robots based on fluid-driven actuators: a review</article-title>. <source>J. Mar. Sci. Eng</source>. <volume>10</volume>:<fpage>735</fpage>. <pub-id pub-id-type="doi">10.3390/jmse10060735</pub-id></citation>
</ref>
<ref id="B17">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Byrne</surname> <given-names>R. M.</given-names></name></person-group> (<year>2019</year>). <article-title>&#x0201C;Counterfactuals in explainable artificial intelligence (XAI): evidence from human reasoning,&#x0201D;</article-title> in <source>IJCAI</source> (<publisher-loc>California, CA</publisher-loc>), <fpage>6276</fpage>&#x02013;<lpage>6282</lpage>. <pub-id pub-id-type="doi">10.24963/ijcai.2019/876</pub-id></citation>
</ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chakraborty</surname> <given-names>S.</given-names></name> <name><surname>Elangovan</surname> <given-names>D.</given-names></name> <name><surname>Govindarajan</surname> <given-names>P. L.</given-names></name> <name><surname>ELnaggar</surname> <given-names>M. F.</given-names></name> <name><surname>Alrashed</surname> <given-names>M. M.</given-names></name> <name><surname>Kamel</surname> <given-names>S.</given-names></name></person-group> (<year>2022</year>). <article-title>A comprehensive review of path planning for agricultural ground robots</article-title>. <source>Sustainability</source> <volume>14</volume>:<fpage>9156</fpage>. <pub-id pub-id-type="doi">10.3390/su14159156</pub-id></citation>
</ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chou</surname> <given-names>Y.-L.</given-names></name> <name><surname>Moreira</surname> <given-names>C.</given-names></name> <name><surname>Bruza</surname> <given-names>P.</given-names></name> <name><surname>Ouyang</surname> <given-names>C.</given-names></name> <name><surname>Jorge</surname> <given-names>J.</given-names></name></person-group> (<year>2022</year>). <article-title>Counterfactuals and causability in explainable artificial intelligence: theory, algorithms, and applications</article-title>. <source>Inform. Fus</source>. <volume>81</volume>, <fpage>59</fpage>&#x02013;<lpage>83</lpage>. <pub-id pub-id-type="doi">10.1016/j.inffus.2021.11.003</pub-id></citation>
</ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Deters</surname> <given-names>R.</given-names></name></person-group> (<year>2001</year>). <article-title>&#x0201C;Scalability &#x00026; multi-agent systems,&#x0201D;</article-title> in <source>2nd International Workshop Infrastructure for Agents, MAS and Scalable MAS, Vol. 10</source>. Montreal.</citation>
</ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dowe</surname> <given-names>P.</given-names></name></person-group> (<year>2009</year>). <article-title>Absences, possible causation, and the problem of non-locality</article-title>. <source>Monist</source> <volume>92</volume>, <fpage>23</fpage>&#x02013;<lpage>40</lpage>. <pub-id pub-id-type="doi">10.5840/monist20099212</pub-id></citation>
</ref>
<ref id="B22">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Filipovi&#x00107;</surname> <given-names>F.</given-names></name> <name><surname>Balen</surname> <given-names>J.</given-names></name> <name><surname>Damjanovic</surname> <given-names>D.</given-names></name> <name><surname>Rimal</surname> <given-names>R.</given-names></name> <name><surname>Azeem</surname> <given-names>M. M.</given-names></name> <name><surname>Shukla</surname> <given-names>A.</given-names></name></person-group> (<year>2023</year>). <article-title>&#x0201C;Proposal of multi-agent robotic fire extinguishing model for industrial premises,&#x0201D;</article-title> in <source>2023 22nd International Symposium INFOTEH-JAHORINA (INFOTEH)</source> (<publisher-loc>East Sarajevo</publisher-loc>), <fpage>1</fpage>&#x02013;<lpage>6</lpage>. <pub-id pub-id-type="doi">10.1109/INFOTEH57020.2023.10094113</pub-id></citation>
</ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Foerster</surname> <given-names>J.</given-names></name> <name><surname>Farquhar</surname> <given-names>G.</given-names></name> <name><surname>Afouras</surname> <given-names>T.</given-names></name> <name><surname>Nardelli</surname> <given-names>N.</given-names></name> <name><surname>Whiteson</surname> <given-names>S.</given-names></name></person-group> (<year>2018</year>). <article-title>Counterfactual multi-agent policy gradients</article-title>. <source>Proc. AAAI Conf. Artif. Intell</source>. <volume>32</volume>:<fpage>1</fpage>. <pub-id pub-id-type="doi">10.1609/aaai.v32i1.11794</pub-id><pub-id pub-id-type="pmid">32446194</pub-id></citation></ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gazi</surname> <given-names>V.</given-names></name></person-group> (<year>2005</year>). <article-title>Swarm aggregations using artificial potentials and sliding-mode control</article-title>. <source>IEEE Trans. Robot</source>. <volume>21</volume>, <fpage>1208</fpage>&#x02013;<lpage>1214</lpage>. <pub-id pub-id-type="doi">10.1109/TRO.2005.853487</pub-id></citation>
</ref>
<ref id="B25">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Gazi</surname> <given-names>V.</given-names></name> <name><surname>Fidan</surname> <given-names>B.</given-names></name></person-group> (<year>2007</year>). <article-title>&#x0201C;Coordination and control of multi-agent dynamic systems: models and approaches,&#x0201D;</article-title> in <source>Swarm Robotics</source>, eds <person-group person-group-type="editor"><name><surname>&#x0015E;ahin</surname> <given-names>E.</given-names></name> <name><surname>Spears</surname> <given-names>W. M.</given-names></name> <name><surname>Winfield</surname> <given-names>A. F. T.</given-names></name></person-group> (<publisher-loc>Berlin; Heidelberg</publisher-loc>. <publisher-name>Springer</publisher-name> Berlin Heidelberg), <fpage>71</fpage>&#x02013;<lpage>102</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-540-71541-2_6</pub-id></citation>
</ref>
<ref id="B26">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Ghedini</surname> <given-names>C.</given-names></name> <name><surname>Ribeiro</surname> <given-names>C. H. C.</given-names></name> <name><surname>Sabattini</surname> <given-names>L.</given-names></name></person-group> (<year>2018</year>). <source>A Decentralized Control Strategy for Resilient Connectivity Maintenance in Multi-Robot Systems Subject to Failures</source>. <publisher-loc>Cham</publisher-loc>: <publisher-name>Springer International Publishing</publisher-name>, <fpage>9</fpage>&#x02013;<lpage>102</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-319-73008-0_7</pub-id></citation>
</ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ha</surname> <given-names>D.</given-names></name> <name><surname>Tang</surname> <given-names>Y.</given-names></name></person-group> (<year>2022</year>). <article-title>Collective intelligence for deep learning: a survey of recent developments</article-title>. <source>Collect. Intell</source>. <volume>1</volume>:<fpage>26339137221114874</fpage>. <pub-id pub-id-type="doi">10.1177/26339137221114874</pub-id></citation>
</ref>
<ref id="B28">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Harbers</surname> <given-names>M.</given-names></name> <name><surname>Bradshaw</surname> <given-names>J. M.</given-names></name> <name><surname>Johnson</surname> <given-names>M.</given-names></name> <name><surname>Feltovich</surname> <given-names>P.</given-names></name> <name><surname>van den Bosch</surname> <given-names>K.</given-names></name> <name><surname>Meyer</surname> <given-names>J.-J.</given-names></name></person-group> (<year>2012</year>). <article-title>&#x0201C;Explanation in human-agent teamwork,&#x0201D;</article-title> in <source>Coordination, Organizations, Institutions, and Norms in Agent System VII</source>, eds <person-group person-group-type="editor"><name><surname>Cranefield</surname> <given-names>S.</given-names></name> <name><surname>van Riemsdijk</surname> <given-names>M. B.</given-names></name> <name><surname>V&#x000E1;zquez-Salceda</surname> <given-names>J.</given-names></name> <name><surname>Noriega</surname> <given-names>P.</given-names></name></person-group> (<publisher-loc>Berlin; Heidelberg</publisher-loc>: <publisher-name>Springer Berlin Heidelberg</publisher-name>), <fpage>21</fpage>&#x02013;<lpage>37</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-642-35545-5_2</pub-id><pub-id pub-id-type="pmid">36158603</pub-id></citation></ref>
<ref id="B29">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Hazon</surname> <given-names>N.</given-names></name> <name><surname>Kaminka</surname> <given-names>G.</given-names></name></person-group> (<year>2005</year>). <article-title>&#x0201C;Redundancy, efficiency and robustness in multi-robot coverage,&#x0201D;</article-title> in <source>Proceedings of the 2005 IEEE International Conference on Robotics and Automation</source> (<publisher-loc>Barcelona</publisher-loc>), <fpage>735</fpage>&#x02013;<lpage>741</lpage>. <pub-id pub-id-type="doi">10.1109/ROBOT.2005.1570205</pub-id></citation>
</ref>
<ref id="B30">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Hsieh</surname> <given-names>M. A.</given-names></name> <name><surname>Mather</surname> <given-names>T. W.</given-names></name></person-group> (<year>2013</year>). <source>Robustness in the Presence of Task Differentiation in Robot Ensembles</source>. <publisher-loc>Berlin; Heidelberg</publisher-loc>: <publisher-name>Springer Berlin Heidelberg</publisher-name>, <fpage>93</fpage>&#x02013;<lpage>108</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-642-33971-4_6</pub-id><pub-id pub-id-type="pmid">27534393</pub-id></citation></ref>
<ref id="B31">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Huamanchahua</surname> <given-names>D.</given-names></name> <name><surname>Aubert</surname> <given-names>K.</given-names></name> <name><surname>Rivas</surname> <given-names>M.</given-names></name> <name><surname>Guerrero</surname> <given-names>E.</given-names></name> <name><surname>Kodaka</surname> <given-names>L.</given-names></name> <name><surname>Guevara</surname> <given-names>D.</given-names></name></person-group> (<year>2022</year>). <article-title>&#x0201C;Land-mobile robots for rescue and search: a technological and systematic review,&#x0201D;</article-title> in <source>2022 IEEE International IOT, Electronics and Mechatronics Conference (IEMTRONICS)</source> (<publisher-loc>Toronto, ON</publisher-loc>), <fpage>1</fpage>&#x02013;<lpage>6</lpage>. <pub-id pub-id-type="doi">10.1109/IEMTRONICS55184.2022.9795829</pub-id></citation>
</ref>
<ref id="B32">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Iocchi</surname> <given-names>L.</given-names></name> <name><surname>Nardi</surname> <given-names>D.</given-names></name> <name><surname>Salerno</surname> <given-names>M.</given-names></name></person-group> (<year>2001</year>). <article-title>&#x0201C;Reactivity and deliberation: a survey on multi-robot systems,&#x0201D;</article-title> in <source>Balancing Reactivity and Social Deliberation in Multi-Agent Systems</source> (<publisher-loc>Berlin; Heidelberg</publisher-loc>: <publisher-name>Springer Berlin Heidelberg</publisher-name>), <fpage>9</fpage>&#x02013;<lpage>32</lpage>. <pub-id pub-id-type="doi">10.1007/3-540-44568-4_2</pub-id></citation>
</ref>
<ref id="B33">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Ismail</surname> <given-names>Z. H.</given-names></name> <name><surname>Sariff</surname> <given-names>N.</given-names></name></person-group> (<year>2018</year>). <article-title>&#x0201C;A survey and analysis of cooperative multi-agent robot systems: challenges and directions,&#x0201D;</article-title> in <source>Applications of Mobile Robots</source> (<publisher-loc>IntechOpen</publisher-loc>), <fpage>8</fpage>&#x02013;<lpage>14</lpage>.</citation>
</ref>
<ref id="B34">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Jin</surname> <given-names>J.</given-names></name> <name><surname>Graves</surname> <given-names>D.</given-names></name> <name><surname>Haigh</surname> <given-names>C.</given-names></name> <name><surname>Luo</surname> <given-names>J.</given-names></name> <name><surname>Jagersand</surname> <given-names>M.</given-names></name></person-group> (<year>2022</year>). <article-title>&#x0201C;Offline learning of counterfactual predictions for real-world robotic reinforcement learning</article-title>, in <italic>2022 International Conference on Robotics and Automation (ICRA)</italic> (<publisher-loc>Philadelphia, PA</publisher-loc>), <fpage>3616</fpage>&#x02013;<lpage>3623</lpage>. <pub-id pub-id-type="doi">10.1109/ICRA46639.2022.9811963</pub-id></citation>
</ref>
<ref id="B35">
<citation citation-type="journal"><person-group person-group-type="author"><collab>Joo J. and K&#x000E4;rkk&#x000E4;inen, K.</collab></person-group> (<year>2020</year>). <article-title>Gender slopes: counterfactual fairness for computer vision models by attribute manipulation</article-title>. <source>arXiv:2005.10430</source>. <pub-id pub-id-type="doi">10.1145/3422841.3423533</pub-id></citation>
</ref>
<ref id="B36">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Krizmancic</surname> <given-names>M.</given-names></name> <name><surname>Arbanas</surname> <given-names>B.</given-names></name> <name><surname>Petrovic</surname> <given-names>T.</given-names></name> <name><surname>Petric</surname> <given-names>F.</given-names></name> <name><surname>Bogdan</surname> <given-names>S.</given-names></name></person-group> (<year>2020</year>). <article-title>Cooperative aerial-ground multi-robot system for automated construction tasks</article-title>. <source>IEEE Robot. Autom. Lett</source>. <volume>5</volume>, <fpage>798</fpage>&#x02013;<lpage>805</lpage>. <pub-id pub-id-type="doi">10.1109/LRA.2020.2965855</pub-id></citation>
</ref>
<ref id="B37">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Kulkarni</surname> <given-names>M.</given-names></name> <name><surname>Dharmadhikari</surname> <given-names>M.</given-names></name> <name><surname>Tranzatto</surname> <given-names>M.</given-names></name> <name><surname>Zimmermann</surname> <given-names>S.</given-names></name> <name><surname>Reijgwart</surname> <given-names>V.</given-names></name> <name><surname>De Petris</surname> <given-names>P.</given-names></name> <etal/></person-group>. (<year>2022</year>). <article-title>&#x0201C;Autonomous teamed exploration of subterranean environments using legged and aerial robots,&#x0201D;</article-title> in <source>2022 International Conference on Robotics and Automation (ICRA)</source> (<publisher-loc>Philadelphia, PA</publisher-loc>), <fpage>3306</fpage>&#x02013;<lpage>3313</lpage>. <pub-id pub-id-type="doi">10.1109/ICRA46639.2022.9812401</pub-id></citation>
</ref>
<ref id="B38">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kyrarini</surname> <given-names>M.</given-names></name> <name><surname>Lygerakis</surname> <given-names>F.</given-names></name> <name><surname>Rajavenkatanarayanan</surname> <given-names>A.</given-names></name> <name><surname>Sevastopoulos</surname> <given-names>C.</given-names></name> <name><surname>Nambiappan</surname> <given-names>H. R.</given-names></name> <name><surname>Chaitanya</surname> <given-names>K. K.</given-names></name> <etal/></person-group>. (<year>2021</year>). <article-title>A survey of robots in healthcare</article-title>. <source>Technologies</source> <volume>9</volume>:<fpage>8</fpage>. <pub-id pub-id-type="doi">10.3390/technologies9010008</pub-id></citation>
</ref>
<ref id="B39">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lan</surname> <given-names>G.</given-names></name> <name><surname>van Hooft</surname> <given-names>M.</given-names></name> <name><surname>De Carlo</surname> <given-names>M.</given-names></name> <name><surname>Tomczak</surname> <given-names>J. M.</given-names></name> <name><surname>Eiben</surname> <given-names>A.</given-names></name></person-group> (<year>2021</year>). <article-title>Learning locomotion skills in evolvable robots</article-title>. <source>Neurocomputing</source> <volume>452</volume>, <fpage>294</fpage>&#x02013;<lpage>306</lpage>. <pub-id pub-id-type="doi">10.1016/j.neucom.2021.03.030</pub-id><pub-id pub-id-type="pmid">23080038</pub-id></citation></ref>
<ref id="B40">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lewis</surname> <given-names>D.</given-names></name></person-group> (<year>1974</year>). <article-title>Causation</article-title>. <source>J. Philos</source>. <volume>70</volume>, <fpage>556</fpage>&#x02013;<lpage>567</lpage>. <pub-id pub-id-type="doi">10.2307/2025310</pub-id></citation>
</ref>
<ref id="B41">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>J.</given-names></name> <name><surname>Abbas</surname> <given-names>W.</given-names></name> <name><surname>Shabbir</surname> <given-names>M.</given-names></name> <name><surname>Koutsoukos</surname> <given-names>X. D.</given-names></name></person-group> (<year>2020</year>). <article-title>&#x0201C;Resilient distributed diffusion for multi-robot systems using centerpoint,&#x0201D;</article-title> in <source>Robotics: Science and Systems</source> (<publisher-loc>Corvalis, OR</publisher-loc>). <pub-id pub-id-type="doi">10.15607/RSS.2020.XVI.021</pub-id></citation>
</ref>
<ref id="B42">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>S.</given-names></name> <name><surname>Zheng</surname> <given-names>P.</given-names></name> <name><surname>Liu</surname> <given-names>S.</given-names></name> <name><surname>Wang</surname> <given-names>Z.</given-names></name> <name><surname>Wang</surname> <given-names>X. V.</given-names></name> <name><surname>Zheng</surname> <given-names>L.</given-names></name> <etal/></person-group>. (<year>2023</year>). <article-title>Proactive human robot collaboration: mutual-cognitive, predictable, and self-organising perspectives</article-title>. <source>Robot. Comput. Integr. Manufact</source>. <volume>81</volume>:<fpage>102510</fpage>. <pub-id pub-id-type="doi">10.1016/j.rcim.2022.102510</pub-id></citation>
</ref>
<ref id="B43">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Luders</surname> <given-names>B.</given-names></name> <name><surname>Kothari</surname> <given-names>M.</given-names></name> <name><surname>How</surname> <given-names>J.</given-names></name></person-group> (<year>2010</year>). <article-title>&#x0201C;Chance constrained RRT for probabilistic robustness to environmental uncertainty,&#x0201D;</article-title> in <source>AIAA Guidance, Navigation, and Control Conference</source>. <pub-id pub-id-type="doi">10.2514/6.2010-8160</pub-id></citation>
</ref>
<ref id="B44">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Mannion</surname> <given-names>P.</given-names></name> <name><surname>Duggan</surname> <given-names>J.</given-names></name> <name><surname>Howley</surname> <given-names>E.</given-names></name></person-group> (<year>2016</year>). <article-title>&#x0201C;Generating multi-agent potential functions using counterfactual estimates,&#x0201D;</article-title> in <source>Proceedings of Learning, Inference and Control of Multi-Agent Systems (at NIPS 2016)</source> (<publisher-loc>New York, NY</publisher-loc>), <fpage>3643</fpage>&#x02013;<lpage>3652</lpage>.</citation>
</ref>
<ref id="B45">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Mueller</surname> <given-names>C.</given-names></name> <name><surname>Tabrez</surname> <given-names>A.</given-names></name> <name><surname>Hayes</surname> <given-names>B.</given-names></name></person-group> (<year>2021</year>). <article-title>&#x0201C;Interactive constrained learning from demonstration using visual robot behavior counterfactuals,&#x0201D;</article-title> in <source>Proceedings of the Accessibility of Robot Programming and Work of the Future Workshop at RSS, Vol. 2021</source>. <publisher-loc>California, CA</publisher-loc>.</citation>
</ref>
<ref id="B46">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Nanadrekar</surname> <given-names>J. S. J.</given-names></name> <name><surname>Subramanian</surname> <given-names>M.</given-names></name> <name><surname>Aditya</surname> <given-names>U.</given-names></name> <name><surname>Jain</surname> <given-names>G.</given-names></name> <name><surname>Rathinasamy</surname> <given-names>V.</given-names></name> <name><surname>Bhatawdekar</surname> <given-names>R. M.</given-names></name></person-group> (<year>2022</year>). <article-title>&#x0201C;Robots in mining,&#x0201D;</article-title> in <source>Proceedings of Geotechnical Challenges in Mining, Tunneling and Underground Infrastructures</source>, eds <person-group person-group-type="editor"><name><surname>Verma</surname> <given-names>A. K.</given-names></name> <name><surname>Mohamad</surname> <given-names>E. T.</given-names></name> <name><surname>Bhatawdekar</surname> <given-names>R. M.</given-names></name> <name><surname>Raina</surname> <given-names>A. K.</given-names></name> <name><surname>Khandelwal</surname> <given-names>M.</given-names></name> <name><surname>Armaghani</surname> <given-names>D.</given-names></name> <name><surname>Sarkar</surname> <given-names>K.</given-names></name></person-group> (<publisher-loc>Singapore</publisher-loc>: <publisher-name>Springer Nature Singapore</publisher-name>), <fpage>625</fpage>&#x02013;<lpage>631</lpage>. <pub-id pub-id-type="doi">10.1007/978-981-16-9770-8_41</pub-id></citation>
</ref>
<ref id="B47">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Neal</surname> <given-names>L.</given-names></name> <name><surname>Olson</surname> <given-names>M.</given-names></name> <name><surname>Fern</surname> <given-names>X.</given-names></name> <name><surname>Wong</surname> <given-names>W.-K.</given-names></name> <name><surname>Li</surname> <given-names>F.</given-names></name></person-group> (<year>2018</year>). <article-title>&#x0201C;Open set learning with counterfactual images,&#x0201D;</article-title> in <source>Proceedings of the European Conference on Computer Vision (ECCV)</source> (<publisher-loc>Munich</publisher-loc>). <pub-id pub-id-type="doi">10.1007/978-3-030-01231-1_38</pub-id></citation>
</ref>
<ref id="B48">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Neerincx</surname> <given-names>M. A.</given-names></name> <name><surname>Waa</surname> <given-names>J.</given-names></name> <name><surname>Kaptein</surname> <given-names>F.</given-names></name> <name><surname>Diggelen</surname> <given-names>J. V.</given-names></name></person-group> (<year>2018</year>). <article-title>&#x0201C;Using perceptual and cognitive explanations for enhanced human-agent team performance,&#x0201D;</article-title> in <source>International Conference on Engineering Psychology and Cognitive Ergonomics</source> (<publisher-loc>Las Vegas, NV</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>204</fpage>&#x02013;<lpage>214</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-319-91122-9_18</pub-id></citation>
</ref>
<ref id="B49">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Nunes</surname> <given-names>I.</given-names></name></person-group> (<year>2014</year>). <article-title>&#x0201C;Improving the design and modularity of BDI agents with capability relationships,&#x0201D;</article-title> in <source>Engineering Multi-Agent Systems</source>, eds <person-group person-group-type="editor"><name><surname>Dalpiaz</surname> <given-names>F.</given-names></name> <name><surname>Dix</surname> <given-names>J.</given-names></name> <name><surname>van Riemsdijk</surname> <given-names>M. B.</given-names></name></person-group> (<publisher-loc>Cham</publisher-loc>: <publisher-name>Springer International Publishing</publisher-name>), <fpage>58</fpage>&#x02013;<lpage>80</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-319-14484-9_4</pub-id></citation>
</ref>
<ref id="B50">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Olfati-Saber</surname> <given-names>R.</given-names></name> <name><surname>Murray</surname> <given-names>R.</given-names></name></person-group> (<year>2004</year>). <article-title>Consensus problems in networks of agents with switching topology and time-delays</article-title>. <source>IEEE Trans. Autom. Control</source> <volume>49</volume>, <fpage>1520</fpage>&#x02013;<lpage>1533</lpage>. <pub-id pub-id-type="doi">10.1109/TAC.2004.834113</pub-id><pub-id pub-id-type="pmid">24109207</pub-id></citation></ref>
<ref id="B51">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parker</surname> <given-names>L.</given-names></name></person-group> (<year>1998</year>). <article-title>Alliance: an architecture for fault tolerant multirobot cooperation</article-title>. <source>IEEE Trans. Robot. Autom</source>. <volume>14</volume>, <fpage>220</fpage>&#x02013;<lpage>240</lpage>. <pub-id pub-id-type="doi">10.1109/70.681242</pub-id></citation>
</ref>
<ref id="B52">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Peysakhovich</surname> <given-names>A.</given-names></name> <name><surname>Kroer</surname> <given-names>C.</given-names></name> <name><surname>Lerer</surname> <given-names>A.</given-names></name></person-group> (<year>2019</year>). <article-title>&#x0201C;Robust multi-agent counterfactual prediction,&#x0201D;</article-title> in <source>Advances in Neural Information Processing Systems, Vol. 32</source>, eds <person-group person-group-type="editor"><name><surname>Wallach</surname> <given-names>H.</given-names></name> <name><surname>Larochelle</surname> <given-names>H.</given-names></name> <name><surname>Beygelzimer</surname> <given-names>A.</given-names></name> <name><surname>d&#x00027;Alch&#x000E9;-Buc</surname> <given-names>F.</given-names></name> <name><surname>Fox</surname> <given-names>E.</given-names></name> <name><surname>Garnett</surname> <given-names>R.</given-names></name></person-group> (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>Curran Associates, Inc.</publisher-name>).</citation>
</ref>
<ref id="B53">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Prorok</surname> <given-names>A.</given-names></name> <name><surname>Malencia</surname> <given-names>M.</given-names></name> <name><surname>Carlone</surname> <given-names>L.</given-names></name> <name><surname>Sukhatme</surname> <given-names>G. S.</given-names></name> <name><surname>Sadler</surname> <given-names>B. M.</given-names></name> <name><surname>Kumar</surname> <given-names>V.</given-names></name></person-group> (<year>2021</year>). <article-title>Beyond robustness: a taxonomy of approaches towards resilient multi-robot systems</article-title>. <source>CoRR, abs/2109.12343</source>.</citation>
</ref>
<ref id="B54">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rabelo</surname> <given-names>R.</given-names></name> <name><surname>Camarinha-Matos</surname> <given-names>L.</given-names></name> <name><surname>Afsarmanesh</surname> <given-names>H.</given-names></name></person-group> (<year>1999</year>). <article-title>Multi-agent-based agile scheduling</article-title>. <source>Robot. Auton. Syst</source>. <volume>27</volume>, <fpage>15</fpage>&#x02013;<lpage>28</lpage>. <pub-id pub-id-type="doi">10.1016/S0921-8890(98)00080-3</pub-id></citation>
</ref>
<ref id="B55">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Raja</surname> <given-names>P.</given-names></name> <name><surname>Pugazhenthi</surname> <given-names>S.</given-names></name></person-group> (<year>2012</year>). <article-title>Optimal path planning of mobile robots: a review</article-title>. <source>Int. J. Phys. Sci</source>. <volume>7</volume>, <fpage>1314</fpage>&#x02013;<lpage>1320</lpage>. <pub-id pub-id-type="doi">10.5897/IJPS11.1745</pub-id><pub-id pub-id-type="pmid">30235894</pub-id></citation></ref>
<ref id="B56">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rezaee</surname> <given-names>H.</given-names></name> <name><surname>Parisini</surname> <given-names>T.</given-names></name> <name><surname>Polycarpou</surname> <given-names>M. M.</given-names></name></person-group> (<year>2021</year>). <article-title>Resiliency in dynamic leader follower multiagent systems</article-title>. <source>Automatica</source> <volume>125</volume>:<fpage>109384</fpage>. <pub-id pub-id-type="doi">10.1016/j.automatica.2020.109384</pub-id></citation>
</ref>
<ref id="B57">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>&#x0015E;ahin</surname> <given-names>E.</given-names></name></person-group> (<year>2005</year>). <article-title>&#x0201C;Swarm robotics: from sources of inspiration to domains of application,&#x0201D;</article-title> in <source>Swarm Robotics</source>, eds <person-group person-group-type="editor"><name><surname>&#x0015E;ahin</surname> <given-names>E.</given-names></name> <name><surname>Spears</surname> <given-names>W. M.</given-names></name></person-group> (<publisher-loc>Berlin; Heidelberg</publisher-loc>: <publisher-name>Springer Berlin Heidelberg</publisher-name>), <fpage>10</fpage>&#x02013;<lpage>20</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-540-30552-1_2</pub-id></citation>
</ref>
<ref id="B58">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Saito</surname> <given-names>Y.</given-names></name> <name><surname>Joachims</surname> <given-names>T.</given-names></name></person-group> (<year>2021</year>). <article-title>&#x0201C;Counterfactual learning and evaluation for recommender systems: Foundations, implementations, and recent advances,&#x0201D;</article-title> in <source>Proceedings of the 15th ACM Conference on Recommender Systems, RecSys &#x00027;21</source>, (New York, NY: Association for Computing Machinery), <fpage>828</fpage>&#x02013;<lpage>830</lpage>. <pub-id pub-id-type="doi">10.1145/3460231.3473320</pub-id></citation>
</ref>
<ref id="B59">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Samarasinghe</surname> <given-names>D.</given-names></name> <name><surname>Barlow</surname> <given-names>M.</given-names></name> <name><surname>Lakshika</surname> <given-names>E.</given-names></name></person-group> (<year>2022a</year>). <article-title>Flow-based reinforcement learning</article-title>. <source>IEEE Access</source> <volume>10</volume>, <fpage>102247</fpage>&#x02013;<lpage>102265</lpage>. <pub-id pub-id-type="doi">10.1109/ACCESS.2022.3209260</pub-id></citation>
</ref>
<ref id="B60">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Samarasinghe</surname> <given-names>D.</given-names></name> <name><surname>Barlow</surname> <given-names>M.</given-names></name> <name><surname>Lakshika</surname> <given-names>E.</given-names></name> <name><surname>Kasmarik</surname> <given-names>K.</given-names></name></person-group> (<year>2022b</year>). <article-title>Grammar-based cooperative learning for evolving collective behaviours in multi-agent systems</article-title>. <source>Swarm Evol. Comput</source>. <volume>69</volume>:<fpage>101017</fpage>. <pub-id pub-id-type="doi">10.1016/j.swevo.2021.101017</pub-id></citation>
</ref>
<ref id="B61">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sartoretti</surname> <given-names>G.</given-names></name> <name><surname>Paivine</surname> <given-names>W.</given-names></name> <name><surname>Shi</surname> <given-names>Y.</given-names></name> <name><surname>Wu</surname> <given-names>Y.</given-names></name> <name><surname>Choset</surname> <given-names>H.</given-names></name></person-group> (<year>2019</year>). <article-title>Distributed learning of decentralized control policies for articulated mobile robots</article-title>. <source>IEEE Trans. Robot</source>. <volume>35</volume>, <fpage>1109</fpage>&#x02013;<lpage>1122</lpage>. <pub-id pub-id-type="doi">10.1109/TRO.2019.2922493</pub-id></citation>
</ref>
<ref id="B62">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shneier</surname> <given-names>M.</given-names></name> <name><surname>Bostelman</surname> <given-names>R.</given-names></name></person-group> (<year>2015</year>). <source>Literature Review of Mobile Robots for Manufacturing</source>. Technical report. <pub-id pub-id-type="doi">10.6028/NIST.IR.8022</pub-id></citation>
</ref>
<ref id="B63">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Singh</surname> <given-names>A. J.</given-names></name> <name><surname>Kumar</surname> <given-names>A.</given-names></name> <name><surname>Lau</surname> <given-names>H. C.</given-names></name></person-group> (<year>2021</year>). <article-title>&#x0201C;Approximate difference rewards for scalable multigent reinforcement learning,&#x0201D;</article-title> in <source>Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent Systems, AAMAS &#x00027;21</source> (<publisher-loc>Richland, SC</publisher-loc>: <publisher-name>International Foundation for Autonomous Agents and Multiagent Systems</publisher-name>), <fpage>1655</fpage>&#x02013;<lpage>1657</lpage>.</citation>
</ref>
<ref id="B64">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Singh</surname> <given-names>B.</given-names></name> <name><surname>Sellappan</surname> <given-names>N.</given-names></name> <name><surname>Kumaradhas</surname> <given-names>P.</given-names></name></person-group> (<year>2013</year>). <article-title>Evolution of industrial robots and their applications</article-title>. <source>Int. J. Emerg. Technol. Adv. Eng</source>. <volume>3</volume>, <fpage>763</fpage>&#x02013;<lpage>768</lpage>.</citation>
</ref>
<ref id="B65">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Smith</surname> <given-names>S. C.</given-names></name> <name><surname>Ramamoorthy</surname> <given-names>S.</given-names></name></person-group> (<year>2020</year>). <article-title>&#x0201C;Counterfactual explanation and causal inference in service of robustness in robot control,&#x0201D;</article-title> in <source>2020 Joint IEEE 10th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)</source> (<publisher-loc>Valparaiso</publisher-loc>), <fpage>1</fpage>&#x02013;<lpage>8</lpage>. <pub-id pub-id-type="doi">10.1109/ICDL-EpiRob48136.2020.9278061</pub-id></citation>
</ref>
<ref id="B66">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Steele</surname> <given-names>F.</given-names></name> <name><surname>Thomas</surname> <given-names>G.</given-names></name></person-group> (<year>2007</year>). <article-title>&#x0201C;Directed stigmergy-based control for multi-robot systems,&#x0201D;</article-title> in <source>2007 2nd ACM/IEEE International Conference of Human-Robot Interaction (HRI)</source> (<publisher-loc>Arlington, VA</publisher-loc>), <fpage>223</fpage>&#x02013;<lpage>230</lpage>. <pub-id pub-id-type="doi">10.1145/1228716.1228747</pub-id></citation>
</ref>
<ref id="B67">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Swaminathan</surname> <given-names>S.</given-names></name> <name><surname>Phillips</surname> <given-names>M.</given-names></name> <name><surname>Likhachev</surname> <given-names>M.</given-names></name></person-group> (<year>2015</year>). <article-title>&#x0201C;Planning for multi-agent teams with leader switching,&#x0201D;</article-title> in <source>2015 IEEE International Conference on Robotics and Automation (ICRA)</source> (<publisher-loc>Seattle, WA</publisher-loc>), <fpage>5403</fpage>&#x02013;<lpage>5410</lpage>. <pub-id pub-id-type="doi">10.1109/ICRA.2015.7139954</pub-id></citation>
</ref>
<ref id="B68">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Taylor</surname> <given-names>A. T.</given-names></name> <name><surname>Berrueta</surname> <given-names>T. A.</given-names></name> <name><surname>Murphey</surname> <given-names>T. D.</given-names></name></person-group> (<year>2021</year>). <article-title>Active learning in robotics: a review of control principles</article-title>. <source>Mechatronics</source> <volume>77</volume>:<fpage>102576</fpage>. <pub-id pub-id-type="doi">10.1016/j.mechatronics.2021.102576</pub-id></citation>
</ref>
<ref id="B69">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tian</surname> <given-names>B.</given-names></name> <name><surname>Cao</surname> <given-names>Y.</given-names></name> <name><surname>Zhang</surname> <given-names>Y.</given-names></name> <name><surname>Xing</surname> <given-names>C.</given-names></name></person-group> (<year>2022</year>). <article-title>Debiasing nlu models via causal intervention and counterfactual reasoning</article-title>. <source>Proc. AAAI Conf. Artif. Intell</source>. <volume>36</volume>, <fpage>11376</fpage>&#x02013;<lpage>11384</lpage>. <pub-id pub-id-type="doi">10.1609/aaai.v36i10.21389</pub-id></citation>
</ref>
<ref id="B70">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Voth</surname> <given-names>D.</given-names></name></person-group> (<year>2004</year>). <article-title>A new generation of military robots</article-title>. <source>IEEE Intell. Syst</source>. <volume>19</volume>, <fpage>2</fpage>&#x02013;<lpage>3</lpage>. <pub-id pub-id-type="doi">10.1109/MIS.2004.30</pub-id></citation>
</ref>
<ref id="B71">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wachter</surname> <given-names>S.</given-names></name> <name><surname>Mittelstadt</surname> <given-names>B.</given-names></name> <name><surname>Russell</surname> <given-names>C.</given-names></name></person-group> (<year>2017</year>). <article-title>Counterfactual explanations without opening the black box: automated decisions and the GDPR</article-title>. <source>Harv. JL Tech</source>. <volume>31</volume>:<fpage>841</fpage>. <pub-id pub-id-type="doi">10.2139/ssrn.3063289</pub-id></citation>
</ref>
<ref id="B72">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>L.</given-names></name> <name><surname>Sun</surname> <given-names>Y.</given-names></name> <name><surname>Barth</surname> <given-names>A.</given-names></name> <name><surname>Ma</surname> <given-names>O.</given-names></name></person-group> (<year>2020</year>). <article-title>Decentralized control of multi-robot system in cooperative object transportation using deep reinforcement learning</article-title>. <source>IEEE Access</source> <volume>8</volume>, <fpage>184109</fpage>&#x02013;<lpage>184119</lpage>. <pub-id pub-id-type="doi">10.1109/ACCESS.2020.3025287</pub-id></citation>
</ref>
<ref id="B73">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhu</surname> <given-names>Q.</given-names></name> <name><surname>Zhang</surname> <given-names>W.-N.</given-names></name> <name><surname>Liu</surname> <given-names>T.</given-names></name> <name><surname>Wang</surname> <given-names>W. Y.</given-names></name></person-group> (<year>2020</year>). <article-title>&#x0201C;Counterfactual off-policy training for neural dialogue generation,&#x0201D;</article-title> in <source>Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)</source> (Association for Computational Linguistics), <fpage>3438</fpage>&#x02013;<lpage>3448</lpage>. <pub-id pub-id-type="doi">10.18653/v1/2020.emnlp-main.276</pub-id><pub-id pub-id-type="pmid">36568019</pub-id></citation></ref>
</ref-list> 
</back>
</article> 