Article,

Proactive hybrid learning and optimisation in self-adaptive systems: The swarm-fleet infrastructure scenario

, , , and .
Information and Software Technology, (2022)
DOI: 10.1016/j.infsof.2022.106826

Abstract

Context: Smart and adaptive Systems, such as self-adaptive and self-organising (SASO) systems, typically consist of a large set of highly autonomous and heterogeneous subsystems that are able to adapt their behaviour to the requirements of ever-changing, dynamic environments. Their successful operation is based on appropriate modelling of the internal and external conditions. Objective: The control problem for establishing a near-to-optimal coordinated behaviour of systems with multiple, potentially conflicting objectives is either approached in a distributed (i.e., fully autonomous by the autonomous subsystems) or in a centralised way (i.e. one instance controlling the optimisation and planning process). In the distributed approach, selfish behaviour and being limited to local knowledge may lead to sub-optimal system behaviour, while the centralised approach ignores the autonomy and the coordination efforts of parts of the system. Method: In this article, we present a concept for a hybrid (i.e., integrating a central optimisation with a distributed decision-making process) system management that combines local reinforcement learning and self-awareness mechanisms of fully autonomous subsystems with external system-wide planning and optimisation of adaptation freedom that steers the behaviour dynamically by issuing plans and guidelines augmented with incentivisation schemes. Results: This work addresses the inherent uncertainty of the dynamic system behaviour, the local autonomous and context-aware learning of subsystems, and proactive control based on adaptiveness. We provide the ‘swarm-fleet infrastructure’—a self-organised taxi service established by autonomous, privately-owned cars—as a testbed for structured comparison of systems. Conclusion: The ‘swarm-fleet infrastructure’ supports the advantages of a proactive hybrid self-adaptive and self-organising system operation. Further, we provide a system model to combine the system-wide optimisation while ensuring local decision-making through reinforcement learning for individualised configurations.

Tags

Users

  • @ies

Comments and Reviews