Abstract
?

This paper presents a distributed set-membership estimation and formation control algorithm for a fleet of vehicles. Each vehicle is equipped with an agent with sensing, acting, computational and communication capacities. The agent's objective is to compute estimation set for the state of all the fleet of vehicles and, additionally, generate control actions to keep its local vehicle in formation. In order to improve the estimation, the agents exchange information with other neighbouring agents. The estimation sets are mathematically characterised by zonotopes. The novelty of the proposed solution lies in the fact that the estimation is guaranteed in spite of the lack of knowledge of the control signal applied by the rest of vehicles. Then, in the presence of bounded disturbances and noises, the position reference for the vehicles can be tracked with a bounded estimation. The proposed solution is tested by simulations.
1. Introduction
The emergence of Wireless Sensor Networks (WSNs), together with the cheapening of IoT (Internet of Things) devices, has multiplied the number of tasks that can be performed in a distributed environment. Among them, distributed monitoring using a WSN is perhaps the field that has grown more in the last years. It is easy to find applications of distributed monitoring in many fields, such as agriculture (Bencini, Collodi, Di Palma, Manes, & Manes, 2009), water distribution system (Lin, Wu, & Wassell, 2008), pipelines (Stoianov, Nachman, & Madden, 2007), electrical power grids (Grilo et al., 2012), structural health monitoring (Yu-liang, Yu, Lian-fen, Jian, & Ying, 2012) or traffic control systems (Tubaishat, Shang, & Shi, 2007), just to name a few.
It is important to mention that distributed estimation and distributed monitoring are not exactly the same, but sometimes these terms are confused with each other. Whereas distributed monitoring implies the deployment of a WSN to measure certain parameters, and to reduce uncertainty and increase robustness, distributed estimation additionally implies to get to know certain hidden variables that are not directly measured and that can be inferred from the data using a dynamical description of the plant.
The literature of distributed estimation is vast, since it has become a hot topic in the last years. Most of the estimation algorithms are modifications of classical centralised estimators to cope with the new distributed paradigm. Then, different modifications of the distributed Kalman filter have been proposed for plants affected by Gaussian noises and disturbances (Battistelli & Chisci, 2016; Cattivelli & Sayed, 2010; Das & Moura, 2015; del Nozal, Orihuela, & Millán, 2017; Olfati-Saber, 2007; Wang, Li, & Zhang, 2017). Distributed Luenberger observers have been also developed for linear systems (del Nozal, Millán, Orihuela, Seuret, & Zaccarian, 2019; Matei & Baras, 2012; Mitra & Sundaram, 2018). For nonlinear systems, distributed Bayesian estimators have been developed in Forti et al. (2018). A complete survey of distributed estimation algorithms based on consensus strategies can be found in Garin and Schenato (2010).
However, the objective of guaranteed estimators is somewhat different. The estimation goal is to compute a set in which the actual state of the plant is contained for sure. Then, in addition to the estimation of the state, a guaranteed or set-membership estimator must provide a set that contains, with probability equal to one, the state of the system. In contrast to Kalman or stochastic filters, guaranteed estimators do not require to know the statistical description of noises and disturbances, but only their extreme bounds. Compared with classical methodologies, guaranteed observers find their place in applications where the uncertainty in the estimation might become a critical issue, such as constrained control.
Perhaps the first feature that makes the difference between the approaches found in the literature is the mathematical description of the sets, being the most common solutions those based in ellipsoids (Wei, Liu, Song, & Liu, 2015), zonotopes (Alamo, Bravo, & Camacho, 2005; Combastel, 2015) and interval observers (Chabane, Maniu, Alamo, Camacho, & Dumur, 2014; Mazenc & Bernard, 2011). Recently, the combination of probability-based and set-membership estimation has led to interesting novel formulations, as in Combastel (2016). Guaranteed estimators must not be confused with robust estimators that deal with uncertain models of the plant dynamics, as in Dong, Wang, Alsaadi, and Ahmad (2015).
When it comes to the extension of the aforementioned formulation to a distributed paradigm, a revision of the state of the art reveals that the literature is rather scarce. Using a zonotope-based description, the authors in Keiffer (2009), García, Rubio, Orihuela, Millán, and Ortega (2017) were able to present distributed estimation algorithms that make use of intersections between sets. Another idea was used in Orihuela, Roshany-Yamchi, Garcia, and Millán (2017), Orihuela, Millán, Roshany-Yamchi, and Garcia (2018), where the authors presented a Kalman-inspired set-membership observer extending the idea in Combastel (2015) to a distributed framework with network-based problems. The natural extension of the previously mentioned work (Combastel, 2016) has been recently published in Combastel and Zolghadri (2018). In Wang, Alamo, Puig, and Cembrano (2018), the authors propose a distributed zonotope-based estimator valid for a family of subsystems coupled by the state. Finally, the use of consensus techniques for distributed estimation with ellipsoidal sets was reported in Ma, Wang, Lam, and Kyriakoulis (2017).
Sometimes, the result of distributed estimation algorithms are used as inputs to distributed controllers. These are the so-called distributed joint estimation and control problem. When an agent is able to apply control actions to the plant, the estimation of the rest of agents is compromised, since those actions are unknown for the rest of agents. Hence, the solution for the joint problem is rather more complex. There exists a common line of research that proposes different formulations of a cascade structure, in which the estimation error must converge faster than the tracking error, see Rubio, Millán, Orihuela, and Vivas (2014), Zhang, Hengster-Movric, Sebek, Desmet, and Faria (2017). In all these papers, the authors assume that the control actions applied to the system at any moment are the same that the local agent would apply if it has access to all control channels. This incurs in an estimation error that disappears when the estimated states of every agent actually converge to the system state.
However, this idea cannot be applied directly to set-membership paradigms, since the described assumption will lead to non-guaranteed estimations in the transient. From the authors' knowledge, there exists no paper in which this problem has been solved, even for linear systems. The only solutions are restricted to formulations of the plant divided into subsystems for which the estimation objectives are just local, as in Orihuela et al. (2017), Orihuela et al. (2018), Combastel and Zolghadri (2018), Wang et al. (2018).
Distributed control methodologies have found application in the control of vehicle formations, where different approaches can be found in the literature either for path planning and formation control. For example, in Achtelik, Lynen, Weiss, Chli, and Siegwart (2014), Ousingsawat and Campbell (2004), path planning strategies are presented for aerial vehicles, ensuring knowledge of the intervals containing critical variables such as temperature, velocity, flow and so on. For surface vehicles, Liu and Bucknall (2015) tested its results in real navigation maritime environment, by using fast matching methodology to improve the path planning algorithm.
Regarding formation control, in Yang, Naeem, Irwin, and Li (2014) the task is transformed into a group of decentralised problems and a stability analysis is presented. In Dong and Geng (2016), a distributed control law is developed to achieve trajectory tracking for N vehicles in rigid formation. For one-dimensional formation of vehicles, Lin, Fardad, and Jovanović (2012) compute optimal control based on the information of the nearest neighbours. More flexibility in the communication is taken into consideration in Peng, Wen, Rahmani, and Yu (2015), where the formation control problem is solved by a consensus algorithm, or in Millan, Orihuela, Jurado, and Rodriguez Rubio (2014), where the control strategy proposed makes possible to deal with delays and packet dropouts. Non-linear approaches found in Li, Shi, and Yan (2016), Cai and De Queiroz (2015) deal with backstepping and receding horizon control, respectively. For constrained underactuated underwater vehicles, the leader–follower paradigm has been successfully applied in Li, Xie, and Yan (2016). Application on a quadrotor platform is presented in Dong, Yu, Shi, and Zhong (2015). Finally, model and environmental uncertainties are considered in Peng, Wang, Chen, Hu, and Lan (2013), Xue, Wu, and Wang (2015), respectively.
This paper tackles the joint problem of distributed set-membership estimation and formation control for fleets of vehicles. The main contributions of the paper are as follows:
A distributed set-membership estimator is applied for the first time for a fleet of vehicles. It is able to estimate, in a guaranteed way, the position and velocity of each vehicle in a distributed way, assuming that the vehicles measure only local information. In particular, one vehicle, the leader, is able to measure its absolute position, while the others measure their relative position with respect to their neighbours. In the literature, we can find an application of set-membership estimation for the localisation of a unique mobile robot in Zhou, Qian, Fang, Ma, and Dai (2015).
The proposed estimator provides guaranteed sets in spite of the presence of unknown control actions that are being applied to other vehicles. Distributed set-membership estimators dealing with unknown control signals have been only successfully applied when the estimation objectives were local (Combastel & Zolghadri, 2018; Orihuela et al., 2018, 2017; Wang et al., 2018).
In addition, the proposed methodology have some other relevant positive features. First, the sets are characterised as zonotopes, this easing the transmission of information through the network. Second, both the estimator and the controller are designed in a decentralised way. Finally, although the stability is proven for vehicles with linear models, it is shown that the proposed algorithm can be applied to the unicycle model under some mild assumptions.
The paper is organised as follows. Section 2 presents the notation and preliminary concepts related to set-membership estimation. Section 3 describes the fleet of vehicles, the agents and formulates the problem under consideration. Section 4 presents the distributed estimation algorithm. The formation control is described in Section 5. The proposed solution is tested by simulations in Section 6. Finally, some conclusions and remarks are drawn in Section 7.
2. Notation and preliminaries
A zonotope is an affine transformation of an unitary hypercube that can be used to characterise sets in a given state space. It will be represented with a calligraphic letter , and is defined by its centre
and its generator matrix
, that is:
(1)
(1) where
represent a column of matrix H. The dimension of the zonotope is n and its order is b. Let
and
be two zonotopes, and R a matrix, all of them of appropriate dimensions. Then, it holds:
(2)
(2)
(3)
(3) where ⊕ denotes the Minkowski sum of two sets.
The operator consists in an order reduction of a zonotope, in such a way that the order of
is smaller than the order of
, and
. The order of
is q. This operation has been defined in the literature in different ways. Throughout this document, the order reduction will be defined as in Combastel (2015). According to that paper, if the order is to be reduced from p to q, the procedure starts by sorting the columns of H on decreasing vector norm to construct matrix
:
and enclosing the set
generated by the b−q + n smaller columns into a box:
where
and
is computed as
A strip can be used to characterise a set of states consistent with a given scalar measurement. It will be represented with a capital letter S and is defined as the set
, where
and
.
The set of states resulting after the intersection of a strip and a zonotope, i.e. , or after the intersection of two zonotopes, i.e.
, can be enclosed by a new zonotope. With some abuse of notation, we will refer this new zonotope
as
or
. The construction of such zonotopes is well described in Alamo et al. (2005).
An estimation zonotope computed by agent i at time instant k with the information available at k is denoted as . Furthermore, a prediction zonotope at instant k + 1 with the information available at k is denoted as
.
The connectivity of the agents is topologically defined by a graph . The graph G is assumed connected and undirected, with vertices
and edges
. The set of vertices to which vertex i is connected is named the neighbourhood of i and is denoted by
. Edge
implies that vertex i sends information to vertex j. The distance between vertex i and vertex j, measured as the number of edges in a shortest path connecting them, is
. Finally, the diameter of the graph
is the greatest distance between any pair of vertices.
3. Problem formulation
This paper presents some contributions to the problem of estimation and formation control of fleet of vehicles. The objective is to present a distributed control algorithm that can be implemented when the disturbances affecting the vehicles are unknown but bounded. The distributed controllers require to know the state of the vehicles, which is not directly measurable. In order to solve this problem, a distributed guaranteed estimator is presented, in such a way that it is able to deal with the possible measurement noises and, at the same time, incorporate the information received from other estimators. The solution for the joint problem of guaranteed estimation and control and the complete algorithm that will be implemented in every agent is given in Sections 4–5.
Prior to that, the description of the fleet of vehicles and of the agents are given in the next sections. The control and estimation problem will be formally stated at the end of this section.
3.1. Fleet description
We consider a fleet of vehicles characterised as in Lafferriere, Williams, Caughman, and Veerman (2005), where p vehicles with the same dynamics1 are considered: (4)
(4) where the entries of
represent r configuration variables for vehicle i and their derivatives, and
represents control inputs. As in Lafferriere et al. (2005), it is further assumed that matrices
and
have the form:
(5)
(5) where ⊗ stands for the Kronecker product of two matrices. This model corresponds to the situation in which the individual configuration variables are decoupled and the acceleration is controlled separately. The odd-numbered entries of
represent position-like variables, whereas the even-numbered entries represent velocity-like variables.
The previous dynamics are discretised with sampling time and additive disturbances are included:
(6)
(6) where
correspond to the discrete-time version of
. The disturbances
can be caused by various reasons, such as non-modelled motor dynamics, lack of accuracy in certain actuators, wind (aerial or ground vehicles), currents (water vehicles) and so on. Throughout the paper, we consider that these disturbances are unknown, but they always belong to known bounded sets, that is,
. These sets are modelled as
(7)
(7) Let
be defined by stacking all the state vectors, control actions and disturbances of each vehicle, that is,
. Then, the dynamics of the complete fleet can be written as follows:
(8)
(8) where
and
.
3.1.1. Unicycle model for the vehicles
The presented formulation of the vehicles as linear systems in (4), although simple, can be used for more complicated models. In particular, this section summarises the results in Lawton, Beard, and Young (2003), in which they present an output feedback linearising control for the control of the hand position of a vehicle, whose dynamics can be written as a double integrator.
The hand position of the vehicle is a point that lies a distance
along the line that is normal to the wheel axis and intersects the wheel axis at the centre point
, as shown in Figure 1. As it is well known, the dynamical equations of the unicycle model are:
(9)
(9) where
,
,
,
are, respectively, the inertial position, orientation, linear speed and angular speed of vehicle i. The mass and moment of inertia of the vehicle are
and
. The input signals are the torque
and the force
.
Published online:
23 January 2020Figure 1. Unicycle model for the vehicle used for controlling the hand position.
Letting and
, the dynamical equations can be easily written as
(10)
(10) Considering the hand position of each vehicle as output, it is shown in Lawton et al. (2003) that system (10) has constant relative degree equal to 2 and can, therefore, be output feedback linearised about the hand position. Now, by using the next output feedback linearising control:
(11)
(11) the input–output dynamics of each vehicle can be represented by the double integrator system:
(12)
(12) It should be mentioned that the internal dynamics, related to the orientation of the vehicle itself, are unobservable and uncontrollable. However, it is also shown in Lawton et al. (2003) that this internal dynamics is stable, but not asymptotically stable. Therefore, we will assume that the orientation of each vehicle is locally measured.
3.2. Agent description
It is assumed that each vehicle is equipped with an agent that has sensing, control, computational and communication capacities. In what respect to the sensing ability, each agent i is assumed to measure the following output: (13)
(13) where
is the output vector,
is the output matrix and
represents the measurement noise. It is assumed that the noises are unknown, but they always belong to known bounded sets, that is,
. These sets are modelled as
(14)
(14) We distinguish two kinds of agents: (a) those who are able to measure the absolute position-like variables of its local vehicle (by using GPS-like sensors or similar) and (b) those who measure relative position-like variables (distances, with optical or acoustic sensors). Both kind of agents are assumed to get information concerning its local velocity-like variables. Matrix
, as supposed, is structurally different for each kind of agent.
Consider p = 3 vehicles moving in a one-dimensional space. Then, and vector
.
Let us assume that agent 1 is able to measure its own position, whereas agents 2 and 3 measure relative positions. Considering a graph topology given by , matrices
are given by
The agents are endowed with communication devices that let them send/receive information to/from their neighbours. The communication topology is mathematically described by a graph G. It is assumed that two possible topologies are valid, chain or star. This assumption will be required later to solve the problem of distributed guaranteed estimation. For instance, for a four vehicle fleet, the two possible schemes are depicted in Figure 2.2 The arrows represent the communication between the agents.
Published online:
23 January 2020Figure 2. Scheme of the formation of the vehicles with the communication between agents: (a) Chain and (b) Star.
It is worth mentioning that those topologies only apply for the communication, not for the very formation of the vehicles. It will be shown in the examples that both topologies can be used to keep different formations.
By using the available information, that is, the measured outputs and the information received from the neighbourhood, the agents must compute estimation sets within which the state must be contained. The construction of this estimation set is described in Section 4.
Regarding the control of the vehicle, it is considered that each agent computes and applies the control signals to its local vehicle. The control signal applied by agent i to the vehicle is chosen as (15)
(15) where the design of matrix
will be tackled in Section 5.
3.3. Problem statement
The goal of control formation problem is to keep a prescribed position for the vehicles. The global reference to be tracked is assumed to be known by each vehicle. Now, the problem can be formally stated.
Let us consider a fleet of p vehicles described by the dynamical model (6). Consider an identical number of agents deployed together with the vehicles. Those agents get some outputs from the fleet given by (13). Let us assume that the disturbances and noises affecting the state and output are bounded by (7)–(14). The agents are able to exchange information according to a communication graph G, with chain or star topology. Given a set of position-like references for each vehicle , the objective of each agent is as follows:
Find an estimation set
such that
for all instant k.
Design a local controller
in such a way that the fleet is able to follow the reference
.
Compared to Orihuela et al. (2017), the estimation objective is here more complex, since in that paper the agents' goal was to estimate the state of its associated subsystem . The ignorance of the control actions applied by the rest of agents, that is,
for
, does not lead to a difficulty in the estimation problem formulated in Orihuela et al. (2017). However, in Problem 1, this fact appears as the main complication.
4. Guaranteed distributed estimator
This section presents the estimation algorithm that aims to satisfy objective (1) of Problem 3.1. Nevertheless, there are some common obstacles that arise in this sort of estimation and control problems that should be described first.
4.1. Main obstacles and proposed solutions for the joint distributed control and estimation problem
When facing the joint problem of distributed estimation and control, the control engineer finds two main drawbacks:
Estimating the whole state with partial information: This difficulty is common in the literature of distributed estimation, no matter the sort of estimator used. The output measured by an agent does not contain enough information to estimate the whole state vector. Although the mathematical details vary between the different formulations, the solution typically consists in introducing the information received from neighbouring agents in the formulation of a centralised observer. For instance, distributed Kalman filter formulations modify the classical Kalman filter by adding the neighbouring information using consensus, as in Olfati-Saber (2007), Das and Moura (2015), Battistelli and Chisci (2016), del Nozal et al. (2017) or with diffusion strategies, see Cattivelli and Sayed (2010), Wang et al. (2017). Consensus algorithms have been also used to extend the Luenberger observer to a distributed formulation in Matei and Baras (2012), del Nozal et al. (2019), Mitra and Sundaram (2018), or for the distributed the Bayesian filter, see Forti et al. (2018).
The literature of distributed set-membership estimation is not that rich. However, some interesting solutions for this problem can be found. The information received from the neighbourhood can be incorporated in the estimated set by intersection, see Keiffer (2009), García et al. (2017), or by means of linear combinations of the sets, as in Orihuela et al. (2017), Combastel and Zolghadri (2018). Previous papers described the sets using zonotopes. Recently, it has been proposed the use of consensus techniques for distributed estimation with ellipsoidal sets, see Ma et al. (2017).
Predicting with an unavailable control signal: This problem only appears when the joint distributed and estimation scheme is tackled at the same time. When a local observer has to make predictions, it must incorporate, at least, two terms: the dynamics of the system and the control actions applied to it. Although a perfect knowledge of the dynamics is assumed, how can an individual agent know the actual control actions if they might be applied by agents that might be geographically distributed?
The solutions adopted in Millán et al. (2013), Rubio et al. (2014), Orihuela, Millán, Vivas, and Rubio (2016), Zhang et al. (2017) present different formulations of a cascade structure, where the estimation errors must converge to zero to achieve the control objectives. In all these papers, the authors assume that the control actions applied to the system at any moment are the same that the local agent would apply if it has access to all control channels. This incurs in an error that disappears when the estimated states of every agent converge to the system state.
This idea cannot be applied directly to set-membership paradigms, since the described assumption will lead to non-guaranteed estimations in the transient. The literature of distributed control and set-membership estimation is rather scarce, and it is limited to formulations of the plant divided into subsystems for which the estimation objectives are just local, see Orihuela et al. (2017), Orihuela et al. (2018), Combastel and Zolghadri (2018).
4.2. Proposed guaranteed estimation algorithm
This section presents the distributed estimation algorithm that will be executed by every agent deployed in the vehicles. In order to save the obstacle described before for the joint control and estimation problem, the proposed solution makes use of two key observations.
Recall that stands for the estimation set (described by a zonotope) computed by agent i at instant k with all the information available at instant k. Let us denote by
to the zonotope sent by agent i to all its neighbours. This zonotope differs, in general, with the estimation set computed by agent i. Then, if agent j is going to use the zonotope received from agent i at instant k to obtain an estimation set
by intersection with
, then agent i knows that the estimation set of agent j at instant k, at least, belong to
. Hence, when agent j computes and applies its local control action to the plant based on (15), agent i will know a region into which the actual control action applied by j is guaranteed to belong. This region will be used to construct the guaranteed prediction sets.
Nonetheless, observe that the previous mechanism is only valid for direct neighbours. To be applicable in general, an additional observation is required. When agent i sends a zonotope at instant k, the information carried in this set spreads through the network in direct paths (without loops) and will be received by every agent, at most, at instant
. This is true due to the particular topologies considered. Since every agent
will incorporate this information to construct their estimation set
by intersection, for some
, then agent i will be able to compute a possible control region for the control action applied by agent j at instant
by means of open-loop predictions.
Before introducing the mathematical details, an illustrative example is presented that will surely help to clarify these ideas.
Consider p = 3 vehicles with associated agents connected with a chain topology . Consider that agent 3 applies a control action at instant k + 1. Agent 1 requires to compute a region that contains for sure this control action
.
Figure 3 depicts, by using illustrative colours and shapes, the way the information spreads through the network. The zonotope sent by agent 1 at instant k (blue circle) affects the estimation set computed by agent 2 (brown triangle). Then, after an adequate evolution of this set (big brown triangle), it will affect the estimation set of agent 3 at instant k + 1. Since the information of each set is used by means of intersections, agent 1 will be able to evolve its estimation set at instant k (green rectangle) in open loop to obtain a predicted set (big green rectangle) that contains the actual control action .
Published online:
23 January 2020Figure 3. Illustrative example of the transmission of information through the network. Intersections and evolutions of the sets are not pretended to be exact.
The complete control and estimation algorithm performed by every agent has five main steps. Next, we provide a description of this algorithm, which has been summarised in Table .
Table 1. Control and estimation loop.
1. Measurement: Using the measured outputs, in (13), the agents compute a strip of consistent states with each component of this vector,
, for
. These strips are defined as in Section 2. In particular, considering for example a unidimensional output, the strip will be given by
, where
is related with the output noise, see (14). Note that the width of this strip directly grows with the energy of the measurement noise. Then, the prediction zonotope available from the last time step
is intersected with every
, obtaining
as a final result. The intersection between a strip and a zonotope can be written as a zonotope, as was explained in Section 2. Finally, an order reduction operation is executed to obtain zonotope
.
2. Communication: At this step of the algorithm the agents send the reduced-order zonotopes obtained before to all its neighbours. After that communication, each pair of connected agents computes the intersection zonotope , which allows them to reduce the estimation uncertainty. Observe that
, as they are related to the same edge in an undirected graph. As explained in Section 2, the intersection between zonotopes can be written as a zonotope.
3. Estimation: By including the information of every edge (in other words, of every neighbour), the agents obtain the best estimation set with the information available at instant k, that is, .
4. Control: The control action applied by agent i to its vehicle is obtained using the centre of the estimation set according to (15).
5. Prediction: Finally, the prediction sets are computed as
(16)
(16) Three terms appear in the previous equation, one for each term in (8): (1) the open-loop dynamics of the system, with the term
; (2) the set of possible control signals applied to the plant,
; and (3) the set of all possible disturbances, given by
. The set
should contain every control action applied to the plant at instant k. Next, we present a method to compute this set.
Note that can be written as
, where
is the submatrix of B for the control channels of agent i. With a little abuse of notation, let us denote
, where
is a set computed by agent i that must contain the vector
, and it is given by
(17)
(17) being
the prediction of the intersection zonotope computed as
(18)
(18) where
is a neighbour of agent i such that there exists a path from m to j. Observer that the zonotope
represents the open-loop evolution of the zonotope
for
time instants, which is the number of instants required for the zonotope
to reach agent m.
Please note that for expression (17)(b) matches (17)(c), for
. Nonetheless, three cases have been defined to distinguish between three important situations:
agent i applies its local control signal
;
agent i is a neighbour of the agent with access to the control channel and
agent i is not neighbour of the agent applying
.
Next theorem states that, with the proposed algorithm, objective (1) of Problem 1 is met, that is, every agent is able to compute estimation sets containing the actual state vector of the whole fleet of vehicles.
Assume that holds for all agents
at some arbitrary initial instant
. Also assume that
and
hold for all agents
and for
. Then, if each agent implements the distributed algorithm described in Table , it is satisfied that
and
,
.
The theorem will be proved by induction. Assume, hence, that .
The first part of the proof focuses on the estimation set . By construction, the strip
is the set consistent with the measurement, so
. Since zonotope
is the representation of the intersection of
and
, then
. The order-reduction operation verifies that
, so
,
.
It is trivially true that for the same reasons exposed above. As zonotope
is computed as the intersection of several
(as many as neighbours), then it holds
.
Now, let us move to the second part of the theorem that focuses on the prediction set. By comparing both Equations (8) and (16), it is easy to check that and
. Next, it will be proven that
.
For m = i it is trivially true, since . For
, note that
, and
is the centre of the estimation zonotope
. Zonotope
is obtained by the intersection of all the
. In particular,
is one of those sets, so
. Then,
and, therefore
.
Finally, consider the last case, whose proof is based on how the information travels though the communication network. The second assumption says that for all the agents, and for
, and by definition of
that can be expressed as
. It also says that
for all the agents, and for
. Summarising, if it is known as set where the state was in a known instant,
, the control signals applied and the set of disturbances, then the state at instant k belongs to
.
The set travel through the network and affect to
in an undirected way due to the interactions among the agents of the network (intersection of zonotopes). Its evolution due to matrix A gives a bigger set than
since it is not corrected with measurements or communication with other agents, and then the set of action signals and the set of disturbances are added in order to compute a set that is affected in the same way than the state. So the centre
and then
.
Then, by properties of the Minkowski sum of sets, it is proved that for all i. Since
was required by assumptions, the theorem is proved by induction.
The hypothesis of Theorem 1 is standard for induction-based proofs. Moreover, they can be easily fulfilled since the initial sets, namely for
, can be trivially enlarged so that they contain for sure the required state vector. Another initialisation scheme consists in a preliminary execution of the algorithm for
time instants with
(open-loop dynamics). Then, it will be easily satisfied that
for
. After these initial steps, the algorithm can run in its proposed manner.
Theorem 4.1 proves that every agent is able to find sets that contain for sure the actual state for the plant. However, it did not directly mention that these sets are found in an optimal way. In fact, the reader might think that the sets could grow until infinity, in such a way that the theorem would be fulfilled but the estimation would be inadequate.
Nevertheless, two main facts prevent the uncontrolled growing of the estimation sets. On the one hand, the mathematical operations involving zonotopes, such as intersection with strip, intersection between zonotopes and order reduction, are computed using published methods that ensure the optimality of those operations (see Section 2 and the references therein). On the other hand, under standard observability assumption, all the outputs collected by the whole fleet contain enough information to estimate the complete state. Since the algorithm is based in the inclusion of this information by means of intersections, the estimation sets are always kept under some bounds.
Finally, it is worth mentioning that the prediction step (Step 5 in Table ) must be deeply analysed in the future, since its performance can be improved if it takes into consideration the particular topology. It is left for future research.
5. Control formation
In the following, we briefly describe how to track position references in a space state representation, inspired on Orihuela et al. (2016). Given (8), a new system's output is set as , where
is a matrix that selects the position (X and Y) of each vehicle. On the other hand, the position references are given by the vector
, which in general, might take different values for different instants. The equilibrium point
of system (8) (without disturbances) associated with the reference
at instant k must satisfy
(19)
(19) which can be expressed as
(20)
(20) According to previous equation, the equilibrium point associated with
can be obtained as
(21)
(21) Then, in order to track the reference
, the following linear system must be stabilised
(22)
(22) where
and
. Previous system is completely equivalent to the one given (8).
Concerning the design of the local controllers , note that each vehicle has its own estimation about the positions and velocities of the whole fleet. Furthermore, observe that matrices A, B in (22) decouple the fleet control formation problem into p local control problems, assuming that each vehicle knows its own reference. Let us denote
as the equilibrium point for vehicle i related to the global reference
. Now, let us define the following local cost function:
(23)
(23) being
two positive-definite matrices. The local controller is then found by finding the gain
that minimises the previous cost function, following an LQR-based paradigm with the controllable pair
.
6. Simulation examples
The performance of the algorithm is illustrated with several simulations with a fleet of four vehicles. The dynamics of each of them in (5) are particularised as a double integrator and in a 2D scenario. Table presents the set of common parameters that have been chosen for the simulations. The rest of parameters, such as the weighting matrices for the LQR cost function or the noise zonotopes, will be given in the different examples, since they will take different values for each experiment.
Table 2. Parameters for the simulations.
6.1. Example 1
We consider a fleet of four vehicles, being the vehicle 1 the one that is able to measure its absolute position, and the rest measure relative positions. We will compare both topologies considered in the paper. The communication topology, then, will be different in each case:
.
,
,
.
This example will show that the fleet is able to keep a given formation (which indeed is different to a linear or star formation) when the proposed algorithm is used. In addition, all the agents will be able to compute adequate estimation sets for the whole fleet. We assume, as in Lafferriere et al. (2005), that the disturbances affecting the model are equal to zero. Measurement noises are, however, taken into consideration. It is also assumed that the centre of the initial zonotopes is located exactly where the initial state of the vehicles is.
Without loss of generality, the reference trajectory is divided into four waypoints (see Section 5). Figures 4(a,b) depict, for both situations, the actual trajectories of the vehicles on the plane xy using solid lines. The initial position is marked with a circle, and the final position with a triangle. The references are represented in dashed lines. It can be seen that the proposed control formation obtains very nice results. No remarkable differences can be found between both scenarios in what respect to control formation.
Published online:
23 January 2020Figure 4. Trajectory of the four vehicles considering both communication topologies: (a) Chain and (b) Star.
On the other hand, the estimation performance can be checked in Figure 5(a,b). The figures depict the horizontal position of the vehicle 1 (black solid line), its reference (blue solid line) and the estimation that every agent computes. Note that the zonotopes represent an interval when considering a unique variable (in this case, the horizontal position), and so, minimum and maximum values for the state are depicted by each agent (dashed lines of different colours).
Published online:
23 January 2020Figure 5. Vehicle 1, actual horizontal position, reference and estimation of the position x by all agents: (a) Chain and (b) Star.
It is easy to check that the state is always contained within the sets that all agents compute, this showing that the first requirement of a guaranteed estimator is fulfilled. Figure 5 also shows that the settling time of the tracking is around 40 sampling instants (4 s) with the chosen controller for both cases.
In addition, the example illustrates the effect of the distance in the estimation when the chain topology is considered, in the sense that the farther an agent is to the agent measuring the particular state (in this case, agent 1), the wider its bounds are. The centre of the zonotopes (the best estimation) does not suffer from this effect. On the contrary, for the star topology, all the agents can be considered symmetric and, therefore, they obtain a similar performance.
6.2. Example 2
This simulation example shows that the proposed distributed estimation and control method can be used for the nonlinear model of the unicycle, as long as the control of the hand is pursued.
We will assume, for the sake of simplicity, that the masses, moments of inertia and hand positions are the same for every vehicle, this is, . Furthermore, the bounds of noises and disturbances are also the same,
. A chain topology is considered for the communication between the vehicles.
In order to compute the output feedback linearising control (11), it is assumed that each vehicle is equipped with an additional sensor that measures its local orientation . The rest of variables, namely,
can be obtained with the information of the estimation sets. The local cost functions in (23) are defined with
.
Figure 6(a,b) depicts, respectively, the position of the hand and the position of the vehicle. Starting for different initial positions, the fleet is trying to join their hands at the origin. These figures effectively show how the proposed method is still valid for the control of the hand position of the fleet. The effect of the internal dynamics is clear in Figure 6(b).
Published online:
23 January 2020Figure 6. Position of the hand and vehicle considering the unicycle model: (a) hand position and (b) vehicle position.
Figure 6(b) also shows the bounds for the coordinate y obtained with the set-membership estimator. Similarly that in the previous example, wider bounds are obtained for the vehicles located far from the leader. Although it is possible to obtain bounds for the coordinate x as well, we have not included them in the figure to make it clearer for the reader. Note that the bounds could be used for collision avoidance. In particular, vehicles 2 and 3 might have a collision after some time when their bounds intersect each other.
6.3. Example 3
In this new example, we study the influence that the disturbances have in the estimation performance. The analysis is made considering again a chain communication topology, which increases the uncertainty. Since we are interested in the estimation bounds, we will assume that initial zonotopes are centred in the actual initial state of the vehicles.
Two scenarios are compared in Figure 7. In both cases, the observers consider that the disturbances might affect the vehicles, so zonotopes in (7) are included in the prediction step. Whereas Figure 7(a) corresponds to the case in which no actual disturbances are applied, Figure 7(b) shows the case in which the disturbances are actually affecting the dynamics.
Published online:
23 January 2020Figure 7. Comparative in the chain communication scenario between the simulation without disturbances and with them: (a) without disturbances and (b) with disturbances.
It can be seen that the proposed control and estimation algorithm is able to track the prescribed reference in spite the fact that the estimation is degraded. Although the control signal applied by the rest of the fleet is ignored and the disturbances are affecting the vehicles, the guaranteed estimation is achieved.
The reader might think, from the previous simulations, that the reference is not tracked in the presence of disturbances. Then, we have made a longer simulation, depicted in Figure 8, that shows that the actual position of the vehicles evolves around the equilibrium point, within a confined region. The size of this region is related, as expected, to the energy of the disturbances.
Published online:
23 January 2020Figure 8. Vehicle 1, actual position, reference and bounds of the estimation of the position x by all agents.
6.4. Example 4
This example analyses the relation between the control matrix designed and the estimation performance. We will consider two identical settings (same initial condition, no noises, no disturbances) to perform a fair comparison. Although no noises and disturbances actually affect the system, the agents will take into account the corresponding zonotopes. In particular,
. In addition, we will consider in this example that the initial position of the vehicles is unknown, so all the observers' initial condition is set to zero and the initial zonotope
is built big enough to encompass the initial state of the vehicles. This fact will let us compare the speed of convergence of the estimation with respect to the control.
Two families of LQR controller have been synthesised, with different aggressiveness. In particular, the local cost functions in (23) are defined with in the first scenario, and
in the second one. A chain communication topology has been considered here.
The results are illustrated in Figure 9 using black colour lines for the first scenario and red colour lines for the second one. The figures show the actual state in solid lines, the centre of the estimation with circles, and the bounds with dashed lines. They all show the estimation of the vertical position of the first vehicle.
Published online:
23 January 2020Figure 9. Comparative of the gain based on a more aggressive control and on a less aggressive control, estimation of state 1 by all agents: (a) Agent 1, (b) Agent 2, (c) Agent 3 and (d) Agent 4.
Several conclusions can be drawn from this set of figures:
As in the previous results, the closer an agent is respect to the agent measuring the variable under consideration, the narrower the estimation is.
The very first instants, in which the dynamics are frozen, represent the initialisation procedure described after Theorem 4.1.
The speed of convergence of the estimators is much faster than that of the controllers, this implementing the standard cascade structure shown by other combined algorithms, such as LQG.
The values of the weighting matrices in the cost functions have a double impact: first of all, they let us tune the speed of convergence to the desired reference. However, the more aggressive the controllers are, the wider the estimation bounds become. This is due to the calculations made in the prediction step (16) that uses the controller matrices of the neighbours to compute the predicted zonotope. More aggressive controller means bigger values for the control matrices
and, then, bigger prediction zonotopes following Equation (17). As expected, this effect does not appear in the first agent, because this is the one measuring the vertical position.
7. Conclusion
This paper proposes a set-membership distributed estimation and control algorithm. This approach assumes not only uncertainties in the model and in the measurement but also lack of knowledge about the signal control and about the complete plant by the agents.
It has been tested on two different topologies: chain and star communication. The star communication obtains better results, since the maximum distance in the network is smaller than in the chain case. However, the relative distance between the central vehicle and each of the neighbour must belong within radio range. The extension of the proposed algorithm to other topologies, specially those containing a spanning tree (so that a directed path from every agent to any node with control capacities exists), is a natural extension of this work.
The impact of the controller design method in the estimation performance, and vice versa, must also be analysed in the future. How is the control performance affected by the uncertainty in the estimation? How could the control matrix be designed with the combined objective of control and estimation performances? The answers to these questions will be subject of future research. Furthermore, considering unreliable links (with delays and dropouts) is another key problem in this field that will tackled in the future.
(0) Initial zonotope | |
(1) Measurement | |
(1.1) Strip | |
(1.2) Intersection | |
(1.3) Order reduction | |
(2) Communication | |
(2.1) Send | |
(2.2) Receive | |
(2.3) Compute | |
(3) Estimation | |
(4) Control | |
(5) Prediction |
Configuration variables | 2 |
Vehicle dynamics | |
Sampling time | |
Maximum order | 200 |
Disclosure statement
No potential conflict of interest was reported by the authors.
Notes
1 The estimation and control algorithm presented in the next section is still valid if vehicles with different dynamics are considered. For the matter of simplicity in the notation, we consider the case of equivalent vehicles.
2 The image of the vehicle is obtained from the url: http://i-hls.com/wp-content/uploads/2013/08/Autonomous-Underwater-Vehicle-AUV-Aselsan.jpg