ABSTRACT
This study presents the development of an event-driven hybrid control for position and force tracking applied on a mobile robotic manipulator for metal recycling tasks. The suggested controller operates in a sequenced strategy starting from a fixed spot, moving the mobile device towards a targeted zone () from where the i-th piece-to-be-recycled is attainable (considering the arm manipulation). Once the event of entering the zone is completed, the mobile robot is fixed at a position, and the end-effector of the robotic arm is enforced towards the piece-to-be-recycled. When the end-effector touches the piece in a given spot (), the hybrid control changes to the force tracking intending to carry the piece towards the spot () where it ill be processed. Each piece location is identified based on a vision-based system that applies deep learning tools using convolutional neural networks. A multi-physics numerical simulation illustrated the application of the developed controller in a realistic scenario, showing all the elements of the event-driven operation. To validate the suggested controller, the comparison with a robust control that works on a wide range of carrying mass confirms the operational improvement of the event-driven hybrid position and force design.
1. Introduction
The interest in the recycling industry is growing continuously, considering the current problem of waste production that has increased with modern industrial development (Ezeah, Fazakerley, and Roberts Citation2013). This is particularly relevant given that almost all industries produce residual items from all their processes (Schlesinger Citation2006). Specifically, with the increase in public awareness of environmental topics, metal recycling has become of paramount relevance because incineration and landfilling represent a significant source of air and groundwater pollution (Grimaud, Perry, and Laratte Citation2016; Shinzato and Hypolito Citation2005). There is reported evidence that recycling has a significant fossil fuel energy efficiency. This has turned the issue of metal recycling into a problem for both the industry and the government due to the hygiene and health problems it may represent for the industrial workers and the general population (Capuzzi and Timelli Citation2018). In this recycling process, metal identification is a persistent challenge: an incorrect scrap inspection could increase time and cost in the process. In addition, the manual handling of metal pieces is time-wasting and may imply a relevant investment of economic and human resources. For the reasons mentioned above, automated metal recycling processes have gradually replaced the traditional method of manually sorting and processing metals, which offers increased efficiency, improved workers’ safety, and reduced environmental impact (Capuzzi and Timelli Citation2018).
The use of advanced technology and machinery to speed the gathering, sorting, and processing of various metals such as steel, aluminium, copper, and brass is called automated metal recycling. These automated systems are built to handle enormous amounts of scrap metal, guaranteeing that valuable resources are recovered and repurposed while reducing waste and energy usage (Schmitz Citation2006). The process begins by collecting scrap metal from diverse sources, such as manufacturing companies, building sites, and consumer garbage. Sensors, magnets, and conveyor belts are used in automated systems to classify gathered materials depending on their magnetic characteristics and content. This sorting step allows for the separation of ferrous metals from non-ferrous metals and other non-ferrous metals (Lu et al. Citation2022). Today, many technological methods have been generated for automatic metal sorting; some of these options are:
Eddy Current Separation: This technique relies on the principle of electromagnetic induction. An eddy current separator creates an alternating magnetic field, which induces electrical currents in conductive materials. When non-ferrous metals pass through, they experience repulsive forces that cause them to be ejected from the stream. Eddy current separators are particularly effective in separating aluminium, copper, brass, and other non-ferrous metals from mixed materials.
Optical Sorting: Optical sorting systems use advanced sensors, cameras, and image processing algorithms to identify and separate non-ferrous metals based on their optical properties. These systems can detect differences in colour, shape, and reflectivity of materials, enabling accurate identification and sorting of various metals. Optical sorting separates different non-ferrous metal types, including aluminium, copper, zinc, and alloys.
X-ray Transmission Sorting: X-ray transmission sorting systems utilise X-ray technology to determine materials’ atomic density and composition. These systems can differentiate between metals and non-metallic materials by analysing the X-ray attenuation characteristics. This method is especially effective in separating heavy non-ferrous metals from lighter materials, such as lead and tungsten.
Induction Sorting: Induction sorting technology uses electromagnetic fields to induce electrical currents in conductive materials. By analysing the induced currents and their response to the applied field, induction sorters can distinguish between various metals. This method is commonly used for sorting copper, brass, aluminium, and other non-ferrous metals.
Near-Infrared (NIR) Sorting: Near-infrared sorting systems utilise sensors that emit and detect near-infrared light to analyse the molecular structure of materials. Different non-ferrous metals exhibit distinct near-infrared absorption patterns, allowing for their identification and separation. NIR sorting is particularly effective in sorting aluminium, copper, and stainless steel materials.
Notice that all the previous sorting options are often combined or integrated into comprehensive automatic handling systems like robotic manipulators to achieve optimal results in precise classification and sorting. Speed and adaptability, enhancing the efficiency and effectiveness of the sorting operation, are the main design criteria of such systems. The main applications of robots are to pick and place identified objects and then grasp-specific metals based on their shape, size, or other characteristics, enabling accurate sorting and organisation into designated containers or conveyor belts. When vision systems are included in the feedback loop of the mobile manipulator, variations in colour, texture, patterns, or other visual features can be determined to establish the composition and quality of metals, aiding in more accurate sorting, among others. Unfortunately, there are some limits to robotic systems employed in metal recycling, such as their efficient conjunction with automatic control rules to accomplish the work because most of such systems only give force or position controls when sorting metals. This signifies flaws in the robot’s operation when removing metal from its place and placing it on the recycling conveyor belt. These deficiencies are derived from the changes the robot undergoes in its composition when carrying a metal that does not always have the same physical properties, mechanical design, or automatic motion. As a result, there are limitations in their application, i.e. they can be applied only in particular tasks, increased operating costs due to an increase in the energy required to operate, and operating risks due to unwanted movements that can lead to risks to human safety.
As one may identify, there is a significant necessity for developing controlled robotic systems that could be robust enough to operate under two working scenarios: the first one centred on solving the position tracking in the task space, and the second one focused on operating under the mass variation at the end-effector when the object to be recycled must be carried. Notice that the transient from the first to the second state is defined by the event when the mobile manipulator grasps the object. Hence, under the adequate mechanical configuration and the practical selection of electrical components to instrument the manipulator, there is a significant necessity for implementing an automatic controller that works in both scenarios, using the grasping event as a trigger for changing the controller’s objective (and its structure in consequence).
Reports show that the hybrid control offers stability and decreases the number of control signal updates (Branicky, Borkar, and Mitter Citation1994; Lennartson et al. Citation1996). As a result, the computational load decreases simultaneously with the energy consumption. Implementing these controllers could consider aperiodic sensing, communication, and computation crucial for controlling complex cyber-physical systems such as the recycling robotic manipulator (Koutsoukos et al. Citation2000). In the proposed hybrid control approaches, the right-hand side of the controlled robotic follows a variable structure form. This condition emphasises the idea of using hybrid control theories to design the individual controller within each stage, as well as the discrete variation of the controller when the grasping event occurs. For this reason, this paper proposes computer aid studies of a hybrid position-force control to adjust the robot’s motion in a non-ferrous metal recycling process (Yoshikawa and Sudou Citation1993).
The main contributions of this study are the following:
A visual-based neural network for recyclable pieces recognition, location estimation, and object pose.
A hybrid controller feedback operating on the neural network estimation for performing its self-navigation towards .
The design of a tracking trajectory hybrid position force control that deals with external forces that affect the behaviour of the mobile manipulator.
The order of this manuscript is the following: Section 2 presents the central notation, and a few mathematical preliminaries are provided regarding the class of event-driven systems. Section 3 describes the problem related to the automation manipulation of objects to be recycled. Section 3 presents the considered mathematical model for the mobile robot, the associated robotic manipulator, and their dynamic characteristics, as well as the motion limitations. Section 4 presents the problem statement related to the tracking trajectory problem for mobile robotic manipulators under state restrictions considering the recycling tasks. Section 5 details the control design, including the construction of the adaptive gain form and the reasoning behind stabilising the error in the tracking space. Section presents a strategy to optimise the convergence zone for the tracking error, a positively invariant set. The same section presents a methodology that could use only the output information from the system to derive the close-loop control design. Section7 presents the numerical results showing the comparative evidence on the proposed control design and a couple of benchmark alternatives.Section 8 closes the study with some final remarks and future trends.
2. Notation and preliminaries on hybrid systems
The mathematical notations used in this work are as follows: The set of real scalars is defined by ; the collection of non-negative real numbers is characterised by ; the set of n-dimensional vectors is represented by ; and the Euclidean norm for is indicated by . The symbol denotes a vector or matrix’s transpose, while its matrix inverse if its determinant is non-zero, is denoted by the symbol . A square matrix with all zeros is represented by the notation , and the identity matrix with rows and columns is indicated by the notation .
A hybrid system displays a combination of continuous and discrete dynamical behaviour that can switch between these modes depending on the occurrence of an event, often characterised by holonomic state constraints. In the case of the industrial robotic system for recycling tasks, continuous dynamics come into play when the robot is moving towards the object and when the robot is carrying the object before depositing it in the position where it will be processed. On the other hand, discrete dynamics are activated when the robot grasps and releases the object at the correct spot.
The mechanical section of the manipulator dynamics follows a Lagrangian model, constituting the continuous component. In contrast, the hybrid model’s discrete aspect comprises a set of impact equations that instantaneously alter the velocities of generalised variables. We adhere to the standard representation of a hybrid model.
Definition 1.
(Ames, Cousineau, and Powell Citation2012) The following tuple defines a non-autonomous hybrid control system:
Here, the symbol represents an oriented graph with and representing sets of vertices and edges correspondingly. A source function and a target function that links an edge with its corresponding source and target, respectively, link these sets. The domain set where represents a smooth manifold for where represents the control space. The set is the set of admissible controls with . The set represents the guards where . The set corresponds to the reset maps, considering that is a smooth map. In addition, the set defines the dynamics of a controlled system (the mobile manipulator in this study) on and , where .
The permissible variations in velocity considered in the dynamics of the automated manipulator warrant the use of hybrid flows or hybrid executions as viable solutions for the hybrid system. The following definition outlines the dynamics of the hybrid system, which can be employed to describe the motion of the robotic device.
Definition 2.
Unilateral Constraints. According to the hybrid model describing the mobile manipulator, some domains and guards are derived from unilateral constraints. The definition of a unilateral constraint is given by a tuple , in which denotes the configuration space (usually when generalised variables are unconstrained), is the hyper-regular Lagrangian, and represents a unilateral restriction on the configuration space, guaranteeing that the zero-level set forms a smooth subspace.
Notice that the unilateral constraint determines the corresponding guard, reset map, and vector field for the continuous dynamics. Concerning the domain of the unilateral constraint, the provided guard satisfies:
The vector , represented as , describes the joint configuration of the robotic manipulator. When a unilateral constraint is only valid within specific domains of the model, particularly depending on a specific combination of generalised variables, it is referred to as a holonomic constraint, denoted as (notably excluding ). Holonomic constraints can be incorporated into the control system dynamics by applying LaGrange multipliers. The common procedure involves differentiating the constraint , which results in the equation , where . Consequently, the Euler-LaGrange model for the continuous dynamics of the mobile manipulator is represented as (Crampin Citation1981; Ortega et al. Citation1994):
Here the inertia matrix is , and the function containing the right-hand sides of the two differential equations is represented by . The starting conditions of (3) are represented by and respectively.
The set of direct current (DC) motors that are regulated by the control that is in line with the motor’s voltage is represented by the signal . represents the LaGrange multiplier, and denotes specific model domains where constraints are in effect. By differentiating twice, the precise expression of is obtained:
The substitution of (3) in (4) and a suitable algebraic manipulation leads to
where . In order to fulfil the holonomic constraint while transporting the recyclable item, the LaGrange multiplier applies force to the mobile manipulator. Both the domain and the guard sets can be defined (using the LaGrange theorem) according to the multiplier’s value as follows:
The controlled system is defined by applying the additional force, represented by the LaGrange multiplier. This is represented by:
where and define the states of the mobile manipulator with and defined in (3). The functions and are:
The nonlinear vector field satisfies the Lipschitz condition locally, with the fixed positive constant (Poznyak Citation2008).
The term represents the effects of external perturbations and internal uncertainties. The model includes this term to consider a more realistic version of the mobile manipulator.
Assumption 1.
The admissible class of uncertainties – perturbations belongs to the following admissible set
The symbol is a compact, convex set such that . To characterise the movement of the mobile manipulator near around the constraint set that contains , it is not sufficient to include the LaGrange multiplier. The so-called kinematic constraint, which specifies the link between velocities before and after carrying the object to be recycled, explains this behaviour.
Definition 3.
A tuple defines the configuration spaces of the source domain and target domain, which are denoted by the following terms and , respectively. Complementary, is the manipulator inertia matrix where , is a smooth enough vector field which typically defines the position of the final element of an open kinematic chain, is and embedding with push-forward and is an invertible matrix to relabel generalised coordinates in the domain attained after the constraint manifold is left.
The vector-valued function should consider the entire set of constraints of both the unilateral and holonomic natures if they exist. The so-called canonical projection needs to be taken into account in the case of manipulator movement. This canonical projection complies with the equation , which necessitates the use of the composite map . The impact equations can be introduced using kinematic constraints.
Definition 4.
The rigid impacts of the mobile robot’s end-effector with the handled object yield a set of possible discrete jumps over the unilateral and/or holonomic constraints. Formally, these effects can be represented as impulses at the precise locations that the constraints define. In the embedded space , let us consider the kinematic constraint and the generalised coordinates . Next, let us define a map , which describes the velocities following impact, and is described by
where . The map that resets the coordinates in the target domain is defined by and satisfies
The regular movement of the mobile manipulator during the individual object handling cycle enforces two independent stages (because no dynamic transition is considered when the object is grasped). The active device belongs to a class of hybrid systems with holonomic constraints defined by the transition between stages. The mobile manipulator’s mechanical section was acquired to explain the robotic device’s continuous flow within each step, both before and following the grasping. This simplified model could complete the controller design suggested in this study.
3. The mobile manipulator and the changing scenario for aluminium recycling
The mobile manipulator considered in this study comprises a wheeled mobile (skid steering configuration with four independent wheels) robot with a differential configuration and a 7-degree-of-freedom robotic manipulator. This device operates in a non-structured scenario where the objects to be recycled (tyre rims in the example considered in this study) can be placed anywhere in the working procedure. shows a sample of the working procedure, including a feasible organisation of the objects and the mobile robotic device.
demonstrates an overlapped sequence of the mobile manipulator performing diverse activities in the working cycle from an initial position at them to a final position at time . Within this time window, the manipulator must move the end-effector to locate the following object that must be collected. Once the object has been located, the mobile manipulator must move towards the detected object to be recycled in a period defined by after . In this period, the manipulator must attain a specific region near the object from where it is attainable by the manipulator. Once the manipulator enters the mentioned region, then the manipulator must move to catch the object within a given period defined by . Notice that the robotic dynamics’ right-hand side changes now, considering the object’s handling. Once this process has ended with grasping the next object to be recycled, the robot must move towards a region form where the mobile manipulator can release the handled object to be further processed. Such part of the process must be completed after a time after the mobile manipulator reaches the region from which the recycled object can be released. After a time , the object must be located at the correct stop from where it will be processed. This task ends an entire cycle for a given specific object.
According to the motion dynamics shown in , there are two sequential stages. The first one comprehended between and , and the second one started at and ended at . The working cycle is completed for each set of two sequential stages.
4. Model of the mobile manipulator
This section describes the mathematical model of the mobile manipulator, which is a coupled system comprising the mobile autonomous terrestrial vehicle (ATV) and the robotic manipulator. The motion details, the state restrictions, and the potential modification in each of the stages of the working cycle are explained.
4.1. Autonomous terrestrial vehicle
To describe the position of the ATV in the working environment depicted in , two motion reference frames were defined:
Inertial Coordinate System: This coordinate system is a global frame fixed in the ATV’s working environment. The position for this frame is defined by the coordinates tuple .
Robot Coordinate System (relative): This frame is attached to the ATV centre of mass, and it is denoted by the coordinates tuple .
displays the configuration of both reference frameworks and the matching set of variables that specify the mobile manipulator’s motion. An additional weight is placed over the ATV to balance the centre of gravity on the mid-point on the axis between wheels, coinciding with the first rotating axis for the first joint in the arm.
We assume that the robot’s centre of mass, designated as point , is positioned along the axis of symmetry. As illustrated in , the robot’s position and orientation in the Inertial Frame can be represented as .
Given that in our design, we assumed no temporal fluctuation over . Hence, the triplet represents the position of any material point on the robot in the robot frame; represent the position of any material point on the robot with respect to the inertial reference frame. The coordinates of the specified point in the inertial frame and robot frame, respectively, make up the components of this vector.
The following transformation relates to these coordinates:
Here is the orthogonal rotation matrix with respect to the -axis (Spong and Vidyasagar Citation1989).
This transformation defines the motion relationship between frames.
4.1.1. Kinematic model
Here, we demonstrate that the motion of the selected ATV can be described by two non-holonomic constraint equations, which are derived by making two key assumptions:
No lateral slip motion restriction that implies the ATV can only travel in a curved motion (forward and backward) with a minimum curvature radius but not sideways. This condition in the relative frame indicates that the velocity of the centre-point A along the lateral axis is zero at any moment during the motion task, that is , . These two conditions can be represented by the following single non-holonomic condition:
(14) (14)The pure rolling restriction denotes the fact that each wheel has just one point of contact with the ground (). The wheel does not slip along its longitudinal axis () and does not skid along its orthogonal axis ().
The velocities of the contact points in the relative frame and are connected to the velocities and of the right (R) and left (L) wheels by:
here is the radius of each wheel. These velocities can be estimated in the inertial frame as a function of the velocities of the ATV centre-point A:
The linear velocity of the ATV in the robot frame, which is the average of the linear velocities of the two wheels, is the linear velocity of each driving wheel in the robot frame.
and the angular velocity of the ATV is
Considering the case that the material port corresponds to point A (implying that ), the linear and angular velocities of the ATV can be expressed as follows:
The translational and angular velocities can also be obtained in the inertial frame as follows:
The forward kinematic model of the TAR is represented by EquationEquation (20)(20) (20) .
Given the contact points’ velocities and the corresponding constraint equations can be rewritten as follows with :
4.1.2. Dynamic modeling of the ATV system
For the ATV that is moving in a two-dimensional plane, the potential energy is constant, and hence, the dynamic model of such a vehicle can be presented as follows:
The dynamic model is necessary for both the creation of motion control algorithms and the simulation analysis of the ATV motion. The following equations of motion can describe the dynamics in the presence of non-holonomic restrictions:
where is the inertia matrix, is the centripetal and Coriolis matrix, is the input matrix, and is the input vector. The term represents all the uncertainties and the perturbations affecting the ATV motion. Furthermore, the expressions for the matrices , , and : are the following:
where is the total mass of the ATV and is equal to the mass of the MR without the driving wheels and actuators () plus the mass of each driving wheel with the actuator (), is the total equivalent inertia; here is the moment of inertia of the MR about the vertical axis through the centre of mass, is the moment of inertia of each driving wheel with a motor about the wheel axis, and is the moment of inertia of each driving wheel with a motor about the wheel diameter. Introducing the relationship between and , the dynamics of and are related by
Given that is in the null space of the restriction matrix , then
The substitution of (26) in (23) leads to
Multiplying both sided by and using its relationship with matrix , one gets
This form represents the model to be controlled simultaneously with the robotic arm. Notice that (28) can be represented as follows
Here and , while and defines the inner dynamics of the ATV system with and . The term represents the effect of uncertainties and perturbations induced by the arm and some non-modelled elements in the recycling facility. The term satisfies the following inclusion (implying the corresponding bounds for perturbations as well as uncertainties), according to the mechanical structure of the robotic arm:
4.2. Robotic arm
The proposed robotic arm device defines six active joints (degrees of freedom). The Euler-LaGrange equations, a theory of analytical mechanics, are used to establish such a device’s mathematical structure (considering the joint motion’s mechanical dynamics). The aforementioned model’s time dependence establishes the acceleration dynamical model in the chosen generalised coordinates and as
whereas relates to the time derivative of the generalised coordinates, the variable refers to the generalised coordinates that describe the motion of the manipulator articulations. In this case, , defines the drifted dynamics associated to the manipulator device; the control associated dynamics are characterised by ; and the dynamics associated with the control are defined by represents a modelling error’s impact on the manipulator dynamics and an abstraction of external perturbations; corresponds to the control action’s impact on the manipulator dynamics. satisfies the following inclusion (implying the corresponding bounds for perturbations as well as uncertainties), according to the mechanical structure of the robotic arm:
In view of (32) notice that .
Taking into account the fundamentals of the Euler-LaGrange approach, both vector functions , and are described as:
Taking into account the Euler-LaGrange basis, the condition for all and the corresponding positive matrix , using the Frobenius norm.
The mechanical robotic arm establishes motion restrictions for and . Without loss of generality, the next class of asymmetric constraints is considered in this study. Such motion limitations are represented as:
The feasible positions for the manipulator end-effector could be calculated using direct kinematics and the corresponding Jacobian in light of the motion coordinate restrictions taken into account in (34). It should be noted that the control issue can be resolved by considering the end-effector’s motion, which may be connected to the joint dynamics when the inverse kinematics of the AM is used.
4.3. Integrated dynamics of the mobile robot
Based on the individual models of the ATV and the robotic arm, we can present the aggregated model represented as follows using the following definitions and :
Here, we used the notation to establish the application of the variable control form when the task scenario changes from the approach to the object to its handling towards the deployment. Hence, in the first scenario, while when the second state happens.
4.4. On the reference trajectories for the mobile manipulator
To define the problem to be solved in this study, introduce the variable corresponding to the tracking error between the trajectories of the combined robot (ATV and RM) and a suitable set of references . The reference dynamics satisfies the following differential equation
with and its derivative . The initial conditions are given vectors in (36). The function is a Lipschitz function that defines the reference acceleration within each stage of the robotic motion for the ATV and the RM. Notice that this function is proposed to change depending on the mobile manipulator’s active mode, given that the referred trajectories may not be differentiable in the edges, even if they are continuous.
Various methods can address the challenge of designing reference trajectories within each continuous domain. This issue is commonly referred to as trajectory planning for robotic systems. Among the widely used approaches is the Bezier polynomial, which involves defining a set of controls to determine a Bezier curve with points from to , where denotes its order (e.g. for linear, for quadratic, etc.). The first and last control points always correspond to the curve’s endpoints, representing the initial conditions after resetting and the achieved final state in each domain. However, the intermediate control points, if present, typically do not lie on the curve. This paper explores an alternative approach to generating reference trajectories. The concept involves constructing the reference trajectory by interpolating sigmoid functions, following the methodology proposed in (Cruz, Luviano-Juárez, and Chairez Citation2014). These functions are differentiable multiple times, allowing for the regulation of their transient trajectory from one steady state to another. Each component of the reference trajectories can be estimated using the following sigmoidal representation. The specific form of the sigmoid function employed in this article is:
where the scalar parameters , , and are present, and denotes the sigmoid function’s inflection point. The reader is directed to the study given in (Cruz, Luviano-Juárez, and Chairez Citation2014) for more information on the design method for the reference trajectories using sigmoid functions.
By using a derivation procedure, , one can easily obtain the expression of each component for the reference function based on the structure of the sigmoid function proposed in (37). In the vector field , the component is the function .
5. Problem formulation
Based on the integrated dynamics presented in (35) and the reference trajectories (36), the dynamics of the tracking error is governed by the following differential equations:
hich have a practically stable equilibrium point on the origin within each time window for each continuous mode. According to the definition of the hybrid system presented in (1), shows the configuration of the mobile manipulator throughout the interaction with the object to be handled. The sense of the vertices is characterised by the scenarios shown here and the edges shown as the interconnection lines. The set is characterised by both controllers operating in each scenario and the dynamics , which are defined by the ordinary differential EquationEquation (38)(38) (38) . The set of guards and resets is defined by the limits illustrated as ellipsoids in the centre of the referred figure, with resets defined by null jumps, given the reference trajectories’ continuity (and the no necessary differentiability on the edges).
According to the mobilisation problem, the edges of the hybrid system are defined by . Here and define the tracking error for the corresponding scenario characterised with or . The detected edges are
Here defines the maximum distance from where the manipulator could reach each object to be handled, while establishes the distance from where the handled object could be released at the position where it will be processed in the recycling process.
The control set is defined by both controllers applied for the selected scenarios with
Here and are given by
The gains , , and that operate in the scenario related to pure tracking trajectory without payload position and the gains , , , and working for solving the tracking in the presence of payload are state dependent. The index refers to the object to be handled in the recycling procedure.
The laws used to obtain the evolution of such gains are the following:
The initial conditions for the adjustable matrices are , , , , , , and . These selections imply no jumps for the evolution of the gains.
The matrices , , , , , , and are positive definite, constant and symmetric. The initial conditions for (42) , , , , , , and are giving as part of the resetting process on the scenario change. Besides, the time varying gains are , , , and .
The matrices , and , satisfy the following Riccati-like matrix equations:
Here, the matrices that define the Riccati-like matrix EquationEquations (43)(43) (43) are defined by
The adaptive PID controller (including its variation with extended uncertainties and perturbations compensation term) is considered in this study because it is regarded as one of the best ways to control electromechanical devices due to its capacity to approach reference trajectories and reject bounded perturbations. A desirable characteristic of any controller forcing the motion of a mobile robot, such as the robotic manipulator, is the ability to reduce the energy consumed during the operation in both scenarios. This is achieved by including the adaptive gains in (41).
6. Main result on the control design method
This section presents the main result of this study regarding the controller’s design, the differential equation’s existence for the control gains, and the reason that explains the existence of the stable trajectories for the hybrid system.
6.1. Main theorem of the study
The next Theorem summarises the main results obtained in this manuscript.
Theorem 6.1.
Consider the tracking error dynamics given in (38), regulated with the law proposed in (40) adjusted with the gains given in (42). Consider the assumptions regarding the admissible sets of external perturbations and uncertainties given in (30) and (32). If there exists positive definite matrices and such that the matrix inequality (43) has a positive definite solution , then the trajectories of the tracing error are stable in the Lyapunov sense with respect to the zero trajectories, that is
where and is a positive definite matrix of appropriate dimensions.
The controller proposed in Theorem 1 assumes that both the articulation and the angular velocity for each joint can be measured simultaneously. Notice that the expression (45) can be reduced only if the matrix is large in some sense.
The proof of this Theorem is presented in the Appendix of this manuscript.
6.2. Optimization of convergence region
The boundedness result proposed in Theorem 1 is a consequence of the non-matched uncertainties and the presence of external perturbations that may affect the orthosis movement. The invariant ellipsoid method can minimise the bound of the convergence region (Ordaz and Poznyak Citation2015a, Citation2015b; Polyakov Citation2010).
Definition 5.
(Polyakov and Poznyak Citation2009) The proposed ellipsoid
is characterised with a centre placed at the origin and the configuration positive definite matrix which is said to be invariant for the trajectories of the system (38) if
the initial condition implies for all ,
the initial condition implies as .
The potential discovery of an invariant ellipsoid characterised by the matrix , with the maximum dimensions, facilitates the minimal deviation of all conceivable trajectories from the origin. While this understanding aligns seamlessly with the outcome elucidated in (43), it necessitates the establishment of a metric or criterion for determining the magnitude of the matrix . In this context, the trace operator serves as the chosen criterion, wherein or .
In Theorem 1, the existence of the invariant set, where the trajectories of converge, has already been established for each evaluation period .
In accordance with the definition outlined in (43), the configuration matrix adheres to the concept of an invariant ellipsoid when taking into account the inequality (45). To determine the solution for the minimal invariant ellipsoid, the following Lemma provides sufficient conditions for designing the gain controller.
Lemma 1.
If the tupple (, ) is a solution of the optimisation problem , subjected to , and
Then, the corresponding controller proposed in (40) guarantees that any trajectory of the system (38) converges to a quasi-minimal ellipsoid within each scenario for the mobile robot.
The proof of this Lemma follows straightforwardly from the application of the Schur complement on (43).
6.3. Output-based adaptive PID controller
The proposed solution for addressing the output-based tracking problem consists of an output feedback controller, comprising a robust exact differentiator (RED) as suggested in (Levant Citation1998) for obtaining velocity estimation. Additionally, a modified version of the adaptive proportional-derivative controller (APD) is employed to track reference trajectories. The velocity estimation is accomplished using this differentiator based on the Super-Twisting algorithm (STA). Employing the STA, the PID control is then selected as part of the control strategy.
where is the estimated vector of the tracking error. The application of the super-twisting algorithm (STA) as a robust and exact differentiator (RED) is based on the following description: If , where is the signal to be differentiated, and represents its derivative, the following auxiliary equation is obtained under the assumption .
The previous differential equation is a state representation of the signal The STA algorithm to obtain the derivative of looks like
where are the STA gains. Here is the output of the differentiator (Levant Citation1998). In this equation,
To implement the control method proposed in this article, the algorithm shown in was developed. The results of this algorithm are showing in the next section.
7. Numerical results
In this section, the results are obtained by implementing the scenario for recycling car wheels in Coppeliasim with the proposed mobile robot. To configure the scenario, Matlab and Coppeliasim were set to work in supervised synchronous mode by the API function for Matlab. The declaration of commands and functional programming was developed in Matlab. On the other hand, the speed of the robot’s active joints was measured using virtual instrumentation in Coppeliasim.
Furthermore, those results compare setting an automatic control algorithm with extended state feedback or Proportional Integral Derivative (PID) control and with a law that considers the implementation of adaptive gains over a configuration of disturbance rejection control form. The initial gains setting for the robotic device are shown in .
As has been mentioned, the task of taking one wheel to the recycling band is developed in four stages. The first stage corresponds to the period when the mobile robot approaches the object. The second one corresponds to the period when the object is lifted. The third refers to the case when the lifted object is moved towards the conveyor. The fourth and last stage corresponds to the period when the object is released.
The application of inverse kinematics allowed us to construct the set of reference trajectories for all the joints in the Kuka robot to reach the wheel and then leave it in the recycling band. Moreover, the difference between the wheel and the home position is computed to orientate the robot and move it close to the wheel.
Each stage of the Kuka robot is executed in an elapsed time of 120 s. Those events’ trajectories and movements are shown in . Both controllers show asymptotic convergence for the tracking errors. This fact can be appreciated in , demonstrating that tracking errors are monotonically decreasing, specifically in those joints that change their position between the stages and in the case of those joints that have to maintain the position, the error is 2°, like in the case of q1,3,5,7.
After analysing the trajectory tracking of each joint, it is possible to observe that each stage change converges to the desired position about 60 s after the start of the corresponding stage. During stage 1, developed by the manipulator, both control laws reflect a similar performance. However, during stages 2 and 3, it is possible to observe a better performance in tracking the trajectories by the proposed controller (ADRC). Although during stage 2, there is no change in the manipulator dynamics, the motion of the daughter’s joints affects the parent joints, which is why slight position changes are observed in the joints q1,3,5,7 for (see ).
Finally, during the stage where the rin is loaded by the manipulator (third stage), it is possible to observe a more noticeable change in the proposed control law (ADRC) performance. The graphs corresponding to the error for reflect a lower error when the system has the ADRC control (magenta line). Specifically, the rin-attracting joint (q) exhibits lower error at the beginning and during that stage.
As part of the controllers’ performance demonstration, both the Mean Square Error (MSE) and the Error Integral (EI) per stage, controller, and joint were calculated and are presented in the and , respectively.
For the first stage, the ADRC controller has an increase close to 0.5% in both EI and MSE compared to the PID type controller; this performance is consistent with the system dynamics not being modified, and the controller gains are adequate.
On the other hand, the second stage shows a 2% decrease in EI and MSE when the ADRC controller is used compared to the PID-type controller. Finally, during the third stage, it is possible to observe a better compensation of the manipulator dynamics modification, which is reflected in a decrease of almost 3% in the EI and MSE when using the ADRC controller compared to the PID controller.
As with the Kuka robot part (), an analysis was performed when the robotic system was moved from its ‘home’ state to a point where the rin to be picked up is achievable. From this stage, the error plot (see ) and the control plot (see ) were obtained. In this case, it is possible to observe how the error converges to zero at a time close to 55 [sec] when the ADRC control law is applied, while when the traditional PID control law is applied, it does not converge to zero until 60 [sec].
An analysis of the control applied for each DoF of the manipulator was also performed. The corresponding graphs are presented in . With them, it is possible to notice that both for the joints that have a change of position in some stage and for those that do not, a lower energy expenditure is required when the ADRC controller is used than when the classical control law is used. This is accentuated in the first 20 s of each stage.
The Mean Square Control (MSC) and Control Integral (CI) per stage, controller, and joint were calculated to confirm the decrease in energy requirement. They are presented in and , respectively.
For the first stage, even though there is an increase of about 0.05% in the error when the system acts under the ADRC law, this law presents a decrease in both the MSC and the CI of about 0.05% compared to the system when it has the PID control law. During the second stage, it can be seen a 4.1% of decrease in CI and MSC when the ADRC controller is used compared to the PID-type controller. Finally, during the third stage, it is possible to observe a better compensation of the manipulator dynamics modification, which is reflected in a decrease of almost 5% in the CI and MSC when using the ADRC controller compared to the PID controller.
The numerical simulations show that ADRC and PID algorithms may exhibit similar error tracking for each DoF. Notice that the comparison of the transient evolution demonstrates the benefit of the ADRC controller showing the tracking error’s convergence, mitigating the change in the manipulator dynamics produced by the task of moving the wheel.
In addition, a significant reduction of the control signal and the tracking error due to ADRC law highlights the improvement enforced with the application of the adaptive form, despite the change in the robot’s dynamics due to the proposed task.
8. Conclusions
This paper introduces a set of successful controllers for robotic manipulators with joint restrictions, employing a novel adaptive gain strategy to address the tracking trajectory problem. The controllers encompass state feedback, output feedback, and a robust version of state feedback that accounts for limited knowledge of manipulator dynamics that can be used in recycling facilities working purely autonomously. All of them utilise state-dependent gains to enforce a better tracking of the reference trajectories. Applying a controlled Lyapunov function establishes the zero trajectory of the tracking error space as asymptotically stable in the Lyapunov sense. Several crucial factors must be considered for the potential implementation of the proposed controller, taking into account processing for the control implementation. Processing speed is a critical consideration for most controllers, but in the case of the proposed controller, higher processing speeds enhance its performance. Therefore, it is imperative to implement this class of controllers on high-speed processors.
Simulated results, incorporating physically inspired joint motion limitations, confirm the satisfaction of the proposed predefined joint motion. Additionally, the efficiency of the proposed adaptive controller enables effective management of external perturbations, as evidenced in the simulated results.
Notably, the presented results affirm the advantages of the adaptive controller compared to similar designs, such as the traditional proportional-integral-derivative control form. Moreover, meeting limitations with an arbitrary structure opens up possibilities for deriving novel alternatives to control complex robotic devices in diverse recycling industries where autonomous manipulators could solve the problem by implementing a hybrid system formulation. The suggested controller can be applied to accomplish more intricate tasks where the robotic device may have complex motion restrictions and velocities.
Acknowledgments
The authors would like to thank the Tecnologico de Monterrey Challenge-Based Research Program project ID IJXT070-22TE60001 and Programa para la Vinculación de Empresas con Instituciones de Educación Superior y Centros de Investigación Comecyt-EdoMex número de proyecto Vinculacion/2023/009.
Disclosure statement
No potential conflict of interest was reported by the author(s).
Data availability statement
The data that support the findings of this study are available from the corresponding author, [author initials], upon reasonable request.
Additional information
Funding
Notes on contributors
Karen Mendoza-Bautista
Karen Mendoza-Bautista received a B.S. degree in biomedical engineering from the National Polytechnic Institute (IPN), Mexico City, Mexico, in 2019 and a master’s degree in Robotics and Advanced Manufacturing, Center of Investigation and Advanced Researching (CINVESTAV) campus Saltillo, IPN, in 2022. Actually, she is studying for a PhD at the Technological Institute of Monterrey (ITESM) in Engineering Sciences. She is also working as a part-time professor on the ITESM campus in Guadalajara. Her current research interests include robot control theory, surgical robots, adaptive control, and vision systems applied to robots. She has published about 3 papers in recognized technical journals.
Mariel Alfaro-Ponce
Mariel Alfaro-Ponce is an assistant professor in the Biomedical Engineering Program at Tecnológico de Monterrey Ciudad de Mexico; she received a bachelor’s degree in Biomedical Engineering, a Master of Science in Microelectronic Engineering, and a Ph.D. in Computer Science from the Instituto Politecnico Nacional, Mexico. Her research interests include artificial intelligence, rehabilitation devices, and intelligent bioinstrumentation. She has been a member of the National System of Researchers of Mexico (SNI-Level I). From 2022 until now, is the head of the Manufacturing Processes for Advanced Materials CDMX research unit.
Isaac Chairez
Isaac Chairez earned the B.S. degree in biomedical engineering from the National Polytechnic Institute (IPN), Mexico City, Mexico, in 2002, and the master’s and Ph.D. degrees from the Department of Automatic Control, Center of Investigation and Advanced Researching (CINVESTAV), IPN, in 2004 and 2007, respectively. He is currently with the National School of Sciences and Engineering of the Tecnológico de Monterrey and the Professional Interdisciplinary Unit of Biotechnology, IPN. He has published over 230 contributions in indexed journals and 300 in international conferences. He has published two books on the applications of neural networks on diverse disciplines. His current research interests include neural networks, fuzzy control theory, nonlinear control, adaptive control, and game theory.
References
- Ames, A. D., E. A. Cousineau, and M. J. Powell 2012. “Dynamically Stable Bipedal Robotic Walking with Nao via Human-Inspired Hybrid Zero Dynamics”. Proceedings of the 15th acm international conference on hybrid systems: Computation and control, Beijin, China, April 17–19, 135–144.
- Branicky, M. S., V. S. Borkar, and S. K. Mitter 1994. “A Unified Framework for Hybrid Control”. Proceedings of 1994 33rd IEEE Conference on Decision and Control, Lake Buena Vista, FL, USA, December 14–16.
- Capuzzi, S., and G. Timelli. 2018. “Preparation and Melting of Scrap in Aluminum Recycling: A Review.” Metals 8 (4): 249. https://doi.org/10.3390/met8040249.
- Crampin, M. 1981. “On the Differential Geometry of the Euler-Lagrange Equations, and the Inverse Problem of Lagrangian Dynamics.” Journal of Physics A: Mathematical and General 14 (10): 2567. https://doi.org/10.1088/0305-4470/14/10/012.
- Cruz, D., A. Luviano-Juárez, and I. Chairez. 2014. Output sliding mode controller to regulate the gait of Gecko-inspired robot. Memorias del XVI Congreso Latinoamericano de Control Automático, CLCA, Cancún, Quintana Roo, México, Ocotober 14–17. Mexico City.
- Ezeah, C., J. A. Fazakerley, and C. L. Roberts. 2013. “Emerging Trends in Informal Sector Recycling in Developing and Transition Countries.” Waste Management 33 (11): 2509–2519. https://doi.org/10.1016/j.wasman.2013.06.020.
- Grimaud, G., N. Perry, and B. Laratte. 2016. “Life Cycle Assessment of Aluminium Recycling Process: Case of Shredder Cables.” Procedia Cirp 48:212–218. https://doi.org/10.1016/j.procir.2016.03.097.
- Koutsoukos, X. D., P. J. Antsaklis, J. A. Stiver, and M. D. Lemmon. 2000. “Supervisory Control of Hybrid Systems.” Proceedings of the IEEE 88 (7): 1026–1049. https://doi.org/10.1109/5.871307.
- Lennartson, B., M. Tittus, B. Egardt, and S. Pettersson. 1996. “Hybrid Systems in Process Control.” IEEE Control Systems Magazine 16 (5): 45–56.
- Levant, A. 1998. “Robust exact differentiation via sliding mode technique.” Automatica 34 (3): 379–384. https://doi.org/10.1016/S0005-1098(97)00209-4.
- Lu, Y., B. Yang, Y. Gao, and Z. Xu. 2022. “An Automatic Sorting System for Electronic Components Detached from Waste Printed Circuit Boards.” Waste Management 137:1–8. https://doi.org/10.1016/j.wasman.2021.10.016.
- Ordaz, P., and A. Poznyak. 2015a. “‘Kl’-Gain Adaptation for Attractive Ellipsoid Method.” IMA Journal of Mathematical Control and Information 32 (3): 447–469. https://doi.org/10.1093/imamci/dnt046.
- Ordaz, P., and A. Poznyak. 2015b. “‘Kl’-Gain Adaptation for Attractive Ellipsoid Method.” IMA Journal of Mathematical Control and Information 32 (3): 447–469. https://doi.org/10.1093/imamci/dnt046.
- Ortega, R., A. Loria, R. Kelly, and L. Praly 1994. “On Passivity-Based Output Feedback Global Stabilization of Euler-Lagrange Systems”. Proceedings of 1994 33rd IEEE Conference on Decision and Control, Lake Buena Vista, FL, USA, December 14–16. https://doi.org/10.1109/CDC.1994.410898.
- Polyakov, A. 2010. “Invariant Ellipsoid Method for Time-Delayed Predictor-Based Sliding Mode Control System”. 2010 11th International Workshop on Variable Structure Systems (VSS), Mexico City, Mexico, June 26–28. https://doi.org/10.1109/VSS.2010.5544685.
- Polyakov, A., and A. Poznyak 2009. “Minimization of the Unmatched Disturbances in the Sliding Mode Control Systems via Invariant Ellipsoid Method”. 2009 ieee control applications,(cca) & intelligent control,(isic), St. Petersburg, Russia, July 08–10. https://doi.org/10.1109/CCA.2009.5280842.
- Poznyak, A. 2008. Advanced Mathematical Tools for Automatic Control Engineers: Volume 1: Deterministic Systems, 775. Amsterdam: Elsevier Science.
- Schlesinger, M. E. 2006. Aluminum recycling, 248. Boca Raton, Fl, USA: CRC press.
- Schmitz, C. 2006. Handbook of Aluminium Recycling, 454. Essen, Germany: Vulkan-Verlag GmbH.
- Shinzato, M. C., and R. Hypolito. 2005. “Solid Waste from Aluminum Recycling Process: Characterization and Reuse of Its Economically Valuable Constituents.” Waste Management 25 (1): 37–46. https://doi.org/10.1016/j.wasman.2004.08.005.
- Spong, M. W., and M. Vidyasagar. 1989. Robot Dynamics and Control, wa343. New York, USA: John Wiley & Sons.
- Yoshikawa, T., and A. Sudou. 1993. “Dynamic Hybrid Position/Force Control of Robot Manipulators-On-Line Estimation of Unknown Constraint.” IEEE Transactions on Robotics and Automation 9 (2): 220–226. https://doi.org/10.1109/70.238286.
Appendix: Proof of the main theorem
Proof. The application of the proposed controller in (40) over the tracking error dynamics presented in (38) leads to the following equivalent differential equation:
where could take the values of or depending on the active scenario.
The substitution of the selected forms for and in (41) implies the following equivalent dynamics:
This differential equation is valid with .
The expression presented in (53) is equivalent to (using the expressions of , , , and ) to the following differential form with :
here , , , and .
Consider the mixed Lyapunov candidate function (valid for each domain for continuous time and that we can analyse after each scenario change) given by:
here we keep the notation in the definition of the function to emphasise the continuous-time analysis. In contrast, in the case of , we considered the values of corresponding to the moments corresponding to the change of scenario.
The time derivative of the first function in (55), , that is
The substitution of the dynamics corresponding to in the derivative of leads to:
The application of a symmetrisation operator and the relationship between the inner product and the trace operator has the following upper estimation for the time derivative of
The previous differential inclusion was obtained after applying the so-called lambda inequality , with (Poznyak Citation2008).
Taking the assumption on the existence of positive definite solutions for the Riccati equation depending on and presented in the Theorem description and considering the class of adjustment methods for the gains , , , and , (all of them included in ) presented in (42), one gets that last inclusion can be transformed to: .
The estimation of the integral operator applied to the previous inclusion, and by the comparison Lemma, it can be proven that , which is valid . Notice that this analysis is valid within each scenario where the continuous dynamics is valid.
Considering the structure of the proposed continuous form of the Lyapunov-like equation, one may confirm that the following inequality is valid
Now, taking into consideration the evolution of matrix for the object to be handled, consider the study of the candidate Lyapunov function in discrete sense.
The second part of the proof will take advantage of the explicit discretisation applied to the dynamics of the tracking error , implying that (54) can be presented as follows:
From the expression (60), we may estimate that
Now, using these discrete dynamics on the candidate Lyapunov function , one has to estimate the variation of such function at two consecutive times when the scenario changes
This function is proposed considering that there are no jumps on the gains on the boundary between scenarios. Then, based on the proposed Lyapunov candidate function, one has the following estimate considering the discrete dynamics (61):
Using the Rayleigh inequality and the upper bound for the admissible class of external uncertainties and modelling imprecision, the following upper bound is valid
Taking the assumption on the existence of positive definite solutions for the discrete Riccati equation establishing the evolution of matrix at each change of scenario leads to
Hence, one may notice that if
This last result completes the proof.