198
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Hybrid robust disturbance rejection position-force control to adjust the motion of mobile manipulator in automatic aluminum foundry recycling industries

, &
Pages 1-19 | Received 22 Nov 2023, Accepted 09 Feb 2024, Published online: 19 Apr 2024

ABSTRACT

This study presents the development of an event-driven hybrid control for position and force tracking applied on a mobile robotic manipulator for metal recycling tasks. The suggested controller operates in a sequenced strategy starting from a fixed spot, moving the mobile device towards a targeted zone (Ωir) from where the i-th piece-to-be-recycled is attainable (considering the arm manipulation). Once the event of entering the zone is completed, the mobile robot is fixed at a position, and the end-effector of the robotic arm is enforced towards the piece-to-be-recycled. When the end-effector touches the piece in a given spot (Ωie), the hybrid control changes to the force tracking intending to carry the piece towards the spot (Ωip) where it ill be processed. Each piece location is identified based on a vision-based system that applies deep learning tools using convolutional neural networks. A multi-physics numerical simulation illustrated the application of the developed controller in a realistic scenario, showing all the elements of the event-driven operation. To validate the suggested controller, the comparison with a robust control that works on a wide range of carrying mass confirms the operational improvement of the event-driven hybrid position and force design.

1. Introduction

The interest in the recycling industry is growing continuously, considering the current problem of waste production that has increased with modern industrial development (Ezeah, Fazakerley, and Roberts Citation2013). This is particularly relevant given that almost all industries produce residual items from all their processes (Schlesinger Citation2006). Specifically, with the increase in public awareness of environmental topics, metal recycling has become of paramount relevance because incineration and landfilling represent a significant source of air and groundwater pollution (Grimaud, Perry, and Laratte Citation2016; Shinzato and Hypolito Citation2005). There is reported evidence that recycling has a significant fossil fuel energy efficiency. This has turned the issue of metal recycling into a problem for both the industry and the government due to the hygiene and health problems it may represent for the industrial workers and the general population (Capuzzi and Timelli Citation2018). In this recycling process, metal identification is a persistent challenge: an incorrect scrap inspection could increase time and cost in the process. In addition, the manual handling of metal pieces is time-wasting and may imply a relevant investment of economic and human resources. For the reasons mentioned above, automated metal recycling processes have gradually replaced the traditional method of manually sorting and processing metals, which offers increased efficiency, improved workers’ safety, and reduced environmental impact (Capuzzi and Timelli Citation2018).

The use of advanced technology and machinery to speed the gathering, sorting, and processing of various metals such as steel, aluminium, copper, and brass is called automated metal recycling. These automated systems are built to handle enormous amounts of scrap metal, guaranteeing that valuable resources are recovered and repurposed while reducing waste and energy usage (Schmitz Citation2006). The process begins by collecting scrap metal from diverse sources, such as manufacturing companies, building sites, and consumer garbage. Sensors, magnets, and conveyor belts are used in automated systems to classify gathered materials depending on their magnetic characteristics and content. This sorting step allows for the separation of ferrous metals from non-ferrous metals and other non-ferrous metals (Lu et al. Citation2022). Today, many technological methods have been generated for automatic metal sorting; some of these options are:

  • Eddy Current Separation: This technique relies on the principle of electromagnetic induction. An eddy current separator creates an alternating magnetic field, which induces electrical currents in conductive materials. When non-ferrous metals pass through, they experience repulsive forces that cause them to be ejected from the stream. Eddy current separators are particularly effective in separating aluminium, copper, brass, and other non-ferrous metals from mixed materials.

  • Optical Sorting: Optical sorting systems use advanced sensors, cameras, and image processing algorithms to identify and separate non-ferrous metals based on their optical properties. These systems can detect differences in colour, shape, and reflectivity of materials, enabling accurate identification and sorting of various metals. Optical sorting separates different non-ferrous metal types, including aluminium, copper, zinc, and alloys.

  • X-ray Transmission Sorting: X-ray transmission sorting systems utilise X-ray technology to determine materials’ atomic density and composition. These systems can differentiate between metals and non-metallic materials by analysing the X-ray attenuation characteristics. This method is especially effective in separating heavy non-ferrous metals from lighter materials, such as lead and tungsten.

  • Induction Sorting: Induction sorting technology uses electromagnetic fields to induce electrical currents in conductive materials. By analysing the induced currents and their response to the applied field, induction sorters can distinguish between various metals. This method is commonly used for sorting copper, brass, aluminium, and other non-ferrous metals.

  • Near-Infrared (NIR) Sorting: Near-infrared sorting systems utilise sensors that emit and detect near-infrared light to analyse the molecular structure of materials. Different non-ferrous metals exhibit distinct near-infrared absorption patterns, allowing for their identification and separation. NIR sorting is particularly effective in sorting aluminium, copper, and stainless steel materials.

Notice that all the previous sorting options are often combined or integrated into comprehensive automatic handling systems like robotic manipulators to achieve optimal results in precise classification and sorting. Speed and adaptability, enhancing the efficiency and effectiveness of the sorting operation, are the main design criteria of such systems. The main applications of robots are to pick and place identified objects and then grasp-specific metals based on their shape, size, or other characteristics, enabling accurate sorting and organisation into designated containers or conveyor belts. When vision systems are included in the feedback loop of the mobile manipulator, variations in colour, texture, patterns, or other visual features can be determined to establish the composition and quality of metals, aiding in more accurate sorting, among others. Unfortunately, there are some limits to robotic systems employed in metal recycling, such as their efficient conjunction with automatic control rules to accomplish the work because most of such systems only give force or position controls when sorting metals. This signifies flaws in the robot’s operation when removing metal from its place and placing it on the recycling conveyor belt. These deficiencies are derived from the changes the robot undergoes in its composition when carrying a metal that does not always have the same physical properties, mechanical design, or automatic motion. As a result, there are limitations in their application, i.e. they can be applied only in particular tasks, increased operating costs due to an increase in the energy required to operate, and operating risks due to unwanted movements that can lead to risks to human safety.

As one may identify, there is a significant necessity for developing controlled robotic systems that could be robust enough to operate under two working scenarios: the first one centred on solving the position tracking in the task space, and the second one focused on operating under the mass variation at the end-effector when the object to be recycled must be carried. Notice that the transient from the first to the second state is defined by the event when the mobile manipulator grasps the object. Hence, under the adequate mechanical configuration and the practical selection of electrical components to instrument the manipulator, there is a significant necessity for implementing an automatic controller that works in both scenarios, using the grasping event as a trigger for changing the controller’s objective (and its structure in consequence).

Reports show that the hybrid control offers stability and decreases the number of control signal updates (Branicky, Borkar, and Mitter Citation1994; Lennartson et al. Citation1996). As a result, the computational load decreases simultaneously with the energy consumption. Implementing these controllers could consider aperiodic sensing, communication, and computation crucial for controlling complex cyber-physical systems such as the recycling robotic manipulator (Koutsoukos et al. Citation2000). In the proposed hybrid control approaches, the right-hand side of the controlled robotic follows a variable structure form. This condition emphasises the idea of using hybrid control theories to design the individual controller within each stage, as well as the discrete variation of the controller when the grasping event occurs. For this reason, this paper proposes computer aid studies of a hybrid position-force control to adjust the robot’s motion in a non-ferrous metal recycling process (Yoshikawa and Sudou Citation1993).

The main contributions of this study are the following:

  • A visual-based neural network for recyclable pieces recognition, location estimation, and object pose.

  • A hybrid controller feedback operating on the neural network estimation for performing its self-navigation towards Ωir.

  • The design of a tracking trajectory hybrid position force control that deals with external forces that affect the behaviour of the mobile manipulator.

The order of this manuscript is the following: Section 2 presents the central notation, and a few mathematical preliminaries are provided regarding the class of event-driven systems. Section 3 describes the problem related to the automation manipulation of objects to be recycled. Section 3 presents the considered mathematical model for the mobile robot, the associated robotic manipulator, and their dynamic characteristics, as well as the motion limitations. Section 4 presents the problem statement related to the tracking trajectory problem for mobile robotic manipulators under state restrictions considering the recycling tasks. Section 5 details the control design, including the construction of the adaptive gain form and the reasoning behind stabilising the error in the tracking space. Section presents a strategy to optimise the convergence zone for the tracking error, a positively invariant set. The same section presents a methodology that could use only the output information from the system to derive the close-loop control design. Section7 presents the numerical results showing the comparative evidence on the proposed control design and a couple of benchmark alternatives.Section 8 closes the study with some final remarks and future trends.

2. Notation and preliminaries on hybrid systems

The mathematical notations used in this work are as follows: The set of real scalars is defined by R; the collection of non-negative real numbers is characterised by R+; the set of n-dimensional vectors is represented by Rn; and the Euclidean norm for zRn is indicated by z. The symbol T denotes a vector or matrix’s transpose, while its matrix inverse if its determinant is non-zero, is denoted by the symbol Π1. A square matrix with all zeros is represented by the notation 0n, and the identity matrix with n rows and columns is indicated by the notation In.

A hybrid system displays a combination of continuous and discrete dynamical behaviour that can switch between these modes depending on the occurrence of an event, often characterised by holonomic state constraints. In the case of the industrial robotic system for recycling tasks, continuous dynamics come into play when the robot is moving towards the object and when the robot is carrying the object before depositing it in the position where it will be processed. On the other hand, discrete dynamics are activated when the robot grasps and releases the object at the correct spot.

The mechanical section of the manipulator dynamics follows a Lagrangian model, constituting the continuous component. In contrast, the hybrid model’s discrete aspect comprises a set of impact equations that instantaneously alter the velocities of generalised variables. We adhere to the standard representation of a hybrid model.

Definition 1.

(Ames, Cousineau, and Powell Citation2012) The following tuple defines a non-autonomous hybrid control system:

(1) H=Γ,D,U,G,R,FG.(1)

Here, the symbol Γ=V,E represents an oriented graph with V and E representing sets of vertices and edges correspondingly. A source function sor:EV and a target function tar:EV that links an edge with its corresponding source and target, respectively, link these sets. The domain set D=DvvV where DvRnv×Rkv represents a smooth manifold for Rnv×Rkv where Rkv represents the control space. The set U=UvvV is the set of admissible controls with UvRkv. The set G=GeeE represents the guards where GeDsor(e). The set R=ReeE corresponds to the reset maps, considering that Re:GeDtar(e) is a smooth map. In addition, the set FG=(fv,gv)vE defines the dynamics of a controlled system (the mobile manipulator in this study) on xDv and uUv, where (fv,gv).

The permissible variations in velocity considered in the dynamics of the automated manipulator warrant the use of hybrid flows or hybrid executions as viable solutions for the hybrid system. The following definition outlines the dynamics of the hybrid system, which can be employed to describe the motion of the robotic device.

Definition 2.

Unilateral Constraints. According to the hybrid model describing the mobile manipulator, some domains and guards are derived from unilateral constraints. The definition of a unilateral constraint is given by a tuple h=(Q,L,δ), in which Q denotes the configuration space (usually Rn when generalised variables are unconstrained), L:TQR is the hyper-regular Lagrangian, and δ:QR represents a unilateral restriction on the configuration space, guaranteeing that the zero-level set δ1(0) forms a smooth subspace.

Notice that the unilateral constraint h determines the corresponding guard, reset map, and vector field for the continuous dynamics. Concerning the domain of the unilateral constraint, the provided guard satisfies:

(2) Dh=q,dqdtTQ:hq0,Gh=q,dqdtTQ:hq=0,dhdtlt0.(2)

The vector qRn, represented as q=qii=1,,nT, describes the joint configuration of the robotic manipulator. When a unilateral constraint is only valid within specific domains of the model, particularly depending on a specific combination of generalised variables, it is referred to as a holonomic constraint, denoted as η(q) (notably excluding dq/dt). Holonomic constraints can be incorporated into the control system dynamics by applying LaGrange multipliers. The common procedure involves differentiating the constraint η(q), which results in the equation Aqdqdt=0, where Aq=∂η∂q. Consequently, the Euler-LaGrange model for the continuous dynamics of the mobile manipulator is represented as (Crampin Citation1981; Ortega et al. Citation1994):

(3) Mqd2qtdt2+Gqt,dqtdt,t+ATqtληqt,dqtdt,ut,t=ut,(3)

Here the inertia matrix is M:RnRn×n, and the function containing the right-hand sides of the two differential equations is represented by G:Rn×Rn×R+Rn. The starting conditions of (3) are represented by q(0) and dq0dt respectively.

The set of direct current (DC) motors that are regulated by the control that is in line with the motor’s voltage is represented by the signal uRn. λη:TQ×RkRc represents the LaGrange multiplier, and Rc denotes specific model domains where constraints are in effect. By differentiating η(q) twice, the precise expression of λη is obtained:

(4) dAqdtdqtdt+Aqd2qdt2=0.(4)

The substitution of (3) in (4) and a suitable algebraic manipulation leads to

(5) ληq,dqdt,u,t=ΓAdAqdtdqdt+AqM1qutΓAAqM1qGqt,dqtdt,t,(5)

where ΓA=A(q)M1qAT(q)1. In order to fulfil the holonomic constraint while transporting the recyclable item, the LaGrange multiplier applies force to the mobile manipulator. Both the domain and the guard sets can be defined (using the LaGrange theorem) according to the multiplier’s value as follows:

(6) Dη=q,dqdtTQ×Rk:λη0,Gη=q,dqdtTQ:λη=0,qdtlt0.(6)

The controlled system is defined by applying the additional force, represented by the LaGrange multiplier. This is represented by:

(7) ddtξ1t=ξ2t,ddtξ2t=fξ1t,ξ2t+gξ1tut+ηξ1t,ξ2t,t,(7)

where ξ1=q and ξ2=dq/dt define the states of the mobile manipulator with ξ1(0) and ξ2(0) defined in (3). The functions f:Rn×RnRn and g:RnRn×n are:

(8) f=M1ξ1Gξ1,ξ2,tATξ1ΓAdAξ1dtξ2+M1ξ1Aξ1ΓAAξ1M1ξ1Gξ1,ξ2,t,g=M1ξ1In×nΓAAξ1M1ξ1.(8)

The nonlinear vector field f satisfies the Lipschitz condition locally, with the fixed positive constant Lf (Poznyak Citation2008).

The term η:Rn×Rn×R+Rn represents the effects of external perturbations and internal uncertainties. The model includes this term to consider a more realistic version of the mobile manipulator.

Assumption 1.

The admissible class of uncertainties – perturbations η belongs to the following admissible set

(9) f=M1ξ1Gξ1,ξ2,tATξ1ΓAdAξ1dtξ2+M1ξ1Aξ1ΓAAξ1M1ξ1Gξ1,ξ2,t,g=M1ξ1In×nΓAAξ1M1ξ1.(9)

The symbol Θ is a compact, convex set such that 0Θ. To characterise the movement of the mobile manipulator near around the constraint set that contains η, it is not sufficient to include the LaGrange multiplier. The so-called kinematic constraint, which specifies the link between velocities before and after carrying the object to be recycled, explains this behaviour.

Definition 3.

A tuple κ=Qs,Qt,M,ϑ,ι,J defines the configuration spaces of the source domain and target domain, which are denoted by the following terms Qs and Qt, respectively. Complementary, M:QsJ1QtRs×s is the manipulator inertia matrix where s=dimQsJ1Qt, ϑ:QsJ1Qt is a smooth enough vector field which typically defines the position of the final element of an open kinematic chain, ι:QsJ1Qt is and embedding with push-forward ι:TQsTQsJ1Qt and J is an invertible matrix to relabel generalised coordinates in the domain attained after the constraint manifold is left.

The vector-valued function s should consider the entire set of d constraints of both the unilateral and holonomic natures if they exist. The so-called canonical projection needs to be taken into account in the case of manipulator movement. This canonical projection complies with the equation π:QsJ1QtJ1Qt, which necessitates the use of the composite map π:TQsJ1Qt. The impact equations can be introduced using kinematic constraints.

Definition 4.

The rigid impacts of the mobile robot’s end-effector with the handled object yield a set of possible discrete jumps over the unilateral and/or holonomic constraints. Formally, these effects can be represented as impulses at the precise locations that the constraints define. In the embedded space QsJ1Qt, let us consider the kinematic constraint κ=Qs,Qt,M,ϑ,ι,J and the generalised coordinates q. Next, let us define a map P:TQsJ1QtTQtJ1, which describes the velocities following impact, and is described by

(10) Pqt,dqtdt=dqtdtM1qtEqtEqtM1qtEqt1Eqtdqtdt,(10)

where Eq=∂κ∂q. The map that resets the coordinates in the target domain is defined by R:TQsTQt and satisfies

(11) Rqst,dqstdt=J00JπιqsPιqst,dqstdt.(11)

The regular movement of the mobile manipulator during the individual object handling cycle enforces two independent stages (because no dynamic transition is considered when the object is grasped). The active device belongs to a class of hybrid systems with holonomic constraints defined by the transition between stages. The mobile manipulator’s mechanical section was acquired to explain the robotic device’s continuous flow within each step, both before and following the grasping. This simplified model could complete the controller design suggested in this study.

3. The mobile manipulator and the changing scenario for aluminium recycling

The mobile manipulator considered in this study comprises a wheeled mobile (skid steering configuration with four independent wheels) robot with a differential configuration and a 7-degree-of-freedom robotic manipulator. This device operates in a non-structured scenario where the objects to be recycled (tyre rims in the example considered in this study) can be placed anywhere in the working procedure. shows a sample of the working procedure, including a feasible organisation of the objects and the mobile robotic device.

Figure 1. Motion example of the mobile manipulator showing the tasks that must be completed before grasping the object, during the contact, and after carrying it.

Render of the operative scenario for the mobile manipulator in different times of task development.
Figure 1. Motion example of the mobile manipulator showing the tasks that must be completed before grasping the object, during the contact, and after carrying it.

demonstrates an overlapped sequence of the mobile manipulator performing diverse activities in the working cycle from an initial position at them T0 to a final position at time T0+ΔT0. Within this time window, the manipulator must move the end-effector to locate the following object that must be collected. Once the object has been located, the mobile manipulator must move towards the detected object to be recycled in a period defined by TA after T0+ΔT0. In this period, the manipulator must attain a specific region near the object from where it is attainable by the manipulator. Once the manipulator enters the mentioned region, then the manipulator must move to catch the object within a given period defined by TA+ΔTA. Notice that the robotic dynamics’ right-hand side changes now, considering the object’s handling. Once this process has ended with grasping the next object to be recycled, the robot must move towards a region form where the mobile manipulator can release the handled object to be further processed. Such part of the process must be completed after a time TB after the mobile manipulator reaches the region from which the recycled object can be released. After a time TB+ΔTB, the object must be located at the correct stop from where it will be processed. This task ends an entire cycle for a given specific object.

According to the motion dynamics shown in , there are two sequential stages. The first one comprehended between T0 and TA+ΔTA, and the second one started at TA+ΔTA and ended at TB+ΔTB. The working cycle is completed for each set of two sequential stages.

4. Model of the mobile manipulator

This section describes the mathematical model of the mobile manipulator, which is a coupled system comprising the mobile autonomous terrestrial vehicle (ATV) and the robotic manipulator. The motion details, the state restrictions, and the potential modification in each of the stages of the working cycle are explained.

4.1. Autonomous terrestrial vehicle

To describe the position of the ATV in the working environment depicted in , two motion reference frames were defined:

  1. Inertial Coordinate System: This coordinate system is a global frame fixed in the ATV’s working environment. The position for this frame is defined by the coordinates tuple (x0,y0,z0).

  2. Robot Coordinate System (relative): This frame is attached to the ATV centre of mass, and it is denoted by the coordinates tuple (xrm,yrm,zrm).

displays the configuration of both reference frameworks and the matching set of variables that specify the mobile manipulator’s motion. An additional weight is placed over the ATV to balance the centre of gravity on the mid-point on the axis between wheels, coinciding with the first rotating axis for the first joint in the arm.

Figure 2. Scheme of the robot with all its frames.

Render of the mobile manipulator with all its joints reference frameworks.
Figure 2. Scheme of the robot with all its frames.

We assume that the robot’s centre of mass, designated as point C, is positioned along the axis of symmetry. As illustrated in , the robot’s position and orientation in the Inertial Frame can be represented as q0=x0y0θT.

Given that z0=zrm in our design, we assumed no temporal fluctuation over z. Hence, the triplet qrm=xrmyrmθrmT represents the position of any material point on the robot in the robot frame; qI=xIyIθIT represent the position of any material point on the robot with respect to the inertial reference frame. The coordinates of the specified point in the inertial frame and robot frame, respectively, make up the components of this vector.

The following transformation relates to these coordinates:

(12) qI=R(θ)qrm,(12)

Here R(θ) is the orthogonal rotation matrix with respect to the z-axis (Spong and Vidyasagar Citation1989).

This transformation defines the motion relationship between frames.

(13) ddtqI(t)=R(θ(t))ddtqrm(t).(13)

4.1.1. Kinematic model

Here, we demonstrate that the motion of the selected ATV can be described by two non-holonomic constraint equations, which are derived by making two key assumptions:

  1. No lateral slip motion restriction that implies the ATV can only travel in a curved motion (forward and backward) with a minimum curvature radius but not sideways. This condition in the relative frame indicates that the velocity of the centre-point A along the lateral axis is zero at any moment during the motion task, that is yrm=ddtyrm=0, ∀t0. These two conditions can be represented by the following single non-holonomic condition:

    (14) ddtx0\cdotsinθ+ddty0\cdotcosθ=0.(14)

  2. The pure rolling restriction denotes the fact that each wheel has just one point of contact with the ground (p). The wheel does not slip along its longitudinal axis (xr) and does not skid along its orthogonal axis (yr).

The velocities of the contact points in the relative frame vR and vL are connected to the velocities ddtϕR and ddtϕL of the right (R) and left (L) wheels by:

(15) vR=ρddtϕRvL=ρddtϕL,(15)

here ρ is the radius of each wheel. These velocities can be estimated in the inertial frame as a function of the velocities of the ATV centre-point A:

(16) ddtxp,R=ddtx0+Lcosθddtθ,ddtyp,R=ddty0Lsinθddtθ,ddtxp,L=ddtx0Lcosθddtθ,ddtyp,L=ddty0+Lsinθddtθ.(16)

The linear velocity of the ATV in the robot frame, which is the average of the linear velocities of the two wheels, is the linear velocity of each driving wheel in the robot frame.

(17) v=vR+vL2=ρ2ddtφR+φL,(17)

and the angular velocity of the ATV is

(18) ω=vRvL2L=ρ2LddtφRφL.(18)

Considering the case that the material port qrm corresponds to point A (implying that v=ddtqrm), the linear and angular velocities of the ATV can be expressed as follows:

(19) ddtxrmyrmθrm=12L00ρρddtφ,φ=φRφL.(19)

The translational and angular velocities can also be obtained in the inertial frame as follows:

(20) ddtqI=ddtxIyIθI=12Lcosθcosθsinθsinθρρddtφ.(20)

The forward kinematic model of the TAR is represented by EquationEquation (20).

Given the contact points’ velocities and the corresponding constraint equations can be rewritten as follows with q=x0,y0,θ,φR,φLT:

(21) Λqddtq=0,Λq=sinθcosθ000cosθsinθLρ0cosθsinθL0ρ.(21)

4.1.2. Dynamic modeling of the ATV system

For the ATV that is moving in a two-dimensional plane, the potential energy is constant, and hence, the dynamic model of such a vehicle can be presented as follows:

(22) ddt∂Ttq˙∂Tt∂q=Qnopott+ΛTqtλ.(22)

The dynamic model is necessary for both the creation of motion control algorithms and the simulation analysis of the ATV motion. The following equations of motion can describe the dynamics in the presence of non-holonomic restrictions:

(23) MATVqtd2qtdt2+CATVqt,dqtdtddtqt=BATVqtuATVtΛqtλt+ϑddtqt,qt,t,(23)

where MATV:R5R5×5 is the inertia matrix, CATV:R5R5×5 is the centripetal and Coriolis matrix, BATV:R5R2 is the input matrix, and uATV is the input vector. The term ϑ:R5×R5×R+ represents all the uncertainties and the perturbations affecting the ATV motion. Furthermore, the expressions for the matrices MATV, CATV, and BATV: are the following:

(24) MATVq=m0mdsθ000mmdcθ00mdsθmdcθI00000Iω00000Iω,CATVq,q˙=00mdddtθcθ0000mdddtθsθ00000000000000000,\breakBATVq=1000001000,(24)

where m=mc+2mw is the total mass of the ATV and is equal to the mass of the MR without the driving wheels and actuators (mc) plus the mass of each driving wheel with the actuator (mw), I=Ic+mcd2+2mwL2+2Im is the total equivalent inertia; here Ic is the moment of inertia of the MR about the vertical axis through the centre of mass, Iw is the moment of inertia of each driving wheel with a motor about the wheel axis, and Im is the moment of inertia of each driving wheel with a motor about the wheel diameter. Introducing the relationship between q and φ, the dynamics of q and φ are related by

(25) ddtq=Sqddtφ,Sq=0.5ρcθρsθρL20ρcθρsθρL02.(25)

Given that S is in the null space of the restriction matrix ΛT, then

(26) d2dt2q=ddtSqddtφ+Sqd2dt2φ.(26)

The substitution of (26) in (23) leads to

(27) MATVqtddtSqtddtφt+Sqtd2dt2φt+CATV\breakqt,dqtdtddtqt=BATVqtuATVtΛqtλt\break+ϑddtqt,qt,t.(27)

Multiplying both sided by ST and using its relationship with matrix Λ, one gets

(28) MATVqtddtSqtddtφt+Sqtd2dt2φt+CATV\breakqt,dqtdtddtqt=BATVqtuATVtΛqtλt\break+ϑddtqt,qt,t.(28)

This form represents the model to be controlled simultaneously with the robotic arm. Notice that (28) can be represented as follows

(29) ddtφat=φbt,ddtφbt=Fφat,φbt+GqatuRMt+Ξqt,ddtqt,t.(29)

Here φa=ϕ and φb=ddtϕ, while F:R5×R5R2 and G:R5R2×2 defines the inner dynamics of the ATV system with Fq,ddtq=Mφ1qCφq,ddtqφb and Gq=Mφ1φa. The term Ξ:R5×R5×R+R2 represents the effect of uncertainties and perturbations induced by the arm and some non-modelled elements in the recycling facility. The term Ξq,ddt,t satisfies the following inclusion (implying the corresponding bounds for perturbations as well as uncertainties), according to the mechanical structure of the robotic arm:

(30) Ξq,ddtq,tΞ0,Ξ0>0.(30)

4.2. Robotic arm

The proposed robotic arm device defines six active joints (degrees of freedom). The Euler-LaGrange equations, a theory of analytical mechanics, are used to establish such a device’s mathematical structure (considering the joint motion’s mechanical dynamics). The aforementioned model’s time dependence establishes the acceleration dynamical model in the chosen generalised coordinates ζa and ζb as

(31) ddtζat=ζbt,ddtζbt=fζat,ζbt+gaζatuRMt+ξζat,ζbt,t,(31)

whereas ζb relates to the time derivative of the generalised coordinates, the variable ζa refers to the generalised coordinates that describe the motion of the manipulator articulations. In this case, f:Z×ZR6, defines the drifted dynamics associated to the manipulator device; the control associated dynamics are characterised by ga:ZR6×6; and the dynamics associated with the control are defined by ξ:××RR6 represents a modelling error’s impact on the manipulator dynamics and an abstraction of external perturbations; uRMR6 corresponds to the control action’s impact on the manipulator dynamics. ξζb,ζa,t satisfies the following inclusion (implying the corresponding bounds for perturbations as well as uncertainties), according to the mechanical structure of the robotic arm:

(32) ξζb(t),ζa(t),tξ0,ξ0>0.(32)

In view of (32) notice that ξζb(t),ζa(t),tLn.

Taking into account the fundamentals of the Euler-LaGrange approach, both vector functions f, and ga are described as:

(33) fζa,ζb=M1ΛTζaλ+Cζa,ζbζb,gaζa=M1ζaBζa.(33)

Taking into account the Euler-LaGrange basis, the condition gbζaΓgb0,gb0>0 for all ζa and the corresponding positive matrix Γ, using the Frobenius norm.

The mechanical robotic arm establishes motion restrictions for ζa and ζb. Without loss of generality, the next class of asymmetric constraints is considered in this study. Such motion limitations are represented as:

(34) Ω=ζa,t|<ζa,i<ζa,i<ζa,i+<+.(34)

The feasible positions for the manipulator end-effector could be calculated using direct kinematics and the corresponding Jacobian in light of the motion coordinate restrictions taken into account in (34). It should be noted that the control issue can be resolved by considering the end-effector’s motion, which may be connected to the joint dynamics when the inverse kinematics of the AM is used.

4.3. Integrated dynamics of the mobile robot

Based on the individual models of the ATV and the robotic arm, we can present the aggregated model represented as follows using the following definitions ςaT=φaT,ζaT and ςbT=φbT,ζbT:

(35) fζa,ζb=M1ΛTζaλ+Cζa,ζbζb,gaζa=M1ζaBζa.(35)

Here, we used the notation uσ(t) to establish the application of the variable control form σ(t) when the task scenario changes from the approach to the object to its handling towards the deployment. Hence, σ(t)=1 in the first scenario, while σ(t)=2 when the second state happens.

4.4. On the reference trajectories for the mobile manipulator

To define the problem to be solved in this study, introduce the variable Δa corresponding to the tracking error between the trajectories of the combined robot (ATV and RM) and a suitable set of references ςaR8. The reference dynamics satisfies the following differential equation

(36) ddtςat=ςbt,ddtςbt=Ψσtςat,ςbt,t.(36)

with ςaR8 and its derivative ςbR8. The initial conditions are given vectors in (36). The function Ψσ(t):R8×R8×R+R8 is a Lipschitz function that defines the reference acceleration within each stage of the robotic motion for the ATV and the RM. Notice that this function is proposed to change depending on the mobile manipulator’s active mode, given that the referred trajectories may not be differentiable in the edges, even if they are continuous.

Various methods can address the challenge of designing reference trajectories within each continuous domain. This issue is commonly referred to as trajectory planning for robotic systems. Among the widely used approaches is the Bezier polynomial, which involves defining a set of controls to determine a Bezier curve with points from P0 to Pν, where ν denotes its order (e.g. ν=1 for linear, ν=2 for quadratic, etc.). The first and last control points always correspond to the curve’s endpoints, representing the initial conditions after resetting and the achieved final state in each domain. However, the intermediate control points, if present, typically do not lie on the curve. This paper explores an alternative approach to generating reference trajectories. The concept involves constructing the reference trajectory by interpolating sigmoid functions, following the methodology proposed in (Cruz, Luviano-Juárez, and Chairez Citation2014). These functions are differentiable multiple times, allowing for the regulation of their transient trajectory from one steady state to another. Each component of the reference trajectories can be estimated using the following sigmoidal representation. The specific form of the sigmoid function employed in this article is:

(37) ςat=|ςa,it|i=1,...,ν,ςa,it=ai1+biettk.i+ci,∀tTi,Ti+Ti,w,(37)

where the scalar parameters ai, bi, and ci are present, and tk,i denotes the sigmoid function’s inflection point. The reader is directed to the study given in (Cruz, Luviano-Juárez, and Chairez Citation2014) for more information on the design method for the reference trajectories using sigmoid functions.

By using a derivation procedure, hi(t)=aibiet(biet1)×1+biet3, one can easily obtain the expression of each component for the reference function δ based on the structure of the sigmoid function proposed in (37). In the vector field δ, the ith component is the function δit.

5. Problem formulation

Based on the integrated dynamics presented in (35) and the reference trajectories (36), the dynamics of the tracking error Δa is governed by the following differential equations:

(38) ddtΔat=Δbt,ddtΔbt=Ψσtςat,ςbt,tFςat,ςbt\break+Gςatuσtt+ηςat,ςbt,t.(38)

hich have a practically stable equilibrium point on the origin within each time window for each continuous mode. According to the definition of the hybrid system presented in (1), shows the configuration of the mobile manipulator throughout the interaction with the object to be handled. The sense of the vertices is characterised by the scenarios shown here and the edges shown as the interconnection lines. The set U is characterised by both controllers operating in each scenario and the dynamics FG, which are defined by the ordinary differential EquationEquation (38). The set of guards and resets is defined by the limits illustrated as ellipsoids in the centre of the referred figure, with resets defined by null jumps, given the reference trajectories’ continuity (and the no necessary differentiability on the edges).

Figure 3. Operating diagram for the MR considering the definition of the hybrid system used in this study.

Figure 3. Operating diagram for the MR considering the definition of the hybrid system used in this study.

According to the mobilisation problem, the edges of the hybrid system are defined by E=E1Δσ(t)=1,E2Δσ(t)=2. Here Δσ(t)=1 and Δσ(t)=2 define the tracking error for the corresponding scenario characterised with σ(t)=1 or σ(t)=2. The detected edges are

(39) matrixE1Δσt=1=||Δσt=1||=ρr,1,breakE2Δσt=2||Δσt=2||ρr,2.cr(39)

Here ρr,1 defines the maximum distance from where the manipulator could reach each object to be handled, while ρr,2 establishes the distance from where the handled object could be released at the position where it will be processed in the recycling process.

The control set U is defined by both controllers applied for the selected scenarios U=u1Δσ(t)=1,u2Δσ(t)=2 with

(40) u1(t|k)=G1ςa(t)Ψ1ςa(t),ςb(t),t|kFςa(t),ςb(t)W1Δσ(t)=1(t|k)u2t|k=G1ςa(t)Ψ2ςa(t),ςb(t),t|kFςa(t),ςb(t)W2Δσ(t)=2(t|k).(40)

Here V1Δσ(t)=1(t) and V2Δσ(t)=2(t) are given by

(41) W1Δσt=1(t|k)=KP,1(t|k)Δt+KD,1(t|k)ddtΔt+KI,1(t|k)\breakτ=0tΔτW2Δσt=2(t|k)=KP,2(t|k)Δt+KD,2(t|k)ddtΔt\break+KI,2(t|k)τ=0tΔτ+KII,2(t|k)τ=0ts=0τΔsdsdτ.(41)

The gains KP,1R8×8, KD,1R8×8, and KI,1R8×8 that operate in the scenario related to pure tracking trajectory without payload position and the gains KP,2R8×8, KD,2R8×8, KI,2R8×8, and KII,2R8×8 working for solving the tracking in the presence of payload are state dependent. The index kN refers to the kth object to be handled in the recycling procedure.

The laws used to obtain the evolution of such gains are the following:

(42) W1Δσt=1(t|k)=KP,1(t|k)Δt+KD,1(t|k)ddtΔt\break+KI,1(t|k)τ=0tΔτW2Δσt=2(t|k)=KP,2(t|k)Δt\break+KD,2(t|k)ddtΔt+KI,2(t|k)τ=0tΔτ+KII,2(t|k)τ=0ts=0τΔsdsdτ.(42)

The initial conditions for the adjustable matrices are KP,1(0|k)=KP,1(tk1|k), KD,1(0|k)=KD,1(tk1|k), KI,1(0|k)=KI,1(tk1|k), KP,2(0|k)=KP,2(tk1|k), KD,2(0|k)=KD,2(tk1|k), KI,2(0|k)=KI,2(tk1|k), and KII,2(0|k)=KII,2(tk1|k). These selections imply no jumps for the evolution of the gains.

The matrices ΛP,1R8×8, ΛD,1R8×8, ΛI,1R8×8, ΛP,2R8×8, ΛD,2R8×8, ΛI,2R8×8, and ΛII,2R8×8 are positive definite, constant and symmetric. The initial conditions for (42) KP,1(0|k), KD,1(0|k), KI,1(0|k), KP,2(0|k), KD,2(0|k), KI,2(0|k), and KII,2(0|k) are giving as part of the resetting process on the scenario change. Besides, the time varying gains are K˜II,σ(t|k)=KII,σ(t|k)KII(k), K˜I,σ(t|k)=KI,σ(t|k)KI(k), K˜D,σ(t|k)=KD,σ(t|k)KD(k), and K˜P,σ(t|k)=KP,σ(t|k)KP(k).

The matrices P1(k)R8×8, P1(k)=P1T(k) and P2(k)R8×8, P2(k)=P2T(k) satisfy the following Riccati-like matrix equations:

(43) PkAk+BKσk+Ak+BKσkTPk+PkBΣ11\breakBTPk+QC=0,ΥT(tk1|k)Pk+ΩΥ(tk1|k)Pk+QD=0,(43)

Here, the matrices that define the Riccati-like matrix EquationEquations (43) are defined by

(44) PkAk+BKσk+Ak+BKσkTPk+PkBΣ11BTPk\break+QC=0,ΥT(tk1|k)Pk+ΩΥ(tk1|k)Pk+QD=0,(44)

The adaptive PID controller (including its variation with extended uncertainties and perturbations compensation term) is considered in this study because it is regarded as one of the best ways to control electromechanical devices due to its capacity to approach reference trajectories and reject bounded perturbations. A desirable characteristic of any controller forcing the motion of a mobile robot, such as the robotic manipulator, is the ability to reduce the energy consumed during the operation in both scenarios. This is achieved by including the adaptive gains in (41).

6. Main result on the control design method

This section presents the main result of this study regarding the controller’s design, the differential equation’s existence for the control gains, and the reason that explains the existence of the stable trajectories for the hybrid system.

6.1. Main theorem of the study

The next Theorem summarises the main results obtained in this manuscript.

Theorem 6.1.

Consider the tracking error dynamics given in (38), regulated with the law uUv proposed in (40) adjusted with the gains given in (42). Consider the assumptions regarding the admissible sets of external perturbations and uncertainties given in (30) and (32). If there exists positive definite matrices QC and QD(k) such that the matrix inequality (43) has a positive definite solution P(k), then the trajectories of the tracing error are stable in the Lyapunov sense with respect to the zero trajectories, that is

(45) limκk=1κ1Πk12ΔT(tk1)QD(k)Δ(tk1)λmaxBTI+Ω1Bη0,(45)

where Πk1=tktk1 and Ω is a positive definite matrix of appropriate dimensions.

The controller proposed in Theorem 1 assumes that both the articulation and the angular velocity for each joint can be measured simultaneously. Notice that the expression (45) can be reduced only if the matrix P(k) is large in some sense.

The proof of this Theorem is presented in the Appendix of this manuscript.

6.2. Optimization of convergence region

The boundedness result proposed in Theorem 1 is a consequence of the non-matched uncertainties and the presence of external perturbations that may affect the orthosis movement. The invariant ellipsoid method can minimise the bound of the convergence region (Ordaz and Poznyak Citation2015a, Citation2015b; Polyakov Citation2010).

Definition 5.

(Polyakov and Poznyak Citation2009) The proposed ellipsoid

(46) E(P1)=zRn:zTP1z1,P1>0,(46)

is characterised with a centre placed at the origin and the configuration positive definite matrix P1 which is said to be invariant for the trajectories of the system (38) if

  1. the initial condition z(0)E(P1) implies z(t)E(P1) for all t0,

  2. the initial condition z(0)E(P1) implies z(t)E(P1) as t.

The potential discovery of an invariant ellipsoid characterised by the matrix P1, with the maximum dimensions, facilitates the minimal deviation of all conceivable trajectories from the origin. While this understanding aligns seamlessly with the outcome elucidated in (43), it necessitates the establishment of a metric or criterion for determining the magnitude of the matrix P(k). In this context, the trace operator serves as the chosen criterion, wherein trP(k)max or trP(k)1min.

In Theorem 1, the existence of the invariant set, where the trajectories of Δ(tk1) converge, has already been established for each evaluation period Πk1.

In accordance with the definition outlined in (43), the configuration matrix P(k) adheres to the concept of an invariant ellipsoid when taking into account the inequality (45). To determine the solution for the minimal invariant ellipsoid, the following Lemma provides sufficient conditions for designing the gain controller.

Lemma 1.

If the tupple (α:=λmaxBTI+Ω1Bη0, P(k)) is a solution of the optimisation problem trP(k)max, subjected to P(k)>0, α>0 and

(47) P(k)A(k)+BKσ(k)+A(k)+BKσ(k)TP(k)+QCP(k)P(k)BΣ11BT0,(47)

Then, the corresponding controller proposed in (40) guarantees that any trajectory of the system (38) converges to a quasi-minimal ellipsoid E(P(k)) within each scenario for the mobile robot.

The proof of this Lemma follows straightforwardly from the application of the Schur complement on (43).

6.3. Output-based adaptive PID controller

The proposed solution for addressing the output-based tracking problem consists of an output feedback controller, comprising a robust exact differentiator (RED) as suggested in (Levant Citation1998) for obtaining velocity estimation. Additionally, a modified version of the adaptive proportional-derivative controller (APD) is employed to track reference trajectories. The velocity estimation is accomplished using this differentiator based on the Super-Twisting algorithm (STA). Employing the STA, the PID control is then selected as part of the control strategy.

(48) u1(t|k)=G1ςatΨ1ςat,ςbt,t|kFςat,ςˆbtW1Δˆσt=1(t|k),u2t|k=G1ςatΨ2ςat,ςbt,t|kFςat,ςˆbtW2Δˆσt=2(t|k),(48)

where Δˆ is the estimated vector of the tracking error. The application of the super-twisting algorithm (STA) as a robust and exact differentiator (RED) is based on the following description: If w1t=rt, where r(t)R is the signal to be differentiated, and w2(t)=r˙(t) represents its derivative, the following auxiliary equation is obtained under the assumption r¨(t)r+.

(49) ddtw1t=w2t;ddtw2t=r¨t.(49)

The previous differential equation is a state representation of the signal r(t). The STA algorithm to obtain the derivative of rt looks like

(50) ddtw¯1(t)=w¯2(t)λ1|Δw(t)|,1/2sign(Δw(t)),ddtw¯2(t)=λ2sign(Δw(t)),Δw=w¯1(t)w1(t),d(t)=ddtw¯1(t)(50)

where λ1,λ2>0 are the STA gains. Here d(t) is the output of the differentiator (Levant Citation1998). In this equation,

(51) sign(z):=1ifz>01,1]ifz=01ifz<0(51)

To implement the control method proposed in this article, the algorithm shown in was developed. The results of this algorithm are showing in the next section.

Figure 4. Employed algorithm flow.

Figure 4. Employed algorithm flow.

7. Numerical results

In this section, the results are obtained by implementing the scenario for recycling car wheels in Coppeliasim with the proposed mobile robot. To configure the scenario, Matlab and Coppeliasim were set to work in supervised synchronous mode by the API function for Matlab. The declaration of commands and functional programming was developed in Matlab. On the other hand, the speed of the robot’s active joints was measured using virtual instrumentation in Coppeliasim.

Furthermore, those results compare setting an automatic control algorithm with extended state feedback or Proportional Integral Derivative (PID) control and with a law that considers the implementation of adaptive gains over a configuration of disturbance rejection control form. The initial gains setting for the robotic device are shown in .

Table 1. Initial gain’s condition of the robotic device.

As has been mentioned, the task of taking one wheel to the recycling band is developed in four stages. The first stage corresponds to the period when the mobile robot approaches the object. The second one corresponds to the period when the object is lifted. The third refers to the case when the lifted object is moved towards the conveyor. The fourth and last stage corresponds to the period when the object is released.

The application of inverse kinematics allowed us to construct the set of reference trajectories for all the joints in the Kuka robot to reach the wheel and then leave it in the recycling band. Moreover, the difference between the wheel and the home position is computed to orientate the robot and move it close to the wheel.

Each stage of the Kuka robot is executed in an elapsed time of 120 s. Those events’ trajectories and movements are shown in . Both controllers show asymptotic convergence for the tracking errors. This fact can be appreciated in , demonstrating that tracking errors are monotonically decreasing, specifically in those joints that change their position between the stages and in the case of those joints that have to maintain the position, the error is ± 2°, like in the case of q1,3,5,7.

Figure 5. Trajectory tracking for each DoF of the Kuka robot.

Figure 5. Trajectory tracking for each DoF of the Kuka robot.

After analysing the trajectory tracking of each joint, it is possible to observe that each stage change converges to the desired position about 60 s after the start of the corresponding stage. During stage 1, developed by the manipulator, both control laws reflect a similar performance. However, during stages 2 and 3, it is possible to observe a better performance in tracking the trajectories by the proposed controller (ADRC). Although during stage 2, there is no change in the manipulator dynamics, the motion of the daughter’s joints affects the parent joints, which is why slight position changes are observed in the joints q1,3,5,7 for 120[sec]t240[sec] (see ).

Figure 6. Temporal evolution of the error for each DoF of the Kuka robot.

Figure 6. Temporal evolution of the error for each DoF of the Kuka robot.

Finally, during the stage where the rin is loaded by the manipulator (third stage), it is possible to observe a more noticeable change in the proposed control law (ADRC) performance. The graphs corresponding to the error for 240[sec]t360[sec] reflect a lower error when the system has the ADRC control (magenta line). Specifically, the rin-attracting joint (q 7) exhibits lower error at the beginning and during that stage.

As part of the controllers’ performance demonstration, both the Mean Square Error (MSE) and the Error Integral (EI) per stage, controller, and joint were calculated and are presented in the and , respectively.

Table 2. Comparison of the mean square error values for the different control laws at each stage for the Kuka robot.

Table 3. Comparison of the error integral values for the different control laws at each stage for the Kuka robot.

For the first stage, the ADRC controller has an increase close to 0.5% in both EI and MSE compared to the PID type controller; this performance is consistent with the system dynamics not being modified, and the controller gains are adequate.

On the other hand, the second stage shows a 2% decrease in EI and MSE when the ADRC controller is used compared to the PID-type controller. Finally, during the third stage, it is possible to observe a better compensation of the manipulator dynamics modification, which is reflected in a decrease of almost 3% in the EI and MSE when using the ADRC controller compared to the PID controller.

As with the Kuka robot part (), an analysis was performed when the robotic system was moved from its ‘home’ state to a point where the rin to be picked up is achievable. From this stage, the error plot (see ) and the control plot (see ) were obtained. In this case, it is possible to observe how the error converges to zero at a time close to 55 [sec] when the ADRC control law is applied, while when the traditional PID control law is applied, it does not converge to zero until 60 [sec].

Figure 7. Temporal evolution of the error for the PLMR.

Figure 7. Temporal evolution of the error for the PLMR.

An analysis of the control applied for each DoF of the manipulator was also performed. The corresponding graphs are presented in . With them, it is possible to notice that both for the joints that have a change of position in some stage and for those that do not, a lower energy expenditure is required when the ADRC controller is used than when the classical control law is used. This is accentuated in the first 20 s of each stage.

Figure 9. Temporal evolution of the control for each DoF of the Kuka robot.

Figure 9. Temporal evolution of the control for each DoF of the Kuka robot.

Figure 8. Temporal evolution of the control for the PLMR.

Figure 8. Temporal evolution of the control for the PLMR.

The Mean Square Control (MSC) and Control Integral (CI) per stage, controller, and joint were calculated to confirm the decrease in energy requirement. They are presented in and , respectively.

Table 4. Comparison of the mean square control values for the different control laws at each stage for the Kuka robot.

Table 5. Comparison of the control integral values for the different control laws at each stage for the Kuka robot.

For the first stage, even though there is an increase of about 0.05% in the error when the system acts under the ADRC law, this law presents a decrease in both the MSC and the CI of about 0.05% compared to the system when it has the PID control law. During the second stage, it can be seen a 4.1% of decrease in CI and MSC when the ADRC controller is used compared to the PID-type controller. Finally, during the third stage, it is possible to observe a better compensation of the manipulator dynamics modification, which is reflected in a decrease of almost 5% in the CI and MSC when using the ADRC controller compared to the PID controller.

The numerical simulations show that ADRC and PID algorithms may exhibit similar error tracking for each DoF. Notice that the comparison of the transient evolution demonstrates the benefit of the ADRC controller showing the tracking error’s convergence, mitigating the change in the manipulator dynamics produced by the task of moving the wheel.

In addition, a significant reduction of the control signal and the tracking error due to ADRC law highlights the improvement enforced with the application of the adaptive form, despite the change in the robot’s dynamics due to the proposed task.

8. Conclusions

This paper introduces a set of successful controllers for robotic manipulators with joint restrictions, employing a novel adaptive gain strategy to address the tracking trajectory problem. The controllers encompass state feedback, output feedback, and a robust version of state feedback that accounts for limited knowledge of manipulator dynamics that can be used in recycling facilities working purely autonomously. All of them utilise state-dependent gains to enforce a better tracking of the reference trajectories. Applying a controlled Lyapunov function establishes the zero trajectory of the tracking error space as asymptotically stable in the Lyapunov sense. Several crucial factors must be considered for the potential implementation of the proposed controller, taking into account processing for the control implementation. Processing speed is a critical consideration for most controllers, but in the case of the proposed controller, higher processing speeds enhance its performance. Therefore, it is imperative to implement this class of controllers on high-speed processors.

Simulated results, incorporating physically inspired joint motion limitations, confirm the satisfaction of the proposed predefined joint motion. Additionally, the efficiency of the proposed adaptive controller enables effective management of external perturbations, as evidenced in the simulated results.

Notably, the presented results affirm the advantages of the adaptive controller compared to similar designs, such as the traditional proportional-integral-derivative control form. Moreover, meeting limitations with an arbitrary structure opens up possibilities for deriving novel alternatives to control complex robotic devices in diverse recycling industries where autonomous manipulators could solve the problem by implementing a hybrid system formulation. The suggested controller can be applied to accomplish more intricate tasks where the robotic device may have complex motion restrictions and velocities.

Acknowledgments

The authors would like to thank the Tecnologico de Monterrey Challenge-Based Research Program project ID IJXT070-22TE60001 and Programa para la Vinculación de Empresas con Instituciones de Educación Superior y Centros de Investigación Comecyt-EdoMex número de proyecto Vinculacion/2023/009.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data availability statement

The data that support the findings of this study are available from the corresponding author, [author initials], upon reasonable request.

Additional information

Funding

The work was supported by the Instituto Tecnologico de Estudios Superiores de Monterrey, Challenge-Based Research Program [ID IKXT025-22UI60001].

Notes on contributors

Karen Mendoza-Bautista

Karen Mendoza-Bautista received a B.S. degree in biomedical engineering from the National Polytechnic Institute (IPN), Mexico City, Mexico, in 2019 and a master’s degree in Robotics and Advanced Manufacturing, Center of Investigation and Advanced Researching (CINVESTAV) campus Saltillo, IPN, in 2022. Actually, she is studying for a PhD at the Technological Institute of Monterrey (ITESM) in Engineering Sciences. She is also working as a part-time professor on the ITESM campus in Guadalajara. Her current research interests include robot control theory, surgical robots, adaptive control, and vision systems applied to robots. She has published about 3 papers in recognized technical journals.

Mariel Alfaro-Ponce

Mariel Alfaro-Ponce is an assistant professor in the Biomedical Engineering Program at Tecnológico de Monterrey Ciudad de Mexico; she received a bachelor’s degree in Biomedical Engineering, a Master of Science in Microelectronic Engineering, and a Ph.D. in Computer Science from the Instituto Politecnico Nacional, Mexico. Her research interests include artificial intelligence, rehabilitation devices, and intelligent bioinstrumentation. She has been a member of the National System of Researchers of Mexico (SNI-Level I). From 2022 until now, is the head of the Manufacturing Processes for Advanced Materials CDMX research unit.

Isaac Chairez

Isaac Chairez earned the B.S. degree in biomedical engineering from the National Polytechnic Institute (IPN), Mexico City, Mexico, in 2002, and the master’s and Ph.D. degrees from the Department of Automatic Control, Center of Investigation and Advanced Researching (CINVESTAV), IPN, in 2004 and 2007, respectively. He is currently with the National School of Sciences and Engineering of the Tecnológico de Monterrey and the Professional Interdisciplinary Unit of Biotechnology, IPN. He has published over 230 contributions in indexed journals and 300 in international conferences. He has published two books on the applications of neural networks on diverse disciplines. His current research interests include neural networks, fuzzy control theory, nonlinear control, adaptive control, and game theory.

References

  • Ames, A. D., E. A. Cousineau, and M. J. Powell 2012. “Dynamically Stable Bipedal Robotic Walking with Nao via Human-Inspired Hybrid Zero Dynamics”. Proceedings of the 15th acm international conference on hybrid systems: Computation and control, Beijin, China, April 17–19, 135–144.
  • Branicky, M. S., V. S. Borkar, and S. K. Mitter 1994. “A Unified Framework for Hybrid Control”. Proceedings of 1994 33rd IEEE Conference on Decision and Control, Lake Buena Vista, FL, USA, December 14–16.
  • Capuzzi, S., and G. Timelli. 2018. “Preparation and Melting of Scrap in Aluminum Recycling: A Review.” Metals 8 (4): 249. https://doi.org/10.3390/met8040249.
  • Crampin, M. 1981. “On the Differential Geometry of the Euler-Lagrange Equations, and the Inverse Problem of Lagrangian Dynamics.” Journal of Physics A: Mathematical and General 14 (10): 2567. https://doi.org/10.1088/0305-4470/14/10/012.
  • Cruz, D., A. Luviano-Juárez, and I. Chairez. 2014. Output sliding mode controller to regulate the gait of Gecko-inspired robot. Memorias del XVI Congreso Latinoamericano de Control Automático, CLCA, Cancún, Quintana Roo, México, Ocotober 14–17. Mexico City.
  • Ezeah, C., J. A. Fazakerley, and C. L. Roberts. 2013. “Emerging Trends in Informal Sector Recycling in Developing and Transition Countries.” Waste Management 33 (11): 2509–2519. https://doi.org/10.1016/j.wasman.2013.06.020.
  • Grimaud, G., N. Perry, and B. Laratte. 2016. “Life Cycle Assessment of Aluminium Recycling Process: Case of Shredder Cables.” Procedia Cirp 48:212–218. https://doi.org/10.1016/j.procir.2016.03.097.
  • Koutsoukos, X. D., P. J. Antsaklis, J. A. Stiver, and M. D. Lemmon. 2000. “Supervisory Control of Hybrid Systems.” Proceedings of the IEEE 88 (7): 1026–1049. https://doi.org/10.1109/5.871307.
  • Lennartson, B., M. Tittus, B. Egardt, and S. Pettersson. 1996. “Hybrid Systems in Process Control.” IEEE Control Systems Magazine 16 (5): 45–56.
  • Levant, A. 1998. “Robust exact differentiation via sliding mode technique.” Automatica 34 (3): 379–384. https://doi.org/10.1016/S0005-1098(97)00209-4.
  • Lu, Y., B. Yang, Y. Gao, and Z. Xu. 2022. “An Automatic Sorting System for Electronic Components Detached from Waste Printed Circuit Boards.” Waste Management 137:1–8. https://doi.org/10.1016/j.wasman.2021.10.016.
  • Ordaz, P., and A. Poznyak. 2015a. “‘Kl’-Gain Adaptation for Attractive Ellipsoid Method.” IMA Journal of Mathematical Control and Information 32 (3): 447–469. https://doi.org/10.1093/imamci/dnt046.
  • Ordaz, P., and A. Poznyak. 2015b. “‘Kl’-Gain Adaptation for Attractive Ellipsoid Method.” IMA Journal of Mathematical Control and Information 32 (3): 447–469. https://doi.org/10.1093/imamci/dnt046.
  • Ortega, R., A. Loria, R. Kelly, and L. Praly 1994. “On Passivity-Based Output Feedback Global Stabilization of Euler-Lagrange Systems”. Proceedings of 1994 33rd IEEE Conference on Decision and Control, Lake Buena Vista, FL, USA, December 14–16. https://doi.org/10.1109/CDC.1994.410898.
  • Polyakov, A. 2010. “Invariant Ellipsoid Method for Time-Delayed Predictor-Based Sliding Mode Control System”. 2010 11th International Workshop on Variable Structure Systems (VSS), Mexico City, Mexico, June 26–28. https://doi.org/10.1109/VSS.2010.5544685.
  • Polyakov, A., and A. Poznyak 2009. “Minimization of the Unmatched Disturbances in the Sliding Mode Control Systems via Invariant Ellipsoid Method”. 2009 ieee control applications,(cca) & intelligent control,(isic), St. Petersburg, Russia, July 08–10. https://doi.org/10.1109/CCA.2009.5280842.
  • Poznyak, A. 2008. Advanced Mathematical Tools for Automatic Control Engineers: Volume 1: Deterministic Systems, 775. Amsterdam: Elsevier Science.
  • Schlesinger, M. E. 2006. Aluminum recycling, 248. Boca Raton, Fl, USA: CRC press.
  • Schmitz, C. 2006. Handbook of Aluminium Recycling, 454. Essen, Germany: Vulkan-Verlag GmbH.
  • Shinzato, M. C., and R. Hypolito. 2005. “Solid Waste from Aluminum Recycling Process: Characterization and Reuse of Its Economically Valuable Constituents.” Waste Management 25 (1): 37–46. https://doi.org/10.1016/j.wasman.2004.08.005.
  • Spong, M. W., and M. Vidyasagar. 1989. Robot Dynamics and Control, wa343. New York, USA: John Wiley & Sons.
  • Yoshikawa, T., and A. Sudou. 1993. “Dynamic Hybrid Position/Force Control of Robot Manipulators-On-Line Estimation of Unknown Constraint.” IEEE Transactions on Robotics and Automation 9 (2): 220–226. https://doi.org/10.1109/70.238286.

Appendix: Proof of the main theorem

Proof. The application of the proposed controller in (40) over the tracking error dynamics presented in (38) leads to the following equivalent differential equation:

(52) ddtΔa(t)=Δb(t),ddtΔb(t)=WσΔ(t|k)ηςa(t),ςb(t),t,(52)

where WσΔ(t|k) could take the values of W1Δσ(t)=1(t|k) or W2Δσ(t)=2(t|k) depending on the active scenario.

The substitution of the selected forms for W1 and W2 in (41) implies the following equivalent dynamics:

(53) ddtΔαt=ΔβtddtΔβt=ΔatddtΔat=ΔbtddtΔbt=KP,σ(t|k)Δat+KD,σ(t|k)Δbt+crKI,σ(t|k)Δβt+KII,σ(t|k)Δαtηςat,ςbt,t.(53)

This differential equation is valid with KII,1(t|k)0,∀t0.

The expression presented in (53) is equivalent to (using the expressions of K˜P,σ(t|k), K˜D,σ(t|k), K˜I,σ(t|k), and K˜II,σ(t|k)) to the following differential form with ΔT=ΔαT,ΔβT,ΔaT,ΔbT:

(54) ddtΔt=Ak+BKσkΔtBηςat,ςbt,t+BK˜σ(t|k)Δt,(54)

here K˜II,σ(t|k)=KII,σ(t|k)KII(k), K˜I,σ(t|k)=KI,σ(t|k)KI(k), K˜D,σ(t|k)=KD,σ(t|k)KD(k), and K˜P,σ(t|k)=KP,σ(t|k)KP(k).

Consider the mixed Lyapunov candidate function (valid for each domain for continuous time and that we can analyse after each scenario change) given by:

(55) V1Δ(t),K˜σ(t|k)=ΔT(t)P(k)Δ(t)+trK˜σT(t|k)ΛC1K˜σ(t|k),V2Δ(tk),K˜σ(tk|k)=ΔT(tk)P(k)Δ(tk)+trK˜σT(tk|k)ΛD1K˜σ(tk|k),(55)

here we keep the notation Δ(t) in the definition of the function V1 to emphasise the continuous-time analysis. In contrast, in the case of V2, we considered the values of t=tk corresponding to the moments corresponding to the change of scenario.

The time derivative of the first function in (55), V1Δ(t),K˜σ(t|k), that is

(56) ddtV1Δt,K˜σ(t|k)=2ΔTtPkddtΔt+2trK˜σT(t|k)ΛC1ddtK˜σ(t|k),(56)

The substitution of the dynamics corresponding to Δ(t) in the derivative of V1Δ(t),K˜σ(t|k) leads to:

(57) ddtV1Δt,K˜σ(t|k)=2ΔTtPkAk+BKσkΔtBηςat,ςbt,t+BK˜σ(t|k)Δt+2trK˜σT(t|k)ΛC1ddtK˜σ(t|k)(57)

The application of a symmetrisation operator and the relationship between the inner product and the trace operator has the following upper estimation for the time derivative of V1Δ(t),K˜σ(t|k)

(58) ddtV1Δt,K˜σ(t|k)=2ΔTtPkAk+BKσkΔtBηςat,ςbt,t+BK˜σ(t|k)Δt+2trK˜σT(t|k)ΛC1ddtK˜σ(t|k)(58)

The previous differential inclusion was obtained after applying the so-called lambda inequality XTY+YTXXTΣ1X+YTΣY, XYRρ×l with 0<Λ=ΛTRρ×ρ (Poznyak Citation2008).

Taking the assumption on the existence of positive definite solutions for the Riccati equation depending on P1 and P2 presented in the Theorem description and considering the class of adjustment methods for the gains KII,σ(t|k), KI,σ(t|k), KD,σ(t|k), and KP,σ(t|k), (all of them included in Kσ(t|k)) presented in (42), one gets that last inclusion can be transformed to: ddtV1Δt,K˜σ(t|k)αQCddtV1Δt,K˜σ(t|k)+λmaxΣ1η0.

The estimation of the integral operator applied to the previous inclusion, and by the comparison Lemma, it can be proven that V1Δt,K˜σ(t|k)V1Δtk1,K˜σ(k1|k)eαQCttk1+λmaxΣ1η0αQC1eαtTi+Te, which is valid ∀t[tk1,tk). Notice that this analysis is valid within each scenario where the continuous dynamics is valid.

Considering the structure of the proposed continuous form of the Lyapunov-like equation, one may confirm that ∀t[tk1,tk) the following inequality is valid

(59) ||Δt||2V1Δtk1,K˜σ(k1|k)λminPkeαQCttk1+λmaxΣ1η0λminPkαQC1eαtTi+Te(59)

Now, taking into consideration the evolution of matrix P(k) for the kth object to be handled, consider the study of the candidate Lyapunov function V2Δ(tk),K˜σ(tk|k) in discrete sense.

The second part of the proof will take advantage of the explicit discretisation applied to the dynamics of the tracking error Δ, implying that (54) can be presented as follows:

(60) Δ(tk)Δ(tk1)=Πk1A(k)+BKσ(k)Δ(tk1)Πk1Bηςa(t),ςb(t),t+Πk1BK˜σ(tk1|k)Δ(tk1)(60)

From the expression (60), we may estimate that

(61) Δ(tk)=Υ(tk1|k)Δ(tk1)Πk1Bηςa(t),ςb(t),t(61)

Now, using these discrete dynamics on the candidate Lyapunov function V2Δ(tk),K˜σ(tk|k), one has to estimate the variation of such function at two consecutive times when the scenario changes ΔV2Δ(tk),K˜σ(tk|k)=V2Δ(tk),K˜σ(tk|k)V2Δ(tk1),K˜σ(tk1|k)

(62) ΔV2Δ(tk),K˜σ(tk|k)=ΔT(tk)P(k)Δ(tk)ΔT(tk1)P(k)Δ(tk1).(62)

This function is proposed considering that there are no jumps on the gains on the boundary between scenarios. Then, based on the proposed Lyapunov candidate function, one has the following estimate considering the discrete dynamics (61):

(63) ΔV2Δ(tk),K˜σ(tk|k)ΔT(tk1)ΥT(tk1|k)P(k)+ΩΥ(tk1|k)P(k)Δ(tk1)+Πk12ηTςa(t),ςb(t),tBTI+Ω1Bηςa(t),ςb(t),t(63)

Using the Rayleigh inequality and the upper bound for the admissible class of external uncertainties and modelling imprecision, the following upper bound is valid

(64) ΔV2Δ(tk),K˜σ(tk|k)Πk12λmaxBTI+Ω1Bη0+ΔT(tk1)ΥT(tk1|k)P(k)+ΩΥ(tk1|k)P(k)Δ(tk1)(64)

Taking the assumption on the existence of positive definite solutions for the discrete Riccati equation establishing the evolution of matrix P(k) at each change of scenario leads to

(65) ΔV2Δ(tk),K˜σ(tk|k)Πk12λmaxBTI+Ω1Bη0ΔT(tk1)QD(k)Δ(tk1)(65)

Hence, one may notice that ΔV2Δ(tk),K˜σ(tk|k)<0 if Πk12λmaxBTI+Ω1Bη0ΔT(tk1)QD(k)Δ(tk1)

This last result completes the proof.