7.2
CiteScore
3.7
Impact Factor
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
Filter by Categories
ABUNDANCE ESTIMATION IN AN ARID ENVIRONMENT
Case Study
Editorial
Invited review
Letter to the Editor
Original Article
REVIEW
Review Article
SHORT COMMUNICATION
7.2
CiteScore
3.7
Impact Factor
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
Filter by Categories
ABUNDANCE ESTIMATION IN AN ARID ENVIRONMENT
Case Study
Editorial
Invited review
Letter to the Editor
Original Article
REVIEW
Review Article
SHORT COMMUNICATION
View/Download PDF

Translate this page into:

ORIGINAL ARTICLE
24 (
3
); 215-220
doi:
10.1016/j.jksus.2011.02.001

Adaptive control of inventory systems with unknown deterioration rate

Department of Statistics and Operations Research, College of Science, King Saud University, P.O. Box 2455, Riyadh 11451, Saudi Arabia
Department of Computer Engineering, College of Computer and Information Sciences, King Saud University, P.O. Box 51178, Riyadh 11543, Saudi Arabia

*Corresponding author abdefoul@ksu.edu.sa (A. Foul),

Disclaimer:
This article was originally published by Elsevier and was migrated to Scientific Scholar after the change of Publisher.

Peer review under responsibility of King Saud University.

Available online 15 February 2011

Abstract

In this paper, a continuous-time model of a production inventory system in which a manufacturing firm produces a single product selling some and stocking the remaining is considered. Model reference adaptive control with feedback is applied to track the output of the system (the inventory level) toward the inventory goal level. The theory is illustrated by the presentation of the results of computer simulation studies of this particular system.

Keywords

Production planning
Inventory
Deterioration
Optimal control
Adaptive control
1

1 Introduction

Applications of optimal control theory to management science, in general, and to production planning, in particular, are proving to be quite fruitful; see Sethi and Thompson (2000). Naturally, with the optimal control theory, optimal control techniques came to be applied to production planning problems. For example,

  • Self-tuning control: Hedjar et al. (2007) study how to use control and identification methods for controlling the system when its parameters are not all a priori known to the designer.

  • Receding-horizon control: Hedjar et al. (2005) applied a discrete-time technique in which the control action is obtained by repeatedly solving on-line open-loop optimization problems at each time step.

  • Predictive control: Hedjar et al. (2004) used a j-step ahead predictor to predict the tracking error. An identification algorithm is incorporated to estimate the model parameters in the case where they are unknown.

In this paper we apply yet another optimal control technique, called model reference adaptive control (MRAC), in which the performance specifications are given in terms of a model (or targets, or goals); see for example Sastry and Bodson (1989). The goals represent the ideal state of the process. Adaptive control is therefore similar to self-tuning control in that it also attempts to overcome unknown or varying system dynamics while achieving adequate tracking performances.

In order to apply the MRAC to the problem of production planning, we will be using another concept from optimal control theory that of feedback control. This concept was first applied by James Watt to control the speed of his steam engine about 200 years ago. Since then, the number of industrial applications has increased to the point that most automatic control systems today include feedback control. As we progress, we will occasionally make use of some notions from the theory of optimal control that may not be familiar to the management scientists. Each time this proves to be necessary, we will be giving, for the novice, the definition of such notions. In the next section we introduce the notation and formally describe the system. We are assuming that the firm of interest to us adopts a continuous-review (instead of a periodic-review) policy. In Section 3 we derive the optimal control variable when all system parameters are known while in Section 4, we derive the optimal control variable when not all system parameters are known. Illustrative examples are provided in both Sections 3 and 4.

2

2 Model assumptions and notation

We consider the classical production planning problem in which a manufacturing firm producing a single item, selling some and stocking the remaining units. We assume that the firm has set an inventory goal level and a production (or control) goal rate. We also assume that a proportion of the units in stock deteriorate at a constant rate which may be known or unknown. Item deterioration is of great importance in inventory theory see Goyal and Giri (2001). In this paper, we apply an optimal control technique, called model reference adaptive control (MRAC) to the fore cited production planning problem where the firm adopts a continuous-review policy, that is, the inventory level is monitored continuously. To state the model we use the following notation:

  • I(t): inventory level at time t,

  • P(t): production rate at time t (P(t) ⩾ 0),

  • D(t): demand rate at time t,

  • θ: constant deterioration rate (θ > 0),

  • Id(t): inventory goal level at time t,

  • Pd(t): production goal rate at time t (Pd(t) ⩾ 0).

The interpretation of the goal rates is as follows:

  • The inventory goal level Id(t) is a safety stock that the firm wants to keep on hand at time t.

  • The production goal rate Pd(t) is the most efficient rate desired by the firm at time t.

Given a reference model in terms of goals Id for the inventory level I (state variable) and Pd for the production rate P, the objective of the control problem is to choose an appropriate production rate P such that all the functions involved are bounded and I tracks Id.

Since demand occurs at rate D, production occurs at the controllable rate P, and deterioration occurs at a constant rate θ, it follows that the inventory level I evolves at each instant of time t according to the state equation

(2.1)
I ̇ = - θ I + P - D . Letting u = P - D (which will be considered in the sequel as the control variable), the state equation (2.1) becomes
(2.2)
I ̇ = - θ I + u .
The pair (Id, ud) satisfies the differential equation
(2.3)
I ̇ d = - a d I d + b d u d ,
where the parameters ad and bd are selected by the firm. In what follows, we give some explanation of how ad and bd are chosen.

In the analysis of process dynamics, the process variables and controls are functions of the time t.

Taking the Laplace transform of both sides of (2.3), we have:

(2.4)
I d ( s ) = b d s + a d u d ( s ) . For any bounded piecewise continuous goal ud(s). Id and ud are measured at each time t.

The parameter ad, which is chosen to be positive (ad > 0) in order to ensure the stability of the reference model, represents the rate of convergence of the desired inventory level.

Since the final value of the desired inventory level is given by: lim t I d ( t ) = b d a d u d ( t ) or I d ( ) = b d a d u d ( ) . One can choose the parameter bd to tune the gain b d a d between the input/output of the reference model. The parameter bd is also chosen to be positive (bd > 0) in order to preserve the sign between the input and the output of the reference model.

Further, the form of Eq. (2.4) allows us to break the transform of the output variable (namely the desired inventory level Id) into the product of two terms: the fraction, known as the transfer function, and the transform of the input variable (namely the desired production rate ud). The transfer function and its parameters characterize the process and determine how the output variable responds to the input variable.

As we mentioned in Section 1, our aim in this paper is to illustrate an application of the MRAC technique by applying it to a management science problem, namely the production planning problem. In Hedjar et al. (2004, 2005, 2007), a reference model ( I d , u d ) is given and the objective is to obtain the state and control (I, u) that minimize a performance index defined as the sum of the penalty costs incurred when (I, u) deviate from their respective goals (Id, ud). In the MRAC technique, no objective function is designed. The reference model (Id, ud) is still given and the goal is to determine the control u so that the state variable I tracks its goal Id.

The next section treats the case where the deterioration rate θ is known. Then, in the following section we will be dealing with the case when θ is unknown.

3

3 Model reference control

For I to track Id for any goal control ud, the control variable u should be chosen so that the transfer function from the input ud to the output I is equal to that of the reference model. As mentioned in Section 1, we propose the following feedback control

(3.1)
u = - k I + u d , where k and need to be calculated. Substituting (3.1) into (2.2), we get
(3.2)
I ̇ = - ( θ + k ) I + u d .
So that the transfer function from the input ud to the output I is given by
(3.3)
I ( s ) u d ( s ) = s + θ + k .
From (2.4), the transfer function of the model is given by
(3.4)
I d ( s ) u d ( s ) = b d s + a d .
These transfer functions are equal provided we choose
(3.5)
= b d and k = a d - θ .
Therefore, it suffices to choose
(3.6)
u = ( θ - a d ) I + b d u d .
And hence the production rate P = D + ( θ - a d ) I + b d u d . The control variable (3.6) guarantees that the transfer function I ( s ) u d ( s ) is equal to that of the reference model. Such a transfer function matching guarantees that I ( t ) = I d ( t ) , t 0 when I ( 0 ) = I d ( 0 ) or | I ( t ) - I d ( t ) | 0 exponentially fast when I ( 0 ) I d ( 0 ) , for any bounded goal control ud.

Simulation example: Let us assume, for example, that items in stock deteriorate at a rate of θ = 0.01 , so that

(3.7)
I ( s ) u ( s ) = 1 s + θ = 1 s + 0.01 . Also, let a d = 0.1 and b d = 1 , so that I d ( s ) u d ( s ) = b d s + a d = 1 s + 0.1 . According to Eq. (3.5), the control parameters are given by = 1 and k = 0.09 . The optimal control is u = - 0.09 I + u d . And the production rate is P = D - 0.09 I + u d . Note that from Eq. (3.7), we have lim t I d = b d a d u d = 10 u d . Now if we want, for instance, the inventory level I to tend to, say, 10, we just need to choose ud = 1. As an illustration, let us assume a seasonal demand of rate D ( t ) = 5 + 5 sin ( π t / 64 ) . A simulation was conducted for 100 units of time and the results are depicted in Fig. 1. The inventory level tracking perfectly the inventory goal level is shown on the left while the production and demand rates are shown on the right.
Results of model reference control simulation.
Figure 1
Results of model reference control simulation.

4

4 Model reference adaptive

A situation virtually always met in practice occurs when the deterioration rate is unknown. When the parameter θ is unknown, the control (3.1) cannot be implemented. Therefore, instead of (3.1), we propose the control

(4.1)
u = - k ( t ) I + ( t ) u d , where k(t) and ( t ) are the estimates of k and , respectively, at time t, and search for an adaptive law to generate k(t) and ( t ) online. Therefore, we can view the problem as an online identification problem of the unknown constants k and . We start with the state equation (2.2) which we express in terms of k and by adding and subtracting the desired input terms - k I + u d to obtain I ̇ = - θ I + u = - θ I + u ± ( - k I + u d ) = - ( θ + k ) I + u d + k I - u d + u . From (3.5), one gets I ̇ = - a d I + b d u + ( k I - u d + u ) , i.e., using the Laplace transform
(4.2)
I ( s ) = b d s + a d u d ( s ) + 1 s + a d [ k I ( s ) - u d ( s ) + u ( s ) ] .
Because I d ( s ) = b d s + a d u d ( s ) is known and bounded, we express (4.2) in terms of the tracking error defined as e ( s ) I ( s ) - I d ( s ) , i.e.,
(4.3)
e ( s ) = 1 s + a d [ k I ( s ) - u d ( s ) + u ( s ) ] .
Substituting u = - k ( t ) I + ( t ) u d in (4.3) and defining the parameter errors Δ k ( t ) k ( t ) - k and Δ ( t ) ( t ) - . We have e ( s ) = 1 s + a d [ - Δ kI ( s ) + Δ u d ( s ) ] . Or in the time domain,
(4.4)
e ̇ = - a d e + ( - Δ kI + Δ u d ) .
The development of a differential equation (4.4) relating the estimation error with the parameter error is a significant step in deriving the adaptive laws for updating k(t) and ( t ) . We assume that the structure of the adaptive law is given by
(4.5)
k ̇ = f 1 ( e , I , u d , u ) and ̇ = f 2 ( e , I , u d , u ) ,
where the functions f1 and f2 are to be designed.

4.1

4.1 Lyapunov approach

Late in the nineteenth century, Lyapunov (Khalil, 2002; Sastry and Bodson, 1989) developed an approach to stability analysis that is widely used at the present time, known as the direct method. In a nutshell, this method consists of exhibiting a positive scalar function V such that V ̇ 0 . Such a function is called Lyapunov function and has the origin as a stable equilibrium point. The Lyapunov function is not unique; rather, many different Lyapunov functions may be found for a given system. Likewise, the inability to find a satisfactory Lyapunov function does not mean that the system is unstable. In the context of adaptive control, the use of the Lyapunov approach allows us not only to analyze the stability properties of the system but also to design an adaptive law for k and . First, we recall below, Lyapunov theorem which allows us to differentiate between the stability and asymptotic stability of a system.

Theorem

Theorem Khalil, 2002

Let x = 0, be an equilibrium point for a general nonlinear system modeled by: x ̇ = f ( x , t ) where t R + and x R n .

Let D R n be a domain containing the equilibrium point x = 0 and define: V : D R be a continuously differentiable function such that: V ( 0 ) = 0 and V ( x ) > 0 in D − {0}:

  • If V ̇ ( x ) 0 in D, then x = 0 ̲ is stable. This means that given ε > 0 , r ( 0 , ε ] , such that B r = { x R n / x < r < ε } D .

  • If V ̇ ( x ) < 0 in D - { 0 ̲ } , then x = 0 ̲ is asymptotically stable, i.e., x ( 0 ) D , x ( 0 ) < δ x ( t ) 0 as t .

Using this theorem, we will show that our closed loop system is stable.

Consider the function

(4.6)
V ( e , Δ k , Δ ) = e 2 2 + Δ k 2 2 γ 1 + Δ 2 2 γ 2 , where γ 1 , γ 2 > 0 , as a Lyapunov candidate for the system described by (4.4) and (4.5). The time derivative V ̇ along any trajectory of (4.4) and (4.5) is given by
(4.7)
V ̇ = - a d e 2 - Δ keI + Δ eu d + Δ k γ 1 f 1 + Δ γ 2 f 2 = - a d e 2 + f 1 γ 1 - eI Δ k + f 2 γ 2 + eu d Δ .
The indefinite terms in (4.7) disappear if we choose f 1 = γ 1 eI and f 2 = - γ 2 eu d . Therefore, for the adaptive law,
(4.8)
k ̇ = γ 1 eI and ̇ = - γ 2 eu d .
Which lead to?
(4.9)
V ̇ = - a d e 2 < 0 .

4.2

4.2 Stability analysis

The inventory system closed by the time varying feedback (4.1) can be represented by: e ̇ = - a d e + Δ u d - Δ kI , Δ k ̇ = γ 1 eI , Δ ̇ = - γ 2 eu d . The above closed loop system can be written under matrix form: x ̇ = A ( t , I , u d ) x ( t ) where x = [ e Δ k Δ ] T represents the state of the inventory system in closed loop and A ( t , I , u d ) = - a d - I u d γ 1 I 0 0 - γ 2 u d 0 0 is the state matrix.

The Lyapunov function can be rewritten as: V ( e , Δ k , Δ ) = V ( x ) = e 2 2 + Δ k 2 2 γ 1 + Δ 2 2 γ 2 = 1 2 x T Γ x > 0 . Since the matrix Γ = 1 0 0 0 1 γ 1 0 0 0 1 γ 2 is positive definite, that is, V ( x ) > 0 in R 3 - { 0 ̲ } where 0 ̲ = [ 0 0 0 ] T is the equilibrium point or the origin of the closed loop system. We have V ̇ ( x ) = - a d e 0 .

Note that:

  • When e 0 , V ̇ < 0 , the state x ( t ) converges toward the equilibrium point provided the tracking error e 0 , until the state reaches the origin, in which case e = 0 , Δ k = 0 , i.e., k ( t ) = k and Δ = 0 , i.e., = .

  • When e = 0 before x(t) reaches the equilibrium point, V ̇ = 0 and according to the theorem given above, x(t) converges to x0, where x 0 = [ 0 k ¯ ¯ ] T , where k ¯ and ¯ are constants. Thus, we have the tracking error e = 0 , I ( t ) = I d , Δ k = k ¯ , i.e., k ( t ) = k + k ¯ and Δ = ¯ , i.e., ( t ) = + ¯ .

This shows that our system is stable and not asymptotically stable. Although in control theory, it is usually preferable to have an asymptotic stability ( x ( t ) 0 ̲ ) instead of stability, in our case, stability ( e 0 ) is sufficient since the tracking performance is achieved ( I ( t ) I d ) .

Implementation: The MRAC control law (4.1) and (4.8) is easily implemented. The adaptive gains γ1 and γ2 can be chosen to be any positive real numbers; see Sastry and Bodson (1989). It is to be noted however that these values affect the transient performance of the closed loop system: large γ1, γ2 lead to faster convergence. The initial values k ( 0 ) , ( 0 ) are chosen to be a priori guesses of the unknown parameters k and , respectively. Small initial parameter error usually leads to better transient behavior. The reference model and input ud are designed so that Id describes the optimal trajectory to be followed by the state equation.

Simulation example: Consider the data of the previous section example, except for the deterioration rate θ that is supposed to be unknown now. We also take I d = 10 , 0 t < 100 , 5 , 100 t 200 , and u d = 1 , 0 t < 100 , 0.5 , 100 t 200 . And γ 1 = γ 2 = 0.2 . A simulation was conducted for 200 units of time. First we show in Fig. 2 the convergence of k(t) to k ¯ (left) and ( t ) to ¯ . Note that these results have been expected in the stability analysis.

Convergence results of model reference adaptive control simulation.
Figure 2
Convergence results of model reference adaptive control simulation.

Fig. 3 shows the variations of the inventory level I(t) and the inventory goal level Id (left) and the variations of the production rate P(t) and demand rate D(t) (right). It is depicted that after transient time, the inventory level I(t) tracks perfectly the inventory goal level I d ( t ) .

Variations results of model reference adaptive control simulation.
Figure 3
Variations results of model reference adaptive control simulation.

5

5 Conclusion

We have shown in this paper how to use an optimal control technique known as ‘Model reference adaptive control with feedback’ to solve a production-inventory planning problem. Simulations have been conducted to validate the results obtained. As a future research direction, we suggest the use of this technique on other inventory models, particularly those involving more than one differential equation, and on problems from other fields, such as economics, finance, etc.

References

  1. , , . Recent trends in modeling of deteriorating inventory. European Journal of Operational Research. 2001;134:1-16.
    [Google Scholar]
  2. , , , . Predictive control of periodic-review production inventory systems with deteriorating items. TOP. 2004;12(1):193-208.
    [Google Scholar]
  3. , , , . Receding horizon control of a hybrid production system with deteriorating items. Nonlinear Analysis: Special Series on Hybrid Systems and Applications. 2005;63:405-422.
    [Google Scholar]
  4. , , , . Self-tuning optimal control of periodic-review production inventory systems with deteriorating items. Advanced Modeling and Optimization. 2007;9(1):91-104.
    [Google Scholar]
  5. , . Nonlinear Systems (third ed.). New Jersey: Prentice-Hall; .
  6. , , . Adaptive Control: Stability, Convergence, and Robustness. Englewood: Prentice-Hall; .
  7. , , . Optimal Control Theory: Applications to Management Science and Economics (second ed.). Dordrecht: Kluwer Academic Publishers; .
Show Sections