Actas de congresos
Function Approximation For A Production And Storage Problem Under Uncertainty
Registro en:
780390458
Ieee International Conference On Mechatronics And Automation, Icma 2005. , v. , n. , p. 665 - 670, 2005.
2-s2.0-27744510838
Autor
Arruda E.F.
Do Val J.B.R.
Almudevar A.
Institución
Resumen
In this work, we present an approximate value iteration algorithm for a production and storage model with multiple production stages and a single final product, subject to random demand. We use linear function approximation schemes in subsets of the state space and represent a few key states in a look-up table form. We obtain some promising results and perform sensitivity analysis with respect to the parameters of the algorithm for the benchmark problem studied. © 2005 IEEE.
665 670 Davis, M.H.A., (1993) Markov Models and Optimization, , London: Chapman and Hall Sethi, S.P., Yan, H., Zhang, H., Zhang, Q., Optimal and hierarchical controls in dynamic stochastic manufacturing sytems: A review (2002) Manuf. & Serv, Ops. Management, 4 (2), pp. 133-170 Yin, K.K., Liu, H., Yin, G.G., Stochastic models and numerical solutions for production planning with applications to the paper industry (2003) Computers & Chemical Engineering, 27, pp. 1693-1706 Si, J., Barto, A., Powell, W., Wunsch, D., (2004) Handbook of Learning and Approximate Dynamic Programming, , Piscataway-NJ: John Wiley & Sons-IEEE Press Bertsekas, D.P., Tsitsiklis, J.N., (1996) Neuro-dynamic Programming, , Belmont: Athena Scientific Sutton, R.S., Barto, A.G., (1998) Reinforcement Learning: An Introduction, , Cambridge: MIT Press Arruda, E.F., Almudevar, A., Do Val, J.B.R., Stability and optimally of a discrete production and storage model with uncertain demand (2004) Proceedings of the 43th IEEE Conference on Decision and Control, pp. 3354-3360. , Nassau Gordon, G., Stable function approximation in dynamic programming (1995) Proceedings of IMCL'95 B. III, L.C., Residual algorithms: Reinforcement learning with function approximation (1995) International Conference on Machine Learning, pp. 30-37. , [Online], Available: citeseer.csail.mit.edu/baird95residual.html Reynolds, S.I., The stability of general discounted reinforcement learning with linear function approximation (2002) Proceedings of the UK Workshop on Computational Intelligence, pp. 139-146. , Birmingham-UK Weiring, M.A., Convergence and divergence in standard and averaging reinforcement learning (2004) Proc. 15th European Conf. on Machine Learning, pp. 477-488. , Pisa-Italy Golub, G.H., Van Loan, C.F., (1996) Matrix Computations, 3rd Ed., , Baltimore: Johns Hopkins University Press