题目:Adaptive Dynamic Programming - A New Tool for Learning Control
时间:2008年10月8日(周三)下午3:00
地点:新主楼E706
报告人:Derong Liu(刘德荣)
IEEE Fellow, "海外杰出青年基金"获得者, 中国科学院"百人计划"入选者,教授
Department of Electrical and Computer Engineering
University of Illinois at Chicago
Email: dliu@ece.uic.edu
Adaptive Dynamic Programming (ADP) has received increasing attention recently. ADP scheme is a design that approximates dynamic programming in the general case, i.e., approximates optimal control over time in noisy, nonlinear environments. There are many engineering problems in practice which can be formulated as cost maximization or minimization problems. Dynamic programming is a very useful tool in solving these problems. However, it is often computationally untenable to run dynamic programming due to the backward numerical process required for its solutions. Over the years, progress has been made to provide approximate solutions to dynamic programming. The idea is to approximate dynamic programming solutions by using neural networks to approximate the cost function. The methodology is a very useful tool for building intelligent agents/controllers in almost any environment.
This speech will review the theorectical development of ADP. Details about the training of the neural networks used in the present design will also be presented. The pole balancing (inverted pendulum) problem will be used as the benchmark in this presentation to show the applicability of ADP.
刘德荣:IEEE Fellow, 现为依利诺斯大学电机与计算机工程系和计算机科学系的终身职正教授。1994年从美国圣母大学(University of Notre Dame)毕业并获电机工程博士学位。从1993年至1995年,在美国通用汽车公司研究开发中心工作。从1995年至1999年,在斯蒂文斯理工学院电机与计算机工程系任助教授。从1999年开始,在依利诺斯大学芝加哥分校电机与计算机工程系工作,先后任该校助教授、终身职副教授、电机与计算机工程系和计算机科学系的终身职正教授。2005年,因在非线性动态系统和递归神经网络方面作出的贡献被选为IEEE Fellow。2008年,入选中国科学院"百人计划"在自动化研究所任研究员。
刘德荣在圣母大学读书期间,获该校麦克尔-博克研究基金的资助(1990-1991年)。在斯蒂文斯理工学院任教期间,获该校哈维-戴维斯杰出教学奖(1997年)。在依利诺斯大学任教期间,获该校大学学者奖(2006-2009年)。1999年,获得了美国国家科学基金会教授早期事业发展奖。2008年,获得了中国国家自然科学基金委"海外杰出青年基金"。他于1995年至2000年任IEEE控制系统学会会议编辑委员会成员、于1997年至1999年任IEEE电路与系统汇刊编委、于2001年至2003年任IEEE信号处理汇刊编委、于2006年至2008年任IEEE神经网络快报编辑。他现任Automatica的编委、IEEE神经网络汇刊编委、IEEE智能计算杂志编委、IEEE电路与系统杂志编委、IEEE智能计算学会电子快报编辑。他于2005年当选为IEEE智能计算学会的理事。
刘德荣目前主要从事智能控制理论及应用、人工神经网络、模糊系统、生物信息学、电力系统运行与控制、无线通讯与无线网络方面的研究工作。自1992年起, 共发表了60多篇国际学术杂志论文、110多篇国际会议论文、同他人合作共出版过八本书。
Derong Liu received the Ph.D. degree in electrical engineering from the University of Notre Dame, Notre Dame, IN, in 1994. From 1993 to 1995, he was a Staff Fellow with General Motors Research and Development Center, Warren, MI. From 1995 to 1999, he was an Assistant Professor in the Department of Electrical and Computer Engineering, Stevens Institute of Technology, Hoboken, NJ. He joined the University of Illinois at Chicago in 1999, where he is now a Full Professor of electrical and computer engineering and of computer science. Since 2005, he has been Director of Graduate Studies in the Department of Electrical and Computer Engineering, University of Illinois at Chicago. He has published seven books (four research monographs and three edited volumes). He is currently the Editor of the IEEE Computational Intelligence Societys Electronic Letter, an Associate Editor of the IEEE Transactions on Neural Networks, an Associate Editor of the IEEE Computational Intelligence Magazine, an Associate Editor of the IEEE Circuits and Systems Magazine, and an Associate Editor of Automatica. He is an elected AdCom member of the IEEE Computational Intelligence Society (2006-2008). He received the Faculty Early Career Development (CAREER) award from the National Science Foundation (1999) and the University Scholar Award from University of Illinois (2006-2009). He is a Fellow of the IEEE.