Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Filters








48,130 Hits in 2.8 sec

Iterative learning control for performance optimisation

Bing Chu
2016 2016 American Control Conference (ACC)  
Iterative learning control (ILC) is a popular design methodology to achieve high performance trajectory tracking of systems operating in a repetitive manner.  ...  This paper further extends the applicability of ILC by showing that ILC can solve a more general problem of optimising some system performance for which trajectory tracking is just a special case.  ...  CONCLUSION Iterative learning control is a popular control design method for high performance trajectory tracking of systems operating in a repetitive manner.  ... 
doi:10.1109/acc.2016.7525233 dblp:conf/amcc/Chu16 fatcat:c3slfsqbcvhl5i3t4zf5trdkfm

An extremum seeking approach to sampled-data iterative learning control of continuous-time nonlinear systems**This research was supported in part by the Institute for Mathematics and its Applications with funds provided by the National Science Foundation and the Australian Research Council

Sei Zhen Khong, Dragan Nešić, Miroslav Krstić
2016 IFAC-PapersOnLine  
Iterative learning control (ILC) of continuous-time nonlinear plants with periodic sampled-data inputs is considered via an extremum seeking approach.  ...  Abstract: Iterative learning control (ILC) of continuous-time nonlinear plants with periodic sampled-data inputs is considered via an extremum seeking approach.  ...  Iterative learning control for reference tracking Iterative learning control (ILC) is a recursive learning based algorithm for solving the reference tracking problem (Moore, 1993; Moore et al., 1992;  ... 
doi:10.1016/j.ifacol.2016.10.292 fatcat:c3xfpgzi7zf63p2r4d57753f2i

Research on Parameter Optimization of Dynamic Priority Scheduling Algorithm Based on Improved Reinforcement Learning

Shanshan Meng, Qiang Zhu, Fei Xia
2020 IET Generation, Transmission & Distribution  
Aiming to solve this problem, a dynamic priority scheduling algorithm based on improved reinforcement learning (RL) is proposed for parameter optimisation.  ...  A scheduling algorithm optimised by RL can be better applied to industrial control and power system resource scheduling, which not only improves control efficiency but reduces scheduling costs.  ...  The number of learning iterations of the improved Qlearning algorithm is reduced from 7867 for basic Q-learning to 116 iterations.  ... 
doi:10.1049/iet-gtd.2019.1468 fatcat:mlpojwx3yzeo3cqh4ajopfqfh4

Control as Hybrid Inference [article]

Alexander Tschantz, Beren Millidge, Anil K. Seth, Christopher L. Buckley
2020 arXiv   pre-print
CHI thus provides a principled framework for harnessing the sample efficiency of model-based planning while retaining the asymptotic performance of model-free policy optimisation.  ...  Here, we unify these approaches by casting model-free policy optimisation as amortised variational inference, and model-based planning as iterative variational inference, within a 'control as hybrid inference  ...  Leveraging these insights, we propose control as hybrid inference (CHI), a framework for combining amortised and iterative inference in the context of control.  ... 
arXiv:2007.05838v1 fatcat:5cjuh5sdt5futfxwqxhktpq67u

Adaptive Model Predictive Control by Learning Classifiers [article]

Rel Guzman, Rafael Oliveira, Fabio Ramos
2022 arXiv   pre-print
This is then integrated into a model predictive path integral control framework yielding robust controllers for a variety of challenging robotics tasks.  ...  Stochastic model predictive control has been a successful and robust control framework for many robotics tasks where the system dynamics model is slightly inaccurate or in the presence of environment disturbances  ...  This controller learning capability can be achieved with data-driven approaches for MPC optimisation (Görges, 2017) .  ... 
arXiv:2203.06783v2 fatcat:iwatvipiavcedlrihad2kchrhq

Fingerprint Policy Optimisation for Robust Reinforcement Learning [article]

Supratik Paul, Michael A. Osborne, Shimon Whiteson
2019 arXiv   pre-print
The central idea is to use Bayesian optimisation (BO) to actively select the distribution of the environment variable that maximises the improvement generated by each iteration of the policy gradient method  ...  Our experiments show that FPO can efficiently learn policies that are robust to significant rare events, which are unlikely to be observable under random sampling, but are key to learning good policies  ...  Acknowledgements We would like to thank Binxin Ru for sharing the code for FITBO, and Yarin Gal for the helpful discussions.  ... 
arXiv:1805.10662v3 fatcat:iyyzn3o4uja3bkk6tzhve2p2ke

Testing the Limits of SMILES-based De Novo Molecular Generation with Curriculum and Deep Reinforcement Learning [article]

Maranga Mokaya, Fergus Imrie, Willem P van Hoorn, Aleksandra Kalisz, Anthony R Bradley, Charlotte M Deane
2022 bioRxiv   pre-print
To help overcome these issues, we propose a new curriculum learning-inspired, recurrent Iterative Optimisation Procedure that enables the optimisation of generated molecules for seen and unseen molecular  ...  Deep reinforcement learning methods have been shown to be potentially powerful tools for de novo design.  ...  Iterative optimisation procedure for diversity control For de novo design tools to be effective, it should be possible to control the specificity of the generated molecules.  ... 
doi:10.1101/2022.07.15.500218 fatcat:rul72ktb5zdppeqcy7qtoornwi

Follow the Gradient: Crossing the Reality Gap using Differentiable Physics (RealityGrad) [article]

Jack Collins, Ross Brown, Jürgen Leitner, David Howard
2021 arXiv   pre-print
We demonstrate RealitGrad on a dynamic control task for a serial link robot manipulator and present results that show its efficiency and ability to quickly improve not just the robot's performance in real  ...  We propose a novel iterative approach for crossing the reality gap that utilises live robot rollouts and differentiable physics.  ...  [16] applied Iterative Linear Quadratic Control, on the linearized dynamics of a cartpole system for direct trajectory optimisation.  ... 
arXiv:2109.04674v1 fatcat:wh2pivygvbcj3e5ihwkkedyt2y

Bayesian functional optimisation with shape prior [article]

Pratibha Vellanki, Santu Rana, Sunil Gupta, David Rubin de Celis Leal, Alessandra Sutti, Murray Height, Svetha Venkatesh
2020 arXiv   pre-print
We demonstrate the effectiveness of our approach for short polymer fibre design and optimising learning rate schedules for deep networks.  ...  We develop a novel Bayesian optimisation framework for such functional optimisation of expensive black-box processes.  ...  We apply our algorithm for learning rate schedule optimisation and found that an optimised learning rate schedule can even make SGD to perform better than both the method proposed by (Vien, Zimmermann  ... 
arXiv:1809.07260v2 fatcat:yreg3cb75bcwxhe77cmecy7a64

Learning control for best dynamic performance

P. Kiriazov
2006 Journal of Biomechanics  
In our study, a conceptual framework for efficient control synthesis in voluntary, ballistic-like movements is proposed.  ...  A very important question is how that large-scale neuro-musculo-skeletal systems are efficiently controlled in dynamic motion tasks.  ...  We propose a control learning approach that optimises the performance within a minimum number of test movements. It has the following main steps: 1.  ... 
doi:10.1016/s0021-9290(06)83276-8 fatcat:paikjmnbrnhwfnm4gczlzlhapm

FEREBUS: a high-performance modern Gaussian process regression engine

Matthew J. Burn, Paul L. A. Popelier
2023 Digital Discovery  
FEREBUS is a highly optimised Gaussian process regression (GPR) engine, which provides both model and optimiser flexibility to produce tailored models designed for domain specific applications.  ...  B. acknowledges the MRC National Productivity Investment Fund (NPIF) for the award of a PhD studentship.  ...  For both UAPSO and IAPSO, a learning automaton is used to select the control parameters.  ... 
doi:10.1039/d2dd00082b fatcat:yeeyhlvhird6bdz36qkq6xvwp4

Optimisation of Structured Neural Controller Based on Continuous-Time Policy Gradient [article]

Namhoon Cho, Hyo-Sang Shin
2022 arXiv   pre-print
performance provided by machine learning techniques.  ...  Such a hybrid paradigm for fixed-structure control synthesis is particularly useful for optimising adaptive nonlinear controllers to achieve improved performance in online operation, an area where the  ...  Specifically, this study addressed application of the CTPG method for performance optimisation of structured control systems prescribing a wellknown structure for the actual control input variable while  ... 
arXiv:2201.06262v4 fatcat:z5avwt3g4zhrrowe4vnynakkga

A Data-Driven Approach to Iterative Learning Control via Convex Optimization

Achille Nicoletti, Michele Martino, Davide Aguglia
2020 IET Control Theory & Applications  
A new data-driven iterative learning control methodology is presented which uses the frequency response data of a system in order to avoid the problem of unmodelled dynamics associated with low-order parametric  ...  A convex optimisation problem is formulated to design the learning filters such that the convergence criterion is minimised.  ...  Iterative learning control (ILC) (which was first proposed in [18] ) seeks to address the tracking problem by implementing a learning algorithm where the desired performance is achieved by incorporating  ... 
doi:10.1049/iet-cta.2018.6446 fatcat:w4pqi5nncncr7bnx4itvprirz4

Optimisation of hand posture stimulation using an electrode array and iterative learning control

Timothy Exell, Christopher Freeman, Katie Meadmore, Ann-Marie Hughes, Emma Hallewell, Jane Burridge
2013 Journal of Automatic Control  
Nonlinear optimisation-based search algorithms have been developed for the precise stimulation of muscles in the wrist and hand, to enable stroke patients to attain predefined gestures.  ...  Initial performance results from unimpaired subjects show the successful reproduction of six reference hand postures using the system.  ...  The approach taken utilises iterative learning control (ILC), a control technique that was developed for industrial systems that repetitively complete the same movement.  ... 
doi:10.2298/jac1301001e fatcat:rlfny4vtuzaybhzc3e3ncrs2qq

Amortized variance reduction for doubly stochastic objectives [article]

Ayman Boustati, Sattar Vakili, James Hensman, ST John
2020 arXiv   pre-print
We propose a new approach in which we use a recognition network to cheaply approximate the optimal control variate for each mini-batch, with no additional model gradient computations.  ...  We illustrate the properties of this proposal and test its performance on logistic regression and deep Gaussian processes.  ...  For each period, we iteratively sample a gradient value then perform an optimisation step on the recognition network.  ... 
arXiv:2003.04125v1 fatcat:wqqkktmx7rapvilro4socxpywu
« Previous Showing results 1 — 15 out of 48,130 results