A curious robot: An explorative-exploitive inference algorithm

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

We propose a sequential learning algorithm with a focus on robot
control. It is initialised by a teacher who directs the robot
through a series of example solutions of a problem. Left alone, the
control chooses its next action by prediction based on a variable
order Markov chain model selected to minimise a MDL criterion based
on generalised code length La of the past robot-environment
interaction. The user specifies the parameter a and as a
result, the robot can be directed towards exploratory behaviour if
confidence in the teacher is low (a<0), and towards
goal-seeking exploitive behaviour if confidence in the teacher is
high (a>0). The novelty of the proposed method lies in the
use of generalised code length in the MDL model selection criterion.
Original languageEnglish
Title of host publicationRoboMat 07 : Coimbra, Portugal, 17-19 September, 2007
EditorsHelder Araujo, Maria Isabel Ribeiro
Number of pages6
PublisherCIM (Centro Internacional de Matematica)
Publication date2007
Pages51-57
ISBN (Print)9789899501133
Publication statusPublished - 2007
EventWorkshop of Robotics and Mathematics (RoboMat 2007) - Coimbra, Portugal
Duration: 17 Sep 200719 Sep 2007

Conference

ConferenceWorkshop of Robotics and Mathematics (RoboMat 2007)
LandPortugal
ByCoimbra
Periode17/09/200719/09/2007

ID: 2030803