Download PDFOpen PDF in browser

Generalizing Robot Imitation Learning with Invariant Hidden Semi-Markov Models

EasyChair Preprint no. 746

16 pagesDate: January 22, 2019

Abstract

Generalizing manipulation skills to new situations requires extracting invariant patterns from demonstrations. For example, the robot needs to understand the demonstrations at a higher level while being invariant to the appearance of the objects, geometric aspects of objects such as its position, size, orientation and viewpoint of the observer in the demonstrations. In this paper, we propose an algorithm that learns a joint probability density function of the demonstrations with invariant formulations of hidden semi-Markov models to extract invariant segments (also termed as sub-goals or options), and smoothly follow the generated sequence of states with a linear quadratic tracking controller. The algorithm takes as input the demonstrations with respect to different coordinate systems describing virtual landmarks or objects of interest with a task-parameterized formulation, and adapt the segments according to the environmental changes in a systematic manner. We present variants of this algorithm in latent space with low-rank covariance decompositions, semi-tied covariances, and non-parametric online estimation of model parameters under small variance asymptotics; yielding considerably lower sample and model complexity in contrast to deep learning approaches. The algorithm allows a Baxter robot to learn a pick-and-place task while avoiding a movable obstacle based on only 4 kinesthetic demonstrations.

Keyphrases: adaptive systems, Hidden Markov Models, Imitation Learning

BibTeX entry
BibTeX does not have the right entry for preprints. This is a hack for producing the correct reference:
@Booklet{EasyChair:746,
  author = {Ajay Kumar Tanwani and Jonathan Lee and Brijen Thananjeyan and Michael Laskey and Sanjay Krishnan and Roy Fox and Ken Goldberg and Sylvain Calinon},
  title = {Generalizing Robot Imitation Learning with Invariant Hidden Semi-Markov Models},
  howpublished = {EasyChair Preprint no. 746},
  doi = {10.29007/d3d4},
  year = {EasyChair, 2019}}
Download PDFOpen PDF in browser