Diversity Is All You Need: Learning Skills without a Reward Function

https://arxiv.org/abs/1802.06070

# Abstract
- Learn **skills** by maximizing information using maximum entropy policy
- Train typical reinforcement learning with best **skill** after unsupervised learning

# 1. Introduction
- **Skill** is just a policy
- Key Idea is discriminability of **skills**
	- Skills has to be distinguishable
	- Skills has to be as diverse as possible

# 2. Related Work
- Three important distinction of paper
	1. Using maximum entropy policies to force skills to be diverse
	2. Fix distribution **p(z)**
	3. Watches every **states**

Paper says that maximizing diversity is better than specific reward on complex behaviors

# 3. Diversity is all you need
![image](https://user-images.githubusercontent.com/2807595/36885448-4220cc4e-1e2a-11e8-8bb7-191228ba04a5.png)
![image](https://user-images.githubusercontent.com/2807595/36885451-4a67acd8-1e2a-11e8-81b6-28f1ee5a64c9.png)

## 3.1. How it works

**H[a|s]** = **MI(a,z|s)** from continuous action space

**F(Θ)** = **H[a|s,z]** + **H[z]** - H[z|s]
- **H[a|s,z]**: skill act randomly
- **H[z]**: **p(z)** to have high entropy
- **H[z|s]**: infer z from current state

## 3.2. Implementation
![image](https://user-images.githubusercontent.com/2807595/36885525-c9d356d4-1e2a-11e8-8214-c8523f81209c.png)

## 4. What skills are learned?
![image](https://user-images.githubusercontent.com/2807595/36885531-d493e606-1e2a-11e8-8f8b-eedb507bef87.png)
(alpha with 0.01 is best discriminative illustration)

# Question
- Is this model similar to random forest?
- What is critic network?
- What is M-Projection?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Diversity Is All You Need: Learning Skills without a Reward Function #7

Abstract

1. Introduction

2. Related Work

3. Diversity is all you need

3.1. How it works

3.2. Implementation

4. What skills are learned?

Question

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Diversity Is All You Need: Learning Skills without a Reward Function #7

Description

Abstract

1. Introduction

2. Related Work

3. Diversity is all you need

3.1. How it works

3.2. Implementation

4. What skills are learned?

Question

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions