Learning the way of life from Reinforcement Learning

Photo by Andrea De Santis on Unsplash

RL in a nutshell

Photo by KD nuggets

Value / Q

  • What’s the value of my current situation?
  • What can I expect from it?
  • What’s the value of the next possible situation and which one should I choose?
  • Am I doing good?

Action & Policy

Exploration

Backpropagation

tl;dr

  1. It’s all about learning. An agent that beat the Go champion learned from nothing through millions of trials and errors
  2. Do exploration to step outside your comfort zone.
  3. Know your value function.
  4. Examine your policy — it’s a habit that determines your fate

--

--

--

Love podcasts or audiobooks? Learn on the go with our new app.

Recommended from Medium

The Artificialized Impact of Biased Narratives on Diversity and inclusion Efforts

An Interview with Secular Student Society at Miami University — Part 4

10 Advantages Introverts Have Over Extroverts

If At First You Don’t Succeed

The cognitive bias codex

The interpersonal comparison is a general problem across psychology- not merely a specific problem…

Listen Up: The First Step to Quitting Bias for Good

Economics for other people

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
etherhyun

etherhyun

More from Medium

Are you staying in India ? Find your House price here.

Eye-Beacon: Version 1 — Project Elaboration

How to Make Small and Low-Cost Audio Player

Banner Image for DFPlayer

How blockchain and cryptocurrencies can influence the education sector? ($SKILLS example)