This document provides an introduction to reinforcement learning (RL) and RL for brain-machine interfaces (RL-BMI). It outlines key RL concepts like the environment, value functions, and methods for achieving optimality including dynamic programming, Monte Carlo, and temporal difference methods. It also discusses eligibility traces and provides an example of an online/closed-loop RL-BMI architecture. References for further reading on the topics are included.