Expected Return - What Drives A Reinforcement Learning Agent In An Mdp