Dabbling in Reinforcement Learning