Beyond No Regret: Instance-Dependent PAC Reinforcement Learning