Sample-Efficient Deep Reinforcement Learning From Single Agent To Multiple Agents