Fleet Policy Learning via Weight Merging and An Application to Robotic Tool-Use