Statistical Learning under Heterogenous Distribution Shift