Multi-Task Averaging: Theory and Practice