Reinforcement learning for query-based multi-document extractive summarisation