Towards Open-Set Computer Vision With Language Guidance