Towards identifying interpretable, manipulable and composable representations for controlling data generation