Multimodal Image Reasoning & Instruction API