Generates a response based on provided images and text using a vision-language model.
animuslabs/Qwen2-VL-NSFW-Vision-1.2animuslabs/Qwen2-VL-NSFW-Vision-1.2)content array in each message can contain:
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.