If you use the bing chat interface and say "Can you draw me a picture of X?", th...

simonw · on Oct 1, 2023

I try to avoid prompts like "Can you ...?" because they could be interpreted as yes/no answers as opposed to commands to do something.

I've been prompting Bing with "Draw me an image of..." or even just "Image: image description" and it's worked well for me so far.

dhruvdh · on Oct 1, 2023

I think this has to do with the verb "draw". LLM is just saying it cannot draw. The image generation is likely a function it "calls". The LLM probably thinks of the image generator as a tool it uses, a separate entity from itself.

l33t7332273 · on Oct 1, 2023

> The LLM probably thinks of the image generator as a tool it uses

I don’t think it’s correct to describe the LLM as “thinking” in this instance, and not even for the normal philosophical objections, but just because I suspect it is a bad heuristic for designing these kinds of prompts.

JieJie · on Oct 1, 2023

As an alternative, I'll ask it to "reckon". For images, simply directing it to "create" an image suffices.

https://www.wordnik.com/words/reckon

brrrrrm · on Oct 1, 2023

Probably. I’ve had limited success getting LLMs (trained on chats/instruct) to output special codes indicating they’re communicating with a separate system (e.g. google, stable diffusion) and then taking that and feeding it back to the user

kaetemi · on Oct 1, 2023

It gives weird errors like that in the chat if it detects the output image as NSFW. Lots of false positives.