Text is a lot easier on bing/dalle 3, for stable diffusion, its better to just do it manually, or use controlnet. There are loras like this one though: https://civitai.com/models/176555/harrlogos-xl-finally-custom-text-generation-in-sd
But its mostly for logos, and still takes a bit of prompting to get it to write what you want.
They change the output. specifically wanted the loras to change the output design so it looks more like a mass of creatures fused together like the thing. For reference, here is the original output from bing:
And here is the bodhor lora on civitai:
edit: I linked the wrong lora before. here is the correct one, I think. I downloaded it some time ago. https://civitai.com/models/51288/body-horror-creatures
Prompt for the main image is:
gritty graphic novel art, a warrior holding up his hand which has turned into the head of a fire breathing dragon and shouting “forefathers one and all, bear witness!” speech bubble