The issue isn't objective quality or realism, it's sticking to a specific style consistently.
_Everyone_ (and their grandmother) can instantly tell a ChatGPT generated image, it has a very distinct style - and in my experience no amount of prompting will make it go away. Same for Grok and to a smaller degree Google's stuff.
What the industry needs (and uses) is something they can feed a, say, wall texture into and the AI workflow will produce a summer, winter and fall variant of that - in the exact style the specific game is using.
And stablediffusion-web-ui before that and others, yes.
When googling, txt2img and img2img, or txt2video img2video etc. (for video) are useful terms, since they encapsulate the usage in a few terms. One could search img2video comfyui workflows, for example.
I thought it would be useful for the conversation to provide these terms, not mentioned before in the thread.
_Everyone_ (and their grandmother) can instantly tell a ChatGPT generated image, it has a very distinct style - and in my experience no amount of prompting will make it go away. Same for Grok and to a smaller degree Google's stuff.
What the industry needs (and uses) is something they can feed a, say, wall texture into and the AI workflow will produce a summer, winter and fall variant of that - in the exact style the specific game is using.