I'm tired of seeing these articles doing these "same prompt" tests like it's apples to apples. Different models require somewhat unique ways to get the best results. A great result from Midjourney probably won't give a great result with Flux or SDXL for example.
Please start comparing with...