{"id":6405,"date":"2025-02-23T18:27:04","date_gmt":"2025-02-23T18:27:04","guid":{"rendered":"https:\/\/robertjwallace.com\/?p=6405"},"modified":"2025-02-24T23:59:26","modified_gmt":"2025-02-24T23:59:26","slug":"ai-image-generation-still-a-ways-to-go","status":"publish","type":"post","link":"https:\/\/robertjwallace.com\/es\/ai-image-generation-still-a-ways-to-go\/","title":{"rendered":"Generaci\u00f3n de im\u00e1genes con IA: todav\u00eda queda un largo camino por recorrer."},"content":{"rendered":"<p class=\"\">A principios de 2024, la generaci\u00f3n de im\u00e1genes con IA ha alcanzado una fascinante paradoja. A primera vista, la tecnolog\u00eda parece casi m\u00e1gica, capaz de crear impresionantes escenas fotorrealistas que fusionan realidad e imaginaci\u00f3n. Tomemos, por ejemplo, la tarea de generar el dise\u00f1o de un coche cl\u00e1sico h\u00edbrido: al combinar un Mustang del 67 con un MGA Roadster del 57, la IA puede producir im\u00e1genes con una atenci\u00f3n excepcional al detalle, sobre fondos monta\u00f1osos perfectamente renderizados con una iluminaci\u00f3n espectacular. El cromo brilla, las curvas fluyen y el sol poniente proyecta las sombras perfectas sobre los acantilados.<\/p>\n\n\n\n<!--more-->\n\n\n\n<p class=\"\">Yet upon closer inspection, the limitations become apparent. While AI excels at certain elements &#8211; like the mechanical precision of car bodies or the natural textures of landscapes &#8211; it still struggles with human subjects. In the classic car challenge, many generators either omitted the requested driver entirely or produced unconvincing human figures that break the illusion of photorealism. These inconsistencies reveal that while AI image generation has made tremendous strides, it remains a technology in development, particularly when it comes to naturally integrating human elements into complex scenes.<\/p>\n\n\n\n<p class=\"\">Here is  a comparison of several free AI image generators, that show both the impressive capabilities and persistent challenges of this rapidly evolving technology. All used the prompt: &#8220;A hybrid car that combines elements of a 1967 Ford Mustang and a 1957 MGA Roadster. The design features the sleek, muscular stance of the Mustang with the graceful curves and open-top design of the MGA. The car is shown fully (not cropped) with a young woman driving. The driver&#8217;s position and the steering wheel are clearly on the left side of the car, with the top of the steering wheel visible. The car is driving on a winding mountain road, with lush greenery and rugged cliffs in the background under a setting sun, creating a dramatic and adventurous atmosphere.&#8221;<\/p>\n\n\n\n<p class=\"\">Here&#8217;s a list of AI image generators mentioned in the article, <\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"\">Deepimg.AI<\/li>\n\n\n\n<li class=\"\">Perchance<\/li>\n\n\n\n<li class=\"\">DALL E3<\/li>\n\n\n\n<li class=\"\">DeepAI<\/li>\n\n\n\n<li class=\"\">Microsoft Designer<\/li>\n\n\n\n<li class=\"\">FlatAI<\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\">Perchance<\/h2>\n\n\n\n<p class=\"\"><a href=\"https:\/\/perchance.org\/ai-photo-generator\">https:\/\/perchance.org\/ai-photo-generator<\/a><\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" width=\"768\" height=\"512\" data-src=\"https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-11.png\" alt=\"\" class=\"wp-image-6406 lazyload\" data-srcset=\"https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-11.png 768w, https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-11-300x200.png 300w, https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-11-600x400.png 600w, https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-11-272x182.png 272w\" data-sizes=\"(max-width: 767px) 89vw, (max-width: 1000px) 54vw, (max-width: 1071px) 543px, 580px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 768px; --smush-placeholder-aspect-ratio: 768\/512;\" \/><\/figure>\n\n\n\n<p class=\"\">While a nice image, there is no driver and not much, if any, influence from a 1957 MGA roadster.  The site did offer multiple images, but the interface was very crude looking.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">DALL E3<\/h2>\n\n\n\n<p class=\"\"><a href=\"https:\/\/openai.com\/index\/dall-e-3\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">https:\/\/openai.com\/index\/dall-e-3<\/a><\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img decoding=\"async\" width=\"1024\" height=\"585\" data-src=\"https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-12-1024x585.png\" alt=\"\" class=\"wp-image-6407 lazyload\" data-srcset=\"https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-12-1024x585.png 1024w, https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-12-300x171.png 300w, https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-12-768x439.png 768w, https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-12-1536x878.png 1536w, https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-12.png 1792w\" data-sizes=\"(max-width: 767px) 89vw, (max-width: 1000px) 54vw, (max-width: 1071px) 543px, 580px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 1024px; --smush-placeholder-aspect-ratio: 1024\/585;\" \/><\/figure>\n\n\n\n<p class=\"\">This image does a better job with merging the Mustang and MGA, but the driver shows that AI still has difficulties with generating realistic humans.  Also I was not pleased with the Mustang logo&#8217;s in the image.  However, there was an edit option that let me prompt: remove the mustang logos and text.  <\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img decoding=\"async\" width=\"1024\" height=\"585\" data-src=\"https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-13-1024x585.png\" alt=\"\" class=\"wp-image-6408 lazyload\" data-srcset=\"https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-13-1024x585.png 1024w, https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-13-300x171.png 300w, https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-13-768x439.png 768w, https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-13-1536x878.png 1536w, https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-13.png 1792w\" data-sizes=\"(max-width: 767px) 89vw, (max-width: 1000px) 54vw, (max-width: 1071px) 543px, 580px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 1024px; --smush-placeholder-aspect-ratio: 1024\/585;\" \/><\/figure>\n\n\n\n<p class=\"\">This removed the logo and text, but gave me a new image that was far worse than the first. The car was less of a combination between a Mustang and a MGA.  The driver still is unrealistic and is not positioned correctly.  I tried one more time, asking &#8220;No, keep the first image, just remove the logo and text&#8221;.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img decoding=\"async\" width=\"1024\" height=\"585\" data-src=\"https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-14-1024x585.png\" alt=\"\" class=\"wp-image-6409 lazyload\" data-srcset=\"https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-14-1024x585.png 1024w, https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-14-300x171.png 300w, https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-14-768x439.png 768w, https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-14-1536x878.png 1536w, https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-14.png 1792w\" data-sizes=\"(max-width: 767px) 89vw, (max-width: 1000px) 54vw, (max-width: 1071px) 543px, 580px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 1024px; --smush-placeholder-aspect-ratio: 1024\/585;\" \/><\/figure>\n\n\n\n<p class=\"\">And Dall E told me &#8220;Here is the updated version of the original image with the logos and text removed. Let me know if you need further modifications!&#8221;  To which I responded &#8220;ARRRGGGG!!!!!&#8221;<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">DeepAI<\/h2>\n\n\n\n<p class=\"\"><a href=\"https:\/\/deepai.org\/machine-learning-model\/text2img\">https:\/\/deepai.org\/machine-learning-model\/text2img<\/a><\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" width=\"640\" height=\"640\" data-src=\"https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-15.png\" alt=\"\" class=\"wp-image-6410 lazyload\" data-srcset=\"https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-15.png 640w, https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-15-300x300.png 300w, https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-15-150x150.png 150w, https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-15-600x600.png 600w, https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-15-100x100.png 100w\" data-sizes=\"(max-width: 640px) 100vw, 640px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 640px; --smush-placeholder-aspect-ratio: 640\/640;\" \/><\/figure>\n\n\n\n<p class=\"\">This did a good job rendering the car, but still without much MGA influence.  They solved the unrealistic human issue by obscuring the driver.  But overall I like this one.  It reminds me of driving through the canyon on the way to the John Day river in Oregon.<br><\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Microsoft Designer<\/h2>\n\n\n\n<p class=\"\"><a href=\"https:\/\/designer.microsoft.com\/image-creator\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">https:\/\/designer.microsoft.com\/image-creator<\/a><\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img decoding=\"async\" width=\"1024\" height=\"585\" data-src=\"https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-16-1024x585.png\" alt=\"\" class=\"wp-image-6411 lazyload\" data-srcset=\"https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-16-1024x585.png 1024w, https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-16-300x171.png 300w, https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-16-768x439.png 768w, https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-16-1536x878.png 1536w, https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-16.png 1792w\" data-sizes=\"(max-width: 767px) 89vw, (max-width: 1000px) 54vw, (max-width: 1071px) 543px, 580px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 1024px; --smush-placeholder-aspect-ratio: 1024\/585;\" \/><\/figure>\n\n\n\n<p class=\"\">I think they hired the same model as Dall E3 :), but other than that this is not too bad.  The rendering is good, and there are elements from both the Mustang and MGA.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Deepimg.AI<\/h2>\n\n\n\n<p class=\"\"><a href=\"https:\/\/deepimg.ai\/ai-image-generator\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">https:\/\/deepimg.ai\/ai-image-generator\/<\/a><\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img decoding=\"async\" width=\"1024\" height=\"701\" data-src=\"https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-17-1024x701.png\" alt=\"\" class=\"wp-image-6412 lazyload\" data-srcset=\"https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-17-1024x701.png 1024w, https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-17-300x205.png 300w, https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-17-768x525.png 768w, https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-17.png 1216w\" data-sizes=\"(max-width: 767px) 89vw, (max-width: 1000px) 54vw, (max-width: 1071px) 543px, 580px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 1024px; --smush-placeholder-aspect-ratio: 1024\/701;\" \/><\/figure>\n\n\n\n<p class=\"\">This is one of the best, in my opinion.  The rendering is very good, the driver is the most realistic.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">FlatAI<\/h2>\n\n\n\n<p class=\"\"><a href=\"https:\/\/flatai.org\/ai-image-generator-free-no-signup\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">https:\/\/flatai.org\/ai-image-generator-free-no-signup\/<\/a><\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img decoding=\"async\" width=\"1024\" height=\"585\" data-src=\"https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-18-1024x585.png\" alt=\"\" class=\"wp-image-6413 lazyload\" data-srcset=\"https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-18-1024x585.png 1024w, https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-18-300x171.png 300w, https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-18-768x439.png 768w, https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-18.png 1344w\" data-sizes=\"(max-width: 767px) 89vw, (max-width: 1000px) 54vw, (max-width: 1071px) 543px, 580px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 1024px; --smush-placeholder-aspect-ratio: 1024\/585;\" \/><\/figure>\n\n\n\n<p class=\"\">This is also very good.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Conclusions<\/h2>\n\n\n\n<p class=\"\">As this comparison demonstrates, AI image generation has reached an intriguing inflection point in early 2024. While each platform showed remarkable capabilities in certain areas &#8211; particularly in rendering vehicles, landscapes, and atmospheric lighting &#8211; they also revealed consistent limitations that highlight the technology&#8217;s current state of development. Deepimg.AI and FlatAI emerged as standout performers, producing the most cohesive and convincing results, especially in their handling of human subjects &#8211; traditionally one of AI&#8217;s greatest challenges.<\/p>\n\n\n\n<p class=\"\">The varying success in merging the distinctive characteristics of the 1967 Mustang and 1957 MGA Roadster serves as a microcosm of AI&#8217;s current capabilities. While some generators produced beautiful vehicles, they often favored one model&#8217;s features over the other, suggesting that AI still struggles with truly creative hybrid designs that require deep understanding of multiple reference points.<\/p>\n\n\n\n<p class=\"\">Perhaps most tellingly, the experiment revealed the importance of iterative refinement in AI image generation. As seen with DALL-E 3&#8217;s attempts at logo removal, current AI systems can be surprisingly rigid when asked to make selective modifications to their outputs. This highlights a key area for future development: the ability to maintain desired elements while precisely adjusting others.<\/p>\n\n\n\n<p class=\"\">For users seeking to leverage these tools effectively, this comparison suggests that success lies in choosing the right platform for specific needs and being strategic with prompts. While no single generator proved perfect across all criteria, each demonstrated unique strengths that could be valuable in different contexts. As these technologies continue to evolve, we can expect to see improvements in their ability to handle complex requests and generate more consistent, customizable results.<\/p>","protected":false},"excerpt":{"rendered":"<p>In early 2024, AI image generation has reached a fascinating paradox. At first glance, the technology seems almost magical &#8211; capable of creating stunning, photorealistic scenes that blend reality with imagination. Take, for instance, the task of generating a hybrid classic car design: when prompted to combine a &#8217;67 Mustang with a &#8217;57 MGA Roadster, &hellip; <\/p>\n<p class=\"link-more\"><a href=\"https:\/\/robertjwallace.com\/es\/ai-image-generation-still-a-ways-to-go\/\" class=\"more-link\">Continuar leyendo<span class=\"screen-reader-text\"> &#8220;AI image generation, still a ways to go.&#8221;<\/span><\/a><\/p>","protected":false},"author":1,"featured_media":6413,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"nf_dc_page":"","_eb_attr":"","footnotes":""},"categories":[143],"tags":[],"class_list":["post-6405","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-computer-stuff"],"featured_image_src":"https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-18-600x400.png","featured_image_src_square":"https:\/\/robertjwallace.com\/wp-content\/uploads\/2025\/02\/image-18-600x600.png","author_info":{"display_name":"Bob","author_link":"https:\/\/robertjwallace.com\/es\/author\/admin\/"},"_links":{"self":[{"href":"https:\/\/robertjwallace.com\/es\/wp-json\/wp\/v2\/posts\/6405","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/robertjwallace.com\/es\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/robertjwallace.com\/es\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/robertjwallace.com\/es\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/robertjwallace.com\/es\/wp-json\/wp\/v2\/comments?post=6405"}],"version-history":[{"count":2,"href":"https:\/\/robertjwallace.com\/es\/wp-json\/wp\/v2\/posts\/6405\/revisions"}],"predecessor-version":[{"id":6416,"href":"https:\/\/robertjwallace.com\/es\/wp-json\/wp\/v2\/posts\/6405\/revisions\/6416"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/robertjwallace.com\/es\/wp-json\/wp\/v2\/media\/6413"}],"wp:attachment":[{"href":"https:\/\/robertjwallace.com\/es\/wp-json\/wp\/v2\/media?parent=6405"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/robertjwallace.com\/es\/wp-json\/wp\/v2\/categories?post=6405"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/robertjwallace.com\/es\/wp-json\/wp\/v2\/tags?post=6405"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}