Please Select Your Location
Australia
Österreich
België
Canada
Canada - Français
中国
Česká republika
Denmark
Deutschland
France
HongKong
Iceland
Ireland
Italia
日本
Korea
Latvija
Lietuva
Lëtzebuerg
Malta
المملكة العربية السعودية (Arabic)
Nederland
New Zealand
Norge
Polska
Portugal
Russia
Saudi Arabia
Southeast Asia
España
Suisse
Suomi
Sverige
台灣
Ukraine
United Kingdom
United States
Please Select Your Location
België
Česká republika
Denmark
Iceland
Ireland
Italia
Latvija
Lietuva
Lëtzebuerg
Malta
Nederland
Norge
Polska
Portugal
España
Suisse
Suomi
Sverige
<< Back to Blog

Goodbye DALL·E 3, ChatGPT Gets Image Creation Upgrade with GPT-4o

VIVE POST-WAVE Team • April 30, 2025

7-minute read

OpenAI has rolled out a major update for ChatGPT's image generation! Last night, while using ChatGPT, I noticed its image creation capabilities had improved significantly. In the middle of our conversation, it suddenly offered, "How about I draw that for you?" It felt like it was showing off a new feature, and the quality was impressive. This morning, I stumbled upon the news—OpenAI officially announced that GPT-4o is replacing DALL-E 3 as the built-in image generation model for ChatGPT, along with several practical upgrades. According to reports, this feature is available for ChatGPT's free, Plus, Team, and Pro versions, allowing all users to experience enhanced image creation capabilities. Let's unpack this exciting announcement!

GPT-4o Image


Why GPT-4o Replaces DALL-E 3 for Image Generation

Previously, ChatGPT's image generation relied heavily on DALL-E 3, an AI designed specifically for creating images. However, OpenAI has now decided to let GPT-4o take over this task. While DALL-E 3 was powerful, it was a standalone image generation system, lacking integration with ChatGPT's conversational abilities. Now, GPT-4o not only understands the context of conversations but can also generate images that better meet the needs communicated in the dialogue.

Moreover, this update enhances image details significantly. DALL-E 3 was often criticized for errors in hand and facial details, but GPT-4o has been fine-tuned through feedback from "human trainers." OpenAI recruited over a hundred human annotators to review AI-generated images, pointing out errors like unnatural finger arrangements, facial distortions, and subtle proportion issues. This "Reinforcement Learning from Human Feedback" (RLHF) technique makes AI-generated images more aligned with human aesthetics and intuition.


New Features in GPT-4o Image Generation: Beyond Transparent Backgrounds, It Understands Brand Guidelines

This update makes ChatGPT more like a real design assistant, not just a simple image generation tool. GPT-4o brings several core upgrades:

1. Transparent Background Images:

This is one of the most exciting features, allowing you to directly output images with no background (PNG files), perfect for designing logos, e-commerce images, and social media content.

2. Upload Brand Design Guidelines:

Users can upload their brand guidelines, allowing the AI to generate images that match the brand's tone, such as color and style guidelines. This ensures more consistency in the selection of image assets for businesses, aligning with their brand style guide.

3. Enhanced Image Editing:

GPT-4o's image editing capabilities are now more flexible than ever, overcoming many limitations of the DALL-E 3 version. It can now handle local modifications more precisely, such as changing backgrounds, adjusting lighting, enhancing details, and even fixing errors in the image without affecting other elements. More importantly, GPT-4o has a better understanding of the relationship between objects and environments, making the editing results more natural rather than just pixel replacements. Additionally, it supports more detailed brush control, allowing users to adjust image details more specifically.

However, AI image generation is still not perfect. In OpenAI's tests, when users uploaded a photo of a living room and asked the AI to rearrange the furniture, GPT-4o could change the scene's layout but might make mistakes like "missing a window." GPT-4o still has room for improvement in understanding spatial structures. Nevertheless, such detail errors are already less frequent than with DALL-E 3.


Creators and AI Training Data: Balancing Technological Development and Copyright Protection

Finally, we must address a sensitive topic: copyright and ethical issues, which are of utmost concern to creators. To balance this issue, OpenAI has introduced an "opt-out" mechanism in this update, allowing creators to choose not to have their works included in AI training data.

According to OpenAI, GPT-4o's image training data primarily comes from "publicly available data" and licensed materials from partners like Shutterstock. They also explained that webmasters can use the "robots.txt" setting to prevent AI from crawling images on their websites, ensuring content is not used for AI training. On the other hand, OpenAI emphasizes that they have implemented content protection mechanisms to prevent AI from generating images that "directly mimic" the style of specific artists. This means that even if the AI has learned certain artistic styles, it will avoid closely resembling a particular artist's specific brushstrokes and compositions when generating images. Hmm... but remember the "Snoopy incident"? It seems this regulation is still not perfect.


This upgrade takes ChatGPT's "creative assistant" function to the next level, making it more natural. Notably, the updates make AI image generation more aligned with commercial needs, and the "AI feel" is less pronounced. Overall, I'm optimistic about the improvement and can't wait to dive deeper into testing its capabilities.