March 28, 2025
Reverse engineering GPT-4o image gen via Network tab – here’s what I found
I am very intrigued about this new model; I have been working in the image generation space a lot, and I want to understand what's going on I found interesting details when opening the network tab to see what the BE was sending – here's what I found. I tried with few different prompts, let's take this as a starter: "An image of happy dog running on the street, studio ghibli style" Here I got four intermediate images, as follows: https://preview.redd.it/ew32o34z4ere1.png?width=2048&format=png&auto=webp&s=0ea0551b7cb262d4b167911201011e43fa9d5fe6 We can see: The BE is actually returning the image as we see it in the UI It's not