"I built a simple React/Python app that takes screenshots of websites and converts them to clean HTML/Tailwind code.
It uses GPT-4 Vision to generate the code, and DALL-E 3 to create placeholder images.
To run it, all you need is an OpenAI key with GPT vision access.
I’m quite pleased with how well it works most of the time. Sometimes, the image generations can be hilariously off. See here for a replica of Taylor Swift’s Instagram page: https://streamable.com/70gow1 I initially had a hard time getting it to work on full page screenshots. GPT4 would code up the first couple of sections and then, get lazy and output placeholder comments for the rest of the page. With some prompt engineering, full page screenshots work a whole lot better now. It’s great for landing pages."