305 Comments
⭠ Return to thread
author
Jun 4, 2022·edited Jun 4, 2022Author

All right. My proposed operationalization of this is that on June 1, 2025, if either if us can get access to the best image generating model at that time (I get to decide which), or convince someone else who has access to help us, we'll give it the following prompts:

1. A stained glass picture of a woman in a library with a raven on her shoulder with a key in its mouth

2. An oil painting of a man in a factory looking at a cat wearing a top hat

3. A digital art picture of a child riding a llama with a bell on its tail through a desert

4. A 3D render of an astronaut in space holding a fox wearing lipstick

5. Pixel art of a farmer in a cathedral holding a red basketball

We generate 10 images for each prompt, just like DALL-E2 does. If at least one of the ten images has the scene correct in every particular on 3/5 prompts, I win, otherwise you do. Loser pays winner $100, and whatever the result is I announce it on the blog (probably an open thread). If we disagree, Gwern is the judge. If Gwern doesn't want to do it, Cassander, if Cassander doesn't want to do it, we figure something else out. If we can't get access to the SOTA language model, we look at the rigged public demos and see if we can agree that one of us is obviously right, and if not then no money changes hands.

Does that work for you?

Expand full comment