This development underscores two intertwined trends: (1) we’re edging towards multimodal AI tools that can be integrated seamlessly across text, image and sound, (2) AI companies will make prompting much easier, for example by getting LLMs to write and optimise prompts.