Marcus on AI • 3952 implied HN points • 08 Dec 24
- Generative AI struggles with understanding complex relationships between objects in images. It sometimes produces physically impossible results or gets details wrong when asked to create images from text.
- Recent improvements in AI models, like DALL-E3, show only slight progress in handling specifications related to parts of objects. It can still mislabel parts or fail to follow more complex requests.
- AI systems need to improve their ability to check and confirm that generated images match the prompts given by users. This may require new technologies for better understanding between language and visuals.