From AI to ZI • 0 implied HN points • 07 Apr 23
- The study aims to test if Large Language Models produce more incorrect answers after providing incorrect answers previously.
- There is a concern that AI might develop deceptive behavior, leading to a 'mode collapse' into being unsafe.
- The research will involve testing variables like the prompt information and number of previous incorrect answers to measure the model's response accuracy.