DYNOMIGHT INTERNET NEWSLETTER • 1515 implied HN points • 14 Nov 24
- Large language models (LLMs) can somewhat play chess, but they struggle after the opening moves. They were not specifically designed for chess, yet they can manage to play using their text training.
- The performance of different language models varies significantly when playing chess. Some models like 'gpt-3.5-turbo-instruct' excel at it, while others perform very poorly.
- It seems that focusing on instruction tuning can make LLMs worse at chess, suggesting that training style impacts their ability to play games effectively.