Generalization or Memorization? Brittleness Testing for Chess-Trained Language Models Paper • 2605.17565 • Published 11 days ago