ML heavy, coding realistic examples with 1-1 online meeting. So far I like the experience and culture here, no arguements. However, there are certain parts that could be improve. The goal is to create a chess-playing language model with under 1 million parameters, which is roughly the number of neurons in a honey bee's brain. At this scale, efficiency and clever architecture choices are key! We are not targetting superhuman performance, but rather exploring how well small models can learn the rules of chess, the goal being (only) to play legal moves.