March 30, 2025

ikayaniaamirshahzad@gmail.com

So, I created a kind of new benchmark for reasoning. I guess there is not enough training data to overfit this, and it’s quite hard to do.

Leave a Comment