IIT Delhi and FSU Jena find AI models excel at basic tasks but struggle with scientific reasoning, highlighting limits for ...