Physics Maths Engineering

‘Brutal’ math test raises the bar for AI


  Peer Reviewed

Abstract

Model-stumping benchmark shows human experts remain on top—for now