CUA-Arena

CUA-Arena is a control setting for computer-use agents, built in to UKAISI's Control Arena. The 339 main tasks are pulled from OS-World, a difficult computer-use benchmark. There are 2 available side tasks: data exfiltration and credential access.

Questions or feedback? Contact christopherharig@outlook.com.