All Articles

Articles

News · 1 min

Is safety ‘dead’ at xAI?

Elon Musk is “actively” working to make xAI’s Grok chatbot “more unhinged, according to a former employee.

Anthony Ha
55 views
Academic · 1 min

BotzoneBench: Scalable LLM Evaluation via Graded AI Anchors

arXiv:2602.13214v1 Announce Type: new Abstract: Large Language Models (LLMs) are increasingly deployed in interactive environments requiring strategic decision-making, yet systematic evaluation of these capabilities remains …

Lingfeng Li, Yunlong Lu, Yuefei Zhang, Jingyu Yao, Yixin Zhu, KeYuan Cheng, Yongyi Wang, Qirui Zheng, Xionghui Yang, Wenxin Li
9 views