Other than Qwen3 series I would like to see some of the following https://github.com/WildEval/ZeroEval/issues/23 - Kimi K2 and Kimi-Dev - Recent update to DeepSeek v3 and DeepSeek R1 - Llama4 series of models - Phi-4 - Some of the other reinforcement models similar to Sky-T1