Thank you for uploading your training script. Just one question. I am new to verl but I could be wrong. The AgentLoop structure returns AgentLoopOutput as far as I know. However, the process_item of fold agent returns DataProto. Could there be a mismatch in this case??
Thank you for uploading your training script. Just one question. I am new to verl but I could be wrong. The AgentLoop structure returns AgentLoopOutput as far as I know. However, the process_item of fold agent returns DataProto. Could there be a mismatch in this case??