Information
2025年1月21日 — How it works. Our SWE-Bench-Verified agent uses: o1 with reasoning_mode high for all agent step and editing logic. A GPT-4o based memory ...
Last Updated: 2025-04-16
Detailed Ratings