Democratizing Reinforcement Learning for LLMs
-
Updated
Jun 5, 2026 - Python
Democratizing Reinforcement Learning for LLMs
RepoMaster: The open-source AI agent that masters GitHub. It turns any code repository into a powerful tool, achieving a new level of autonomous task-solving. An open alternative to Claude-Code.
SE-Agent is a self-evolution framework for LLM Code agents. It enables trajectory-level evolution to exchange information across reasoning paths via Revision, Recombination, and Refinement, expanding the search space and escaping local optima. On SWE-bench Verified, it achieves SOTA performance
Repo-level benchmark for real-world Code Agents: from repo understanding → env setup → incremental dev/bug-fixing → task delivery, with cost-aware α metric.
Container-free RL framework for training software engineering agents
Your AI-powered SWE teammate, built into your git workflow
Agentic Harness for the LLVM Compiler
An autonomous AI coding agent with novel innovations in tool state management and code editing, running in a Docker sandbox with a persistent shell, parallel tools, and a VNC desktop.
Autonomous Software Engineering Agents — self-healing, self-diagnosing development team powered by Claude Code and A2A protocol
[ACL 2026 Findings] SWE-AGILE: A Software Agent Framework for Efficiently Managing Dynamic Reasoning Context
cli-first harness agnostic agent orchestration tool
A curated list of training & evaluation environments for LLM/VLM agents (SWE-Gym, GEM, RAGEN, AgentGym, WebArena, OSWorld, ToolBench…). Updated weekly.
Reducing bottlenecks for Software Engineers while working with background agents on the go
Lawful Good Rust Agentic Operating System — production-grade agent harness with WASM sandboxing, heterogeneous inference, SurrealDB memory, sentinel guardrails, and 7-phase autonomous execution (Algorithm v6.3.0)
Drive Cursor and any ACP-compatible agent from your terminal — with persistent sessions and full automation support.