multi-agent-debate - 技术专题深度解读

aiming-lab / AutoResearchClaw

Fully autonomous & self-evolving research from idea to paper. Chat an Idea. Get a Paper. 🦞

paper-generation scientific-discovery autonomous-research llm-agents multi-agent-debate citation-verification self-evolving openclaw metaclaw

Updated Jun 3, 2026
Python

deeplearning-wisc / debate-or-vote

Official code repo for NeurIPS 2025 Spotlight paper, "Debate or Vote: Which Yields Better Decisions in Multi-Agent LLMs?"

multi-agent neurips llm multi-agent-debate

Updated Oct 15, 2025
Python

Multi-Agent-LLMs / mallm

Framework: Multi-Agent LLMs For Conversational Task-Solving (MALLM)

multi-agent debate multi-agent-debate

Updated Apr 14, 2026
Python

focuslead / ai-council-framework

Research-backed methodology for multi-AI collaborative decision-making with structured debate, consensus synthesis, and bias reduction

multi-model bias-reduction prompt-engineering multi-agent-debate ai-council llm-consensus anti-sycophancy

Updated Feb 3, 2026

DA2I2-SLM / DAR

Source code for the paper: Hear Both Sides: Efficient Multi-Agent Debate via Diversity-Aware Message Retention

multi-agent-systems ai-research vllm llm-inference multi-agent-debate

Updated Apr 16, 2026
Python

tjhavranek / research-audit-duel-protocol

Human-in-the-loop adversarial workflows for high-stakes research audit: from ChatGPT-Gemini duels to 4-model MAD.

gemini meta-analysis grok claude peer-review large-language-models chatgpt multi-agent-debate adversarial-evaluation research-audit

Updated May 30, 2026

dayeonki / cultural_debate

Code for "Multiple LLM Agents Debate for Equitable Cultural Alignment" [ACL 2025 Oral]

multi-agent-systems large-language-models multi-agent-debate cultural-alignment

Updated Apr 4, 2026
Python

bssm-oss / CodeAgora

Code review, but with 5 models arguing first.

cli typescript static-analysis opencode openai developer-tools code-review code-quality sarif claude multi-provider github-actions llm ai-code-review muti-agent hallucination-detection multi-agent-debate mcp-server

Updated Jun 5, 2026
TypeScript

A brutally fault-tolerant Mixture-of-Agents (MoA) pipeline built in pure Python. Designed to orchestrate chaotic, round-robin LLM proxy endpoints through a rigorous 4-stage Agentic Workflow (Generate ➔ Cross-Critique ➔ Rebuttal ➔ Judge). Built to eradicate hallucination and guarantee absolute accuracy in complex, multi-step reasoning tasks.

asyncio chain-of-thought multi-agent-debate llm-orchestration mixture-of-agents cloudflare-ai-gateway

Updated Mar 10, 2026
Python

tjhavranek / mad-research

Three Claude Code skills for working with Codex CLI: codex-bridge (one-shot Codex calls), mad-build (Claude+Codex collaboration with cross-review), and mad-research (three-stream adversarial audit of papers, grants, reports with anonymized cross-critique and fresh-Codex synthesis).

codex claude peer-review ai-tools multi-agent-debate claude-code adversarial-evaluation research-audit

Updated Jun 3, 2026

vibhu1233 / autoresearch

Enable autonomous AI agents to optimize LLM training code through iterative experiments and improve models without manual intervention overnight

productivity ai gpu iteration artificial-general-intelligence multi-agent-systems multi-agent-system paper-generation scientific-discovery ai-agent multi-agent-debate self-evolving deepresearch kernel-optimization local-ai-agents openclaw experiment-execution

Updated Jun 5, 2026

sat048 / llm_multi_agent

Research paper on how agentic debate pipelines can be constructed to reduce hallucinations in LLMs with open-source and commercial models

multi-agent-systems large-language-models multi-agent-debate llm-hallucination agentic-pipelines

Updated Dec 9, 2025
HTML

thada2402 / AutoResearchClaw

Generate research papers autonomously by chatting with OpenClaw, using Python 3.11+, with a self-evolving framework and extensive test coverage.

latex reproducible-research open-science scientific-workflows codex academic-writing ai-research paper-generation autonomous-research llm-agents multi-agent-debate self-evolving agent-skills claude-code openclaw autoresearch nightly-mvp researchclaw paper-pipeline

Updated Jun 5, 2026
Python

p3nchan / workspace-redesign

AI Agent Workspace Redesign: A structured multi-agent debate methodology for managing AI agent workspaces (memory, file organization, protection tiers, boot sequences)

architecture workspace methodology ai-agents multi-agent-debate

Updated Mar 6, 2026

ramtinz / multi-agent-debate-protocols

supporting codes for the study on multi-agent debate protocols

diversity machine-learning ai algorithms simulation consensus debate multi-agent-systems ai-agents streamlit llm ollama multi-agent-debate

Updated Mar 27, 2026
Python

Santhoshi-Ravi / minerva

Neurips paper code - Evaluating and enhancing Large Language Models (LLMs) using mathematical datasets through innovative Multi-Agent Debate Architecture, without traditional fine-tuning or Retrieval-Augmented Generation techniques. This project explores advanced strategies to boost LLM capabilities in mathematical reasoning.

llm llms-benchmarking multi-agent-debate

Updated Dec 6, 2024
Jupyter Notebook

JZtt-kyle / making-debate

Multi-LLM debate orchestrator that drives ChatGPT, Claude, and DeepSeek web UIs (no API keys) through a 5-phase loop: propose → critique → revise → synthesize → ratify-or-veto. Editorial dark UI.

react typescript browser-automation ai-agents claude chrome-devtools-protocol playwright editorial-design chatgpt multi-llm deepseek multi-agent-debate llm-council

Updated May 12, 2026
TypeScript

tommilifeless973 / autoresearch-builder

Build autonomous experiment loops that edit files, run tests, and keep only improvements for any project type

react productivity cms builder angular ai skill iteration wysiwyg landing-pages claude paper-generation scientific-discovery autonomous-agent multi-agent-debate claude-code self-improving-systems karpathy-inspired openclaw

Updated Jun 5, 2026
Shell

doresternutatory21 / ex_autoresearch

Build autonomous ML research in Elixir: design, train, and iterate GPT models across GPUs with fault-tolerant BEAM concurrency

infrastructure productivity ai lua management cheatsheet corona coronasdk devops-tools karpathy claude scientific-discovery autonomous-research autonomous-agent googlers multi-agent-debate citation-verification self-evolving claude-code

Updated Jun 5, 2026

hillolkallol / twelve-angry-agents

Run your decisions through a jury of 12 AI minds before you commit.

python agent edge-ai llm offline-llm langgraph multi-agent-debate agentic-ai edge-llm gemma4

Updated Apr 15, 2026
Python

Here are 24 public repositories matching this topic...