tachibot-mcp

Stop AI Hallucinations Before They Start Run models from OpenAI, Google, Anthropic, xAI, Perplexity, and OpenRouter in parallel. They check each other's work, debate solutions, and catch errors before you see them.

GitHub

TachiBot MCP

Multi-Model AI Orchestration Platform

51 AI tools. 7 providers. One protocol.

Orchestrate Perplexity, Grok, GPT-5, Gemini, Qwen, Kimi K2.5, and MiniMax M2.5 from Claude Code, Claude Desktop, Cursor, or any MCP client.

Get Started · View Tools · Documentation

If TachiBot helps your workflow, a star goes a long way.

What's New in v2.15.0

`/blueprint` Skill — Multi-Model Implementation Planning

New skill that creates bite-sized TDD implementation plans using a 7-step multi-model council:

/blueprint add OAuth with refresh tokens

Pipeline: Grok search → Qwen+Kimi analysis → Kimi decompose → GPT pre-mortem critique → Gemini final judgment → bite-sized TDD output (exact files, test-first steps, commit points).

Bridges planner_maker's multi-model intelligence with the writing-plans execution format.

31 Prompt Engineering Techniques (was 22)

Added 9 research-backed techniques for coding and decision-making:

Technique	Source	Category
`reflexion`	Shinn et al. 2023	Engineering
`react` (ReAct)	Yao et al. 2022	Engineering
`rubber_duck`	Hunt & Thomas 2008	Engineering
`test_driven`	Beck 2003	Engineering
`scot` (Structured CoT)	Li et al. 2025 (+13.79% HumanEval)	Structured Coding
`pre_post` (Contracts)	Empirical SE 2025	Structured Coding
`bdd_spec` (Given/When/Then)	BDD 2025	Structured Coding
`least_to_most`	Zhou et al. 2022	Research
`pre_mortem`	Klein 2007	Decision

Techniques are embedded directly in tool system prompts for automatic application.

MiniMax M2.5 Upgrade

minimax_code — SWE-Bench 80.2%, per-task TECHNIQUE tags (SCoT, reflexion, rubber_duck), per-task temperatures
minimax_agent — ReAct + least-to-most decomposition protocol, HALT criteria

Enhanced Skills

/breakdown — now uses least_to_most ordering + pre_mortem failure analysis
/judge — first judge now runs pre-mortem ("assume this FAILED")
/decompose — deep-dives include pre/post contracts per sub-problem
/prompt — auto-recommend flow with 30-intent matching guide, 13 categories

Skills (Claude Code)

TachiBot ships with 9 slash commands for Claude Code. These orchestrate the tools into powerful workflows:

Skill	What it does	Example
`/blueprint`	Multi-model planning → bite-sized TDD steps	`/blueprint add OAuth with refresh tokens`
`/judge`	Multi-model council - parallel analysis with synthesis	`/judge how to implement rate limiting`
`/think`	Sequential reasoning chain with any model	`/think grok,gemini design a cache layer`
`/focus`	Mode-based reasoning (debate, research, analyze)	`/focus architecture-debate Redis vs Pg`
`/breakdown`	Strategic decomposition with pre-mortem	`/breakdown refactor payment module`
`/decompose`	Split into sub-problems, deep-dive each one	`/decompose implement collaborative editor`
`/prompt`	Recommend the right thinking technique (31 available)	`/prompt why do users churn`
`/algo`	Algorithm analysis with 3 specialized models	`/algo optimize LRU cache O(1)`
`/tachi`	Help - see available skills, tools, key status	`/tachi`

Skills automatically adapt to your configured API keys. Even with just 1-2 providers, all skills work.

Getting started? Type /tachi to see what's available.

Key Features

Multi-Model Intelligence

51 AI Tools across 7 providers — Perplexity, Grok, GPT-5, Gemini, Qwen, Kimi, MiniMax
Multi-Model Council — planner_maker synthesizes plans from 5+ models into bite-sized TDD steps
Smart Routing — Automatic model selection for optimal results
OpenRouter Gateway — Optional single API key for all providers

Advanced Workflows

YAML-Based Workflows — Multi-step AI processes with dependency graphs
Prompt Engineering — 31 research-backed techniques (including SCoT, ReAct, Reflexion)
Verification Checkpoints — 50% / 80% / 100% with automated quality scoring
Parallel Execution — Run multiple models simultaneously

Tool Profiles

Profile	Tools	Best For
Minimal	12	Quick tasks, low token budget
Research Power	31	Deep investigation, multi-source
Code Focus	29	Software development, SWE tasks
Balanced	39	General-purpose, mixed workflows
Heavy Coding (default)	45	Max code tools + agentic workflows
Full	51	Everything enabled

Developer Experience

Claude Code — First-class support
Claude Desktop — Full integration
Cursor — Works seamlessly
TypeScript — Fully typed, extensible

Quick Start

Installation

npm install -g tachibot-mcp

Setup

Gateway Mode (Recommended) — 2 keys, all providers:

{
  "mcpServers": {
    "tachibot": {
      "command": "tachibot",
      "env": {
        "OPENROUTER_API_KEY": "sk-or-xxx",
        "PERPLEXITY_API_KEY": "pplx-xxx",
        "USE_OPENROUTER_GATEWAY": "true"
      }
    }
  }
}

Direct Mode — One key per provider:

{
  "mcpServers": {
    "tachibot": {
      "command": "tachibot",
      "env": {
        "PERPLEXITY_API_KEY": "your-key",
        "GROK_API_KEY": "your-key",
        "OPENAI_API_KEY": "your-key",
        "GOOGLE_API_KEY": "your-key",
        "OPENROUTER_API_KEY": "your-key"
      }
    }
  }
}

Get keys: OpenRouter | Perplexity

See Installation Guide for detailed instructions.

Tool Ecosystem (51 Tools)

Research & Search (6)

perplexity_ask · perplexity_research · perplexity_reason · grok_search · openai_search · gemini_search

Reasoning & Planning (9)

grok_reason · openai_reason · qwen_reason · qwq_reason · kimi_thinking · kimi_decompose · planner_maker · planner_runner · list_plans

Code Intelligence (8)

kimi_code · grok_code · grok_debug · qwen_coder · qwen_algo · qwen_competitive · minimax_code · minimax_agent

Analysis & Judgment (11)

gemini_analyze_text · gemini_analyze_code · gemini_judge · jury · gemini_brainstorm · openai_brainstorm · openai_code_review · openai_explain · grok_brainstorm · grok_architect · kimi_long_context

Meta & Orchestration (5)

think · nextThought · focus · tachi · usage_stats

Workflows (9)

workflow · workflow_start · continue_workflow · list_workflows · create_workflow · visualize_workflow · workflow_status · validate_workflow · validate_workflow_file

Prompt Engineering (3)

list_prompt_techniques · preview_prompt_technique · execute_prompt_technique

Advanced Modes (bonus)

Challenger — Critical analysis with multi-model fact-checking
Verifier — Multi-model consensus verification
Scout — Hybrid intelligence gathering

Example Usage

Multi-Model Planning

// Create a plan with multi-model council
planner_maker({ task: "Build a REST API with auth and tests", mode: "start" })
// → Grok searches → Qwen analyzes → Kimi decomposes → GPT critiques → Gemini synthesizes

// Execute with checkpoints
planner_runner({ plan: planContent, mode: "step", stepNum: 1 })
// → Automatic verification at 50%, 80% (kimi_decompose), and 100%

Task Decomposition

kimi_decompose({
  task: "Migrate monolith to microservices",
  depth: 3,
  outputFormat: "dependencies"
})
// → Structured subtasks with IDs, parallel flags, acceptance criteria

Code Review

kimi_code({
  task: "review",
  code: "function processPayment(amount, card) { ... }",
  language: "typescript"
})
// → SWE-Bench 76.8% quality analysis

Deep Reasoning

focus({
  query: "Design a scalable event-driven architecture",
  mode: "deep-reasoning",
  models: ["grok", "gemini", "kimi"],
  rounds: 5
})

Documentation

Setup Guides

Contributing

Contributions welcome! See CONTRIBUTING.md for guidelines.

Like what you see?

Star on GitHub — it helps more than you think.

Website · Docs · npm · Issues

AGPL-3.0 — see LICENSE for details.

Made with care by @byPawel

Multi-model AI orchestration, unified.

Related Servers

Scout Monitoring MCP

sponsor

Put performance and error data directly in the hands of your AI assistant.

Alpha Vantage MCP Server

sponsor

Access financial market data: realtime & historical stock, ETF, options, forex, crypto, commodities, fundamentals, technical indicators, & more

MockMCP

Create mock MCP servers instantly for developing and testing agentic AI workflows.

Docker

Run and manage docker containers, docker compose, and logs

Loop MCP Server

Enables LLMs to process array items sequentially with a specific task.

Codelogic

Utilize Codelogic's rich software dependency data in your AI programming assistant.

RefactorMCP

Automated refactoring tools for C# code transformation using Roslyn.

CRAN Package README MCP Server

Fetch comprehensive information about CRAN packages, including READMEs, metadata, and search functionality.

Claude Memory MCP Server

A persistent memory server for Large Language Models, designed to integrate with the Claude desktop application. It supports tiered memory, semantic search, and automatic memory management.

MCPControl

Programmatically control Windows mouse, keyboard, window management, screen capture, and clipboard operations.

Lanhu MCP

⚡ Boost Requirement Analysis Efficiency by 200%! The World's First Team Collaboration MCP Server Designed for the AI Coding Era. Automatically analyzes requirements, generates full-stack code, and downloads design assets.

PostHog MCP

Integrates with PostHog for feature flag management and error tracking.