improve-codebase-architecture

Temukan peluang pendalaman dalam basis kode, berdasarkan bahasa domain di CONTEXT.md dan keputusan di docs/adr/. Gunakan ketika pengguna ingin meningkatkan arsitektur, menemukan peluang refactoring, mengonsolidasikan modul yang terikat erat, atau membuat basis kode lebih mudah diuji dan dinavigasi oleh AI.

npx skills add https://github.com/mattpocock/skills --skill improve-codebase-architecture

Improve Codebase Architecture

Surface architectural friction and propose deepening opportunities — refactors that turn shallow modules into deep ones. The aim is testability and AI-navigability.

This command is informed by the project's domain model and built on a shared design vocabulary:

  • Run the /codebase-design skill for the architecture vocabulary (module, interface, depth, seam, adapter, leverage, locality) and its principles (the deletion test, "the interface is the test surface", "one adapter = hypothetical seam, two = real"). Use these terms exactly in every suggestion — don't drift into "component," "service," "API," or "boundary."
  • The domain language in CONTEXT.md gives names to good seams; ADRs in docs/adr/ record decisions this command should not re-litigate.

Process

1. Explore

Read the project's domain glossary (CONTEXT.md) and any ADRs in the area you're touching first.

Then use the Agent tool with subagent_type=Explore to walk the codebase. Don't follow rigid heuristics — explore organically and note where you experience friction:

  • Where does understanding one concept require bouncing between many small modules?
  • Where are modules shallow — interface nearly as complex as the implementation?
  • Where have pure functions been extracted just for testability, but the real bugs hide in how they're called (no locality)?
  • Where do tightly-coupled modules leak across their seams?
  • Which parts of the codebase are untested, or hard to test through their current interface?

Apply the deletion test to anything you suspect is shallow: would deleting it concentrate complexity, or just move it? A "yes, concentrates" is the signal you want.

2. Present candidates as an HTML report

Write a self-contained HTML file to the OS temp directory so nothing lands in the repo. Resolve the temp dir from $TMPDIR, falling back to /tmp (or %TEMP% on Windows), and write to <tmpdir>/architecture-review-<timestamp>.html so each run gets a fresh file. Open it for the user — xdg-open <path> on Linux, open <path> on macOS, start <path> on Windows — and tell them the absolute path.

The report uses Tailwind via CDN for layout and styling, and Mermaid via CDN for diagrams where a graph/flow/sequence reliably communicates the structure. Mix Mermaid with hand-crafted CSS/SVG visuals — use Mermaid when relationships are graph-shaped (call graphs, dependencies, sequences), and hand-built divs/SVG when you want something more editorial (mass diagrams, cross-sections, collapse animations). Each candidate gets a before/after visualisation. Be visual.

For each candidate, render a card with:

  • Files — which files/modules are involved
  • Problem — why the current architecture is causing friction
  • Solution — plain English description of what would change
  • Benefits — explained in terms of locality and leverage, and how tests would improve
  • Before / After diagram — side-by-side, custom-drawn, illustrating the shallowness and the deepening
  • Recommendation strength — one of Strong, Worth exploring, Speculative, rendered as a badge

End the report with a Top recommendation section: which candidate you'd tackle first and why.

Use CONTEXT.md vocabulary for the domain, and the /codebase-design vocabulary for the architecture. If CONTEXT.md defines "Order," talk about "the Order intake module" — not "the FooBarHandler," and not "the Order service."

ADR conflicts: if a candidate contradicts an existing ADR, only surface it when the friction is real enough to warrant revisiting the ADR. Mark it clearly in the card (e.g. a warning callout: "contradicts ADR-0007 — but worth reopening because…"). Don't list every theoretical refactor an ADR forbids.

See HTML-REPORT.md for the full HTML scaffold, diagram patterns, and styling guidance.

Do NOT propose interfaces yet. After the file is written, ask the user: "Which of these would you like to explore?"

3. Grilling loop

Once the user picks a candidate, run the /grilling skill to walk the design tree with them — constraints, dependencies, the shape of the deepened module, what sits behind the seam, what tests survive.

Side effects happen inline as decisions crystallize — run the /domain-modeling skill to keep the domain model current as you go:

  • Naming a deepened module after a concept not in CONTEXT.md? Add the term to CONTEXT.md. Create the file lazily if it doesn't exist.
  • Sharpening a fuzzy term during the conversation? Update CONTEXT.md right there.
  • User rejects the candidate with a load-bearing reason? Offer an ADR, framed as: "Want me to record this as an ADR so future architecture reviews don't re-suggest it?" Only offer when the reason would actually be needed by a future explorer to avoid re-suggesting the same thing — skip ephemeral reasons ("not worth it right now") and self-evident ones.
  • Want to explore alternative interfaces for the deepened module? Run the /codebase-design skill and use its design-it-twice parallel sub-agent pattern.

Lebih banyak skill dari mattpocock

grill-me
mattpocock
Wawancarai pengguna secara terus-menerus tentang suatu rencana atau desain hingga mencapai pemahaman bersama, selesaikan setiap cabang pohon keputusan. Gunakan saat pengguna ingin menguji ketahanan rencana, diuji habis-habisan tentang desain mereka, atau menyebut "grill me".
researchcommunicationproject-management
grill-with-docs
mattpocock
Sesi memanggang yang menguji rencana Anda terhadap model domain yang ada, mempertajam terminologi, dan memperbarui dokumentasi (CONTEXT.md, ADR) secara langsung saat keputusan mengkristal. Gunakan saat pengguna ingin menguji ketahanan rencana terhadap bahasa proyek dan keputusan yang telah didokumentasikan.
developmentdocumentresearch
teach
mattpocock
Ajari pengguna keterampilan atau konsep baru, di dalam ruang kerja ini.
communicationproductivity
tdd
mattpocock
Pengembangan berbasis pengujian dengan siklus merah-hijau-refaktor. Gunakan ketika pengguna ingin membangun fitur atau memperbaiki bug menggunakan TDD, menyebutkan "merah-hijau-refaktor", menginginkan pengujian integrasi, atau meminta pengembangan berbasis pengujian terlebih dahulu.
developmenttesting
to-prd
mattpocock
Ubah konteks percakapan saat ini menjadi PRD dan publikasikan ke pelacak isu proyek. Gunakan saat pengguna ingin membuat PRD dari konteks saat ini.
developmentdocumentproject-management
handoff
mattpocock
Ringkas percakapan saat ini menjadi dokumen serah terima untuk diambil alih oleh agen lain.
communicationproject-managementdocument
diagnose
mattpocock
Putaran diagnosis terstruktur untuk bug sulit dan regresi performa. Reproduksi → minimalkan → hipotesis → instrumentasi → perbaiki → uji regresi. Gunakan saat pengguna mengatakan "diagnosis ini" / "debug ini", melaporkan bug, mengatakan ada yang rusak/melempar error/gagal, atau menjelaskan regresi performa.
developmenttestingcode-review
to-issues
mattpocock
Memecah rencana, spesifikasi, atau PRD menjadi isu-isu yang dapat diambil secara independen di pelacak isu proyek menggunakan irisan vertikal tracer-bullet. Gunakan saat pengguna ingin mengonversi rencana menjadi isu, membuat tiket implementasi, atau memecah pekerjaan menjadi isu-isu.
developmentproject-management