minimal-run-and-audit

Rigor Run skill for README-first deep learning repo reproduction. Use when the task is specifically to capture or normalize evidence from the selected smoke test or documented inference or evaluation command and write standardized `repro_outputs/` files, including patch notes when repository files changed. Do not use for training execution, initial repo intake, generic environment setup, paper lookup, target selection, hidden scientific-meaning changes, or end-to-end orchestration by itself.

npx skills add https://github.com/lllllllama/rigorpilot-skills --skill minimal-run-and-audit

ZIP herunterladen GitHub

minimal-run-and-audit

Use this as the Rigor Run skill. The installed slug remains minimal-run-and-audit for compatibility.

Use the shared operating principles in ../../references/agent-operating-principles.md; this skill should make run evidence auditable without turning every command into a rigid protocol.

When to apply

After a reproduction target and setup plan exist.
When the main skill needs execution evidence and normalized outputs.
When a smoke test, documented inference run, documented evaluation run, or other short non-training verification is appropriate.
When the user already knows what command should be attempted and wants execution plus reporting only.

When not to apply

During initial repo scanning.
When environment or assets are still undefined enough to make execution meaningless.
When the task is a literature lookup rather than repository execution.
When the user is still deciding which reproduction target should count as the main run.

Clear boundaries

This skill owns normalized reporting for an attempted command.
It may receive execution evidence from the main skill or a thin helper.
It does not choose the overall target on its own.
It does not perform broad paper analysis.
It does not own training startup, resume, or long-running training state.
It should not normalize risky code edits into acceptable practice.
It must not hide changes that alter evaluation, preprocessing, checkpoints, metrics, or other scientific meaning.

Input expectations

selected reproduction goal
runnable commands or smoke commands
environment and asset assumptions
optional patch metadata

Output expectations

execution result summary
standardized repro_outputs/ files
SCIENTIFIC_CHANGELOG.md for changed scientific meaning and evidence status
COMPARABILITY_REPORT.md for README/paper/baseline comparability
clear distinction between verified, partial, and blocked states
PATCHES.md when repo files changed

Notes

Use references/reporting-policy.md, ../../references/research-rigor-principles.md, scripts/run_command.py, and scripts/write_outputs.py.

Mehr Skills von lllllllama

ai-research-explore

Rigor Explore compatible skill slug for meaningful and potentially novel deep learning research candidates. Use when the researcher has chosen the task family, dataset, benchmark, evaluation method, provided SOTA references, and wants candidate-only exploration on top of `current_research` with auditable repo understanding, idea gating, fair comparison, and governed experiments written to `explore_outputs/`. Do not use for README-first trusted reproduction, open-ended direction finding,...

researchdata-analysisapi

analyze-project

Rigor Analyze / Rigor Audit schreibgeschützte Fähigkeit für Deep-Learning-Forschungsrepositorien. Verwenden, wenn der Benutzer ein Repository lesen und verstehen, Modellstruktur und Trainings- oder Inferenz-Einstiegspunkte inspizieren, Konfigurationen und Einfügepunkte überprüfen oder verdächtige Implementierungsmuster kennzeichnen möchte, ohne Code zu ändern oder schwere Jobs auszuführen. Nicht für aktive Befehlsausführung, umfassendes Refactoring, spekulative Code-Anpassung oder automatische Fehlerbehebung verwenden.

developmentcode-reviewresearch

ai-research-reproduction

RigorPilot Reproduce-Mode-Orchestrator für die README-first-Reproduktion von Deep-Learning-Repositorien. Verwenden, wenn der Benutzer einen durchgängigen, minimal vertrauenswürdigen Ablauf wünscht, der zuerst das Repository liest, das kleinste dokumentierte Inferenz- oder Evaluierungsziel auswählt, Aufnahme, Einrichtung, vertrauenswürdige Ausführung, optionales vertrauenswürdiges Training, optionale Repository-Analyse und optionale Paper-Gap-Auflösung koordiniert, konservative Patch-Regeln durchsetzt, Evidenzannahmen, Abweichungen und menschliche Entscheidungspunkte aufzeichnet,...

researchdevelopmentdocument

We need to translate the given English text into German, preserving the name "explore-code" if it appears. The text is a description of an agent skill. The instruction says: "Translate only the text inside <text>. Do not include the name unless it appears in the source text." The name "explore-code" does not appear in the source text. So we just translate the text. Also preserve product names, protocol names, URLs, numbers, technical terms. The text has terms like "LoRA", "adapter layers", "backbone", "head", "rollback-aware records", "explore_outputs/", "current_research". These should be kept as is. Also "Rigor Improve implementation leaf skill" might be a proper name? It says "Rigor Improve implementation leaf skill" - likely a skill name. But the instruction says preserve product names, so keep "Rigor Improve implementation leaf skill" as is? Actually it says "Preserve product names, protocol names, URLs, numbers, and technical terms." "Rigor Improve" might

developmentresearchcode-review

env-and-assets-bootstrap

Rigor Setup-Fähigkeit zur Reproduktion eines README-zuerst Deep-Learning-Repositorys. Verwenden, wenn die Aufgabe spezifisch darin besteht, eine konservative conda-erste Umgebung, Checkpoint- und Dataset-Pfadannahmen, Cache-Speicherort-Hinweise und Setup-Notizen vor einem Durchlauf in einem README-dokumentierten Repository vorzubereiten. Nicht verwenden für Repository-Scans, vollständige Orchestrierung, Papierinterpretation, abschließende Durchlaufberichte oder generische Umgebungseinrichtung, die nicht an ein spezifisches Reproduktionsziel gebunden ist.

developmentdevops

We need to translate the given English text into German, preserving the name "explore-run" if it appears. The text is a description of a skill. The instruction says to translate only the text inside <text>, and not include the name unless it appears in the source text. The name "explore-run" appears in the source text? Let's check: The text says "Rigor Improve / Rigor Explore run leaf skill" - here "run" is part of the phrase, but "explore-run" as a name? The instruction says "Name to preserve: explore-run". In the source text, we see "explore-run" is not explicitly written as a single word; it's "Explore run" (two words). But the name to preserve is "explore-run". Possibly it appears as "explore-run" later? The text ends with "explore_outputs/" which is different. The instruction says "Do not include the name unless it appears in the source text." So we should not add "explore-run" if it's not there

researchdevelopmentdata-analysis

Rigor Debug / Rigor Audit-Fähigkeit für Deep-Learning-Forschung. Verwenden, wenn der Benutzer einen Traceback, Terminalfehler, CUDA OOM, Checkpoint-Ladefehler, Shape-Mismatch, NaN-Verlustsymptom oder Trainingsfehler einfügt und eine konservative Diagnose vor jeglichem Patchen wünscht, wobei Debug-Fixes klar von Forschungsbeiträgen getrennt sind. Nicht verwenden für breites Refactoring, spekulative Anpassung, automatisches exploratives Patchen oder allgemeine Repository-Einarbeitung.

developmenttestingcode-review

paper-context-resolver

Rigor Paper Context-Helfer für die Reproduktion von Deep-Learning-Repos nach dem README-First-Prinzip. Nur verwenden, wenn die README- und Repository-Dateien eine enge reproduktionskritische Lücke hinterlassen und die Aufgabe darin besteht, ein spezifisches Paper-Detail wie Datensatzaufteilung, Vorverarbeitung, Evaluierungsprotokoll, Checkpoint-Zuordnung oder Laufzeitannahme aus primären Paper-Quellen zu klären, während Konflikte aufgezeichnet werden. Nicht verwenden für allgemeine Paper-Zusammenfassung, Repository-Scanning, Umgebungseinrichtung, Befehlsausführung, titelbasierte Paper-Suche oder...

researchdocumentdata-analysis