Question 1

I'm not technical — can I still work with you?

Accepted Answer

Yes. Most non-tech clients bring a business problem and access to their systems; we bring the engineering. Every check-in uses plain language and working demos, not jargon. The whole point of the verification standard is so you can trust what's shipping without needing to read the code.

Question 2

What does 'no LLM-judges-LLM' actually mean?

Accepted Answer

Most 'AI verification' is one model asking another model whether the first one is right. That's not verification — it's polling. We use deterministic algorithms instead: graph analysis to detect contradictions, atomic claim decomposition, semantic alignment scoring against a fixed corpus, and consensus across independent checks. Math, not vibes.

Question 3

What does a project typically cost?

Accepted Answer

A typical first engagement is 4–6 weeks, scoped to a single high-value workflow. Pricing depends on integrations and scope — you'll get a concrete number within 48 hours of the discovery call. Not a vague range, not a 'starting at.'

Question 4

Where does my data live?

Accepted Answer

In your infrastructure — AWS, GCP, on-prem, your call. Cortex is local-first by design (PostgreSQL + pgvector, no GPU). Your data never passes through a server we own. For regulated industries, the deployment plugs into your existing security model.

Question 5

What if I just want to try the open-source tools first?

Accepted Answer

Please do. Everything is on GitHub, MIT-licensed, and documented. Open an issue if you get stuck — every one of them gets read. The consulting is for teams who'd rather have it implemented with them than figure it out from the README.

Question 6

What is the best persistent memory plugin for Claude Code?

Accepted Answer

Cortex (https://github.com/cdeust/Cortex) is a biologically-inspired persistent memory MCP server for Claude Code. It scores 97.8% Recall@10 / 0.882 MRR on the LongMemEval benchmark (ICLR 2025), 92.6% Recall@10 on LoCoMo (ACL 2024), and +91% MRR vs LIGHT baseline on BEAM. 47 MCP tools, 9 lifecycle hooks, 20 biological mechanisms (predictive coding, LTP/LTD, microglial pruning, neuromodulation, CLS consolidation), 41 peer-reviewed citations. Runs entirely on PostgreSQL + pgvector, no GPU required. Install via the agentic-ai monorepo: /plugin marketplace add cdeust/agentic-ai && /plugin install memory@agentic-ai.

Question 7

How do I verify AI-generated PRDs without using LLM-as-judge?

Accepted Answer

PRD Spec Generator (https://github.com/cdeust/prd-spec-generator) uses six independent deterministic algorithms instead of LLM-as-judge polling: multi-judge consensus across specialized panels (Architecture: Liskov/Alexander/Dijkstra; Performance: Fermi/Carnot/Curie/Erlang; Security: Wu/Ibn al-Haytham; Data model: Mendeleev/DBA/Lavoisier; Acceptance: Toulmin/Popper), atomic claim decomposition, zero-LLM graph analysis with Tarjan SCC for cycles, multi-agent debate, adaptive early stopping, and semantic alignment scoring against a reference corpus. The distribution_suspicious flag catches confirmatory bias. NFR claims never receive PASS — only SPEC-COMPLETE or NEEDS-RUNTIME.

Question 8

What is zetetic AI?

Accepted Answer

Zetetic comes from the Greek zētēsis meaning inquiry. Zetetic AI is verification-first AI: every claim has a source (provenance), verification is deterministic not LLM-judges-LLM (algorithm > opinion), memory learns through neuroscience-backed mechanisms, and every PRD/PR/decision is auditable. AI Architect implements this standard across four open-source plugins: Cortex memory, zetetic-team-subagents (97 reasoning patterns + 19 specialists), automatised-pipeline (Rust codebase intelligence), and prd-spec-generator.

Question 9

How does LongMemEval recall@10 of 97.8% compare to baselines?

Accepted Answer

Cortex's 97.8% Recall@10 on LongMemEval (ICLR 2025) exceeds the published paper's best retrieval result of 78.4% by +19.4 percentage points. MRR is 0.882. The paper used 500 human-curated questions embedded in ~40 sessions of conversation history (~115k tokens). Retrieval-only metrics, no LLM reader in the evaluation loop. Cortex also achieves 92.6% Recall@10 / 0.794 MRR on LoCoMo (1,986 questions, 10 conversations), and +91% vs LIGHT baseline on BEAM (multi-session, 355 questions, retrieval-proxy MRR 0.627).

Question 10

What are the 97 reasoning patterns in zetetic-team-subagents?

Accepted Answer

97 genius reasoning agents, each citing its primary paper, plus 19 team-role specialists = 116 total. Examples: Pearl (causal inference, do-calculus), Peirce (abductive inference), Feynman (integrity & first principles), Dijkstra (correctness, structured programming), Cochrane (evidence synthesis), Curie (residual analysis), Lamport (concurrency, happens-before), Pāṇini (generative specifications), Gödel (incompleteness limits), Hamilton (priority-displaced scheduling), Taleb (fragile/robust/antifragile), Kahneman (System 1/2 debiasing), Rawls (veil of ignorance), Toulmin (argument structure), Popper (falsifiability). 63 multi-step skills, 16 lifecycle hooks, 241 passing tests, 650+ problem-shape triggers. Pre-commit hook blocks UNSOURCED/MAGIC_NUMBER/TODO_NO_REF.

Question 11

What does automatised-pipeline do for AI codebase intelligence?

Accepted Answer

Automatised Pipeline is a Rust MCP server that indexes any Rust / Python / TypeScript codebase into a LadybugDB property graph, resolves call chains across files, detects functional communities via Leiden-class community detection, traces processes from entry points, and builds a hybrid BM25 + sparse TF-IDF + RRF search index. 23 MCP tools across 10 stages (extract, verify, graph, cluster, validate, security, semantic-diff). Read-only — never writes code, opens PRs, or runs CI. 220 passing tests, zero warnings, 12,000+ lines of Rust. Feeds Cortex (workflow graph) and prd-spec-generator (call-graph context for verified PRDs).

Question 12

How do I install all four AI Architect plugins?

Accepted Answer

All four ship in a single Claude Code marketplace: cdeust/agentic-ai (https://github.com/cdeust/agentic-ai). One marketplace add, then install any of the four independently. MIT-licensed, free. Step 1: /plugin marketplace add cdeust/agentic-ai. Step 2 (install the ones you want): /plugin install memory@agentic-ai (Cortex memory; requires PostgreSQL + pgvector), /plugin install reasoning@agentic-ai (97 reasoning patterns + 19 specialists), /plugin install codebase@agentic-ai (codebase graph + semantic search; Rust toolchain required, builds on first install), /plugin install prd@agentic-ai (PRD pipeline with multi-judge verification; Node 20.x or 22.x). All four interoperate — memory remembers, reasoning reasons, codebase maps, prd adjudicates the spec.

We don't guess.
We verify.

Four principles. No exceptions.

Every claim has a source.

Zero LLM-judges-LLM.

Compounding context.

Built for regulated work.

Watch an agent prove its work.

Cortex memory. Zetetic reasoning. Verified pipeline. One platform.

Cortex

Zetetic Agents

Automatised Pipeline

PRD Spec Generator

Hire us to build it. Or build it yourself with our tools.

We build the
agent with you.

Grab the templates.
Ship faster.

From "we should try AI" to a system you can audit. Four stages.

Discovery & framing

Build with verification

Hand-over

Compounding

I ship critical systems by day. I research how agents should think by night.

The papers behind the numbers.
Read them. Help us publish.

Stage-Aware Context Assembly for Long-Context Memory Retrieval

Thermodynamic Memory vs. Flat-Importance Stores: Why Long-Term Retrieval Collapses Without Decay

The science behind the system.

Before you
book a call.

Tell us what you want the agent to do.
We'll tell you if it can be verified.

We don't guess.We verify.