Available for Contract & Consulting
AI Platform Engineering — Built, Not Theorised
25+ years building, securing, and scaling enterprise systems. Author of a 65,000+ line production-grade agentic platform with Temporal workflow orchestration, hybrid RAG, MCP tooling, and cryptographic audit trails. I bring the same engineering rigour to every engagement.
Background
I've scaled Fortune 500 network backbones, maintained 40Gbps global streaming infrastructure at 99.99% availability, delivered £325K annual AWS cost reductions in FCA-regulated fintech, and built 24x7 operations teams from the ground up.
I designed and built ACE — a 40+ microservice agentic collaboration platform running Temporal workflows, LangGraph orchestration, hybrid vector/BM25 retrieval, and a custom governance layer for AI agent operations. 65,000+ lines of production-grade async Python, not prototypes.
Services
AI Agent & LLM Integration
Production deployment of AI agents with Temporal workflow orchestration, MCP tool servers, and multi-provider LLM integration (Claude, GPT-4). HITL controls and governance boundaries built in from day one.
RAG & Knowledge Systems
Hybrid retrieval pipelines: pgvector HNSW + BM25 full-text with RRF fusion and cross-encoder reranking. Document ingestion via Celery task queues, embedding with sentence-transformers, and knowledge graphs (Apache AGE).
Platform Engineering
Design and build internal platforms. 40+ microservice architectures on Docker with NATS JetStream event bus, asyncpg connection pooling, Redis caching, and full observability (Prometheus, Grafana, Loki, OpenTelemetry).
Stabilisation & SRE
Reliability engineering for production systems. Circuit breakers, dead-letter queues, incident management, SLO frameworks, capacity planning, and on-call process design. £325K annual AWS cost reduction at YouLend.
Security & Audit
Cryptographic audit trails with SHA-256 hash chains and Merkle roots. mTLS with CA lifecycle management, Ed25519 signing, JWT with JTI revocation, and append-only ledgers enforced at the database layer.
Network & Infrastructure
Enterprise network architecture, cloud migration, hybrid environments. BGP, peering, transit, zero-trust, and 25+ years of deep Linux systems expertise from Fortune 500 backbone to fintech production.
What I’ve Built — ACE Platform
65,000+ lines of async Python across 40+ microservices. Not a weekend project — a production-shaped agentic platform with real security, real observability, and real governance.
Temporal Workflow Orchestration
Production Temporal deployments. Long-running workflows with deterministic event replay, failure recovery, and observable execution history. Not a cron job — a state machine.
Hybrid RAG Pipeline
pgvector HNSW + PostgreSQL BM25 full-text search, fused via Reciprocal Rank Fusion. Cross-encoder reranking (BAAI/bge-reranker-v2-m3). Freshness and authority boosting. Sub-second retrieval.
20+ MCP Tools
Full Model Context Protocol server with SSE transport. Tools for document search, SME index, AIOps knowledge base, agent control, and ledger queries. JSON-RPC 2.0 compliant.
Cryptographic Audit Ledger
Append-only PostgreSQL ledger with SHA-256 hash chains and Merkle roots. DB-level DELETE/TRUNCATE denial. Circuit breakers on NATS publish. Dead-letter queue with retry tracking.
Multi-Agent Orchestration
Team topologies (parallel, pipeline, coordinator-worker, mesh), loop execution with stuck detection, and multi-tier capability assessment.
LangGraph + Celery Pipelines
LangGraph StateGraph for document lifecycle orchestration. Celery distributed task queues for 5-stage ingestion with dead-letter handling and exponential backoff retry.
Technology — Verified in Production Code
AI & Agentic
Temporal Workflows
LangGraph
MCP Server (SSE)
Pydantic AI
Anthropic Claude API
OpenAI API
A2A Protocol
Multi-Agent Orchestration
HITL Controls
Retrieval & Knowledge
pgvector (HNSW)
sentence-transformers
Cross-Encoder Reranking
Hybrid RAG (Vector + BM25 + RRF)
Apache AGE (Knowledge Graph)
Celery Task Pipelines
MinIO / S3
Platform & Infrastructure
Python / FastAPI / asyncio
PostgreSQL / asyncpg
NATS JetStream
Redis
Docker (40+ services)
Alembic Migrations
SQLAlchemy 2.0 (async)
Pydantic v2 / Settings
Linux (RHEL/Ubuntu)
AWS
Azure
Kubernetes
Terraform
CI/CD
Observability & Security
Prometheus
Grafana
Loki
OpenTelemetry
structlog
Circuit Breakers
mTLS / PKI / CA
OAuth2 / OIDC
JWT (jose / JTI revocation)
Ed25519 Signing
SHA-256 Hash Chains
Vault / OpenBAO
Event Sourcing
AIOps Flywheels
SRE
BGP / Enterprise Networking
Engagement
How I Work
- Discovery call to understand your situation, constraints, and goals
- Scoped engagement: advisory, hands-on build, or embedded team augmentation
- Transparent delivery with clear milestones and knowledge transfer
- Available for short-term engagements or ongoing retainer
Based In
United Kingdom. Available for remote engagements globally and on-site within the UK.