HubSecure is a multi-tenant FinTech compliance platform I co-built for BCV Group. The hardest problem wasn't the blockchain KYC or the multi-tenant architecture — it was making AML screening fast enough to be useful.

The manual process

Before AI: a compliance officer receives a new customer application. They manually search the customer's name (and name variations) against sanctions lists, PEP databases, and adverse media. They cross-reference addresses, dates of birth, and known associates.

Average time: 4 hours per customer. For a platform processing hundreds of applications per month, that's 5+ full-time compliance officers doing nothing but name-matching.

What we built

An AI agent pipeline that:

  1. Ingests the application — name, DOB, nationality, address, known associates
  2. Generates name variations — transliterations, common misspellings, alias patterns (this is where LLMs shine — they understand that "Mohammed" has 30+ valid Latin spellings)
  3. Screens against 1.7M+ records — OFAC SDN, EU sanctions, UN consolidated list, PEP databases, adverse media
  4. Scores risk — weighted scoring based on match confidence, record severity, and jurisdiction
  5. Generates a structured report — the compliance officer gets a pre-filled assessment with evidence links, not a raw data dump

Architecture decisions

Why not traditional fuzzy matching? We tried it first. Levenshtein distance and Soundex catch simple typos but miss transliteration variants entirely. "Abdulrahman" vs "Abd al-Rahman" vs "Abdul Rahman" — these are the same person, and traditional fuzzy matching scores them as different names.

An LLM with the right prompt and a few-shot examples handles this effortlessly. It understands naming conventions across Arabic, Cyrillic, Chinese, and Thai transliteration systems.

Why agents, not a single LLM call? The screening has distinct phases that benefit from different models and retry strategies. Name variation generation uses a fast model (Haiku). Sanctions matching is deterministic (database lookup). Risk scoring uses a reasoning model (Sonnet). Each step can fail and retry independently.

Why not a third-party screening API? Cost and control. Enterprise AML screening APIs charge $2-5 per check. At volume, that's significant. More importantly, the client needs to customize scoring weights per jurisdiction and add custom screening rules. A third-party API is a black box.

Results

  • Screening time: 4 hours → 45 seconds
  • False positive rate: Lower than manual screening (the AI is better at name transliteration than most humans)
  • Cost per screening: ~$0.03 (LLM API calls + compute)
  • Human review: Still required for high-risk flags, but the AI does 90% of the work

The lesson for other products

If your team has humans doing repetitive pattern-matching work — screening, categorization, triage, data extraction — that's not "too complex for AI." That's exactly where AI agents earn their place.

The key is building it as an augmentation layer, not a replacement. The compliance officer still makes the final call. The AI just eliminated 3 hours and 55 minutes of mechanical work.

Share this article

X LinkedIn

Continue reading

Why Your LLM App Needs a Graph

Stateless chatbots are a dead end. Here is how I am using graph-based architectures to build reliable AI agents that can actually reason.

Why startups hire a senior developer in Bangkok

Timezone overlap with Europe, US mornings, and APAC. Senior rates without SF/London pricing. And someone who ships while you sleep.

Why I'm Ditching Traditional Serverless for the Edge

Cold starts and opaque pricing are killing developer joy. Here is why I am moving new workloads to Cloudflare Workers and keeping the heavy lifting in containers.

← Back to all articles