Intelligence, Orchestrated.

Live in productionWith a Saudi customer

Auto Quotation System

Turns inbound RFQ emails into priced, ready-to-send quotation drafts. The operator goes from "type everything" to "review and send".

What this system does

Watches a shared sales inbox. When email arrives the system decides whether it is a Request For Quotation, parses the customer and the line items, matches each line to a product in your catalog, looks up any prior quotation for the same customer in your ERP, prepares a draft quotation, and hands the operator a dashboard with scored matches. The operator reviews, fixes anything that looks wrong, and sends.

The problem it solved

Before: sales reps copy-pasted item descriptions from emails into spreadsheets. Looking up SKUs took minutes per line — some RFQs had 80 lines. Customers waited 2–7 days for a quote. Many bought elsewhere. The backlog had no visibility — stuck RFQs were forgotten until the customer chased them. After: median time-to-quote dropped to under 30 minutes for cases the system handles cleanly. 107 days of live operation at writing.

How it works, step by step

1
Read the inbox
A worker polls the shared sales mailbox every 30 seconds. New emails get stored and queued for analysis.
2
Classify and extract
An LLM decides if the email is an RFQ. If yes, a second LLM call extracts the customer, the line items (description, quantity, specs), the project name, and the due date — all into a strict JSON schema.
3
Match the catalog
For each line item: first try an exact SKU match. If no match, do a vector search across 3,500 product variants. Take the top 20 candidates and have an LLM score each against a structured rubric. Pick the best, record a score 0-100, and lock if score ≥ 85.
4
Link to the ERP
Search the ERP (read-only) for recent quotations from the same customer. If a similar one exists, surface it next to the new draft so the operator can see context.
5
Operator reviews and sends
Dashboard shows scored matches, easy wins, and gaps. Operator fixes any wrong matches, the system learns from the override, then sends a PDF quotation via email.

Real numbers from production

Live operation

107 days

Median time-to-quote

< 30 min

Catalog size

~3,500 variants

Daily LLM cost

$0.15-$0.40

Cache hit rate

~70%

Screenshots from the real system

These are not mockups. Every screenshot below is from the system running in production.

1. The operator dashboard. Easy wins, gaps, and scored matches.

2. A single line-item match with the LLM rationale and score.

3. Linking a new RFQ to prior quotations for the same customer in the ERP.

4. Gaps view: products customers ask for that we cannot match to the catalog.

5. Funnel: emails received → classified as RFQ → fully matched → sent.

6. Daily LLM token usage and cost breakdown.

Key technical decisions

For anyone evaluating the system from an engineering angle: why these choices, and what was traded off.

One Postgres for everything

Job queue, app state, embeddings, caches, audit log — all in one database. One backup, one connection pool, one metric set.

LLM as ranker, not retriever

Embeddings find the right neighborhood fast and cheap. LLMs pick the right answer from 20 candidates. Putting them in series gives both.

Score everything, gate on score

Every match scored 0-100. Auto-select fires only above 85. Below 30 the line is flagged "no match." Operators sort by score and triage from the bottom up.

Read-only against the ERP

Never write. A wrong write into the ERP costs far more than any benefit. Sync runs every 6 hours; staleness is acceptable.

Idempotency keys on every job

Re-running a classification, a match, or a send must not duplicate. The job table enforces this with a unique index.

What I would build differently today

If I rebuilt this today I would skip the embeddings-as-JSON-arrays approach and use pgvector from day one. The cosine math in JS works at this scale but pgvector would let the catalog grow without re-architecting. I would also add a built-in evaluation harness so changes to the matcher can be measured before they ship.

Want a similar system for your business?

Share the workflow and the systems you use today. Within 24 hours we reply with scope, KPIs, timeline, and a SAR estimate.

Start now

Intelligence, Orchestrated.

Back to all systems

Live in productionWith a Saudi customer

Auto Quotation System

Turns inbound RFQ emails into priced, ready-to-send quotation drafts. The operator goes from "type everything" to "review and send".

What this system does

The problem it solved

How it works, step by step

1
Read the inbox
A worker polls the shared sales mailbox every 30 seconds. New emails get stored and queued for analysis.
2
Classify and extract
An LLM decides if the email is an RFQ. If yes, a second LLM call extracts the customer, the line items (description, quantity, specs), the project name, and the due date — all into a strict JSON schema.
3
Match the catalog
For each line item: first try an exact SKU match. If no match, do a vector search across 3,500 product variants. Take the top 20 candidates and have an LLM score each against a structured rubric. Pick the best, record a score 0-100, and lock if score ≥ 85.
4
Link to the ERP
Search the ERP (read-only) for recent quotations from the same customer. If a similar one exists, surface it next to the new draft so the operator can see context.
5
Operator reviews and sends
Dashboard shows scored matches, easy wins, and gaps. Operator fixes any wrong matches, the system learns from the override, then sends a PDF quotation via email.

Real numbers from production

Live operation

107 days

Median time-to-quote

< 30 min

Catalog size

~3,500 variants

Daily LLM cost

$0.15-$0.40

Cache hit rate

~70%

Screenshots from the real system

These are not mockups. Every screenshot below is from the system running in production.

1. The operator dashboard. Easy wins, gaps, and scored matches.

2. A single line-item match with the LLM rationale and score.

3. Linking a new RFQ to prior quotations for the same customer in the ERP.

4. Gaps view: products customers ask for that we cannot match to the catalog.

5. Funnel: emails received → classified as RFQ → fully matched → sent.

6. Daily LLM token usage and cost breakdown.

Key technical decisions

For anyone evaluating the system from an engineering angle: why these choices, and what was traded off.

One Postgres for everything

Job queue, app state, embeddings, caches, audit log — all in one database. One backup, one connection pool, one metric set.

LLM as ranker, not retriever

Embeddings find the right neighborhood fast and cheap. LLMs pick the right answer from 20 candidates. Putting them in series gives both.

Score everything, gate on score

Every match scored 0-100. Auto-select fires only above 85. Below 30 the line is flagged "no match." Operators sort by score and triage from the bottom up.

Read-only against the ERP

Never write. A wrong write into the ERP costs far more than any benefit. Sync runs every 6 hours; staleness is acceptable.

Idempotency keys on every job

Re-running a classification, a match, or a send must not duplicate. The job table enforces this with a unique index.

What I would build differently today

Want a similar system for your business?

Share the workflow and the systems you use today. Within 24 hours we reply with scope, KPIs, timeline, and a SAR estimate.

Start now

Intelligence, Orchestrated.

Auto Quotation System

What this system does

The problem it solved

How it works, step by step

Read the inbox

Classify and extract

Match the catalog

Link to the ERP

Operator reviews and sends

Real numbers from production

Screenshots from the real system

Key technical decisions

One Postgres for everything

LLM as ranker, not retriever

Score everything, gate on score

Read-only against the ERP

Idempotency keys on every job

What I would build differently today

Want a similar system for your business?

Auto Quotation System

What this system does

The problem it solved

How it works, step by step

Read the inbox

Classify and extract

Match the catalog

Link to the ERP

Operator reviews and sends

Real numbers from production

Screenshots from the real system

Key technical decisions

One Postgres for everything

LLM as ranker, not retriever

Score everything, gate on score

Read-only against the ERP

Idempotency keys on every job

What I would build differently today

Want a similar system for your business?

Auto Quotation System

What this system does

The problem it solved

How it works, step by step

Read the inbox

Classify and extract

Match the catalog

Link to the ERP

Operator reviews and sends

Real numbers from production

Screenshots from the real system

Key technical decisions

One Postgres for everything

LLM as ranker, not retriever

Score everything, gate on score

Read-only against the ERP

Idempotency keys on every job

What I would build differently today

Want a similar system for your business?

Auto Quotation System

What this system does

The problem it solved

How it works, step by step

Read the inbox

Classify and extract

Match the catalog

Link to the ERP

Operator reviews and sends

Real numbers from production

Screenshots from the real system

Key technical decisions

One Postgres for everything

LLM as ranker, not retriever

Score everything, gate on score

Read-only against the ERP

Idempotency keys on every job

What I would build differently today

Want a similar system for your business?