Question 1

How does the OCR layer handle Swiss-German content?

Accepted Answer

Swiss-German is handled as DE at the OCR step. The layer does not require a separate Schweizerdeutsch model. In practice, the documents that hit production are written in standard DE with occasional Swiss-German notes — supplier names, place names, handwritten Belege. The two-pass pattern keeps both the raw text and the structured zones, so the downstream extractor can resolve ambiguities without us tuning a CH-only model.

Question 2

Is OCR accuracy the same on scans as on digital PDFs?

Accepted Answer

No, and we do not pretend otherwise. Digital PDFs are easier — text is already encoded, the structured pass just confirms layout. Scans, photos and mixed-quality PDFs are where two-pass earns its keep: raw text and structured zones disagree more often, and the downstream extractor picks per field. We do not publish a single accuracy number because it is not meaningful across input shapes.

Question 3

How does the OCR layer feed downstream extraction?

Accepted Answer

The OCR pass outputs raw text plus structured zones. Both go into the next queue step — classification, then field extraction in OpenAI JSON mode. The contract is explicit: the extractor sees text and zones, decides per field which one to trust, and writes a typed JSON payload. The OCR layer never makes business decisions; it produces inputs for the extractor that does.

Question 4

Can the OCR run in near-real-time, or only in batch?

Accepted Answer

Both. The OCR step lives inside a staged Laravel job queue. For nightly batches — supplier datasheet imports, archive ingestion — workers scale horizontally on Docker. For portal uploads where a user is waiting, the same step runs at higher priority and returns within seconds for typical Swiss documents. Same code path, different queue priority — no fork between batch and online.

Question 5

What does the cost model look like for OCR?

Accepted Answer

We do not resell OCR by the page. We bill the engagement — discovery, integration, and run support — and pass the underlying Mistral OCR usage through at cost. For customers who already have an extractor and only want the OCR layer wired in, the engagement is short. For full IDP rollouts the OCR cost is folded into the broader S001 quote and rarely the dominant line item.

Question 6

Can we keep the OCR layer and swap the extractor later?

Accepted Answer

Yes. The OCR contract — raw text plus structured zones — is stable and provider-neutral. Customers who start with our S001.1 or S001.2 extractor can later swap the extraction model behind the same OCR layer, or vice versa. The two passes are designed to outlast the choice of downstream model.

Question 7

Does the OCR layer handle handwriting?

Accepted Answer

Within Mistral OCR's published handwriting envelope, yes — handwritten margin notes on supplier datasheets, signed delivery notes, handwritten Belege. We do not claim handwriting recognition as a headline feature. In production, handwriting fields almost always route to HITL review downstream; the OCR pass just gives the reviewer a clean candidate to confirm or correct.

Question 8

Where does OCR data live and is Swiss residency supported?

Accepted Answer

Default deployment is EU-hosted. For Swiss data-residency workloads, the OCR step runs on Swiss-resident servers or on customer premises. Where no public model endpoint may be reached, we wire in the Apertus sovereign-LLM track for the extractor and keep Mistral OCR on a Swiss-hosted gateway. Every OCR pass is logged with model ID and version for audit downstream.

OCR Software for Swiss Docs

OCR Software, productized

How we deliver it

Sample collection and input audit

Two-pass OCR on your inputs

Contract with the downstream extractor

Queue, scale and hand-off

Selected engagements

Why two-pass OCR, not a single model call

Two passes give the extractor a choice

OCR is a layer, not a product

Built against real Swiss documents

Frequently Asked Questions