All posts
    AP AutomationData QualityInvoice ExtractionGlobal AP

    The quiet data-quality tax on global AP: dates, currency, and silently-wrong fields

    June 13, 20256 min readBy Founder, EZ Cloud

    The errors that hurt an AP team aren't the ones that throw an exception. A blank field gets noticed. A garbled total gets queried. The dangerous failures are the ones that produce a plausible wrong answer — a date that's valid but means the wrong day, an amount that parsed cleanly but dropped a digit. Those don't stop the pipeline. They flow straight through to the ledger and surface weeks later as a missed discount, an early payment, or a reconciliation that won't tie out.

    Once you're processing invoices across multiple countries, currencies, and suppliers, this becomes a steady, low-grade tax on accuracy. It's worth naming the specific places it leaks in, because each has a disciplined fix.

    Dates: the DD/MM versus MM/DD trap

    The single most expensive ambiguity in global AP is the date. 03/07/2025 is the 7th of March to an American supplier and the 3rd of July to an Australian or British one. There is nothing in the digits themselves that tells you which — and OCR will happily extract either reading with full confidence.

    This matters because the date drives money decisions. An invoice date sets the start of payment terms; a due date sets when cash moves. Misread one by a few months and you've either paid early (lost float) or paid late (lost a discount, damaged the supplier relationship, maybe incurred a late fee). The invoice looked fine the whole way through.

    Disciplined handling means refusing to guess from the digits alone:

    • Anchor to the supplier's locale, not your own. The format should be inferred from where the supplier operates, not from the processing team's default. A US-default parser quietly mangles every European and Australian invoice it sees.
    • Use disambiguating signals when the day exceeds 12. 25/07/2025 can only be DD/MM — that's a free, unambiguous anchor that tells you the supplier's convention for the rest of their invoices.
    • Prefer text months where present. "7 March 2025" or "Mar 7 2025" removes the ambiguity entirely; favour that field over a numeric one when both appear.
    • Normalize to one internal format immediately. Convert every date to an unambiguous ISO form (YYYY-MM-DD) at the point of extraction, so nothing downstream ever re-guesses.

    And critically — when the format genuinely can't be resolved, flag it for a human rather than picking a reading. A held invoice costs minutes; a wrong payment date costs real money and trust.

    Currency and amounts: symbols, separators, and junk

    Amount fields fail differently. The number is usually right; what corrupts it is everything around the number.

    • Thousands and decimal separators invert by locale. 1.234,56 is twelve hundred in much of Europe; 1,234.56 is the same value in the US and UK. Parse one with the other's rules and you're off by orders of magnitude.
    • Currency symbols are not the currency. $ could be USD, AUD, CAD, or several others; a misattributed currency code at posting is a silent FX error that no amount check will catch.
    • Junk characters ride along. OCR pulls in stray glyphs, trailing footnote marks, a misread separator, a space inside the number. A naive parser either chokes or — worse — truncates the value at the bad character and posts the fragment.
    • Negatives and credits hide in formatting. Parentheses, a trailing minus, or a "CR" suffix all mean a credit. Strip the formatting without preserving the sign and a credit note posts as a charge.

    The right discipline mirrors the date approach: strip and normalize deliberately, never permissively. Remove known formatting characters by rule, interpret separators by the supplier's locale, carry the currency as an explicit code rather than inferring it from a symbol, and validate the cleaned number against the invoice's own totals where they exist (lines summing to subtotal, subtotal plus tax equalling total). When the arithmetic doesn't reconcile, that's the signal to hold — not to post the best guess.

    Why "silently wrong" is the real risk

    The throughline is that extraction confidence and extraction correctness are not the same thing. A field can be read with high confidence and still be wrong, because the model that read it didn't know the supplier's locale or didn't reconcile the value against the rest of the invoice. The teams that stay accurate at scale treat every amount and date field as a value to be validated, not merely captured — and they keep a deliberate review path for the genuinely ambiguous cases instead of letting the pipeline pick. This is the same principle behind treating extraction as something to be checked against live ERP state, which we discussed in matching an OCR'd supplier name to the Oracle vendor master: the safest pipelines escalate to a human at the exact point confidence runs out, rather than guessing through it.

    It compounds with everything downstream. A wrong date breaks payment timing; a wrong amount breaks three-way matching and duplicate detection; a wrong currency breaks the ledger. Get the boring fields right and the clever automation on top of them — matching, duplicate detection, touchless posting — actually becomes trustworthy.

    Where EZ Cloud fits

    EZ Cloud handles dates and amounts as locale-aware, validated values rather than raw OCR output — inferring date format from the supplier's region, normalizing separators and stripping junk by rule, carrying currency as an explicit code, and reconciling totals before anything posts. When a field genuinely can't be resolved with confidence, it routes to review rather than guessing, so the quiet errors that survive most pipelines get caught at the door. For finance teams running AP across multiple countries and currencies, that disciplined, locale-aware handling is what keeps "automated" from quietly meaning "automatically wrong" — and it's part of the native ERP integration approach we build on.

    See it against your Oracle AP

    Book a 30-minute walkthrough — we'll run a real exception from supplier email to Oracle posting, on Fusion or EBS.