Informal use is already here
Staff use ChatGPT, Copilot, and browser AI to summarize reports, clean exports, draft variance explanations, and reconcile faster. It is invisible by default, and banning it just moves it further out of sight.
Your team is probably already using AI — informally, and inside the accounting, ERP, and banking software you already pay for. Most of that use is sensible people saving time. The question is whether anyone can see it, whether outputs are checked before they matter, and who owns the number when it is wrong.
None of this is a scandal. It is what adoption looks like when useful tools arrive before workflow rules do. But finance is where the numbers travel: into payments, hedges, covenants, board packs, and filings.
Staff use ChatGPT, Copilot, and browser AI to summarize reports, clean exports, draft variance explanations, and reconcile faster. It is invisible by default, and banning it just moves it further out of sight.
Accounting, ERP, and banking platforms now ship AI features inside licences you already hold. Few teams have reviewed what those features touch, retain, or are permitted to do.
Once an AI-assisted figure is pasted into the workbook, it looks identical to a human one. If nobody flags it, nobody reviews it — and nobody can answer for it later.
The failure mode that matters in finance is not a crash. It is a fluent, plausible, wrong number that nobody thought to reconcile back to source.
AI genuinely accelerates finance work: summarizing, classifying, drafting, comparing files, spotting anomalies. That value is real, and teams that refuse it outright will simply fall behind teams that use it well.
But when researchers test AI systems against realistic, end-to-end enterprise finance workflows — messy spreadsheets, emails, cross-file retrieval, validation, reporting — the results argue for supervision, not blind trust.
In the Finch benchmark of real-world enterprise finance and accounting workflows, the strongest model tested passed 38.4% of workflows — despite spending an average of 16.8 minutes per task. The tasks are the exact texture of month-end: multi-file, multi-step, judgment-laden work.
That is a warning about unattended reliance, not a verdict on usefulness. Model capability will keep improving. What does not change: an AI system cannot be the accountable party for a payment, a hedge, or a filed number.
Source: Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows (arXiv:2512.13168). Figures reviewed July 2026.
Thirteen questions, about three minutes. It measures the five things that make AI use in finance controllable: visibility, data exposure, validation, audit trail, and ownership.
Answer for your finance, accounting, and treasury team as it actually operates — not as the policy binder says it should. “We don’t know” is a legitimate answer, and it is scored honestly: in finance, not knowing is itself a finding.
This scorecard runs entirely in your browser. No AI is involved in scoring — the logic is fixed, documented below, and the same for everyone. Nothing is sent to Bastion, or anyone else, unless you request the written report at the end.
Every answer carries a fixed risk weight of 0–3 points, set in advance and identical for everyone. Question 1 is not scored; it sets the materiality of your result (how much of your AI use touches cash, payments, reconciliations, FX, or reporting).
We publish the logic because that is the standard we think any tool touching finance decisions should meet. If a vendor cannot show you how their AI scores, ranks, matches, or decides — that is one of the questions below.
Before relying on AI features inside accounting, ERP, banking, or treasury systems, ask for evidence — not the brochure. A vendor with real controls will answer these quickly. Hesitation is also an answer.
Bastion is independent: not a software vendor, not a reseller, and not paid to recommend anyone’s AI. Our work is finance-workflow judgment — the same discipline we apply to FX, payments, and treasury process.
Bastion uses AI every week in its own research and publication workflow: drafting under fixed validation rules, independent review, and human sign-off before anything reaches a client. We built these controls for our own work before recommending them to anyone else.
The highest-stakes AI use in finance is where numbers drive decisions: cash forecasts, FX exposure summaries, payment workflows, reconciliations, reporting. That is exactly the workflow territory Bastion already reviews in its treasury and FX advisory work.
The goal is not to ban AI. It is a short set of working rules: where AI is allowed, where it is advisory-only, where it is prohibited, what gets validated, what gets logged, and who owns it — so the team keeps the speed and the company keeps control.
AI use in finance is one of the areas the Free Treasury Audit now covers: alongside cash visibility, FX exposure, payment controls, and reporting, we look at where AI already touches your numbers and whether the workflow around it holds up.
It depends on what goes in and what relies on what comes out. Drafting a memo is low risk. Pasting bank statements, forecasts, customer data, or payment details into a public AI tool — or relying on AI-produced figures without reconciling them to source — is a control gap, not a technology question. Set written rules for what data may enter which tools, and validate any output that feeds a decision.
If anyone on the team uses AI — including features inside accounting, ERP, or banking software — then yes, a short one. A useful finance AI policy fits on a page: which tools are approved, what data may never leave the company, which outputs need review before use, and who owns exceptions. Generic company-wide AI policies rarely address finance data specifically.
Vendor scale is not the same as control evidence. The practical questions are what data the feature processes and retains, whether it can act or only recommend, whether its actions and overrides are logged, and whether outputs trace back to source records. Those answers vary by vendor, by module, and by release — and features are often switched on by default.
AI can accelerate the drafting and data-shaping work, but benchmarks of realistic enterprise finance workflows show even frontier systems fail the majority of end-to-end tasks. Treat AI output as a draft from a capable new hire: useful, fast, and reconciled to source before anything — a hedge, a payment, a board number — relies on it.
Start with visibility, not prohibition. Give staff a safe channel to disclose the AI shortcuts they already use — amnesty first, rules second. Then map what touches material work (cash, payments, reconciliations, FX, reporting), set validation and logging expectations for those workflows, and name an owner. Banning AI without visibility just pushes use further out of sight.