The Upstream Problem No One Talks About
Anthony Uyende · February 24, 2026 · 5 min read
The work before the work
Every accounting firm knows the feeling. Tax season opens and the inbox floods. Clients send bundles — 400-page PDFs mixing bank statements, T4s, receipts, and invoices. Some send photos of crumpled receipts. Others forward entire email chains and say "everything's in there."
Before a single journal entry gets made, someone at the firm has to open every attachment, split every PDF, rename every file, figure out what's missing, and chase whatever isn't there.
This isn't bookkeeping. This isn't tax prep. This is the upstream layer — and it's where the majority of unproductive time disappears.
The hidden cost
The numbers are brutal. A mid-size firm managing 150 clients, each sending around 11 documents per month, processes roughly 19,800 documents per year. At 2–3 minutes per document, that's 660 to 990 hours annually — nearly a full-time employee — before accounting even begins.
And it gets worse. This work doesn't scale with revenue. It scales with document volume. More clients means more chaos, not more margin.
"Sometimes it's not even worth starting a file when too many documents are missing. You waste time beginning it, then come back to it two weeks later. The client thinks you're working on it, but you haven't even opened their envelope."
— Nader El-Masri, CPA
This pattern repeats across every firm we've spoken to. Senior staff end up doing work that shouldn't require their expertise, simply because it can't be delegated without context.
Why existing tools miss this
The accounting software market is mature. QuickBooks, Xero, Dext, Hubdoc, DataSnipper — each tool optimizes a step inside the workflow. They assume documents arrive clean, structured, and correctly labeled.
They don't.
The problem isn't extraction accuracy. OCR has been good enough for years. The problem is that a 400-page PDF containing six different document types, some duplicated, some missing context, needs to be understood before any tool can process it.
That understanding — which document goes where, what's missing, what the firm's specific treatment rules are — has lived in people's heads. It's never been captured as infrastructure.
What upstream automation looks like
Solving the upstream problem doesn't mean replacing accountants. It means removing the friction that sits between "client sends documents" and "accountant reviews clean data."
That means:
- Absorbing messy inputs without requiring clients to change behavior
- Splitting bundled documents automatically, detecting where one ends and another begins
- Classifying by type using the firm's own categories and rules
- Flagging what's missing before anyone opens the file
- Learning from corrections so the same mistake doesn't happen twice
This isn't a feature inside an accounting tool. It's a new layer — the infrastructure that makes everything downstream work better.
The firms that figure this out first won't just save time. They'll fundamentally change how they scale — by making document volume a solved problem instead of an ongoing tax on every engagement.
Ready to bring order to document chaos?
See how firms like yours use Coalesc to process client documents faster.
Book a call