Extract, reconcile, and trace every transaction across bank statements — with transfer detection, flow-of-funds analysis, and check-to-statement matching. One normalized dataset with page-level provenance.
Matter evidence
Operating · Q1
—Queued
Checks & deposit slips
—Queued
Brokerage statement
—Queued
0 of 3 complete
Evidence in → structured ledger → transfer & flow analysis → case-ready export
Built for forensic work — not adapted from a bookkeeping tool. Financial investigations overview
or
Browse FilesNeed to process bank statements, invoices, or receipts at scale? Sign up to DocuClipper
Built for investigation-grade work
< 1 day
data prep on most engagements
6+ banks
normalized into a single dataset
Every row
traced back to its source PDF page
Three steps. Same day.
Bank statements, card statements, check images, deposit slips — any mix of formats, any number of institutions. Upload a single matter or a high-volume batch.
Transactions are extracted, normalized to a consistent schema, and matched across accounts. Checks are linked to statement lines. Missing periods are flagged. You get clean rows, not raw PDFs.
Categorize activity, follow the flow of funds across entities, review flagged items, and export an Excel-ready dataset where every row points back to its source page.
How the product behaves under the hood — inventory, classification, transfer logic, funds views, and documentary matching — beyond generic speed-and-accuracy marketing copy.
Treat the upload set like evidence custody: know which institution–account–period combinations are represented, which months never arrived, and where filenames or date ranges disagree with the transaction data inside.
What you are indexing
institution, product (checking / card / LOC) account_last4, currency, statement_period_end file_hash, ingest_timestamp, page_count balance_begin, balance_end, reconcile_status
Apply a repeatable taxonomy across the normalized ledger: keyword and pattern rules, amount thresholds, and counterparty-driven tags — so two analysts do not silently diverge on how payroll, transfers, or cash deposits are labeled.
Typical rule inputs
description_raw, normalized_payee amount, debit_credit, transaction_type category, rule_id, applied_at
Identify internal movements across accounts you already have: paired credits and debits within configurable date and amount tolerances, including ACH references, wires, and card payment patterns — so internal noise is separated from third-party spend.
Pairing heuristics (examples)
|amount_A + amount_B| ≤ tolerance |date_A − date_B| ≤ N business days memo_token_overlap OR reference_id_match flag: unmatched_outflow / possible_external_account
Move from flat transaction grids to structured views of how cash moved among entities and accounts: aggregate by period, counterparty cluster, or transfer chain so you can explain the story without rebuilding pivot logic for every engagement.
Views built on normalized data
monthly_net_by_account top_counterparties_by_category transfer_edges(A→B, amount, date) source_page_index per transaction row
Ingest front/back check images and deposit tickets through OCR, extract check number, date, amount, payee/payer fields, and link each image record to the bank-reported line when amounts, dates, and reference data align.
Linked artifact metadata
check_image_id, check_number, amount matched_transaction_id, match_score bank_line_date, bank_line_description
Controls that sit alongside the core engine — so the dataset you analyze is balanced, traceable, and consistent across staff.
Roll extracted transaction totals back to opening and closing balances from the PDF so you know which files reconciled cleanly before you rely on them in a report.
Surface outflows that never pair to an inflow inside your workspace — a short list of transfers to investigate or to support additional document requests.
Cash-flow trends, counterparty rollups, and timeline-oriented cuts that sit on top of the same normalized schema as the technical modules above.
Every exported row retains a pointer to the source PDF page so workpapers and exhibits answer “where did this number come from” without ad-hoc screenshots.
Different column layouts and date formats land in one consistent transaction model so filters, rules, and transfer logic run once across the whole matter.
Upload high page counts across many accounts; processing is built for litigation-scale volumes rather than one-off conversions.
Civil, criminal, regulatory, or internal — if banking records are part of the evidence, DocuClipper shortens the path to answers.
The difference shows up on day one of every engagement.
| Feature | DocuClipper | Manual / generic tools |
|---|---|---|
| Data intake time per engagement | Under a day for most matters | 3–5 days of manual entry |
| Statement formats supported | PDF, scanned images, native bank exports | Whatever the analyst can copy-paste |
| Cross-account, multi-entity tracing | Normalized schema + flow-of-funds views | Fragile cross-tab spreadsheets |
| Missing period / gap detection | Automatic flags before analysis starts | Often discovered late — or not at all |
| Categorization consistency | Saved rules applied across all matters | Each analyst re-labels from scratch |
| Deliverable traceability | Each row linked to source PDF page | Screenshots, notes, and hope |
| Check & deposit slip linkage | OCR + proposed match to statement line | Side-by-side PDFs and manual pairing |
Dedicated pages for each subsystem — inventory and analysis through tracing, categorization, checks, and flow of funds.
Coverage, reconciliation, and review-oriented cuts.
Rules, tags, and consistent labels across matters.
Pairing logic and line-level trails.
Aggregated movement across accounts and entities.
OCR from check and slip images into structured fields.
"Data preparation is usually handled in less than a day, which lets me put my energy into the actual analysis."
Forensic CPA · Independent litigation support practice
"DocuClipper lets us do a rapid assessment of scope on day one — before we've committed to a billing estimate."
Managing Director · Regional forensic accounting firm
Straight answers on data intake, multi-bank normalization, checks, and exports.