Guide
How to Digitise 10 Years of Paper Files (Without Losing Your Mind)
The backlog is the part of going paperless everyone dreads. Our how to go paperless guide covers the full process. This article tackles the hardest step: what to do with years of accumulated paper.
The First Rule: Don't Scan Everything
The biggest mistake in any backlog digitisation project is treating it as a scanning project. It's not — it's a document management project that happens to involve some scanning.
Before a single page goes through a scanner, you need to decide what's worth keeping. Most businesses find that 30–50% of their paper backlog is redundant — duplicate copies, outdated versions, documents past their retention period, personal files that shouldn't be there, junk mail that somehow accumulated for a decade.
Scanning redundant documents wastes time, wastes storage, and clutters your new digital system from day one.
Step 1 — Triage Your Backlog
Sort everything into four categories before you scan anything:
- Scan and keep digital only: Documents still within retention period, referenced occasionally, not legally required as originals
- Scan and retain physical: Documents where originals have legal value (signed deeds, some contracts, wet-signature documents)
- Archive without scanning: Documents legally required to be kept but never referenced (very old records approaching end of retention period — cheaper to archive than scan)
- Shred: Everything past retention period, duplicates, personal files, junk
This triage step is tedious but it determines the scope of the scanning project. Most teams find it reduces the scanning volume by 40–60%.
Step 2 — Prioritise by Business Value
Don't digitise chronologically. Digitise by business value:
- Active files first: Documents referenced regularly. These deliver the most immediate benefit — staff can find them digitally from day one.
- Compliance-critical records second: Documents needed for audits, legal proceedings, or regulatory inspection. These reduce risk most urgently.
- Historical archive last: Old records rarely accessed. These can be a slow background project over weeks or months.
Step 3 — Set Up Your Digital Structure First
Don't start scanning until your SharePoint or DMS folder structure and metadata schema are defined and tested. Scanning without a destination structure creates a digital version of the paper pile — same chaos, different format.
Define: folder hierarchy, metadata fields per document type, naming conventions, retention labels. Scan 50 documents as a test to validate the structure works before processing thousands.
Step 4 — Choose Your Scanning Approach
DIY batch scanning
Use a high-speed ADF scanner (35–60ppm) with a 50–100 sheet ADF. Remove staples, unfold documents, batch by document type. Scan in batches of 50–100 pages, applying metadata per batch. For a 20-person office with 5 filing cabinets: expect 40–80 hours of scanning work.
Dedicated scanning project days
Block out specific days — one department at a time. Clear the decks, assign someone to the scanner, work through systematically. More effective than an open-ended "scan when you have time" approach that drags on for months.
Scanning bureau
For large volumes (10,000+ pages), professional scanning bureaus are often more cost-effective than staff time. Typical rates: £0.03–0.08/page for straightforward documents, including OCR and basic indexing. A 50,000-page archive: £1,500–4,000. The bureau handles preparation, scanning, OCR, and returns structured digital files ready to import into your DMS.
Bureau advantages: fast (days not months), consistent quality, no staff time diverted, GDPR-compliant handling (reputable bureaus are ISO 27001 certified). Disadvantages: cost, security consideration of sending documents off-site, some quality control required on the output.
Step 5 — Verify and Index as You Go
Don't scan everything and then try to index it all at the end. Index as you go — apply metadata to each batch immediately after scanning. Catch errors while the documents are still on the desk in front of you.
Random quality checks: after every 200 pages scanned, verify 10 random documents. Check image quality, correct filing, metadata accuracy. Catching a systematic problem early prevents hours of remediation work later.
Step 6 — Secure Disposal
Once documents are scanned and verified, originals that don't need physical retention should be securely shredded — not put in a recycling bin. Documents containing personal data require GDPR-compliant disposal. Options: in-house cross-cut shredder (suitable for small volumes), confidential waste sacks collected by a shredding company (most cost-effective for medium volumes), on-site shredding service (most secure for large volumes or sensitive material).
Need help structuring your backlog project?
Find a UK DMS specialist who can scope and manage it for you.
Find a Consultant →Going Paperless Guide →