April 29th, 2026

New

Fixed

Duplicate Detection — Find, Review, and Remove Copies Across Your Files

New

One of the most-requested features since launch: proper duplicate detection across files. When you upload a Sent folder and an Inbox, or two overlapping date-range exports, the same email typically appears in more than one file. RedactBox now automatically finds those copies using each email's Message-ID header and provides a wizard to review and remove them — without touching the copy you want to keep.


How you'll know

When duplicates exist in a project, a banner appears above the email list summarising how many items are affected. Counts grow live as each file finishes parsing, so you don't have to wait for every PDF to generate before seeing the first number.

  • Banner — live count of items with copies, with a Review duplicates button

  • Inline chips — every affected email row shows an Original or Duplicates chip so you can see at a glance which copies are keepers

  • Filter — turn on Needs duplicate review in the filter panel to narrow the list to unresolved groups

The Manage Duplicates wizard

Open it from the banner or Tools → Manage Duplicates. Three tabs cover every state of a group:

  • Active — a three-step wizard: scan results, review the groups, confirm

  • Resolved — everything you've already handled, with a per-row Undo

  • Dismissed — groups you chose to ignore, with a Restore action

Inside the Review step, you can:

  • Resolve a single group from its inline Resolve button

  • Select multiple groups and resolve them together

  • Preview any email inline before making the call

  • Dismiss a group you don't want to act on now

  • Undo all with an inline two-click confirm

Quick-resolve chip

You don't have to open the full wizard for every group. Click the yellow Duplicates chip beside an email, and a popover shows which file is being kept, why (Smart pick, Newest, or Oldest badge), and a Resolve duplicate button. Tick Also approve the original if you want the kept copy marked Processed in one go.

Smart pick, Newest, Oldest

A new Duplicates section in Settings lets you choose how RedactBox picks which copy to keep:

  • Smart pick (recommended) — keeps whichever copy has the most text and attachments. Usually, the most complete version, since emails lose content as they're forwarded or quoted.

  • Keep newest — always keeps the most recent copy

  • Keep oldest — always keeps the earliest copy. Useful for compliance workflows where the original send date matters.

Changing keep mode immediately re-assigns the Original across every unresolved group. Already-resolved groups are left alone.

Auto-approve the kept original

A second setting controls what happens to the copy you keep. Off by default — the kept copy stays in its current state, and you approve it yourself. On — it's marked Processed automatically the moment you resolve the group. Per-action override from the popover is always available.

Re-emerged groups

If you dismiss a group and later upload a file containing another copy of the same email, RedactBox flags the group as re-emerged on the Active tab so it comes back to your attention. No manual restore needed.

Export integration

Duplicates you trashed during review are automatically excluded from every export — ZIP, PDF, bulk. Only the copy you kept appears in the output. Dismissed groups have no effect on exports; all copies stay in.


Also in this release

Exports rebuilt for bigger projects — large projects no longer time out or stall:

  • Filter before you export — pick only the emails that match your current filter

  • Branded summary PDF included in every export ZIP

  • The progress bar only appears for exports of 200+ emails; smaller exports just finish

  • Cancel actually cancels — no ghost completions in the background

Other improvements

  • Uploads up to 3 GB now work end-to-end

  • PST files no longer crash during import

  • Exclude / negation operators in email filters (does not contain, is not, does not equal)

  • Bigger caps on word lists and bulk redaction payloads

  • Auto-redact → All Emails is out of beta — BETA tag removed

Fixes

  • Word list redaction shows correct aggregate counts with proper progress and working undo

  • Stripe subscription upgrades and downgrades handle more edge cases cleanly

  • Quick filters in Triage no longer flash through stale data when switching files

  • The "file processed" toast no longer fires on files already processed before you opened the app


All of the information above is also available in our Knowledge Centre!

https://help.redactbox.co.uk/en

As always, hit the feedback portal to tell us what's missing or what could be sharper.

https://feedback.redactbox.co.uk/