Source Archiving Policy

UKPoliticsDecoded Standards & Governance Hub

1. Purpose

This policy explains how UKPoliticsDecoded captures, preserves, and publishes source material used in explainers, analysis, and public interest documentation. Archiving ensures long term access to evidence, protects against link rot, and maintains a verifiable record of government and public‑sector information.

The policy applies to all content archived through the UKPoliticsDecoded platform and its dedicated archiving service at archive.ukpoliticsdecoded.uk.

2. Scope

The policy covers:

  • what types of sources are archived
  • how sources are captured and preserved
  • how authenticity and integrity are maintained
  • how archived material is made available
  • how takedown requests are handled
  • legal and ethical considerations

It does not cover internal editorial processes, which are defined separately in the Editorial Standards Policy.

3. What We Archive

3.1 Eligible Material

We archive publicly accessible content that is relevant to:

  • government policy announcements
  • legislation, statutory instruments, and parliamentary records
  • regulator publications and guidance
  • official data releases
  • ministerial speeches and statements
  • public interest journalism used for context
  • research and reports cited in explainers

3.2 Material We Do Not Archive

We avoid archiving:

  • private or paywalled content
  • personal data not intended for public release
  • social media posts from private accounts
  • copyrighted material where archiving would breach fair use or licensing rules
  • content that poses ethical or legal risks

4. How Sources Are Captured

4.1 Multi Layer Capture

Each source is archived using a combination of methods to ensure durability, readability, and evidentiary integrity:

SingleFile Snapshot

full visual representation of the page

Screenshot

above the fold visual evidence

Wget HTML

raw source code for long term preservation

Readability or Mercury extraction

clean, accessible text

PDF

used for formal citation when appropriate

Media capture

for embedded video or audio

HTTP headers

server metadata and redirect chain

Archive.org

external verification when available

4.2 Why We Use Multiple Formats

Visual Accuracy

Screenshots and snapshots preserve appearance

Machine Readability

HTML and text extraction for searchability

Forensic Verification

Headers and metadata for authenticity

Accessibility

Clean text for screen readers and reformatting

Long term Resilience

Multiple formats protect against technology changes

This layered approach ensures that each source can be independently verified even if the original page changes or is removed.

5. When We Archive

5.1 Automatic Archiving

A source is archived automatically when:

  • it is cited in an explainer or analysis
  • it is likely to change over time (e.g., GOV.UK pages)
  • it forms part of a developing policy story
  • it is a government announcement or ministerial statement

5.2 Proactive Archiving

We may archive proactively when:

  • a source is at risk of removal
  • a page is part of a significant public interest event
  • a document is expected to be updated or replaced

6. Authenticity and Integrity

6.1 Preservation of Original Content

Archived material is preserved exactly as captured. We do not:

  • edit archived pages
  • alter text or layout
  • remove or modify metadata

If a source changes, a new snapshot is created.

6.2 Metadata and Audit Trail

Timestamp

When the capture was made

URL

Original source location

Capture Methods

Which tools were used

HTTP Headers

Server response data

File Hashes

Cryptographic verification

Extractor List

Processing method record

This provides a transparent, verifiable chain of custody.

7. Public Access

7.1 Default Openness

Open Access

Archived sources are made publicly accessible to support transparency and public understanding.

7.2 Access Restrictions

Access may be restricted when:

  • content contains personal data
  • the original publisher removed the content due to error
  • legal or ethical concerns apply
  • a valid takedown request is received

Where access is restricted, the reason is documented.

8. Takedown Requests

8.1 Grounds for Consideration

We consider takedown requests when archived content:

  • contains personal data
  • was published in error by the original source
  • is defamatory or unlawful
  • breaches copyright or licensing restrictions

8.2 Review Process

Requests are reviewed individually, balancing:

  • public interest
  • legal obligations
  • ethical considerations
  • the integrity of the archive

8.3 How to Request Removal

How to Request Removal

Requests can be submitted via the contact page on UKPoliticsDecoded.uk.

9. Legal Basis

9.1 Fair Dealing and Public Interest

Archiving is conducted under:

  • fair dealing for reporting, research, and review
  • public interest documentation
  • preservation of government and public records
  • legitimate interest in maintaining evidence for transparency

9.2 Copyright

We do not archive material where doing so would breach copyright law or licensing terms.

10. Updates to This Policy

This policy is reviewed periodically to reflect:

  • changes in UK law
  • updates to archiving technology
  • evolving editorial standards
  • public feedback

Revisions are logged on the Standards site.