Critically Endangered small

Since 1994, people have set up collections of digital content on the web with software now outdated. Those collections are valuable but lose funding and care as institutions re-configure their tasks and individuals retreat from their task due to retirement or (as volunteers) to old age.

Group: Web

Trend in 2021:

Consensus Decision

Added to List: 2019

Towards greater risk

Previous classification: Critically Endangered

Imminence of Action

Action is recommended within twelve months, detailed assessment is a priority.

Significance of Loss

The loss of tools, data or services within this group would impact on people and sectors around the world.

Effort to Preserve

Loss seems likely: by the time tools or techniques have been developed the material will likely have been lost.


Academic and institutional website from the first decade of the web containing details of research projects and interests as well as research data.

‘Practically Extinct’ in the Presence of Aggravating Conditions

Inaccessible to web archive; bespoke code; insufficient documentation; uncertain intellectual property right.

‘Endangered’ in the Presence of Good Practice

Secured by web archive; documentation and rights information published alongside

2021 Review

This entry was added in 2019. There are overlaps with the entry with the ‘Semi-Published Research Data’ entry, but it is a separate entry to distinguishing ‘current’ and ‘legacy’ collections with different risk profiles: in this case the fact that materials of legacy web collections are no longer actively maintained increased the classification to critically endangered. Loss has already occurred, and future loss can be prevented through approaches such as web archiving and code preservation, but risk has become greater due to security issues posed by hosting legacy technology software and services which have prompted disposal of content imminently without adequate review or selection.

Additional Jury Comments

The Internet Archive and other national web archiving bodies have copies of a lot of websites that would fit into this category but by no means all. There’s also a distinction between the software or code used to deliver the user experience and the data. Such code is secondary to the content.

This issue can be intensified by the legacy IT Infrastructure in cases where much of the content is hosted there, as security concerns may lead to disposal of content imminently IT. In these scenarios, their imminence of action becomes more urgent given the security issues posed by hosting legacy technology/software/etc.

Scroll to top