The Daily Sydney

Sydney news, every day

News

Sydney's Duplicate Image Problem: How the City Stacks Up Against London, Singapore and New York

From Parramatta council records to State Library archives, Sydney is quietly grappling with a digital housekeeping crisis that bigger cities solved years ago.

By Sydney News Desk · Published 5 July 2026, 5:16 am

3 min read

Sydney's Duplicate Image Problem: How the City Stacks Up Against London, Singapore and New York
Photo: Photo by Talha Resitoglu on Pexels

Sydney's public institutions are sitting on tens of millions of duplicate digital images — redundant photographs, scanned documents and heritage records stored multiple times across fragmented government servers — and the bill for fixing it is rising faster than the storage costs themselves. The issue landed back on the agenda this month after a City of Sydney Council audit flagged the problem in its digital asset management review, drawing comparisons with infrastructure overhauls already completed in London, Singapore and parts of New York.

The timing matters. NSW is mid-way through an ambitious digitisation push tied to the state's broader public sector modernisation agenda. The State Library of New South Wales on Macquarie Street has been scanning fragile colonial-era materials since 2019. The City of Parramatta Council — which manages one of the fastest-growing local government areas in the country — accelerated its own records digitisation program after the 2022 merger consolidation work wrapped up. Both institutions, along with dozens of smaller councils and state agencies, now hold sprawling image repositories that were built quickly and without a unified deduplication standard. The result is predictable: the same photograph of, say, the 1890s Parramatta riverbank sitting in four different folders across three different systems, each copy incurring storage, licensing and retrieval costs.

What Other Cities Did First

London's approach is the most cited benchmark. The Greater London Authority consolidated its visual assets under a single digital asset management platform, a process completed in stages between 2021 and 2023 across more than 30 borough councils. Singapore's National Archives ran a parallel deduplication program from 2020, using perceptual hashing — a technique that identifies near-identical images even when file names, formats or metadata differ — to cull redundant files from its national digitisation archive. New York City's Department of Records and Information Services published a public-facing deduplication policy in 2022 after an internal review found roughly 18 per cent of its digitised photograph collection was duplicated at least once.

Sydney has no equivalent published policy. A spokesperson for the NSW Department of Customer Service, which oversees the state's digital infrastructure strategy, has not responded to requests for comment before deadline. The City of Sydney Council confirmed its digital asset review is ongoing but declined to provide figures ahead of a formal report expected later this quarter.

The practical consequences go beyond wasted storage. When emergency services, planners or journalists submit freedom of information requests for historical imagery — say, aerial shots of Western Sydney flood corridors or before-and-after photographs of the Barangaroo foreshore — duplicate files slow retrieval, inflate response times and sometimes return contradictory metadata. Infrastructure NSW has flagged image data integrity as a secondary concern in its Metro West project documentation, though the agency has not attributed specific costs to the problem publicly.

The Local Fix Is Fragmented

Several Sydney institutions are moving independently. The Powerhouse Museum, which relocated its main collection to Parramatta's Riverside Precinct, began a deduplication audit of its digitised object photography in early 2026 as part of a broader collection management overhaul. The State Archives NSW at Kingswood in Western Sydney is understood to be trialling automated deduplication tools across its post-1990 scanned document holdings, though the program has no confirmed public timeline or budget allocation.

What Sydney lacks — and what London, Singapore and New York built — is a cross-agency coordination framework. Without one, each institution re-solves the same technical problem at its own expense. A 2024 report by the Australian Library and Information Association estimated that Australian cultural institutions collectively spend between $40 million and $60 million annually on redundant digital storage, though that figure covers all file types, not images alone.

The path forward is clearer than the timeline. Institutions looking to act before any state-level policy emerges have three practical options: adopt open deduplication standards like those published by the Digital Preservation Coalition, run perceptual hashing tools across legacy repositories before migrating to new platforms, or join one of the existing cross-council digital asset consortia already operating in Victoria and Queensland. NSW has neither mandated nor funded any of those options. Until it does, Sydney's digital image libraries will keep growing — and so will the waste inside them.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Sydney

This article was produced by the The Daily Sydney editorial desk and covers news in Sydney. See our editorial standards for how we use AI.

The Daily Sydney brief

The day's Sydney news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Sydney and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Sydney news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Sydney and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Sydney

More in News

Enjoyed this story? Get tomorrow's briefing free.