De-anonymising public data
Jinfo Blog
9th February 2010
By Anne Jordan
Item
âDe-anonymiseâ does not yet appear in the Oxford English dictionary, but it may be a word the editors of that venerable authority on the English language may want to review for addition. I recently came across the word in an article about how public data may be misused by criminals to discover personal level data, potentially limiting the roll-out of recent government data initiatives. Criminal activity on the web is nothing new. Last weekend the website of Tata Consultancy Services, one of Indiaâs largest software and services companies was hacked and a âFor Saleâ message posted. Google and Twitter have also suffered breaches in the past. Whether a joke, for political ends, or criminal purposes, these events can be embarrassing and potentially commercially damaging. A recent article has reported another risk for website owners to guard against, and particularly sites hosting public data sets, such as the UK local and national data initiatives, the Greater London Authorityâs Datastore and the UK governmentâs data.gov.uk. These have been launched since the New Year and welcomed in LiveWire postings by myself and Michele Bate at http://digbig.com/5bbbmn and http://digbig.com/5bbbmq. The article in The Guardian (http://digbig.com/5bbbnr) looks at how statistical "de-anonymisation" techniques might limit the roll-out of such public data initiatives. Computer scientists in the US have discovered ways to "re-identify" the names of people included in supposedly anonymous datasets. The example cited is a movie rental company but there are more serious implications. The discovery that lists can be "de-anonymised" needs to be included in the debate about how information is released and where to draw the line. Dr Ian Brown, of the Oxford Internet Institute believes the discovery raises concerns about initiatives such as Data.gov.uk. He says: "they are looking at releasing crime reports down to street level. You have to think about how people might be able to link that back to individuals."About this article
- Blog post title: De-anonymising public data
- Link to this page
- View printable version
What's new at Jinfo?
Community session
11th December 2024
2025 strategic planning; evaluating research reports; The Financial Times, news and AI
5th November 2024
How are information managers getting involved with AI? Navigating privacy, ethics, and intellectual property
- 2025 strategic planning; evaluating research reports; The Financial Times, news and AI
5th November 2024 - All recent Jinfo Subscription content
31st October 2024 - End-user training best practice research
24th October 2024
- Jinfo Community session (TBC) (Community) 23rd January 2025
- Clinic on contracting for AI (Community) 11th December 2024
- Discussing news and AI strategies with the Financial Times (Community) 21st November 2024
Learn more about the Jinfo Subscription