Researchers: Your ‘Anonymous Data’ May Not Be As Anonymous After All -- Security Today

Researchers: Your ‘Anonymous Data’ May Not Be As Anonymous After All

Americans could be signing over the keys to their identity when filling out medical forms that promise to “anonymize” their information, according to a new algorithm developed by scientists.

By Haley Samsel
Jul 25, 2019

When most Americans sign agreements allowing their medical records or personal information to be used for research, they are told that their data will be “anonymized” — in other words, it cannot be traced back to them. Residents who fill out Census Bureau forms, providing data that determines how government funds are distributed and may become public, are told the same thing.

But, according to research published in the journal Nature Tuesday, your data may not be as anonymous as you thought. Scientists at the Imperial College London and Université Catholique de Louvain in Belgium have come up with a computer algorithm that can identify 99.98 percent of Americans from “almost any available data set with as few as 15 attributes,” including gender, ZIP code or marital status, The New York Times reported.

In making the algorithm public, the researchers made a difficult choice in alerting the world to the massive amount of personal information already available via data sets that are bought and sold without regulation in many parts of the globe. Usually, the flaw is reported to a country or company, but the data privacy problem is so prevalent that the authors decided to publish it widely.

“It’s always a dilemma,” Yaniv Erlich, chief scientific officer at MyHeritage, a consumer genealogy service, told the Times. “Should we publish or not? The consensus so far is to disclose. That is how you advance the field: Publish the code, publish the finding.”

The finding poses a major issue for security experts tasked with protecting consumer data, particularly when it comes to medical and health data sets. Usually, researchers “de-identify” individuals by removing attributes, substituting fake values or by releasing only parts of anonymized data.

But this isn’t enough to protect people from being identified, either as individuals or part of a household data set, according to the study’s authors.

“We need to move beyond de-identification,” Alexandre de Montjoye, a computer and lead author of the paper, told the Times. “Anonymity is not a property of a data set, but is a property of how you use it.”

The balance between encouraging scientific research and potentially exposing the personal information of hundreds of millions of people to cybercriminals is extremely tricky, and the data gathered about individuals is never completely private, according to the researchers.

“You cannot reduce risk to zero,” Erlich said.

de Montjoye told the Times that medical professionals are now asking patients to sign forms letting them know that their medical data could be shared with other hospitals and a system that might give his information to universities, government agencies and private companies. One form he saw as a patient even said that he could be identified through the data he signed over.

“We are at a point where we know a risk exists and count on people saying they don’t care about privacy,” he said. “It’s insane.”

About the Author

Haley Samsel is an Associate Content Editor for the Infrastructure Solutions Group at 1105 Media.

Featured

How Commercial and Multifamily Trends are Changing Hardware Decisions

Commercial and mixed-use developments require precise hardware selection to prevent premature maintenance costs without overbuilding low-traffic openings. Read Now
- Access Control
- Physical Security
- Risk Management
Mega-Event Security Requires a Comprehensive Security Program

Modern stadium security operates like a temporary city command center, merging regional transit data and AI video analytics to ensure frictionless fan experiences. Read Now
In An Ever-changing Marketplace

After five decades covering security technology, adapting to digital transformation requires shifting from static print reporting to interactive, multi-format digital engagement. Read Now
- Dealers and Integrators
- Corporate
- Security Staffing
Why Edge and On-Prem GenAI Matter

Deploying natural-language Generative AI directly on edge devices protects critical infrastructure by accelerating forensic search without creating cloud cyber exposures. Read Now
Can Cloud Solutions Improve Security and Monitor Activities?

Migrating physical security IoT devices to hybrid cloud architectures shifts maintenance burdens while shielding networks against escalating device hacking threats. Read Now
- Physical Security
- Cybersecurity
- Network Centric

Artificial Intelligence

New Products

Compact IP Video Intercom

Viking’s X-205 Series of intercoms provide HD IP video and two-way voice communication - all wrapped up in an attractive compact chassis.
FEP GameChanger

Paige Datacom Solutions Introduces Important and Innovative Cabling Products GameChanger Cable, a proven and patented solution that significantly exceeds the reach of traditional category cable will now have a FEP/FEP construction.
AITX ROAMEO™ and SARA™

Deploy a unified, self-coordinating outdoor security presence by combining ROAMEO’s autonomous patrol capabilities with SARA’s proactive event assessment and real-time response.

Security Today eNews

Sign up today for essential industry news and product information that can help you stay afloat in the fast-paced world of security.

Email Address*Country*

Please type the letters/numbers you see above.