Limit data analysis by design

As nearly every human interaction now generates some form of data, systems should be designed to limit the invasiveness of data analysis by all parties in the transaction and networking.

Industry is gaining insights into and intelligence on our lives that were previously possessed by powerful Intelligence Agencies, and tomorrow their potential may exceed them. In the future, industry giants will have more insight into the world than the most powerful intelligence agencies. What they know and represent about us will have significant effects on individuals, groups, and whole societies.

What's the problem?

As a result of design choices in modern technologies, individual and collective behaviour is increasingly traceable. Metadata and logs, and other forms of observed data are generated of every interaction. The growing stores of data that companies and governments hold about individuals and groups is now automatically generated from human behaviour. This is at odds with how most users understand privacy as being about what they knowingly and overtly disclose to companies.

Powerful institutions with access to data now have unprecedented population-level knowledge about individuals, groups, communities, and whole nations and markets. With this knowledge they will have insight and intelligence on patterns of behaviour and other trends. They may identify customary behaviours and activities, as well as deviations. Even as these categories become divorced from the individual pieces of personal data, they provide powerful insights into how groups, societies and markets function. And they will likely be kept secret or understandable to the few. While monopolies are traditionally measured in terms of market power, this raises the question of how the data economy needs new ways to measure what qualifies as dominance in the marketplace.

Why this matters

In the future, industry giants will have more insight into the world than the most powerful intelligence agencies. What they know and represent about us will have significant effects on individuals, groups, and whole societies.

What we would like to see

We should be able to know the metadata and other observed and derived data that is generated through our interactions, and where this data leaks to and who has access, e.g. in WhatsApp and SMS or financial transactions, what does which provider have access to and what does that allow them to infer?

Individuals are asked for their consent when their data is to be used to generate analytics for purposes beyond their own direct advantage and legitimate interest, even if the data that is taken from their use is de-identified or anonymised.

Individuals should be able to filter-out metadata and other observed data and prevent processing on platforms, e.g. removing photo metadata and processing on platforms unless an individual wishes for metadata to be disclosed.

Where systems are de facto compulsory and it is impossible for individuals to object, they should be able to be pseudonymous and they must be able to represent themselves as is in their interests, which in inflexible systems would mean the ability to lie and fabricate data. The exception to this would be when systems have a legitimate and specific purpose, as those systems would minimise data, which should be the default.

Even as the user-interface of devices and services disappears and background processing takes more data than under knowledgeable consent, we need more transparency of the data processing on devices and the data emerging from devices and services. Just as firewalls are able to identify and interfere with flows of data from computers, we want to see innovations that give individuals controls over data emerging from other technologies, whether from scripts running on websites to IoT devices calling home.

What this will mean

Competition law would have to consider dominance of a company through the knowledge is has on individuals activities, intelligence and insight it possesses on individuals and groups and whole societies, and the choices it made through the design of systems.

User-generated content systems will permit users to control data disclosure, including through the restraint and even fabrication of observed data. A location tracking and communications tool should allow the user to mis-represent their location to others, instead of relying only on system-generated GPS data – and exceptions to this rule must be clear, e.g. gaming. By design these systems must allow for reduction of observed, derived and inferable data, e.g. in photos and posts.

Platforms should limit ability of third parties to conduct unlawful surveillance, and these third parties should not be able to collect personal data (e.g. photo location) except when necessary and proportionate to a legitimate aim. They should also inform users what data is accessible to third parties, how, and under what circumstances.

Essential reform actions

Regulators will need to broaden their remits around data, intelligence, and power, e.g. competition regulators need to reflect upon data, data protection regulators need to increase the scope of their work to consider analytics, anonymous data, group privacy.

Stronger controls on social media to prevent the generation of SOCMINT, and stronger rules on access by third parties.

Examples

Researchers scraped videos of transgender vloggers off YouTube without their knowledge to train facial recognition software

Examples

Facebook expands advertising transparency

Examples

Behavioural biometrics flag fraud but invade privacy

Examples

Consumer behaviour closely predicts politices, race, income, education, gender

Examples

Choice of iOS or Android can predict credit-worthiness

Examples

Polar social site reveals work and home locations of military personnel

Examples

Austria proposes to seize asylum seekers' devices to check their identities and travel routes

Examples

Smartphone app monitors mental health status

Impact Stories

Advocacy

Privacy International’s submission to the UN Special Rapporteur on the right to education

The October 2024 report on Artificial Intelligence in education referenced this submission but also adopted our recommendation that companies providing AI systems to educational institutions “make their technologies fully auditable by any third party”.

Long Read

The Identity Gatekeepers and the Future of Digital Identity

Updated on 7 October 2020 PI’s engagement with Yoti (a UK-based digital identity provider) resulted in improvements of the company’s privacy policy, which now includes a clearer description of how users’ personal data (including photo and passport data collected by the app) are processed.

Impact Case Study

Fighting data-exploitative business models

What is the problem Business models of lots of companies is based on data exploitation. Big Tech companies such Google, Amazon, Facebook; data brokers; online services; apps and many others [collect, use and share huge amounts of data about us](https://privacyinternational.org/long-read/4398

Impact Case Study

Image of human body with dots of data being extracted

The fight for the global standard in data protection law

What happened Strong and effective data protection law is a necessary safeguard against industry and governments' quest to exploit our data. A once-in-a-generation moment arose to reform the global standard on data protection law when the European Union decided to create a new legal regime. PI had

Report and Analysis

Long Read

Generic silver blister pill packets on a red background

News

23rd May 2025

Limit data analysis by design

What's the problem?

Why this matters

What we would like to see

What this will mean

Essential reform actions

Examples

Researchers scraped videos of transgender vloggers off YouTube without their knowledge to train facial recognition software

Facebook expands advertising transparency

Behavioural biometrics flag fraud but invade privacy

Consumer behaviour closely predicts politices, race, income, education, gender

Choice of iOS or Android can predict credit-worthiness

Polar social site reveals work and home locations of military personnel

Austria proposes to seize asylum seekers' devices to check their identities and travel routes

Smartphone app monitors mental health status

Impact Stories

Privacy International’s submission to the UN Special Rapporteur on the right to education

The Identity Gatekeepers and the Future of Digital Identity

Fighting data-exploitative business models

What is the problem Business models of lots of companies is based on data exploitation. Big Tech companies such Google, Amazon, Facebook; data brokers; online services; apps and many others [collect, use and share huge amounts of data about us](https://privacyinternational.org/long-read/4398

The fight for the global standard in data protection law

Report and Analysis

All Eyes on my Period? Period tracking apps and the future of privacy in a post-Roe world

Enter the Fediverse

Your future AI Assistant still needs to earn your trust

News

Are AI Assistants built for us or to exploit us? and other questions for the AI industry

The US border surveillance expansion has global implications

Generative AI won't take over the world, surveillance capitalism already has

Afghanistan: What Now After Two Decades of Building Data-Intensive Systems?

Op-ed: The Unequal Application of Advertising Transparency

Use of 2FA information for commercial purposes is unacceptable

Related Content

All Eyes on my Period? Period tracking apps and the future of privacy in a post-Roe world

Are AI Assistants built for us or to exploit us? and other questions for the AI industry

Enter the Fediverse

Your future AI Assistant still needs to earn your trust