Website privacy policies are around four times longer than they were two decades ago, according to new research by a cyber security expert at De Montfort University Leicester (DMU).
The research involved gathering data from privacy policies on some of the world’s most visited websites, as well as examining historical versions of webpages stored on the Internet Archive’s Wayback Machine.
This data was then analysed using the machine learning algorithm BERT, which examines large amounts of human language data to identify patterns.
“Privacy policies are notorious for being lengthy documents that are hard to understand and it is well-known that most users do not read privacy policies, but almost all users tick the box to agree with them.
“I think, from a user’s point of view, these policies are fundamentally broken.”
The research was conducted by Dr Isabel Wagner (pictured)
Her analysis also showed that policies published today are harder to read and require more access to user data for the organisations that write them.
According to the Flesch reading ease scale, which measures the readability of text, privacy policies written in 2021 had scores similar to academic papers written for the likes of the Harvard Law Review.
“We found concerning developments in the data practices described in policies, such as increased collection and sharing of sensitive data and lack of choice,” added Dr Wagner.
“It is especially concerning that these data practices are obscured in lengthy policies that require university education to understand, and that would take users more than one hour per day to read.”
As a result of the study, Dr Wagner suggests that until policies are significantly simplified, machine learning could help users navigate through the extensive jargon.
“If the user’s browser could automatically label what privacy policies say, using our machine learning approach, then the browser could also match this against the user’s preferences and display a user-friendly summary,” added Dr Wagner.
Posted on Wednesday 23rd February 2022