Our Insights

Thought Leadership and Industry Trends

Home 9 Insights 9 Data Privacy 9 The Evolving Role of AI in Protecting Data Privacy

The Evolving Role of AI in Protecting Data Privacy

Nov 19, 2021

Global privacy expert Jonathan Armstrong of Cordery addressed the role of analytics and AI in identifying and protecting data in our recent webinar, Global Data Privacy Update: GDPR Walks the Walk. This blog, adapted from his comments, is the second of a 3-part series. To start with Part 1, click here.

Growing tension between AI and data privacy compliance

First, from a regulatory point of view, we need to be clear about what AI is versus a very sophisticated algorithm that’s running a logical process. Of course, either can be useful in areas like eDiscovery, and all sorts of other ways. The first AI case that we had with the UK regulator was a project for a hospital to predict serious illness from patients based on patterns that had been observed in other patients.

There’s undoubtedly some good to be done under AI, but we’ve had a number of warnings now, both from the UK regulator in the Google health case that I’ve talked about. And then more recently, a campaign from the Spanish regulator, who persuaded the Italian regulator to join some investigations looking initially at least at food delivery apps. We’ve had a couple of cases that illustrate some of the issues.

AI under fire the EU

The first case is over a food delivery app, a Spanish entity owned by Glovo. And the Italian subsidiary is quite a popular app in Italy, with some 19,000 riders running around Italy, delivering food off that app and a second app that’s UK-based called Deliveroo. It has around 8,000 riders in Italy, again, running round delivering food. And the cases are somewhat similar. The fines are somewhat similar, 2.6 million euros for Glovo’s Foodinho and 2.5 million euros for Deliveroo.

Effectively, they were allocating riders to jobs on the basis of an algorithm that they called AI. And the regulator, the Italian Data Protection Authority, said they weren’t transparent in how that was running and that their algorithm wasn’t fair. It took too much data from the riders. They said that it was justified to take some geolocation data but not constant geolocation data.

They found it hard to justify things like capturing battery levels. And I know that’s been something that we’ve highlighted as a concern in the past. A couple of other corporations have gotten into trouble for this. My understanding is that it’s as simple as many developers just taking a standard set of data, either from Android or from Apple and that includes battery data. Some people don’t even want the data, but they just get it as part of the standard package.

But the more fundamental concern I think here was that at least one of the apps scored riders over whether they worked Friday, Saturday or Sundays. And the delivery company said that was necessary because they were busy times and wanted to incentivize people who would turn up and work when they were busy.

“Fine-plus” cases are catching on

The concern is that the Sabbath for most major religions falls on Friday, Saturday or Sunday. So, you may be discriminating against a Jewish rider or a Catholic rider based on them not wanting to work because they wanted to observe their Sabbath.

What we’re increasingly seeing under GDPR is these “fine-plus” cases – Italy has been the pioneer of that. The Telecom Italia mobile case starts off with a fine of 20 million euros plus do the following five things. And these are “fine-plus” cases as well in that the regulator fined both operations but also dictated what that algorithm should look like going forward. They have to do a data protection impact assessment. They have to be prepared to justify the lines in the code, if you like, or the AI parameters that are set there. And you can’t replicate human bias in machines.

Transparency sounds great, until it comes to source code

I think for the eDiscovery world, I think that’s somewhat instructive as well, because oftentimes we set our keywords, we set our rules, we set the engine in a way to go and do its stuff. And sometimes we teach the engine and sometimes in a more sophisticated setting, the engine learns itself, but we still have to know what’s going on. And one of the real challenges in this area is of course, a lot of people developing these applications don’t want to say how it’s built, because that’s their secret sauce. And if they’re a startup corporation for example, they want to hold on to that so that they can use that to attract customers and attract funding.

We’re going to see a real fight, I think, in AI in the next year or so as regulators get tougher on AI and want the source code to be released, want the coding criteria to be released. Developers will resist that and data subjects will sit in the middle and say, somebody made bad decisions about me and nobody’s being transparent.

This is complicated by the fact that we’re in a world at the moment where conspiracy theorists thrive. If you’re not transparent about AI, people assume or just make up the criteria that’s being used. My prediction is they’re going to be solid business reasons behind people being more transparent as well, so that they can say, no, you weren’t just excluded on the basis of your religion, you’re excluded because you always delivered 30 minutes later than Giuseppe. And the more transparent we are about how we’re making those decisions, maybe the less pushback we’ll get in this conspiracy-oriented world.

Click here to read Part III: 5 Proactive Steps Toward GDPR Compliance.

About the Author

Jonathan Armstrong

Qualified as a lawyer in the UK in 1991, Jonathan has focused on technology, risk and governance matters for more than 20 years. His practice includes advising multinational companies on matters involving risk, compliance and technology across Europe. He has handled legal matters in more than 60 countries involving emerging technology, corporate governance, ethics code implementation, reputation, internal investigations, marketing, branding and global privacy policies. Jonathan is recognized as one of the most influential figures in risk, data security, and compliance in the UK and internationally. For more, visit the Cordery website.

There are no upcoming events at this time

Cookie	Duration	Description
cookielawinfo-checbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
bcookie	2 years	This cookie is set by linkedIn. The purpose of the cookie is to enable LinkedIn functionalities on the page.
lang	session	This cookie is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
lidc	1 day	This cookie is set by LinkedIn and used for routing.

Cookie	Duration	Description
_ga	2 years	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_gcl_au	3 months	This cookie is used by Google Analytics to understand user interaction with the website.
_gid	1 day	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visted in an anonymous form.
_hjFirstSeen	30 minutes	This is set by Hotjar to identify a new user’s first session. It stores a true/false value, indicating whether this was the first time Hotjar saw this user. It is used by Recording filters to identify new user sessions.
oktgid	1 year	This cookie is used for storing the visitor ID of the user who clicked on an okt.to link.
oktsid		This cookie is used for storing the session ID of the user who clicked on an okt.to link.
pardot	past	The cookie is set when the visitor is logged in as a Pardot user.
vuid	2 years	This domain of this cookie is owned by Vimeo. This cookie is used by vimeo to collect tracking information. It sets a unique ID to embed videos to the website.

Cookie	Duration	Description
_fbp	3 months	This cookie is set by Facebook to deliver advertisement when they are on Facebook or a digital platform powered by Facebook advertising after visiting this website.
bscookie	2 years	This cookie is a browser ID cookie set by Linked share Buttons and ad tags.
fr	3 months	The cookie is set by Facebook to show relevant advertisments to the users and measure and improve the advertisements. The cookie also tracks the behavior of the user across the web on sites that have Facebook pixel or Facebook social plugin.
IDE	1 year 24 days	Used by Google DoubleClick and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.
test_cookie	15 minutes	This cookie is set by doubleclick.net. The purpose of the cookie is to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	This cookie is set by Youtube. Used to track the information of the embedded YouTube videos on a website.

Cookie	Duration	Description
_dc_gtm_UA-109542572-2	1 minute	No description
_hjAbsoluteSessionInProgress	30 minutes	No description
_hjid	1 year	This cookie is set by Hotjar. This cookie is set when the customer first lands on a page with the Hotjar script. It is used to persist the random user ID, unique to that site on the browser. This ensures that behavior in subsequent visits to the same site will be attributed to the same user ID.
_hjIncludedInPageviewSample	2 minutes	No description
_hjTLDTest	session	No description
AnalyticsSyncHistory	1 month	No description
CONSENT	16 years 8 months 26 days 9 hours 2 minutes	No description
UserMatchHistory	1 month	Linkedin - Used to track visitors on multiple websites, in order to present relevant advertisement based on the visitor's preferences.

Our Insights

Thought Leadership and Industry Trends

The Evolving Role of AI in Protecting Data Privacy

Growing tension between AI and data privacy compliance

AI under fire the EU

“Fine-plus” cases are catching on

Transparency sounds great, until it comes to source code

Jonathan Armstrong

Our Blog

Sign Up for Our Newsletter

About CDS

Contact Us