UK’s ICO Opens Formal Investigation into Perplexity’s Data Scraping Practices
The UK ICO has opened a formal investigation into Perplexity’s scraping of paywalled content, mirroring EU complaints filed in January 2026.
Perplexity, the AI search company that built its reputation on summarizing the web so you don’t have to read it, is now facing serious questions about which parts of the web it was actually allowed to read. The UK Information Commissioner’s Office has opened a formal investigation into the company’s data practices, focusing specifically on whether Perplexity scrapes content from paywalled publications without explicit permission from those publishers.
The probe puts Perplexity squarely in the crosshairs of UK data protection law — and it doesn’t arrive in isolation. EU privacy complaints targeting the same behavior were filed in January 2026, meaning the company is now fighting regulatory fires on two fronts simultaneously. For a startup that raised hundreds of millions of dollars on the promise of being a better search engine, this is an expensive moment to be explaining yourself to multiple regulators.
What the ICO Is Actually Looking At
The investigation centers on a specific and thorny question: does systematically scraping content that sits behind paywalls — content that publishers explicitly restrict — violate UK data protection rules and consent requirements inherited from GDPR frameworks? Publishers argue it clearly does. Their business model depends on controlling access to their journalism, and an AI tool that ingests and reproduces that content without a licensing agreement essentially freeloads off years of reporting investment.

The ICO’s involvement signals that this isn’t just a copyright dispute between Perplexity and a few newspaper groups — it’s a data protection issue with regulatory teeth. Under UK GDPR, processing data without a lawful basis is a prosecutable offense, not just an awkward conversation. The ICO has the authority to issue fines, impose operational restrictions, and in serious cases, order a company to stop processing data entirely.
Perplexity’s likely defense — that web scraping is a standard industry practice, that its crawler respects robots.txt signals, or that its use falls under some version of research exemption — has already been tested by the company in response to earlier publisher complaints, with mixed results. Regulators tend to be less impressed by technical compliance arguments when the underlying behavior strips revenue from identifiable third parties.
A Coordinated Regulatory Squeeze

The timing of the UK investigation, following closely behind the EU complaints filed in January, suggests something more coordinated than coincidence. European and UK data regulators have maintained close working relationships post-Brexit, and it would be naive to assume both sets of authorities landed on Perplexity independently. The convergence is probably the point.
This pattern is familiar from earlier rounds of tech regulation. When multiple jurisdictions pile in within weeks of each other, it typically precedes enforcement rather than just extended dialogue. The Q2 2026 window cited as a potential timeline for restrictions in the UK feels plausible given the ICO’s recent pace on high-profile investigations.
Publishers — particularly those in the UK news industry who’ve watched their subscription revenue get undercut for years — will be watching this case closely. If the ICO rules that scraping paywalled content without consent is unlawful, every AI company that’s been treating the open web as a free-for-all training and retrieval buffet will need to revisit its data sourcing strategy.
What’s Next for Perplexity
Perplexity now has to engage formally with the ICO, which means documentation requests, legal responses, and the kind of regulatory attention that tends to distract a growth-stage company from, well, growing. The EU process runs in parallel, and the outcomes don’t have to match — different legal frameworks, different timelines, potentially different conclusions.
What’s harder to dismiss is the broader signal. AI companies have operated under a loose assumption that if content is technically reachable on the internet, it’s fair game to process. Regulators in both the UK and EU are actively contesting that assumption — and Perplexity is currently the test case. How this investigation resolves will set a precedent that affects every AI search tool, every training pipeline, and every startup that’s currently scraping its way to a product demo.
For now, Perplexity is in the uncomfortable position of being a case study in real time. The company hasn’t collapsed under publisher pressure before, but those were lawsuits. This is a data protection regulator with statutory authority — a different kind of problem entirely.


