Max Harlow

11 Jun 2025
Britain counts the mounting cost of taxing wealthy 'non-doms'
Bloomberg News

A look at the effects of the Labour government’s tax changes targeting the wealthy, and whether they are likely to achieve their aim of bringing in more tax revenue. I analysed filings from Companies House to identify when and how business leaders had changed their country of residence over the previous four years.
25 Jun 2024
Labour diverts activists away from Lib Dem target seats
Financial Times

Revealed Labour’s electoral strategy in the runup to the 2024 general election which would likely maximise Conservative losses in the south of England. I scraped and analysed data from Labour’s volunteering website, which directs activists to where they should campaign.

CSV Match

Finds fuzzy matches between CSV files. Based on Textmatch, a Python library I also maintain. Has been used by news organisations including the Wall Street Journal who used it to match up officials’ shareholding declarations with names of companies their agency had oversight of and the New Humanitarian who used it to identify a company the United Nations had a contract with who was also on its own sanctions list.
Ship Overviewer

Processes ship tracking data and generates a summary of where the vessel has been, and identifies any gaps. It can also highlight where data has changed, which can be used to spot where transponder data has been spoofed.

21 May 2022
How to be a (better) data editor
Dataharvest 2022Mechelen, Belgium
As data journalism has become mainstream, more data editor positions have been created. But what makes a good data editor? In this panel we discussed what it takes to do the job effectively, the different things it can involve, and the different routes to getting there. With Marie-Louise Timcke, Jan Strozyk, Helena Bengtsson, Eva Belmonte, and Dominik Balmer, moderated by me.
23 Mar 2022
Investigative data journalism
Journalism by Numbers, Birkbeck UniversityLondon, UK
Guest lecture covering the origins of investigative data journalism, the nature of data in investigations, where it comes from, plus what code is and how it is used in the newsroom to do this kind of work.

24 May 2025
Scraping the unscrapable: advanced approaches to deal with complex sites and evade anti-scraping systems
Dataharvest 2025Mechelen, Belgium
Scraped data is often the backbone of an investigation, but some websites are more difficult to scrape than others. This session covers best practices for dealing with tricky sites, including coping with captchas, using proxy and other scraping services, plus the tradeoffs and costs of these approaches.
7 Mar 2025
Finding needles in haystacks with fuzzy matching
Nicar 2025Minneapolis, USA
Fuzzy matching is a process for linking up names that are similar but not quite the same. It can be an important part of data-led investigations, identifying connections between key people and companies that are relevant to a story. This class covers how it fits into the investigative process, and includes a practical introduction to using the CSV Match tool I developed.