How Neo4j was Used to Analyze the Panama Papers Dataset – Interview with Mar Cabra of the ICIJ

09 Aug, 2019



Bryce Merkl Sasaki interviews Mar Cabra, the Editor of the Data & Research Unit at the International Consortium of Investigative Journalists (ICIJ), during GraphConnect Europe 2016. In April 2016, the ICIJ used the Neo4j graph database and the graph visualization tool Linkurious to coordinate the cross-border investigation of a 2.6-TB leaked dataset from Panamanian law firm Mossack Fonseca dubbed “the Panama Papers.” The investigation involved over 370 reporters from over 100 media organization in 76 countries and required a number of big data open source tools in order to process and analyze the connections in 11.5 million documents of the dataset. Mar discusses which tools and approaches her 3-developer team took in order to make this dataset accessible and digestible to an array of non-technical reports and seasoned data journalists alike. Watch this video to see Mar’s complete presentation “How the ICIJ Used Neo4j to Unravel the Panama Papers”: https://www.youtube.com/watch?v=S20XMQyvANY