Discovering these connections would have been impossible if the information had been structured into a tabular format, according to Eifrem. The ICIJ used Neo4j and other tools to detect connections that were otherwise invisible — like the shared address between Gunnlaugsson and his wife, Anna Sigurlaug Pálsdóttir. Pierre Romera, chief technology officer of the ICIJ, told Business Insider: “Most of the leaks we get are not structured since they are raw documents. “With the Paradise Papers, those documents represented 1.4 TB of data and were gathered from different sources. Putting them in a single one database was a challenge for us. With Neo4j and [visualisation tool] Linkurious, and after a few weeks of research, we were able to propose to our 382 journalists a way to explore the data and also to share visualisations from stories they were working on. It’s surprising how intuitive a graph database can be for non-tech savvy people. Thanks to this approach, we could both investigate and prepare the future releases.”Read more here.
Keywords: