Load CSV in the Real World
Load CSV is an incredibly agile and useful tool for getting datasets large and small into Neo4j. In this live-coding session, Nicole will demonstrate the process of downloading a raw .csv file from the Internet and importing it into Neo4j. This will include cleaning the .csv file, visualizing a data model, and writing the Cypher query that will import the data. This presentation is meant to make Neo4j users aware of common obstacles when dealing with real-world data in .csv format, along with best practices when using LOAD CSV.Follow along!
We’ve pulled together everything you will need to do this on your own machine. (Note that Nicole uses a 16GB machine. If you have less RAM and particularly if you are on Windows, check out the links in the Further Reading section of this post.)Things you need
Things you optionally need
- Sublime Text 2 (the editor used in the webinar) along with the Cypher plugin.
LOAD_CSV.cql
. A handful of example queries for asking questions of the data are located in example_queries.cql
in the Github Repo.
Further reading:
Cypher: Avoiding the Eager by Mark Needham
Load CSVs with Success By Michael Hunger
Load CSVs into Neo4j Quickly and Successfully By Michael Hunger
Speaker: Nicole White, Data Scientist, Neo Technology
