Free O’Reilly Book: Graph Algorithms in Apache Spark and Neo4j
7 min read

Graph Algorithms Are the Key to Predictive Analytics
Graph analytics use relationships to reveal structural and predictive insights hiding in your data. Whether you are building dynamic network models, mitigating risk and fraud or forecasting real-world behavior. Graph algorithms provide one of the most potent approaches to analyzing connected data because their mathematical calculations are specifically built to operate on relationships. Relationships tend to be one of the strongest predictors of future behavior. Because graph algorithms use the connections inherent within data (i.e relationships), they reveal structures other types of analytics miss yet prove to be some of the most predictive elements in data. By integrating graph algorithms into our analytics, we can better infer behavior and improve our forecasts.Results of several graph algorithms applied to the Game of Thrones dataset.
Here’s What You’ll Learn When You Pick Up the Book
Graph Algorithms: Practical Examples in Apache Spark & Neo4j is for developers and data scientists looking to acquire graph algorithms skills to develop more intelligent solutions and enhance machine learning models. With hands-on examples of how to use graph algorithms in Apache Spark and Neo4j, the book also includes sample codes and tips for over 20 practical graph algorithms that cover optimal pathfinding, importance through centrality and community detection using methods like clustering and partitioning. With Graph Algorithms, you’ll learn how to:-
- Reveal more predictive elements in your data We take you through the fundamental concepts of graph analytics and how graph algorithms are built to extract contextually relevant information. You’ll learn how algorithms calculate important measures and get guidance on when to use them.
- Use structural patterns hidden in your data We discuss the impact of subtle parameters in popular algorithms like PageRank and Louvain Modularity with tips on how to avoid common pitfalls. You’ll see working code and sample datasets for Spark and Neo4j.
- Improve machine learning predictions We cover how algorithms like Label Propagation can be used for preprocessing data and semi-supervised learning. We walk through a machine learning workflow for link prediction, combining Neo4j for feature extraction and Spark MLlib for training a model using a random forest classifier. And demonstrate increased accuracy, precision and recall in our model.