It’s All in the Relationships: 15 Rules of a Native Graph Database

Learn 15 aspects of a native graph database.

Like a complex system grid or an air-traffic-control map, a graph database is represented as a network of nodes and connections called a labeled property graph. The nodes, which appear as circles or squares, represent entities such as people, products, companies or orders.

In Neo4j, the connections between database nodes are called relationships, and those relationships are as important as the nodes they connect.

Each relationship is directional and knows its beginning and ending node, and each node knows about all other nodes with which it has an inbound or outbound relationship – an advantage known as index-free adjacency.

native graph database

Native Graph Storage and Processing

But a property graph model is a lot more than a picture. It actually describes how you access and query information in the database.

To maximize efficiency and performance, a graph database platform must store and access data as it appears in the database’s property graph. A database platform that employs such an approach is known as a native graph database.

Graph Imposters Pay the Price

For datasets that are connected, complex or changing, non-native graph database approaches are inherently flawed with performance, integrity, ease-of-use and scalability risks.

Relational databases bolt on extra processing layers that translate graph queries into relational-table-based storage and processing models used by the underlying database technologies. The result is considerably slower performance and application responsiveness that often renders graph applications completely unusable.

Most NoSQL database technologies add similar translation layers and inefficiencies to create graph applications, again resulting in sluggish performance and even risking corrupt or unsuccessful graph query results.

15 Rules of the Native Graph Database

Graph database management systems must model, manage and access data and their relationships entirely through native data storage and graph processing methods. To be a native graph DBMS, a technology must conform with these basic rules.

Relational and NoSQL databases break the rules by layering graph capabilities atop non-graph data and are plagued by performance, latency, consistency and data-corruption problems.

  1. Native Storage and Modeling
    Store and model data as a graph of relationships instead of in rows and columns, indexed records, or in any other structure.
  2. Native Graph Management
    Manage data and relationships entirely through native graph capabilities, and not through a graph-logic layer that sits atop a non-graph storage or processing foundation.
  3. First-Class Relationships
    Treat relationships among graph data elements as first-class database elements, complete with directional and quantifying properties used by the graph database engine.
  4. Real-Time Availability
    Query data in real time regardless of the volume or complexity of its underlying relationships.
  5. Index-Free Adjacency
    Link every data element directly to its incoming and outgoing relationships, making it possible to traverse millions of records per second.
  6. Comprehensive Data Management
    Fully handle the retrieval, insertion, modification and deletion of data and underlying relationships.
  7. Discrete Management
    Add and modify data and relationships without having to make changes to the existing database schema, data or relationships.
  8. Cypher Support
    Fully support Cypher, the open-standard query language embraced by industry leaders.
  9. Nonsubversion
    Prohibit attempts to access or modify data via bypassing openCypher or subverting integrity rules and constraints enforced by the graph database.
  10. ACID Transactions
    Ensure that all transactions are ACID – i.e., follow the rules of Atomicity, Consistency, Isolation and Durability – to guarantee graph and data consistency.
  11. Consistent Reads
    Ensure that users who read and re-read data always see the same data unless others perform intervening updates.
  12. Consistent Writes
    Ensure that users who write and update data always see the latest data unless others perform intervening updates.
  13. Integrity Independence
    Store data-integrity constraints in the graph data catalog, not in application programs.
  14. Data independence
    Applications are logically unaffected when underlying graph data storage representations or access methods change.
  15. Seamless Presentation
    Store and display data relationship graphs in a unified manner to provide users a seamless view of the entire graph model of the database, regardless of where the data is stored.

Is Relational Technology Dead?

Does all this mean that the days of relational database technology are coming to a close? Certainly not.

Relational approaches are appropriate for tabular data with static schemas, but not for the demands of highly connected or changing datasets. And for queries that traverse multiple levels through graph data – such as friends-of-friends-of-friends queries – the resulting swarm of table index lookups relegates relational performance to unacceptable levels.

In stark contrast, graph databases use index-free adjacency to traverse millions of data records with sub-second response times, even when those queries mine data several layers deep.

Want to learn more on how relational databases compare to their graph counterparts? Get The Definitive Guide to Graph Databases for the RDBMS Developer, and discover when and how to use graphs in conjunction with your relational database.

Get the Ebook