Video Summary and Transcription
This Talk is about real-time data updates for Neo4j using GraphQL subscriptions. The Neo4j GraphQL library provides automatic schema generation, read queries, mutations, and resolvers. It handles subscriptions using WebSockets and is completely agnostic. The library also has built-in support for events and relationships, allowing for real-time updates and easy scaling. The Talk concludes with details about subscriptions using AMQP and a showcase of Neo4j GraphQL subscriptions in a project called neo place.
1. Introduction to Neo4j and GraphQL Subscriptions
I'm Thomas Wiese, a software engineer at Neo4j in the GraphQL team. Today, I'll talk about real-time data updates for Neo4j using GraphQL subscriptions. Neo4j is a heavily scalable database without a schema, based on nodes and relationships forming a graph. The query language for Neo4j is called Cipher, which is easy to learn and has powerful graph pattern matching capabilities. We have built an open source library in TypeScript that integrates Neo4j and GraphQL, allowing clients to send queries that are translated to Cypher and handled automatically in the background.
Hi all, I'm Thomas Wiese, I'm a software engineer at Neo4j in the GraphQL team. And today I'll be talking about real-time data updates for Neo4j using GraphQL subscriptions.
First things first, we have to talk about the database that we use, and it's Neo4j. And Neo4j is also a company, but mostly known for its database, a database that could actually scale very heavily and it doesn't have any schema. All you need is nodes and relationships which form a graph. As you can see one here, a movie graph with relationships and nodes. And the nodes are the bubbles and the relationships are the arrows. And the most important thing to note here is that both nodes and relationships can contain properties. So they're first class citizens.
And of course, if we have a database, we need a way to query that database or a query language. And that's what Neo4j built as well. It's called Cipher. And Cipher got a lot of inspiration from SQL or SQL. But it added some more things on top because we have to query graphs. So we have to do things like graph pattern matching. And one example, a simple example on top is here on the top row where you can see a match where we match a person who acted in a movie. And we have the arrows pointing or indicating a relationship. And this way, we can actually read it quite intuitively. And it's believe me, believe it or not, it's actually very easy to learn. And it has a lot of the power that SQL already has is built in here. So where clauses match, et cetera, et cetera. And we call this ASCII or it's almost ASCII art, what we do here.
So that we have, now we have a database, a really amazing database. And of course the topic of this conference, GrowthQL. And wouldn't it be great if we could just get those two together, right? Could we do something like this here? Yeah, of course, we can do that. Actually, we already built it for you. And we started two years ago building an open source library written in TypeScript for your convenience and all the library DOS or the centerpiece of the libraries, as you can see here in the image. It provides an API or it's the base for an API for a GrowthQL API, where the GrowthQL clients can send their GrowthQL queries to the API. Then the library in the background translates on the fly this query to a Cypher query, and sends that to the database and the response back to the API and then back to the client. And that's all handled for you automatically in the background.
2. Automatic Schema Generation
You don't have to do anything here, but we need an API, right? We have to define the API somehow. Usually, you would create your types, write read queries, mutations, and resolvers. But that's painful. I want automatic schema generation.
You don't have to do anything here, but we need an API, right? We have to define the API somehow. And usually you would go along with this or something like this, right? You create your types, your type movie here with a field title. And then you have to do this cumbersome stuff here. You have to write read queries. You have to write all these mutations. They add, change, remove, et cetera, et cetera. You even have to write the resolvers for all of that. I mean, how painful is that? No way. I'm not going to do that.
3. Automatic Generation and GraphQL Query Translation
I want automatic schema generation, right? That's exactly what we did with the Neo4j GraphQL library. You only need to define your types, and the library will handle the rest. It provides read queries, aggregates, connections like the relay connection, and even cursor-based pagination. The best part is that the resolvers are already written, and mutations are included too. Let's see how it works with a sample query to get movies released in 1999 and their actors.
I want automatic schema generation, right? I mean, you do the boilerplate, I do automatic schema generation. And that's exactly what we did with the Neo4j GraphQL library. So the only thing you have to provide to make your GraphQL API ready to be used is to define your types or the type definitions, which is take a very simple one here to get started. A movie with a title, all you need.
And then the library in the background for you will create this here. You will get the read queries out of the box. You haven't done anything, but it's all available for you. So you can do read of movies, filtering, where for instance. We have aggregates already ready for you, again with filtering built in. We have connections, like the relay connection, specification is implemented here. And where you can do cool things like cursor-based pagination, all of that out of the box.
And you know what's the best part of it? The resolvers are already written. You don't even have to write that as well. And it gets better. Wow. Mutations, they're also included. You can create, delete, update movies, all that. All these nice things. Again, filtering wherever you need it is already included. And again, the resolvers, they're actually already built in. You don't have to do anything. But that's kind of interesting. So let's have a look at how all of this works on that query level. So something you're familiar with to start with. We have a GraphQL query where we want to get all the movies which were released in 1999. We want to get the title and the name of all the actors that activated it. So far, so good.
The Neo4j library, GraphQL library, sorry, will then automatically translate this when it sees this or gets this GraphQL request to this Cypher query. And I'm going to walk you through it real quick. So on the top, we match all the movies.
4. GraphQL Subscriptions and Neo4j GraphQL Library
One GraphQL query matches one cypher query. The N plus one problem doesn't exist. We use WebSockets for real-time data updates. The Neo4j GraphQL library handles subscriptions. It's completely agnostic and can be built upon. Let's head over to the code and define the type definitions.
Then we get or filter out only the movies with a certain release date, 1999 in this case. And then in this call sub-query, the brackets, what we do is we do a relationship traversal from the actor or the movie to the actor. And we get all the names for it. And then we pack it all up, all the actor names and everything in a form that the GraphQL client can consume or that it actually expects. Do you know what's the best part about this? One GraphQL query matches one cypher query. Do you see what's missing? The N plus one problem, it just doesn't exist. We took care of it. You don't have that problem. We need one query resource in one call to the database and it's all done. Isn't that cool?
Now that you know all the basics there are to know, let's go on to the actual cool stuff, subscriptions. Just on the highest possible level, what are GraphQL subscriptions? Two entities are involved, the client and the server, where the client subscribes at the server for a particular event, and by doing that they open up a permanent connection. In the GraphQL specification, there is no clear definition of what the protocol, like on the transport layer, what the protocol is supposed to be here. It's actually up to the server. What we've seen is that most actually use WebSockets or the WebSocket protocol to do just that and in the demo that I'm going to show in a while, we do exactly that. We use WebSockets. So once the connection is established, the server then has the possibility to send or push through events to the client without the client having to do anything, right? No busy pulling, no nothing like that and we can send as many events as we want, which is pretty cool. So we have our real-time data updates. Just what the title has promised. Let's go a level deeper and have a look at how it's implemented using the Neo4j GraphQL library where the library is the instance in the center and there will be a couple of more of those pictures so it will always be the same, the instance in the center.
So in this setup here we have a client on the right hand side who subscribed to an event and then another client, the one on the left-hand side, sent a mutation. Think of creating a movie, right? Sent a mutation, it gets translated to a query, on the fly, in the Neo4j graphical library. It's executed against the database and then it comes back to the Neo4j graphical library which then informs all the subscribed clients that this event happened, right? Pretty straightforward to this point. There's one thing that we have to keep in mind here is if someone were to make a change to the Neo4j database there at the bottom of the image from the outside, so not using the Neo4j graphical API, we wouldn't detect that. That's a limitation right now, we know of that, but we actually work on it, so stay tuned. It might actually be fixed going forward soon enough. The other thing is this is completely agnostic, there's absolutely nothing that prevents you from building on top of this. But I think now I gave you enough context to see all of this in action. You can look at this demo or at the demo code later on from my colleague Andres, he built these amazing demos, not just the one I show you now, but several of those. But we're going to have a look at it, so let's head over to the code and look at some code. So, as you saw before, the first thing we have to do is to define the type definitions.
5. Type Definitions and Relationships
We define two types, movie and person, and specify their fields. We use a relationship directive to form the relationship between the movie and the person, specifying the type of relationship, direction, and properties. Relationships in Neo4j can hold data, and we specify this in GraphQL by adding an interface and providing roles for the movie.
And we're going to use a slightly bigger type definition than just the type movie. So the first thing we do is define two types, type movie and type person, and give them some fields, as we specified here. And then you see a relation or a directive that you haven't really seen before, which is called a relationship. And it does exactly that. It forms or specifies the relationship between the movie and the person. In the directive, we can specify what type of relationship it is, acted in in this case, the direction, and the properties. Because remember, in Neo4j, the relationships, they're first class citizens, they can hold data. And that's how we specify it in GraphQL. We specify an interface. And then we can add any data we want here, which is providing some roles, because you have to have a role in a movie, right? As a person. That forms our base.
6. Server Code and Subscriptions
Let's briefly look at the server code. Subscription is a plugin that can be developed by the community. For the demo, we use event emitters from Node.js. You define a plugin, get your type definitions, create a Neo4j driver, pass them to the Neo4j GraphQL class, and include the plugin. Then do the server setup with an express server and WebSocket server. Create a schema with resolvers and pass it to an Apollo server. The root types include query and mutation, but we're interested in subscriptions.
So let's briefly look at the server code, because that can be quite interesting for a lot of people as well. And the first thing to note here is subscription is a plugin. So we let people develop their own plugin. If the community, someone in the community, maybe you want to develop your own plugin, you're free to do that. We have provided a bunch of them for you already, but more on that later.
For the demo, we use something very simple. We use event emitters from Node.js to get it done in a fast and quick manner for this demo. So you define a plugin, we have some for you, so no need to write them yourself. But here, this is a demo, I want to point out a couple of things. Then what you got to do is you have to get your type definitions, of course. You have to create a driver, a Neo4j driver, to access your database. I started that in the background for the demo, so that's all running. No worries here. Then, as a first important point is we have to pass those to the Neo4j GraphQL class, which we then instantiate later on. And also include the plugin to make sure to the GraphQL API, or the Neo4j GraphQL instance that is running to show subscriptions are on, we're on here. And then we have to do the entire server setup. We have to get an express server, WebSocket server on top of that. I'm just leaving out a bunch of details, which you can look at your own convenience. They're not really adding to the demo right now, but it's not really hard. Don't worry. Then we have to create a schema including all the resolvers. And we just pass that schema to an Apollo server, because it's just a convenient way to do that, to use Apollo. We then we can start it. I did that already. So we're not really losing any valuable time.
So if we head over to the code, the first thing you're probably going to notice is over here, the root types. I mentioned those before. We have to query and mutation where the auto generated content is ready, create, update. But we're not interested in that right now. We want to have a look at subscriptions, right? So subscriptions comes with a lot of this automatically generated goodness out of the box for you.
7. Built-in Support for Events and Relationships
We have built-in support for creating, deleting, and updating movies and persons. We also have built-in support for relationships. For example, if you create a movie and an actor connection, it's automatically handled. Let's start with a simple case: creating a subscription for a movie created event. We can specify what information we want to return from the event. Then, we create a movie using a mutation, and we receive the subscription event along with the regular mutation response. We can see the event details and the movie data. The same concept applies to relationship events, where we can see the event details, the movie and actor involved, and the roles of the actor.
So we have, for instance, movie created deleted and updated. Same for a person. And again, we have where clauses or filtering already built in for you. The same we also have for relationships, right? Because they're first class citizens. So if you create a movie and then actor connection, this is built in for you. You can just use it. And that's what we're going to do now.
But I start with a simple case, so we all get our heads around it quite easily. So we can create a subscription, a movie created with an event and a timestamp. And then we want to know from that particular event that we get back from the movie what exactly we want to return. So what has been. And this is just standard GraphQL so I can add whatever I want here. And I know there's a tagline. So I'm going to add it. Then I hit subscribe. And I see it listening. What do we have to do now? Create a movie, right? Let's do that. So we create a movie. And we have a mutation create movie story of toys. And if we execute that, we see we already have a subscription in. But we also have the regular response from the mutation that comes back. And we have subscription events here like create. We saw the event that is create the timestamp and the movie in exactly the form that we requested. And this is subscription, right? We want to see multiple of those. Look at that too. And of course, if I add another one, it will all come in a natural flow. And I didn't have to do anything, right? Except listening to it. So, let's stop this example and hop over to the relationship one.
So, if there's a movie relationship created event, we do the same again. We want to see which event it is, the field name of it, then which movie got created, which actor node got created, or which actor, and the roles that actor had.
8. Real-time Data Updates and More Details
Let's hit subscribe and create a movie and an actor. It's amazing how fast it works. We can see the movie and actor created and connected in real-time. That concludes our demo, and now let's move on to more details.
And again, here we can do whatever GraphQL offers us at this point. Let's hit subscribe. Again, it's listening. That's amazing. And now let's create a movie and an actor, right? So, we create gum forest with an actor called Kevin Bacon. And he apparently was a runner in that movie. We have to see that, right? So, let's store it. And again, it's here, right? We have connect as an event type relationship field name gum forest is here. And of course, if I just create another Kevin Bacon, E, look at that, he's here as well. So, that all works out of the box. And it's extremely fast, as you can see, it's already here. So, that concludes our demo part. And I'm going to show you some more details in the background. So, let's head back into the slides. And now they're loaded.
9. Subscriptions and Scaling with Neo4j GraphQL
If you create a movie with a GraphQL query, the Neo4j library will generate the cipher query for you. When subscriptions are enabled, extra data is added to the cipher query to maintain data consistency. Scaling is handled by pushing events to a broker, which broadcasts them to all running instances. We have built a plugin that simplifies the implementation and setup.
So, prior to... So, if you don't have subscription turned on, so just regular GraphQL, it quotes. If you create a movie on the left-hand side, the hot fuzz with a GraphQL query, the Neo4j library will create this cipher query for you. So, a call subquery again in the brackets, create the movie, add the title, or set the title and return it in a fashion that a GraphQL client can understand. But then, as before in the demo, if we turn on subscriptions, we have to add some extra data.
The good thing is... And that's how it's supposed to be. The mutation on the left-hand side for GraphQL query didn't change at all. But we have to add some extra information to the cipher query, which is this metadata here with an event to create. As we've seen before, the type name, the timestamp, all the things you've seen in action before, we add that into the cipher query. Because then, at the bottom, you see there in the return statement, we return it back after it has hit the database. Why do we do that? This means we can keep consistency in the database without actually changing anything. We have one mutation that changes the database, but then we can have a multitude of subscribers on the other end that are consuming this event.
So, we have absolute data consistency in the database, and we can subscribe or send the event to as many subscribers as possible or as they're subscribed without having an impact on the data or the database. But that's actually the point. So many thousands of subscribers, is that going to work? Do we not need to horizontally scale all of that? In this case, your graphical library at some point, it cannot handle everything. So, we have to horizontally scale it. Which means we will end up with situations like this here, where a client subscribes to an instance where the mutation on the left-hand didn't even occur. Isn't there an amazing link here? I mean, how does this work? But obviously, rest assured, otherwise I wouldn't talk about it. This is fixed. We already built all of that for you. So, this will work, which means your use case will scale. Both your Neo4j database and your Neo4j GraphQL API will scale with your use case. And we do that, as you can see here, by after we do the mutation, the Cypher query, and it's back in the GraphQL library, we will push an event there, number three, to a broker, which can be Kafka, which can be a Redis, or a RabbitMQ. And that broker then, in its turn, will broadcast or fan out at the event to all the running instances. And those will then send the event to their subscribers. So, scaling's solved, right? And now you may say, well, Thomas, cool, that's a nice promise, but this must be very hard to implement and set up and all of that. Actually not, it's super easy. Look at that. Remember the plugin system from before? Well, we've already built a plugin for you that handles just that case.
10. Subscriptions AMQP and Neo Place
Subscriptions AMQP allows you to easily define connection details and credentials to your AMQP broker. Currently in beta, it will soon be generally available. We created neo place, inspired by Reddit's rplace, to showcase our GraphQL subscriptions. Built with Neo4j and Oracle databases, a broker, and our Neo4j GraphQL library, neo place offers real-time updates. Visit the open source library on GitHub and connect with our community on Discord. Thank you for your support!
It's called Subscriptions AMQP, and all you have to do is define the connection details and credentials to your AMQP broker. And yeah, sorry, you still have to set up that one. There's many ways to do that. So you only have to provide us the credentials or the plugin of us. And you pass that plugin to the Neo4j GraphQL instance or class as you did before and as you saw in the demo. And off you go, that's all you need to do. Currently Subscriptions are still in beta but they will very soon be generally available and then you can get started with this.
But to round things off, I want to talk about something really cool. So this you may have seen somewhere before. It's on Reddit and it was in 2017 on April Fool's Day. It's rplace and this allowed users to set one pixel or one color. Then they have to wait for 15 minutes and set another pixel and so on and so forth for a lot of users. We've created this really, really amazing community art and we thought, this is the coolest thing to show or to showcase our GraphQL subscriptions. And we did just that, we created neo place for you. You can actually access it right now under this link, under this bit.ly link or the QR code here that is visible. And we took the same ideas but built it with the things that I just highlighted before with Neo4j database in the cloud and Oracle database. And we used a broker to make horizontal scaling and Breeze. We of course use our Neo4j GraphQL library and the GraphQL API. And of course, I mean, GraphQL subscriptions for a real-time update of when people set the pixels as well. So go here, try it out and have fun, draw something amazing, and enjoy the power of GraphQL, GraphQL subscriptions, and Neo4j.
And that's actually all I have for you today. You can head over to the open source library under GitHub. You can see here on the link where you can find us. We have all the plugins there, a lot of things to take in. Open source for you, available all the time. And this also is a great opportunity to say thank you to our community. You can always reach out to us on Discord under this link. We really had a lot of time and inspiration from our community and they helped us build the product that is there now. And if we wouldn't have you as a community, we wouldn't be here right now presenting this wonderful product. So thank you very much and see you next time.
Comments