English versionEN

Enlist the Help of Your Schema for Caching!

Based on our experience running GraphCDN we’ve seen schemas that make it easier to cache and others that might throw some stones in your way. Let me share how to avoid common pitfalls and stay clear of the boulders and let your schema help you with caching GraphQL responses.

This talk has been presented at GraphQL Galaxy 2021, check out the latest edition of this Tech Conference.

FAQ

Marco Locher is part of the GraphCDN team.

Consistent naming for cacheable types is crucial because most clients and GraphQL tooling default to using standard fields like ID or _ID for caching. Inconsistent naming can lead to difficulties in caching and purging objects from the cache when backend modifications occur.

Schema awareness allows the cache to make smarter decisions based on the types returned by queries. This includes optimizing cache storage, handling fragments and interfaces efficiently, and potentially returning partial results while fetching missing fields in the background, enhancing performance and user experience.

Duplicating types in the schema to add a single field can lead to inefficiencies in the cache, as it may need to store the data twice depending on the presence of the extra field. This reduces cache efficiency and increases resource consumption.

It is recommended to use standardized specifications like the cursor connection specification for handling metadata, especially in contexts like pagination. This avoids extending types with fields that are only required in specific contexts, thus maintaining efficiency and consistency in the cache.

Splitting queries can be beneficial as it allows for caching public data that can be reused across different users, enhancing cache effectiveness. This approach separates public and private data, allowing public parts of queries to be cached and reused, thus optimizing performance and resource utilization.

graphql

Marko Locher

6 min

09 Dec, 2021

Comments

Video Summary and Transcription

In this lightning talk, the speaker discusses best practices for caching GraphQL APIs. They emphasize the importance of consistent naming and configuration for caching fields. Keeping types consistent and making the cache aware of the schema can improve efficiency. The speaker also suggests splitting queries to optimize caching and reduce server round trips.

Available in Español: ¡Utiliza tu esquema para el almacenamiento en caché!

1. Caching Best Practices

Short description:

In this lightning talk, I'd like to give you some pointers on how your schema can help when caching GraphQL APIs, both via document caches like GraphCDN, but also normalized caches like they're implemented in clients like Apollo, Client, or Urcl. The very first item that I'd like to talk about is having consistent naming for the fields you want to cache. It is important to configure your clients accurately to use the appropriate fields. Keeping your types consistent is also crucial to avoid duplication and make your cache more efficient. Additionally, making your cache aware of your schema unlocks additional functionality and allows for smarter decisions based on the types returned by your queries. Lastly, consider splitting queries in some cases to optimize caching and reduce round trips to the server.

Hello, everybody. My name is Marco Locher and I'm part of the GraphCDN team. If you're interested in GraphCDN or caching GraphQL in general, I hope you didn't miss Max's talk on how to etch GraphQL APIs earlier today.

The very first item that I'd like to talk about might seem very obvious, but having ideas on the types that you want to cache and having a consistent naming for those fields is quite important. We've seen a couple of projects from our own users where changing those had a ton of impact. If you can stick to ID or underscore ID, most clients and related GraphQL tooling will use those fields by default. However, if you can't because you're already using other names for those fields in your legacy APIs or it's not easy, you can configure your clients to use the appropriate fields. It is very important to make sure that that configuration is accurate. On GraphCDN, for example, you would configure those fields as so-called key fields, and they define how you can find those objects in the cache again, and more importantly, how you can purge them if you make modifications on your backend. Similarly, we would recommend that you stick to globally unique IDs in your application, for example, using UUIDs or something similar, but if that's not something that you can accommodate, there are workarounds for that as well that most clients implement.

The second item is keeping your types consistent. When working with our users, we have seen schemas where types are duplicated to add a single field. However, they were otherwise identical and they were representing the same data. This will make your cache less efficient as it will now need to store the data twice, depending on whether that extra field is present or not. Similarly, if you want to enrich the data with metadata that might only be required in a very specific context, thinking about search results, for example, where you want to show the search string in a highlighted way, we would recommend implementing a concept like the cursor connection specification for that metadata, instead of extending your type with fields that are only required in a very specific context. Even though the cursor connection spec is mainly aimed at handling pagination, it lends itself very well to being extended for other similar use cases like the one that I talked about just now.

Very important, as well, is to make sure that your cache is aware of your schema. Every cache will offer some functionality, whether it's aware of the schema or not. However, if you make your cache aware of the schema of your data, it will unlock additional functionality that wouldn't be possible otherwise. It can make smarter decisions based on what types are returned by your queries, something that is especially important in the context of fragments or when you're using interfaces. Having your cache aware of your schema will also allow it to return partial results based on already cached data if the missing fields are designated as optional. And while your app is already displaying some information to the user, the cache fetches the missing fields in the background. Without that knowledge, that would have been a query that would have been returned by your API directly and the cache would not have been involved with that at all.

And lastly, if you're using a document cache like RefCDN, in some cases, you might actually be better splitting your queries instead of submitting just a single one. I know GraphQL is known for its flexibility and for the fact that you can customize each query to get you exactly the data that you need. But in some cases, splitting queries and having more than one might actually be beneficial. For example, let's take a look at a query that fetches a list of the most recent articles from a blog as well as a list of recommendations based on the currently logged in user. A document-based cache like RefCDN, for example, takes a look at the whole response that you get. And if it is tied to a specific user, then it would only be able to use that cache data for that specific user in the future again. However, if you split that query into a public part and a more private part, the public part can be reused for every single user, no matter whether they are logged in or not, no matter where they're based. And especially when working with both a document-based cache and a normalized cache as part of your GraphQL client, the client will not be impacted by that either, since it will already have most of the data or even all of the data required in its local cache and would not require a round trip to the server at all.

Thank you very much for listening in. I hope there were some valuable takeaways for all of you. If you have any questions, I'm happy to answer them during the Q&A session or you can ping me on Twitter as well.

Available in other languages:

Check out more articles and videos

We constantly think of articles and videos that might spark Git people interest / skill us up or help building a stellar career

From GraphQL Zero to GraphQL Hero with RedwoodJS

GraphQL Galaxy 2021

32 min

From GraphQL Zero to GraphQL Hero with RedwoodJS

Top Content

Tom Preston-Werner

GitHub cofounder, RedwoodJS author

Tom Pressenwurter introduces Redwood.js, a full stack app framework for building GraphQL APIs easily and maintainably. He demonstrates a Redwood.js application with a React-based front end and a Node.js API. Redwood.js offers a simplified folder structure and schema for organizing the application. It provides easy data manipulation and CRUD operations through GraphQL functions. Redwood.js allows for easy implementation of new queries and directives, including authentication and limiting access to data. It is a stable and production-ready framework that integrates well with other front-end technologies.

frameworks graphql redwoodjs builders and founders

Local State and Server Cache: Finding a Balance

Vue.js London Live 2021

24 min

Local State and Server Cache: Finding a Balance

Top Content

Natalia Tepluhina

GitLab

This Talk discusses handling local state in software development, particularly when dealing with asynchronous behavior and API requests. It explores the challenges of managing global state and the need for actions when handling server data. The Talk also highlights the issue of fetching data not in Vuex and the challenges of keeping data up-to-date in Vuex. It mentions alternative tools like Apollo Client and React Query for handling local state. The Talk concludes with a discussion on GitLab going public and the celebration that followed.

graphql vue server cache

Batteries Included Reimagined - The Revival of GraphQL Yoga

GraphQL Galaxy 2021

33 min

Batteries Included Reimagined - The Revival of GraphQL Yoga

Uri Goldshtein

Founder of The Guild, the largest open source group in GraphQL ecosystem.

Envelope is a powerful GraphQL plugin system that simplifies server development and allows for powerful plugin integration. It provides conformity for large corporations with multiple GraphQL servers and can be used with various frameworks. Envelope acts as the Babel of GraphQL, allowing the use of non-spec features. The Guild offers GraphQL Hive, a service similar to Apollo Studio, and encourages collaboration with other frameworks and languages.

graphql react server components

Rock Solid React and GraphQL Apps for People in a Hurry

GraphQL Galaxy 2022

29 min

Rock Solid React and GraphQL Apps for People in a Hurry

Ryan Chenkie

Founder @ CourseLift

The Talk discusses the challenges and advancements in using GraphQL and React together. It introduces RedwoodJS, a framework that simplifies frontend-backend integration and provides features like code generation, scaffolding, and authentication. The Talk demonstrates how to set up a Redwood project, generate layouts and models, and perform CRUD operations. Redwood automates many GraphQL parts and provides an easy way for developers to get started with GraphQL. It also highlights the benefits of Redwood and suggests checking out RedwoodJS.com for more information.

react graphql

Adopting GraphQL in an Enterprise

GraphQL Galaxy 2021

32 min

Adopting GraphQL in an Enterprise

Shruti Kapoor

Lead Front End Engineer @ Slack

Today's Talk is about adopting GraphQL in an enterprise. It discusses the challenges of using REST APIs and the benefits of GraphQL. The Talk explores different approaches to adopting GraphQL, including coexistence with REST APIs. It emphasizes the power of GraphQL and provides tips for successful adoption. Overall, the Talk highlights the advantages of GraphQL in terms of efficiency, collaboration, and control over APIs.

graphql enterprise

Step aside resolvers: a new approach to GraphQL execution

GraphQL Galaxy 2022

16 min

Step aside resolvers: a new approach to GraphQL execution

Benjie

GraphQL Technical Steering Committee

GraphQL has made a huge impact in the way we build client applications, websites, and mobile apps. Despite the dominance of resolvers, the GraphQL specification does not mandate their use. Introducing Graphast, a new project that compiles GraphQL operations into execution and output plans, providing advanced optimizations. In GraphFast, instead of resolvers, we have plan resolvers that deal with future data. Graphfast plan resolvers are short and efficient, supporting all features of modern GraphQL.

graphql api development

Workshops on related topic

Build a Headless WordPress App with Next.js and WPGraphQL

React Summit 2022

173 min

Build a Headless WordPress App with Next.js and WPGraphQL

Top Content

Workshop

Kellen Mace

In this workshop, you’ll learn how to build a Next.js app that uses Apollo Client to fetch data from a headless WordPress backend and use it to render the pages of your app. You’ll learn when you should consider a headless WordPress architecture, how to turn a WordPress backend into a GraphQL server, how to compose queries using the GraphiQL IDE, how to colocate GraphQL fragments with your components, and more.

next.js wordpress graphql

Build with SvelteKit and GraphQL

GraphQL Galaxy 2021

140 min

Build with SvelteKit and GraphQL

Top Content

Workshop

Scott Spence

Have you ever thought about building something that doesn't require a lot of boilerplate with a tiny bundle size? In this workshop, Scott Spence will go from hello world to covering routing and using endpoints in SvelteKit. You'll set up a backend GraphQL API then use GraphQL queries with SvelteKit to display the GraphQL API data. You'll build a fast secure project that uses SvelteKit's features, then deploy it as a fully static site. This course is for the Svelte curious who haven't had extensive experience with SvelteKit and want a deeper understanding of how to use it in practical applications.

Table of contents:
- Kick-off and Svelte introduction
- Initialise frontend project
- Tour of the SvelteKit skeleton project
- Configure backend project
- Query Data with GraphQL
- Fetching data to the frontend with GraphQL
- Styling
- Svelte directives
- Routing in SvelteKit
- Endpoints in SvelteKit
- Deploying to Netlify
- Navigation
- Mutations in GraphCMS
- Sending GraphQL Mutations via SvelteKit
- Q&A

graphql svelte

Relational Database Modeling for GraphQL

GraphQL Galaxy 2020

106 min

Relational Database Modeling for GraphQL

Top Content

Workshop

Adron Hall

In this workshop we'll dig deeper into data modeling. We'll start with a discussion about various database types and how they map to GraphQL. Once that groundwork is laid out, the focus will shift to specific types of databases and how to build data models that work best for GraphQL within various scenarios.
Table of contentsPart 1 - Hour 1 a. Relational Database Data Modeling b. Comparing Relational and NoSQL Databases c. GraphQL with the Database in mindPart 2 - Hour 2 a. Designing Relational Data Models b. Relationship, Building MultijoinsTables c. GraphQL & Relational Data Modeling Query Complexities
Prerequisites a. Data modeling tool. The trainer will be using dbdiagram b. Postgres, albeit no need to install this locally, as I'll be using a Postgres Dicker image, from Docker Hub for all examples c. Hasura

database graphql

Build and Deploy a Backend With Fastify & Platformatic

JSNation 2023

104 min

Build and Deploy a Backend With Fastify & Platformatic

Top Content

WorkshopFree

Matteo Collina

Platformatic allows you to rapidly develop GraphQL and REST APIs with minimal effort. The best part is that it also allows you to unleash the full potential of Node.js and Fastify whenever you need to. You can fully customise a Platformatic application by writing your own additional features and plugins. In the workshop, we’ll cover both our Open Source modules and our Cloud offering:- Platformatic OSS (open-source software) — Tools and libraries for rapidly building robust applications with Node.js (https://oss.platformatic.dev/).- Platformatic Cloud (currently in beta) — Our hosting platform that includes features such as preview apps, built-in metrics and integration with your Git flow (https://platformatic.dev/).
In this workshop you'll learn how to develop APIs with Fastify and deploy them to the Platformatic Cloud.

node.js cloud graphql fastify

Building GraphQL APIs on top of Ethereum with The Graph

GraphQL Galaxy 2021

48 min

Building GraphQL APIs on top of Ethereum with The Graph

Workshop

Nader Dabit

The Graph is an indexing protocol for querying networks like Ethereum, IPFS, and other blockchains. Anyone can build and publish open APIs, called subgraphs, making data easily accessible.

In this workshop you’ll learn how to build a subgraph that indexes NFT blockchain data from the Foundation smart contract. We’ll deploy the API, and learn how to perform queries to retrieve data using various types of data access patterns, implementing filters and sorting.

By the end of the workshop, you should understand how to build and deploy performant APIs to The Graph to index data from any smart contract deployed to Ethereum.

graphql ethereum api development

Hard GraphQL Problems at Shopify

GraphQL Galaxy 2021

164 min

Hard GraphQL Problems at Shopify

Workshop

5 authors

At Shopify scale, we solve some pretty hard problems. In this workshop, five different speakers will outline some of the challenges we’ve faced, and how we’ve overcome them.

Table of contents:
1 - The infamous "N+1" problem: Jonathan Baker - Let's talk about what it is, why it is a problem, and how Shopify handles it at scale across several GraphQL APIs.
2 - Contextualizing GraphQL APIs: Alex Ackerman - How and why we decided to use directives. I’ll share what directives are, which directives are available out of the box, and how to create custom directives.
3 - Faster GraphQL queries for mobile clients: Theo Ben Hassen - As your mobile app grows, so will your GraphQL queries. In this talk, I will go over diverse strategies to make your queries faster and more effective.
4 - Building tomorrow’s product today: Greg MacWilliam - How Shopify adopts future features in today’s code.
5 - Managing large APIs effectively: Rebecca Friedman - We have thousands of developers at Shopify. Let’s take a look at how we’re ensuring the quality and consistency of our GraphQL APIs with so many contributors.

case study scalability graphql