English versionEN

Art & Entropy: Introducing Chaos to Your Frontend

CTO @ Prolong, also teacher and speaker in my spare time. I like to learn from other fields of study and apply it into ours.

Bookmark

Chaos Engineering is a current trend which involves studying the behavior of a system in the face of external events that are often unlikely, but in this case provoked (server or load-balancer crash, loss of DNS, etc.).

The disorder thus generated provides a wealth of information on how our systems work, enabling us to improve their robustness.

But strangely enough, all the books, talks and tutorials on Chaos Engineering overlook an important component of our systems. And yet, if there's one area where unpredictability, inconsistency and the need for resilience are central concerns, it's the frontend.

💥Chaos, frontend, and Japanese ancestral art 👘: 3 notions that at first glance have nothing in common, but which together open up new perspectives in the development of our applications.

This talk has been presented at React Summit 2024, check out the latest edition of this React Conference.

FAQ

Kintsugi is the ancient Japanese art of repairing ceramics and porcelain using gold-sprinkled lacquer. It treats breakage and repair as part of an object's history, rather than something to be disguised.

Kintsugi has evolved from its origins in the 15th century Japan to blend with contemporary art. Examples include the work of Jan Forman, who uses Legos to repair walls, and Raquel Susman, who uses gold lacquer to repair asphalt.

Netflix regularly crashes parts of its infrastructure, sometimes several times a day, to ensure that everything is working properly. This practice has led them to develop highly advanced automated restoration processes.

The benefits of chaos engineering include discovering and identifying system weaknesses, gaining a better understanding of interdependencies, building confidence in system stability, and implementing disaster recovery protocols.

Yes, chaos engineering can be applied to frontend development, although it is less common. The goal is to ensure that the frontend remains resilient and functional despite disruptions, similar to how backend chaos engineering is practiced.

Methods to create disruptions in a frontend application include simulating slow or failed HTTP requests, using pseudo localization to test language handling, manipulating timers, and altering navigation history. These can help identify potential weaknesses in the frontend.

Pseudo localization is a method that replaces text with an altered version containing more letters and characters while remaining readable. It helps developers test how their applications handle different languages and text lengths.

The chaos frontend toolkit is a browser extension and NPM library created to help developers experiment with chaos engineering in the frontend. It includes various tools to simulate disruptions and test the resilience of web applications.

Chaos engineering is a practice, invented by Netflix in 2011, that involves intentionally disturbing a system to observe how it reacts and to identify weaknesses. It aims to improve system resilience by learning through controlled experiments.

Implementing chaos engineering in production for frontend applications is challenging because the disruptions are more visible to users. It is generally better suited for test and staging environments.

web development

Thibaud Courtoison

16 min

18 Jun, 2024

Comments

Video Summary and Transcription

Welcome to the talk, Art & Entropy, Introducing Chaos in Your Front-End. Chaos engineering is a practice invented by Netflix in 2011 to observe how a system reacts to intentional disturbance. Applying chaos engineering to the frontend is experimental but necessary, as a broken frontend can negatively impact the user experience. Intentional perturbations in the frontend can be induced through various areas such as HTTP requests with slow 3G network or unstable Wi-Fi. Tools like chaos frontend toolkits can be used to experiment with chaos engineering in the frontend and embrace breakage as part of the application's story.

Available in Español: Arte y Entropía: Introduciendo el Caos en tu Frontend

1. Art & Entropy: Introducing Chaos in Your Front-End

Short description:

Welcome to the talk, Art & Entropy, Introducing Chaos in Your Front-End. Have you heard of Kintsugi? It's the ancient Japanese art of repairing ceramics and porcelain using gold-spinkled lacquer. In the early days of Kintsugi, collectors would intentionally break their own ceramics and have them repaired. Kintsugi can depart from its original medium to adapt to a new sport. Is our web app perfect, bug-free, on all browsers, in all responsive sizes? No. But does it need to be perfect? No, it just needs to be resilient, to work no matter what. We are going to use chaos engineering to make sure that our apps are resilient. Chaos engineering is a practice invented by Netflix in 2011. The aim is to observe how a system reacts to intentional disturbance. The benefits of chaos engineering include discovering and identifying weaknesses in a controlled environment and understanding the interdependencies of the components of our systems.

Hi everyone, and welcome to the talk, Art & Entropy, Introducing Chaos in Your Front-End. My name is Thibaut, and you can find me on Twitter with the hero name Pseudo. Let's get started.

Have you heard of Kintsugi? It's the ancient Japanese art of repairing ceramics and porcelain using gold-spinkled lacquer. As a philosophy, Kintsugi treats breakage and repair as part of an object's history, rather than something to be disguised. In the early days of Kintsugi, around the 15th century in Japan, collectors were so fond of the rendering that they would intentionally break their own ceramics and have them repaired. Over the years, Kintsugi has evolved and blended with contemporary art. Examples include the work of Jan Forman, who uses Legos to repair walls in the urban landscape, but also the work of Raquel Susman, who uses gold lacquer to repair asphalt. In this way, Kintsugi can depart from its original medium to adapt to a new sport. A new sport, and why not the web?

Because yes, let's ask ourselves. Is our web app perfect, bug-free, on all browsers, in all responsive sizes? No. But does it need to be perfect? No, it just needs to be resilient, to work no matter what. It can be broken and functional, broken and beautiful, like ceramics repaired with Kintsugi. Sounds good. So, how do we make sure that our apps are resilient? Well, we are going to use chaos engineering to do just that. Maybe you've heard the term before. Invented by Netflix in 2011, this practice draws parallels in science with chaos theory, which is a study of the evolution of the entropy disorder in a system. The aim is to observe how a system reacts to intentional disturbance. For example, intentionally crashing a server, and watching how the infrastructure reacts. Will it pass requests onto other servers? Will it restart a server and redirect traffic? How quickly? Basically, we break stuff and then we watch. The good thing is, instead of waiting for things to blow up, to check that our infrastructure will hold up, we do it ourselves. And that's how we improve, by learning how our infrastructure reacts to chaos.

And where it reaches the next level is, at Netflix, they do that in production. They crash their infrastructure regularly, sometimes several times a day, to make sure that everything's working properly. And this has prompted them to set up highly advanced automated restoration processes, to the point where they are able to restart entire regions of the infrastructure in just a few seconds, without human intervention. These are the benefits of chaos engineering. First of all, it will help us discover and identify weaknesses in a controlled environment. We know what we broke, so we can reverse-fix it easily if things go sideways during the experiment. It will also give us an increased understanding of the interdependencies of the components of our systems. Interdependencies.

2. Applying Chaos Engineering to the Frontend

Short description:

In a previous company, we discovered the interdependencies of our system when an unresponsive infrastructure caused user login requests to stack indefinitely. The benefits of chaos engineering include confidence and implementing a disaster recovery protocol. Applying chaos engineering to the frontend is experimental but necessary, as a broken frontend can negatively impact the user experience. Chaos engineering is applied in four steps: defining the nominal state, making a hypothesis, creating perturbations, and comparing states. Creating disruptions on the frontend can be done through various areas, such as HTTP requests with slow 3G network or unstable Wi-Fi.

For example, in a previous company, we had an old legacy server which was generating PDFs, but also, and everyone had forgotten at the time, JYP lookups for security purposes. And one day, the infrastructure on which this server was became unresponsive. They started a chain reaction where user login requests, which relied on JYP, also became unresponsive. And that's when we realized that there were no defined timeouts on those calls, which, in turn, means that user login requests were stacking indefinitely, therefore, deducing our entire infrastructure. And, unfortunately, that's how we rediscovered the interdependencies of our system.

The third benefit of chaos engineering is confidence. Would you rather wait to be called at 7am on a Sunday morning because your application is down, or would you rather break it yourself on a Tuesday in the early afternoon and see that everything is working all right and that you can sleep peacefully on the weekend? And finally, it will force us to implement a disaster recovery protocol. We won't wait for an accident to happen before we think about solutions.

The thing is, in all the resources that exist on the subject of chaos engineering, books, documentation, it's all about infrastructure. So, I said to myself, why not apply it to frontend this time, because no matter how resilient your infrastructure is, how many load balancers you have, how many redundancies you have, if your frontend is broken, the users don't care. Their whole experience on your app will be negative. So, as I said, there are no resources on the subject of chaos engineering applied to frontend, nor are there any all-in-one tools or toolboxes for doing so. So, what comes next in this talk is experimental. Let's see how far we can push this subject. Let's start with the basics of chaos engineering.

It is applied in four steps. First, we'll define the nominal state of our system. For example, the user is able to log in, the user is able to watch the last season of The Witcher. Second, we'll make an hypothesis. We'll assume the continuity of the nominal state during the experiment. The user is still able to log in, the user is still able to watch the last season of The Witcher, and we will use two groups for that, control group and test group. And third, intentionally create a perturbation reproducing a real event, for example, a server crash. And finally, we'll compare the states of the two groups and we will try to disprove the hypothesis put forward earlier. Are our users still able to log in and watch the last season of The Witcher? If they don't, then we just identified a flaw in the resiliency of our system. Our aim is to apply this chaos engineering experiment to the frontend. For steps one, two, and four, it's not too different from classic chaos engineering. So we are going to look into step three and how we can create disruptions on the frontend. So there are a few areas of perturbations that we can see. The first one is HTTP requests. It can be in the form of slow 3G network, unstable Wi-Fi, unresponsive CDN, etc.

3. Intentional Perturbations and Localization

Short description:

GitHub is an example of an application that handles slow or no response well. Even without CSS and JavaScript, it remains functional. To induce perturbations in our app, we can add random delays and failures to HTTP requests. Localization can be challenging due to language and design differences, leading to broken elements for certain user groups.

The real world perturbation can be summarized into really slow HTTP response or just no response at all. One example of an application which handles this very well is GitHub. All other CSS and JavaScript is hosted on a CDN on GitHub.githubassets.com. We can simulate what happens if the CDN fails in Chrome by blocking all network requests to that domain. And here's what it looks like. So here I'm on the React repository and I'm looking into the different folders and files to find what I am looking for. And yeah, here I am able to find the file I want and all the code associated. And so we can see two things. First of all, well, it's not pretty. But second, it's working. We can actually go through the repository and look at the code. And this is a good example of resiliency. Even with no CSS, with no external CSS and no JavaScript, GitHub is working.

And so here is how we could intentionally induce perturbation in our app. We can proxy, xhre and fetch. This can be done in a few lines of code. For example, we add one chance out of two to add random delay and one chance out of hundred to completely fail your requests. We put that in the app and with that we'll quickly see if our app doesn't handle delay or errors well. A second area of perturbation that we can do is localization. This can be tricky to handle well because of right to left languages, latin fonts, spacing, etc. In the real world perturbation, as well as right to left languages, there are also the verbose languages. Let's take an example. I just developed a nice button. It's been approved by the designer. Let's ship it to production. But wait. Here is what Romanian users will actually see. I gave my button a fixed width and now it's broken. But the issue is I don't speak Romanian. I speak French and just enough English to give this talk.

4. Intentional App Breakage and Tools

Short description:

We can intentionally break our app on localization by using pseudo localization, altering the text with more letters and characters. Timers can be perturbed by browsers throttling them, causing delays. Manipulating the navigation history can simulate users' backward and forward movements. Additional perturbations include simulating double clicks, checking for accessibility issues, and testing mobile tablet viewports. There are tools available for chaos engineering in the frontend.

So, how can we intentionally break our app on localization while still having a good developer experience? Well, we can use what is called pseudo localization. This method replaces the text with an altered version with more letters and characters while still making it readable by a human. And to do that we can use the pseudo localization npm package by TrickVGilfason that can perform it automatically on the app.

So, here is an example of what it would look like on the React website. And as we can see, the text is longer than the original with accents and other glyphs. And yet, the interface handles it pretty well. The buttons don't overflow and the menu takes the needed place to be shown.

Another area of perturbations are timers. We all assume that one second is equal to one second, right? And yet, that's not exactly true in Browserland. In some cases, the browsers can choose to throttle timers to reduce CPU and battery usage. And this means that if your app expects a set timeout to be called precisely after a specific amount of time, this timeout can be delayed for up to a minute, which may be breaking your feature. You can intentionally reproduce this perturbation by proxying set timeout and set interval to intentionally add delay to the timers. If your app doesn't handle this, you will quickly notice the issue.

So, here in this code, there is one chance out of two to add 500 milliseconds or remove minus 500 milliseconds. And the fourth area is the history. We often see the user's journey as a linear and unidirectional path interval app. But we often forget that it is quite frequent for the users to travel back and forth in the history to perform actions. And it can also happen by accident. And there is nothing quite as frustrating as losing your data in a big form when that happens. Again, we can manually create this perturbation with a couple lines of codes. Here, every minute, there is one chance out of 100 to randomly go back or forth in the navigation history.

So, those are the four areas of perturbation. But there can be even more. Why not simulate double clicks for every clicks of our users? Why not turn our app in black and white with a single line of CSS to check for accessibility issues? Or let's go crazy. Why not force the mobile tablet viewports, even when on desktop, to make sure that our apps are working there as well? So, yeah. By now, you must be looking at me like that. What we've seen looks interesting, but you may be lost on how to actually do that in your web apps. The good news is I lied. Earlier in this talk, I said that there was no all-in-one tools for chaos engineering in the frontend. But that's not true.

5. Chaos Frontend Toolkits and Embracing Breakage

Short description:

I created chaos frontend toolkits, a browser extension and NPM library, to experiment with chaos engineering. Control and balance are essential in chaos engineering, setting boundaries and ensuring the right level of disruption. Implementing chaos engineering in production may not be feasible for the frontend, but it can be applied in test and staging environments. Let's embrace breakage and reburn as part of our application's story.

Because I want you to experiment with chaos engineering, I created chaos frontend toolkits, which is a browser extension and NPM library that includes all the previous areas of perturbation I told previously. You will be able to simulate double clicks and many more experiments with a single click.

And now, that's how I see you. You're ready to go back to your companies and break everything. But before I leave you, I want to ask you to listen to the words of Tessia De Vries. Magic is organizing chaos. And while oceans of mystery remain, we have deduced that this requires two things. Balance and control. Without them, chaos will kill you. And as Tessia De Vries says very well, magic is organizing chaos. This requires two things, balance and control.

First of all, control. Don't forget that chaos engineering is a method of experimentation. We need to set up boundaries for this experiment on a given system and for a given time, respecting the four steps, nominal state, hypothesis, perturbation, comparison. And finally, balance. Chaos must be sufficiently present to test the system resilience, but not too present as it may disturb users.

And here, we come to what I think is a limit of chaos engineering applied to the frontend. In the chaos engineering experiments carried out by Netflix in production, disruptions are almost invisible to the users because everything happens in the backend. But on the front, the user will inevitably be affected by the disruptions that I mentioned earlier. So, I think it's impossible to implement it in production. In the best case scenario, it could be applied to test and staging environments. But given that chaos engineering applied to the frontend has never been done before, why not be a precursor and see what can be done with it? Let's apply the Kintsugi philosophy. Let's treat breakage and reburn as part of the story of our application rather than something to be disguised.

This is the end of the presentation. Thank you for listening.

Available in other languages:

Check out more articles and videos

We constantly think of articles and videos that might spark Git people interest / skill us up or help building a stellar career

Don't Solve Problems, Eliminate Them

React Advanced 2021

39 min

Don't Solve Problems, Eliminate Them

Top Content

Kent C. Dodds

Creator of EpicWeb.dev, EpicReact.Dev, TestingJavaScript.com

Kent C. Dodds discusses the concept of problem elimination rather than just problem-solving. He introduces the idea of a problem tree and the importance of avoiding creating solutions prematurely. Kent uses examples like Tesla's electric engine and Remix framework to illustrate the benefits of problem elimination. He emphasizes the value of trade-offs and taking the easier path, as well as the need to constantly re-evaluate and change approaches to eliminate problems.

remix best practices web development

Jotai Atoms Are Just Functions

React Day Berlin 2022

22 min

Jotai Atoms Are Just Functions

Top Content

Daishi Kato

Zustand, Jotai & Waku author

State management in React is a highly discussed topic with many libraries and solutions. Jotai is a new library based on atoms, which represent pieces of state. Atoms in Jotai are used to define state without holding values and can be used for global, semi-global, or local states. Jotai atoms are reusable definitions that are independent from React and can be used without React in an experimental library called Jotajsx.

state management web development builders and founders jotai react jotai react native

Debugging JS

React Summit 2023

24 min

Debugging JS

Top Content

Watch video: Debugging JS

Mark Erikson

Replay.io

Debugging JavaScript is a crucial skill that is often overlooked in the industry. It is important to understand the problem, reproduce the issue, and identify the root cause. Having a variety of debugging tools and techniques, such as console methods and graphical debuggers, is beneficial. Replay is a time-traveling debugger for JavaScript that allows users to record and inspect bugs. It works with Redux, plain React, and even minified code with the help of source maps.

best practices case study javascript web development debug

The Epic Stack

React Summit US 2023

21 min

The Epic Stack

Top Content

Watch video: The Epic Stack

Kent C. Dodds

Creator of EpicWeb.dev, EpicReact.Dev, TestingJavaScript.com

This Talk introduces the Epic Stack, a project starter and reference for modern web development. It emphasizes that the choice of tools is not as important as we think and that any tool can be fine. The Epic Stack aims to provide a limited set of services and common use cases, with a focus on adaptability and ease of swapping out tools. It incorporates technologies like Remix, React, Fly to I.O, Grafana, and Sentry. The Epic Web Dev offers free materials and workshops to gain a solid understanding of the Epic Stack.

react web development builders and founders future of development epic react

A Look Ahead at Web Development in 2025

JSNation US 2024

32 min

A Look Ahead at Web Development in 2025

Top Content

Wes Bos

Full Stack Developer, Speaker & Teacher, Co-host of Syntax.fm podcast.

Today, Wes Boss introduces the new features of the web, including customizable select and temporal, a standardized API for working with dates, time, and duration. The current date API in JavaScript has some problems related to time zones and date manipulation. With the temporal API, you can create dates without a time zone, specify dates without a year, and create durations without being attached to a specific date. The API also provides features for finding the difference between two dates. Invokers is a declarative click handlers API that eliminates the need for JavaScript. Speculation API enables pre-rendering and pre-loading of pages, improving performance. The CSS Anchor API allows positioning elements based on another element's location. Web components are encapsulated, framework-agnostic, and easy to use, offering a standardized approach for building reusable UI components. Building media UI components, like video players, is made easier with web components like Shoelace. Transformers JS allows running AI models in JavaScript for tasks like emotion detection and background removal. Python doesn't run in the browser, but JavaScript does. Small AI models can be loaded and executed faster in the browser using technologies like WebGPU. Animate height auto transition using calc size. Apply starting styles to elements for smooth animations. Use Vue transition for CSS and JavaScript animations. Syntax website with Vue transition for smooth page transitions. CSS relative colors allow for lighter or darker shades. Scope CSS ensures styles only apply to specified div containers. Web primitives facilitate modern JavaScript code. You can create web requests and receive web responses using the same primitives on both the client and server. There are many new web standards that work everywhere and frameworks like Hano and Nitro are built upon them. The select and Popover elements are accessible by default. Most of the discussed features will be available in all browsers by 2025. The future of web development with AI is uncertain, but web developers should embrace AI tools to improve efficiency. Implicit CSS lazy loading depends on whether it's prefetching or pre-rendering. Wes Boss discusses the specific features he is excited about in web development, including starting style, calc auto, and allowed discrete. He shares his preferred way of staying informed on new web development discoveries, emphasizing the importance of being part of the community and keeping up with industry discussions. Wes also mentions reading W3C meeting notes and recommends following the Twitter account Intent2Ship to stay updated on upcoming CSS features. Lastly, he discusses the potential impact of the new Scope CSS feature on developers' management of styles.

web development

Fighting Technical Debt With Continuous Refactoring

React Day Berlin 2022

29 min

Fighting Technical Debt With Continuous Refactoring

Top Content

Watch video: Fighting Technical Debt With Continuous Refactoring

Alex Moldovan

CodeSandbox

This Talk discusses the importance of refactoring in software development and engineering. It introduces a framework called the three pillars of refactoring: practices, inventory, and process. The Talk emphasizes the need for clear practices, understanding of technical debt, and a well-defined process for successful refactoring. It also highlights the importance of visibility, reward, and resilience in the refactoring process. The Talk concludes by discussing the role of ownership, management, and prioritization in managing technical debt and refactoring efforts.

team productivity web development developer challenges inspiration

Workshops on related topic

React, TypeScript, and TDD

React Advanced 2021

174 min

React, TypeScript, and TDD

Top Content

Featured Workshop

Paul Everitt

ReactJS is wildly popular and thus wildly supported. TypeScript is increasingly popular, and thus increasingly supported.

The two together? Not as much. Given that they both change quickly, it's hard to find accurate learning materials.

React+TypeScript, with JetBrains IDEs? That three-part combination is the topic of this series. We'll show a little about a lot. Meaning, the key steps to getting productive, in the IDE, for React projects using TypeScript. Along the way we'll show test-driven development and emphasize tips-and-tricks in the IDE.

react best practices typescript devtools web development test driven development react

Web3 Workshop - Building Your First Dapp

React Advanced 2021

145 min

Web3 Workshop - Building Your First Dapp

Top Content

Featured Workshop

Nader Dabit

In this workshop, you'll learn how to build your first full stack dapp on the Ethereum blockchain, reading and writing data to the network, and connecting a front end application to the contract you've deployed. By the end of the workshop, you'll understand how to set up a full stack development environment, run a local node, and interact with any smart contract using React, HardHat, and Ethers.js.

react blockchain web development ethereum web3

Remix Fundamentals

React Summit 2022

136 min

Remix Fundamentals

Top Content

Workshop

Kent C. Dodds

Building modern web applications is riddled with complexity And that's only if you bother to deal with the problems
Tired of wiring up onSubmit to backend APIs and making sure your client-side cache stays up-to-date? Wouldn't it be cool to be able to use the global nature of CSS to your benefit, rather than find tools or conventions to avoid or work around it? And how would you like nested layouts with intelligent and performance optimized data management that just works™?
Remix solves some of these problems, and completely eliminates the rest. You don't even have to think about server cache management or global CSS namespace clashes. It's not that Remix has APIs to avoid these problems, they simply don't exist when you're using Remix. Oh, and you don't need that huge complex graphql client when you're using Remix. They've got you covered. Ready to build faster apps faster?
At the end of this workshop, you'll know how to:- Create Remix Routes- Style Remix applications- Load data in Remix loaders- Mutate data with forms and actions

remix web development

Vue3: Modern Frontend App Development

Vue.js London Live 2021

169 min

Vue3: Modern Frontend App Development

Top Content

Workshop

Mikhail Kuznetsov

The Vue3 has been released in mid-2020. Besides many improvements and optimizations, the main feature of Vue3 brings is the Composition API – a new way to write and reuse reactive code. Let's learn more about how to use Composition API efficiently.

Besides core Vue3 features we'll explain examples of how to use popular libraries with Vue3.

Table of contents:
- Introduction to Vue3
- Composition API
- Core libraries
- Vue3 ecosystem

Prerequisites:
IDE of choice (Inellij or VSC) installed
Nodejs + NPM

web development vue composition api vue vue 3

Developing Dynamic Blogs with SvelteKit & Storyblok: A Hands-on Workshop

JSNation 2023

174 min

Developing Dynamic Blogs with SvelteKit & Storyblok: A Hands-on Workshop

Top Content

WorkshopFree

2 authors

This SvelteKit workshop explores the integration of 3rd party services, such as Storyblok, in a SvelteKit project. Participants will learn how to create a SvelteKit project, leverage Svelte components, and connect to external APIs. The workshop covers important concepts including SSR, CSR, static site generation, and deploying the application using adapters. By the end of the workshop, attendees will have a solid understanding of building SvelteKit applications with API integrations and be prepared for deployment.

web development fullstack ssr svelte

0 to Auth in an hour with ReactJS

React Summit 2023

56 min

0 to Auth in an hour with ReactJS

WorkshopFree

Kevin Gao

Passwordless authentication may seem complex, but it is simple to add it to any app using the right tool. There are multiple alternatives that are much better than passwords to identify and authenticate your users - including SSO, SAML, OAuth, Magic Links, One-Time Passwords, and Authenticator Apps.
While addressing security aspects and avoiding common pitfalls, we will enhance a full-stack JS application (Node.js backend + React frontend) to authenticate users with OAuth (social login) and One Time Passwords (email), including:- User authentication - Managing user interactions, returning session / refresh JWTs- Session management and validation - Storing the session securely for subsequent client requests, validating / refreshing sessions- Basic Authorization - extracting and validating claims from the session token JWT and handling authorization in backend flows
At the end of the workshop, we will also touch other approaches of authentication implementation with Descope - using frontend or backend SDKs.

react web development security authentication