Evolving the Node HTTP Client with undici

Rate this content
Bookmark
The video discusses the Node HTTP client and introduces the Undici library as a more efficient alternative. Undici addresses issues like missing HTTP pipelining and Keepalive support, offering almost 10 times better performance. The speaker highlights the challenges with the current Node HTTP client, such as the response object not being a true Node stream. WebAssembly is used to enhance performance, and Undici supports customizable request settings. The video also mentions the implementation of Fetch on top of Undici, with experimental support in Node 17. Additionally, the talk covers various types of dispatchers in Undici, including client, pool, balanced pool, agent, and proxy agent. Future plans include support for HTTP/2 and HTTP/3, as well as enhancements in DNS lookup and pool scheduling.

From Author:

What are some of the problems of the current Node core HTTP client and how can we move beyond them? A talk about how undici tries to bring the next step in client side HTTP in Node. Includes some practical advice to using undici and common pitfalls with some commentary in regards to API design decisions and future plans with some extra Fetch API bonus.a

This talk has been presented at Node Congress 2022, check out the latest edition of this Tech Conference.

FAQ

Robert Nagy is the lead developer at Next Edition, a frequent contributor to Node.js, and a TSC member.

Bitchy is a library created to address issues with the current Node HTTP client. It aims to provide a more user-friendly API and improve performance.

The current Node HTTP client has issues such as the response object not being a true Node stream, lack of HTTP pipelining support, and intrinsic linkage between the request and response objects.

HTTP pipelining is a feature that allows multiple HTTP requests to be sent out before receiving responses, thereby reducing latency.

Keepalive is a feature that allows a single TCP connection to remain open for multiple HTTP requests, reducing the overhead of opening and closing connections.

Indichy is a new approach to the HTTP client in Node.js, aiming to provide a more user-friendly API, support HTTP pipelining and Keepalive, and improve performance using WebAssembly.

Indichy supports Keepalive natively and handles server timeout hints to avoid unexpected connection closures, unlike Node Core.

Indichy offers almost 10 times better performance compared to the Node Core HTTP client.

The Fetch implementation in Node.js is built on top of Indichy to avoid exposing multiple experimental APIs.

Indichy has built-in support for mocking requests using mock client and mock pool tools, similar to what Nock provides for the Node HTTP client.

Robert Nagy
Robert Nagy
35 min
18 Feb, 2022

Comments

Sign in or register to post your comment.

Video Transcription

1. Introduction to Node HTTP Client and Indichy

Short description:

Hello, everyone. My name is Robert Nagy. I'm the lead developer at Next Edition and a frequent contributor to Node.js and also a TSC member. I'm going to talk about the current state of the Node HTTP client and the problems we see. The response object is not actually a Node stream, it lacks support for HTTP pipelining, and the request and response objects are intrinsically linked. We have tried fixing these issues multiple times, but it's difficult without causing disturbance. A few years ago, Matteo Macalina created the library Indichy, which aims to provide a more user-friendly API for HTTP in Node.

Hello, everyone. My name is Robert Nagy. I'm the lead developer at Next Edition and a frequent contributor to Node.js and also a TSC member. And I'm going to talk a little bit about some work we've been doing on the HTTP client in Node using a library we created called Bitchy.

So, first of all, let's talk a little bit about the current state of the Node HTTP client. I think most people are familiar with the HTTP request API. And people that are using NPM libraries like Got and Axios, etc., these are usually just wrappers around the HTTP request call that Node Core provides. Now, this has a long history and it has worked for a very long time. But those of us that have been working on maintaining this API feel we have kind of reached the end of the line of what we can do with reasonable effort with this API.

So what are some of the problems we see here? So first of all, the response object you receive from the request API is not actually a Node stream. It just pretends to be a Node stream. There are various reasons, both the compatibility and performance reasons for that. But there are slight differences that can cause weird issues if you're unlucky. Especially if you're using APIs that expect streams, and you wanted to use it seamlessly. It doesn't have support for HTTP pipelining. Pipelining is a feature of the HTTP protocol that can provide some significant performance advantages. Also, the request and the response object in this API are intrinsically linked. So if you destroy, for example, the request, even though it has completed, it will also kill the response. And this linkage is very difficult to fix or remove without breaking the entire ecosystem. It's entirely based on streams, which causes some limitation of what we can achieve in terms of performance. And a lot of the internals in the API are publicly accessible, and we have ecosystem modules that depend on internal implementation details. Back then, we didn't have symbols so it wasn't possible to actually hide things from users. And all of these things cause issues that we believe are very difficult to fix without causing undue disturbance in the entire ecosystem.

So a few years ago, Matteo or actually, another point here is we have tried fixing these issues multiple times, and that's caused problems and we had to revert them. I'm actually one of the persons that had spent a lot of time trying to resolve these issues. And it's quite disappointing when you are forced to revert them due to compatibility or performance regressions, et cetera. So here are some pull requests that just revert work that has been done towards this. So a few years ago, Matteo Macalina created this library called Indichy, which is a new take on what HTTP can and could, can be in Node. And I got involved a year or so ago, and I've done a lot of work making this production ready. So what are our aims? What is we are achieving or trying to achieve within Indichy? So we want a bit more user-friendly API so people don't have to go to a third-party library by default.

2. Node HTTP Client and Umidigi

Short description:

We have managed to replace the entire HTTP client outside of Node core by using WebAssembly, which provides performance advantages. Unleashy supports Keepalive and pipelining, addressing issues in Node Core. We have sorted out the problem with Keepalive in Umidigi, and achieved almost 10 times better performance compared to the Node Core client. We are developing Umidigi outside of core for later inclusion and have hidden all the internals behind symbols. We're also looking into implementing fetch on top of Umidigi. It's important to consume or destroy the body to release the connection, and we provide a helper function called dump. We have support for signals.

We don't want to have any native dependencies. So this is actually required for us to, in order to replace the entire HTTP client outside of Node core, we native dependencies. And the HTTP parsing node is HTTP, which is a native library. But we have managed to get around this by using WebAssembly, and it's working very well, and we do see actually some performance advantages with using WebAssembly.

Another important thing is we want to support Keepalive and pipelining, so I'm just quickly going to explain that. Keepalive is actually something that the node core client supports, but it's not enabled by default. And there are some things you have to think about this in Node Core that Unleashy tries to handle for you. So without pipelining, every request you do actually creates a new connection and closes that. With Keepalive, you can reuse the connection for subsequent requests. So you skip the overhead of closing and establishing a new connection. And with pipelining, it's actually possible to send multiple requests before the server has replied. And thereby, you can reduce some of the latency of your requests. And we have spent a lot of time making sure Unleashy supports this natively.

And as I mentioned, there is a problem with Keepalive that Nord Core doesn't handle, is that once a connection is established, there's a timeout that the server will keep that connection alive, waiting for further requests before closing the connection. Now, if you're unlucky, you could actually send a request at the exact moment the server's timeout happens and therefore the server will close the connection while your request is arriving. Now, some servers provide a Keepalive hint in their responses so you can actually find out for how long the server expects to keep the connection open, and therefore, the client can actually do stuff to avoid this unexpected closure happening. So that's something we've sorted out in Umidigi.

We have also looked at performance so with Umidigi we were able to achieve an almost 10 times better performance relative to the Node Core client, which I have to say is a rather good result. We are developing it outside of core at the moment for later inclusion. There are some advantages of this especially in terms of the implementation velocity development velocity and we've hidden all the internals behind symbols so we don't have a repeat of third-party libraries, depending on implementation details. And we're also looking into implementing fetch on top of Umidigi. So Umidigi's API is the most basic API is Umidigi request and you basically do an await Umidigi request where you get a body, a status code, and headers and the body is a node stream but we have also implemented some helper functions inspired by the fetch mixing body specifications so you have body.text.json.array buffer and you can just await those but otherwise the body is a normal node.js stream. So this is quite simple. One important note that I've noticed some people miss is that it's very important, even if you don't care about the body, you should consume it or destroy it to release the connection because the way it works with the keep-alive is unless the previous request has completed you can't actually process the next one so it's important to either destroy the body or dump it or consume it. We provide a helper called dump, which the downside of destroying the body is that that actually will cause the socket to be destroyed. We have a helper function here called dump which will actually try to read the entire body so that the connection can be reused but if the body or the response from the server exceeds a certain threshold then it chooses to eventually close the connection. So you don't have to download a gigabyte of the data before being able to reuse the connection. And yes, if you don't do this then the connection won't be freed for the next request. We have support for signals like any good citizen of the promise world.

QnA

Check out more articles and videos

We constantly think of articles and videos that might spark Git people interest / skill us up or help building a stellar career

It's a Jungle Out There: What's Really Going on Inside Your Node_Modules Folder
Node Congress 2022Node Congress 2022
26 min
It's a Jungle Out There: What's Really Going on Inside Your Node_Modules Folder
Top Content
The talk discusses the importance of supply chain security in the open source ecosystem, highlighting the risks of relying on open source code without proper code review. It explores the trend of supply chain attacks and the need for a new approach to detect and block malicious dependencies. The talk also introduces Socket, a tool that assesses the security of packages and provides automation and analysis to protect against malware and supply chain attacks. It emphasizes the need to prioritize security in software development and offers insights into potential solutions such as realms and Deno's command line flags.
Towards a Standard Library for JavaScript Runtimes
Node Congress 2022Node Congress 2022
34 min
Towards a Standard Library for JavaScript Runtimes
Top Content
There is a need for a standard library of APIs for JavaScript runtimes, as there are currently multiple ways to perform fundamental tasks like base64 encoding. JavaScript runtimes have historically lacked a standard library, causing friction and difficulty for developers. The idea of a small core has both benefits and drawbacks, with some runtimes abusing it to limit innovation. There is a misalignment between Node and web browsers in terms of functionality and API standards. The proposal is to involve browser developers in conversations about API standardization and to create a common standard library for JavaScript runtimes.
ESM Loaders: Enhancing Module Loading in Node.js
JSNation 2023JSNation 2023
22 min
ESM Loaders: Enhancing Module Loading in Node.js
ESM Loaders enhance module loading in Node.js by resolving URLs and reading files from the disk. Module loaders can override modules and change how they are found. Enhancing the loading phase involves loading directly from HTTP and loading TypeScript code without building it. The loader in the module URL handles URL resolution and uses fetch to fetch the source code. Loaders can be chained together to load from different sources, transform source code, and resolve URLs differently. The future of module loading enhancements is promising and simple to use.
Out of the Box Node.js Diagnostics
Node Congress 2022Node Congress 2022
34 min
Out of the Box Node.js Diagnostics
This talk covers various techniques for getting diagnostics information out of Node.js, including debugging with environment variables, handling warnings and deprecations, tracing uncaught exceptions and process exit, using the v8 inspector and dev tools, and generating diagnostic reports. The speaker also mentions areas for improvement in Node.js diagnostics and provides resources for learning and contributing. Additionally, the responsibilities of the Technical Steering Committee in the TS community are discussed.
Node.js Compatibility in Deno
Node Congress 2022Node Congress 2022
34 min
Node.js Compatibility in Deno
Deno aims to provide Node.js compatibility to make migration smoother and easier. While Deno can run apps and libraries offered for Node.js, not all are supported yet. There are trade-offs to consider, such as incompatible APIs and a less ideal developer experience. Deno is working on improving compatibility and the transition process. Efforts include porting Node.js modules, exploring a superset approach, and transparent package installation from npm.
Multithreaded Logging with Pino
JSNation Live 2021JSNation Live 2021
19 min
Multithreaded Logging with Pino
Top Content
Today's Talk is about logging with Pino, one of the fastest loggers for Node.js. Pino's speed and performance are achieved by avoiding expensive logging and optimizing event loop processing. It offers advanced features like async mode and distributed logging. The use of Worker Threads and Threadstream allows for efficient data processing. Pino.Transport enables log processing in a worker thread with various options for log destinations. The Talk concludes with a demonstration of logging output and an invitation to reach out for job opportunities.

Workshops on related topic

Node.js Masterclass
Node Congress 2023Node Congress 2023
109 min
Node.js Masterclass
Top Content
Workshop
Matteo Collina
Matteo Collina
Have you ever struggled with designing and structuring your Node.js applications? Building applications that are well organised, testable and extendable is not always easy. It can often turn out to be a lot more complicated than you expect it to be. In this live event Matteo will show you how he builds Node.js applications from scratch. You’ll learn how he approaches application design, and the philosophies that he applies to create modular, maintainable and effective applications.

Level: intermediate
Build and Deploy a Backend With Fastify & Platformatic
JSNation 2023JSNation 2023
104 min
Build and Deploy a Backend With Fastify & Platformatic
WorkshopFree
Matteo Collina
Matteo Collina
Platformatic allows you to rapidly develop GraphQL and REST APIs with minimal effort. The best part is that it also allows you to unleash the full potential of Node.js and Fastify whenever you need to. You can fully customise a Platformatic application by writing your own additional features and plugins. In the workshop, we’ll cover both our Open Source modules and our Cloud offering:- Platformatic OSS (open-source software) — Tools and libraries for rapidly building robust applications with Node.js (https://oss.platformatic.dev/).- Platformatic Cloud (currently in beta) — Our hosting platform that includes features such as preview apps, built-in metrics and integration with your Git flow (https://platformatic.dev/). 
In this workshop you'll learn how to develop APIs with Fastify and deploy them to the Platformatic Cloud.
Building a Hyper Fast Web Server with Deno
JSNation Live 2021JSNation Live 2021
156 min
Building a Hyper Fast Web Server with Deno
WorkshopFree
Matt Landers
Will Johnston
2 authors
Deno 1.9 introduced a new web server API that takes advantage of Hyper, a fast and correct HTTP implementation for Rust. Using this API instead of the std/http implementation increases performance and provides support for HTTP2. In this workshop, learn how to create a web server utilizing Hyper under the hood and boost the performance for your web apps.
0 to Auth in an Hour Using NodeJS SDK
Node Congress 2023Node Congress 2023
63 min
0 to Auth in an Hour Using NodeJS SDK
WorkshopFree
Asaf Shen
Asaf Shen
Passwordless authentication may seem complex, but it is simple to add it to any app using the right tool.
We will enhance a full-stack JS application (Node.JS backend + React frontend) to authenticate users with OAuth (social login) and One Time Passwords (email), including:- User authentication - Managing user interactions, returning session / refresh JWTs- Session management and validation - Storing the session for subsequent client requests, validating / refreshing sessions
At the end of the workshop, we will also touch on another approach to code authentication using frontend Descope Flows (drag-and-drop workflows), while keeping only session validation in the backend. With this, we will also show how easy it is to enable biometrics and other passwordless authentication methods.
Table of contents- A quick intro to core authentication concepts- Coding- Why passwordless matters
Prerequisites- IDE for your choice- Node 18 or higher
GraphQL - From Zero to Hero in 3 hours
React Summit 2022React Summit 2022
164 min
GraphQL - From Zero to Hero in 3 hours
Workshop
Pawel Sawicki
Pawel Sawicki
How to build a fullstack GraphQL application (Postgres + NestJs + React) in the shortest time possible.
All beginnings are hard. Even harder than choosing the technology is often developing a suitable architecture. Especially when it comes to GraphQL.
In this workshop, you will get a variety of best practices that you would normally have to work through over a number of projects - all in just three hours.
If you've always wanted to participate in a hackathon to get something up and running in the shortest amount of time - then take an active part in this workshop, and participate in the thought processes of the trainer.
Mastering Node.js Test Runner
TestJS Summit 2023TestJS Summit 2023
78 min
Mastering Node.js Test Runner
Workshop
Marco Ippolito
Marco Ippolito
Node.js test runner is modern, fast, and doesn't require additional libraries, but understanding and using it well can be tricky. You will learn how to use Node.js test runner to its full potential. We'll show you how it compares to other tools, how to set it up, and how to run your tests effectively. During the workshop, we'll do exercises to help you get comfortable with filtering, using native assertions, running tests in parallel, using CLI, and more. We'll also talk about working with TypeScript, making custom reports, and code coverage.