English versionEN

OpenAI in React: Integrating GPT with Your React Application

Explore the integration of OpenAI's GPT into React applications to create dynamic, intelligent user interfaces. This session offers practical insights into leveraging GPT's powerful NLP capabilities for chatbots, content generation, and personalized user experiences. Dive into the technical setup with Next.js 14, LangChain, Vercel's AI SDK, and MongoDB's Vector Search to optimize and customize your applications. Join us to transform your React projects with AI.

This talk has been presented at React Summit 2024, check out the latest edition of this React Conference.

FAQ

Attendees are encouraged to scan the provided QR code for more information, attend the free workshop, or visit the MongoDB booth for further queries.

A RAG pipeline stands for Retrieval Augmented Generation. It is a method used to enhance the capabilities of language models by retrieving relevant information from external sources to answer questions.

The demo used technologies such as MongoDB, OpenAI API, React, Next.js, and the langchain library. Other tools included .env for environment variables and React markdown for styling.

Documents are transformed into vector embeddings using a process that involves reading the documents, chunking them into pieces, and storing these embeddings in a vector database like MongoDB.

MongoDB is used as a vector database to store the vector embeddings of the documents. It is also utilized for vector search to retrieve the most relevant document chunks for answering user questions.

If the chat bot does not know the answer, it will respond with 'Sorry, I don't know how to help with that,' indicating that the information is not available in the provided documentation.

The user's question is intercepted and sent to a vector search API route, which retrieves relevant document chunks. These chunks are then used to form a response template, which is sent to the language model to generate an answer.

The demo used fake documentation for a fictional JavaScript library called fancy widget.js. This included typical documentation sections such as README, usage, license, installation, contributing, change log, and API reference.

The AI chat bot is tested by asking it questions related to the provided documentation, such as 'What is fancy widget.js?' and checking whether it responds accurately based on the information given.

The purpose of the talk is to demonstrate how to create an AI chat bot that can answer questions based on information that the LLM was never trained on, using a basic Retrieval Augmented Generation (RAG) pipeline.

artificial intelligence openai

Jesse Hall

10 min

14 Jun, 2024

Comments

utsav singh
Nice
17:24 Jun 13, 2025

Video Summary and Transcription

In this Talk, the speaker demonstrates how to create an AI chat bot that can answer questions based on information it was never trained on. They build a basic RAG pipeline in just five minutes using live coding. The speaker also shows how to create embeddings and a vector database, set up a vector search index and endpoint, and modify the chat route to enhance the chat bot's capabilities. The program is run and tested, and the Talk concludes with an invitation to join a workshop for more information.

Available in Español: OpenAI en React: Integrando GPT con tu aplicación de React

1. Introduction

Short description:

I'm going to speed run creating an AI chat bot that can answer questions based on information that the LLM was never trained on. We're going to create a basic RAG pipeline and build it in about five minutes, live coding.

I'm going to pack as much information into this talk as I can, and if I go too fast, I lose you, if you have any questions, come find me afterwards, and I'm happy to talk. So what I'm going to do is I'm going to speed run creating an AI chat bot, and not just any AI chat bot, a chat bot that can answer questions based on information that the LLM was never trained on. So we're going to create a basic RAG pipeline, retrieval augmented generation. So on top of that, I'm going to build this in about five minutes, and I am going to live code, so nothing can go wrong, right?

2. Creating Embeddings and Vector Database

Short description:

I ran NPX create next app using the next lane chain example. Installed MongoDB, React markdown, and .env. Checked the app, added dark mode. Tested with a question about MongoDB. Created fake documentation for a JavaScript library. Transformed markdown files into vectors and saved them in a MongoDB vector database. Used vector search to enhance LLM capabilities. Set up file system promises, open AI embeddings, Mongo client, and MongoDB Atlas vector search. Created embeddings for each document and stored them in MongoDB. Fixed a typo.

So I did already run NPX create next app using the next lane chain example. I installed MongoDB, the lane chain MongoDB integration, React markdown for some styling and .env because we're going to use a node script in order to run our ingest.

So I also have an open AI API key and my MongoDB Atlas connection string in my environment variables. So let's go ahead and check out this app. So this is the example straight, without any alteration. Well, I added dark mode so I wouldn't blind everybody, so you're welcome for that. So let's just test to make sure it works, so let's say what is MongoDB and hopefully the Wi-Fi works, and okay, there we go. And open AI responds to us with a pretty good answer. So it's working out of the box. Great.

Let's check out the code. So I've got this fake documents directory here, and I use chat GPT to help me create some fake documentation for a fake JavaScript library called fancy widget.js. So we have read me, usage, license, installation, contributing, change log, API reference. We have all the documentation that you'd expect from a JavaScript library. So what we're going to do is we're going to take these markdown files and we're going to transform them into vectors, vector embeddings, and then we're going to save those in our vector database. We're going to use MongoDB for the vector database. And then we can use, during vector search, we can use this to augment the LLMs capabilities so they can answer questions based on this information.

All right, so let's go ahead and get started doing that. So in the root here, I'm going to create a new file. We're going to name it create embeddings.mjs, and then we are going to do some typing here. So we're going to import our file system promises from recursive character text splitter, and then import open AI embeddings from lang chain open AI, and then our Mongo client from MongoDB, and then our MongoDB Atlas vector search from lang chain, and then we'll set up our Mongo clients, getting our environment variable there for our connection string.

Our database name is going to be documents, collection name embeddings. We'll set up our collection, and then we'll get our documents directory, those fake documents, and then get the files for those, and then console log the file names, and then the file name, look through those file names, get each document. After we read each document, then we're going to console log that we're vectorizing the document, and then our splitter is going to use our recursive character text splitter from lang chain, and we'll chunk those into different pieces, and then output those and store those into MongoDB using MongoDB Atlas vector search. We'll create those embeddings, we'll tell it which collection, which index name, which text key, and which embedding key to use, and then console log that we're done, close the connection to MongoDB.

And there is a bit of a typo here. Of course that didn't happen in practice. And of course I wasn't typing because that was a VS code extension. This is supposed to be import recursive character text splitter. So let me grab that.

3. Creating Vector Search Index and Endpoint

Short description:

In under 40 lines of code, we have created our ingest for our documents. Ran node create embeddings and successfully created embeddings and saved them in MongoDB. Checked MongoDB and found the documents with text and embeddings fields. Created a vector search index using the JSON editor and defined the index path, dimensions, and similarity. Created a new endpoint called vector search in our route.ts file.

So let me grab that. And we should be good, I hope. Live coding, nothing ever goes wrong, right? So in under 40 lines of code, we have created our ingest for our documents. So let's go ahead and run that. So open up the console, let's kill this console, and then let's run node create embeddings. And yeah. Lang chain import, yes. Thank you. Lang chain import. Anything else that I'm missing? Yeah. That. Thank you so much. Okay, let's go back into our console, and let's run node create embeddings, and there we go, now it's working. Thank you. Sorry, it's looping through each one. It's creating those embeddings and it's vectorising them and saving them in MongoDB. So let's go ahead and check that out. So right now there's nothing in MongoDB. Let's refresh it. And now we should have some documents in MongoDB, which we do. Amazing. So we have a text field which is the original text chunk, and then we have the embeddings field which is an array of numbers, those are the vectors, and we have some extra metadata as well. So the next thing we need to do in order to use this is create a search index, a vector search index. So let's go ahead and create a new index. We're going to use the JSON editor, and let's go ahead and define this index here. So we're going to say the type is vector, vector search. The path is going to be the embedding field in our documents. We define our dimensions, our similarity, and then let's select the embeddings collection, and then next, and create, and good. Okay. While it's doing that, it takes just a few seconds to do that, but while it's doing that, we need to go over to our API end points, and we're going to create a new end point called vector search, and then our route.ts file.

4. Setting Up Vector Search and Modifying Chat Route

Short description:

We import OpenAI embeddings, the Mongo client, and MongoDB Atlas vector search. Set up the database and collection. Configure the database with the collection, index name, and embedding key. Create embeddings using MongoDB Atlas vector search. Retrieve the top five similar chunks from the vector store and return the results. Modify the existing chat route to send the user's message to the vector search API and receive additional context to pass to the LLM. Add the extra context and user's question to the message queue. Continue with chat GPT or open AI to GPT 3.5.

Okay. So we're going to do some more speed typing here, and this time, we're going to import our OpenAI embeddings from OpenAI, our Mongo client, and then the MongoDB Atlas vector search from lang chain. And then we'll create our post route here, define the database name, the collection name, and then set up the actual collection. And then after that, we will set up our database config, so we'll define our collection there, index name, our text key, and our embedding key, and then we'll set up our vector store using MongoDB Atlas vector search. We'll create embeddings there for the question or the prompt that the user is sending in, the question. Then we'll set up our retriever, which is our vector store. So we're going to retrieve the top five similar chunks from our vector store. We'll get those results, and then we'll return those results. So this endpoint is using vector search in MongoDB to return the most relevant results for this.

So last thing we're going to do is go to our existing chat route. So this is the route that came with this example project, and we're going to just change this a little bit. We're going to intercept the user's message and do something with it. So let's do some more typing here. So we're going to pop off the current message in the message queue, and we're going to send that to that vector search API route that we just created. That vector search API route is then going to return additional context that we can use to then send to the LLM. So we're going to create this template. We're going to say, you are a very enthusiastic, fancy widget representative who loves to help the people. Given the following sections from the fancy widget documentation, answer the questions using only that information. I'll put it in markdown format. If you are unsure and the answer is not explicitly written in the documentation, say, sorry, I don't know how to help with that. We're going to add that extra context that we got from vector search, and we're going to add the user's question back into here and then push that back onto the message queue. After that, everything will continue as is, going to chat GPT or to open AI to GPT 3.5 is what we're using here.

5. Running the Program and Conclusion

Short description:

Run the program again and ask it questions. It will answer based on the information we provided. If the question is not within the given context, it will say it doesn't know. End of the talk. Scan the QR code for more info and join the workshop.

Okay. So now if we go back into the console, and let's run this again, let's go back and refresh this. Now if I ask it, what is fancy widget.js? It should... Oh, no. Live coding. Vector search. Come on. You all saw that way before. You didn't say anything. Come on. Yeah, I know. I know. Huh? Let's kill this and run it again. All right. What is... Let me just make sure it's running. Yes, okay. What is fancy widget.js? And it should answer us from our information. Yes, it did. It worked. Now, what if I say what is MongoDB? It should say I don't know how to help with that, because we didn't give it that context. So it's only answering questions about the information that we gave it. Okay. So that is the end of my talk, all the time that I have. Scan the QR code for more info. Attend my free workshop and come see us at the MongoDB booth.

Available in other languages:

Check out more articles and videos

We constantly think of articles and videos that might spark Git people interest / skill us up or help building a stellar career

Building a Voice-Enabled AI Assistant With Javascript

JSNation 2023

21 min

Building a Voice-Enabled AI Assistant With Javascript

Top Content

Tejas Kumar

Author of the "Fluent React" bestselling book, software engineer with 23 years of experience, and host of the developer-loved ConTejas Code podcast.

This Talk discusses building a voice-activated AI assistant using web APIs and JavaScript. It covers using the Web Speech API for speech recognition and the speech synthesis API for text to speech. The speaker demonstrates how to communicate with the Open AI API and handle the response. The Talk also explores enabling speech recognition and addressing the user. The speaker concludes by mentioning the possibility of creating a product out of the project and using Tauri for native desktop-like experiences.

case study artificial intelligence

The Ai-Assisted Developer Workflow: Build Faster and Smarter Today

JSNation US 2024

31 min

The Ai-Assisted Developer Workflow: Build Faster and Smarter Today

Top Content

Addy Osmani

Engineering Leader Working on Google Chrome

AI is transforming software engineering by using agents to help with coding. Agents can autonomously complete tasks and make decisions based on data. Collaborative AI and automation are opening new possibilities in code generation. Bolt is a powerful tool for troubleshooting, bug fixing, and authentication. Code generation tools like Copilot and Cursor provide support for selecting models and codebase awareness. Cline is a useful extension for website inspection and testing. Guidelines for coding with agents include defining requirements, choosing the right model, and frequent testing. Clear and concise instructions are crucial in AI-generated code. Experienced engineers are still necessary in understanding architecture and problem-solving. Energy consumption insights and sustainability are discussed in the Talk.

artificial intelligence

AI and Web Development: Hype or Reality

JSNation 2023

24 min

AI and Web Development: Hype or Reality

Top Content

Wes Bos

Full Stack Developer, Speaker & Teacher, Co-host of Syntax.fm podcast.

This talk explores the use of AI in web development, including tools like GitHub Copilot and Fig for CLI commands. AI can generate boilerplate code, provide context-aware solutions, and generate dummy data. It can also assist with CSS selectors and regexes, and be integrated into applications. AI is used to enhance the podcast experience by transcribing episodes and providing JSON data. The talk also discusses formatting AI output, crafting requests, and analyzing embeddings for similarity.

productivity artificial intelligence

The Rise of the AI Engineer

React Summit US 2023

30 min

The Rise of the AI Engineer

Top Content

Watch video: The Rise of the AI Engineer

Shawn Swyx Wang

Latent.Space Editor & Smol.ai Founder

The rise of AI engineers is driven by the demand for AI and the emergence of ML research and engineering organizations. Start-ups are leveraging AI through APIs, resulting in a time-to-market advantage. The future of AI engineering holds promising results, with a focus on AI UX and the role of AI agents. Equity in AI and the central problems of AI engineering require collective efforts to address. The day-to-day life of an AI engineer involves working on products or infrastructure and dealing with specialties and tools specific to the field.

web development artificial intelligence builders and founders future of development

Web Apps of the Future With Web AI

JSNation 2024

32 min

Web Apps of the Future With Web AI

Jason Mayes

Web AI Lead at Google.

Web AI in JavaScript allows for running machine learning models client-side in a web browser, offering advantages such as privacy, offline capabilities, low latency, and cost savings. Various AI models can be used for tasks like background blur, text toxicity detection, 3D data extraction, face mesh recognition, hand tracking, pose detection, and body segmentation. JavaScript libraries like MediaPipe LLM inference API and Visual Blocks facilitate the use of AI models. Web AI is in its early stages but has the potential to revolutionize web experiences and improve accessibility.

artificial intelligence

Code coverage with AI

TestJS Summit 2023

8 min

Code coverage with AI

Premium

Jaap Brasser

Codium

Codium is a generative AI assistant for software development that offers code explanation, test generation, and collaboration features. It can generate tests for a GraphQL API in VS Code, improve code coverage, and even document tests. Codium allows analyzing specific code lines, generating tests based on existing ones, and answering code-related questions. It can also provide suggestions for code improvement, help with code refactoring, and assist with writing commit messages.

artificial intelligence

Workshops on related topic

AI on Demand: Serverless AI

DevOps.js Conf 2024

163 min

AI on Demand: Serverless AI

Top Content

Featured WorkshopFree

Nathan Disidore

In this workshop, we discuss the merits of serverless architecture and how it can be applied to the AI space. We'll explore options around building serverless RAG applications for a more lambda-esque approach to AI. Next, we'll get hands on and build a sample CRUD app that allows you to store information and query it using an LLM with Workers AI, Vectorize, D1, and Cloudflare Workers.

serverless architecture artificial intelligence

AI for React Developers

React Advanced 2024

142 min

AI for React Developers

Top Content

Featured Workshop

Eve Porcello

Knowledge of AI tooling is critical for future-proofing the careers of React developers, and the Vercel suite of AI tools is an approachable on-ramp. In this course, we’ll take a closer look at the Vercel AI SDK and how this can help React developers build streaming interfaces with JavaScript and Next.js. We’ll also incorporate additional 3rd party APIs to build and deploy a music visualization app.
Topics:- Creating a React Project with Next.js- Choosing a LLM- Customizing Streaming Interfaces- Building Routes- Creating and Generating Components - Using Hooks (useChat, useCompletion, useActions, etc)

react next.js artificial intelligence

Building Full Stack Apps With Cursor

JSNation 2025

46 min

Building Full Stack Apps With Cursor

Featured Workshop

Mike Mikula

In this workshop I’ll cover a repeatable process on how to spin up full stack apps in Cursor. Expect to understand techniques such as using GPT to create product requirements, database schemas, roadmaps and using those in notes to generate checklists to guide app development. We will dive further in on how to fix hallucinations/ errors that occur, useful prompts to make your app look and feel modern, approaches to get every layer wired up and more! By the end expect to be able to run your own AI generated full stack app on your machine!
Please, find the FAQ here

artificial intelligence

Vibe coding with Cline

JSNation 2025

64 min

Vibe coding with Cline

Featured Workshop

Nik Pash

The way we write code is fundamentally changing. Instead of getting stuck in nested loops and implementation details, imagine focusing purely on architecture and creative problem-solving while your AI pair programmer handles the execution. In this hands-on workshop, I'll show you how to leverage Cline (an autonomous coding agent that recently hit 1M VS Code downloads) to dramatically accelerate your development workflow through a practice we call "vibe coding" - where humans focus on high-level thinking and AI handles the implementation.You'll discover:The fundamental principles of "vibe coding" and how it differs from traditional developmentHow to architect solutions at a high level and have AI implement them accuratelyLive demo: Building a production-grade caching system in Go that saved us $500/weekTechniques for using AI to understand complex codebases in minutes instead of hoursBest practices for prompting AI agents to get exactly the code you wantCommon pitfalls to avoid when working with AI coding assistantsStrategies for using AI to accelerate learning and reduce dependency on senior engineersHow to effectively combine human creativity with AI implementation capabilitiesWhether you're a junior developer looking to accelerate your learning or a senior engineer wanting to optimize your workflow, you'll leave this workshop with practical experience in AI-assisted development that you can immediately apply to your projects. Through live coding demos and hands-on exercises, you'll learn how to leverage Cline to write better code faster while focusing on what matters - solving real problems.

artificial intelligence

Free webinar: Building Full Stack Apps With Cursor

Productivity Conf for Devs and Tech Leaders

71 min

Free webinar: Building Full Stack Apps With Cursor

Top Content

WorkshopFree

Mike Mikula

In this webinar I’ll cover a repeatable process on how to spin up full stack apps in Cursor. Expect to understand techniques such as using GPT to create product requirements, database schemas, roadmaps and using those in notes to generate checklists to guide app development. We will dive further in on how to fix hallucinations/ errors that occur, useful prompts to make your app look and feel modern, approaches to get every layer wired up and more! By the end expect to be able to run your own ai generated full stack app on your machine!

fullstack artificial intelligence

Working With OpenAI and Prompt Engineering for React Developers

React Advanced 2023

98 min

Working With OpenAI and Prompt Engineering for React Developers

Top Content

Workshop

Richard Moss

In this workshop we'll take a tour of applied AI from the perspective of front end developers, zooming in on the emerging best practices when it comes to working with LLMs to build great products. This workshop is based on learnings from working with the OpenAI API from its debut last November to build out a working MVP which became PowerModeAI (A customer facing ideation and slide creation tool).
In the workshop they'll be a mix of presentation and hands on exercises to cover topics including:
- GPT fundamentals- Pitfalls of LLMs- Prompt engineering best practices and techniques- Using the playground effectively- Installing and configuring the OpenAI SDK- Approaches to working with the API and prompt management- Implementing the API to build an AI powered customer facing application- Fine tuning and embeddings- Emerging best practice on LLMOps

artificial intelligence openai react and ai