JavaScript Conferences

JSNation US 2024

JSNation US 2024

English versionEN

Enhancing AI with RAG

Phil Nash

Introduction to Retrieval Augmented Generation

Retrieval Augmented Generation (RAG) is a powerful technique that enhances the capabilities of large language models by providing them with additional context. This approach addresses two main limitations: the inability to access up-to-date information and the lack of private data. By integrating RAG, developers can offer models the necessary context to generate more accurate and relevant responses.

RAG involves storing relevant data that can be retrieved and used as context in response to user queries. This process allows models to generate responses using information they did not have access to during training. The key challenge lies in effectively retrieving the right data based on natural language queries.

Understanding Vector Embeddings

Vector embeddings are central to the RAG process. They represent the meaning of text as a list of numbers, allowing for comparisons based on similarity. Unlike traditional keyword search, vector embeddings enable searches based on the similarity of meaning, making them ideal for natural language processing tasks.

Embedding models, available from various AI companies, facilitate this process by converting text into high-dimensional vectors. These models can capture the nuanced meanings of words and entire texts. For instance, OpenAI offers models that provide vector representations with thousands of dimensions, ensuring a rich capture of textual meaning.

Building a Basic Vector Model

Creating a vector model from scratch involves several steps. Initially, all words from a corpus are collected, excluding common words that contribute little to meaning. These words form the basis for creating vectors for each text item in the corpus.

Each text is then converted into a vector, with each element representing the count of specific words. This approach, while simple, has limitations due to its reliance on word presence and frequency, which may not fully capture the complexity of language.

Improving Embedding Models

While basic vector models provide a starting point, they have inherent shortcomings. They are sparse and sensitive to vocabulary, and they struggle to capture the subtleties of language, such as word order and context-dependent meanings. Embedding models, in contrast, offer a more sophisticated solution by capturing these nuances.

These models, often part of larger AI frameworks, allow for more accurate and scalable solutions. They enable the efficient processing of large datasets, making them suitable for real-world applications where data volume can be substantial.

Utilizing Vector Databases

As data grows, managing it efficiently becomes crucial. Vector databases like AstroDB facilitate this by providing indexing and search capabilities optimized for vector data. These databases perform cosine similarity searches, which are faster and more efficient than traditional methods.

Vector databases can also automate the vectorization process, simplifying the workflow for developers. This automation reduces the need for separate calls to embedding models and streamlines the integration of RAG systems.

Practical Application and Demonstration

To illustrate the effectiveness of RAG, consider a conference bot tasked with identifying talks based on user queries. By vectorizing both the talks and user queries, the system can identify similar topics and provide relevant recommendations.

This approach was demonstrated using a vectorized query system stored in AstroDB, allowing for real-time similarity searches. The system successfully identified talks related to specific topics, showcasing the practical benefits of RAG in enhancing information retrieval.

Conclusion

Retrieval Augmented Generation is a valuable tool for developers seeking to enhance the capabilities of large language models. By leveraging vector embeddings and databases, RAG systems can provide more accurate and contextually relevant responses. As AI technology continues to evolve, RAG represents a significant step forward in making these models more useful and adaptable to real-world applications.

Developers are encouraged to explore further possibilities within RAG, such as alternative embedding techniques and database solutions, to continue improving the efficiency and effectiveness of their AI systems.

Watch full talk with demos and examples:

Watch video on a separate page

This talk has been presented at JSNation US 2024, check out the latest edition of this JavaScript Conference.

FAQ

Large language models have limitations such as not knowing up-to-date data due to training cutoff dates and not having access to private information.

Retrieval-augmented generation (RAG) is a method that provides large language models with additional context from up-to-date or private data to improve their responses.

RAG improves performance by retrieving relevant data to provide context, allowing models to generate responses with information not available at their training cutoff date.

Vector embeddings are lists of numbers that represent the meaning of a body of text, used to capture and compare meaning in natural language processing.

Cosine similarity measures how similar two vectors are, which helps compare the meanings of different texts or queries to improve search relevance.

Tools and technologies for building RAG systems include embedding models, vector databases like AstroDB, and machine learning techniques for natural language processing.

Vector databases enhance RAG systems by efficiently storing and indexing vector embeddings, enabling fast and scalable similarity searches.

Phil Nash is a Developer Relations Engineer at Datastacks, and he is known online as Fonash.

Phil Nash's talk is about building retrieval-augmented generation (RAG) from scratch, particularly in the context of using generative AI models.

machine learning artificial intelligence

Phil Nash

20 min

21 Nov, 2024

Comments

Sign in or register to post your comment.

Proceed to separate video page with transcriptions and chapters

Available in other languages:

Check out more articles and videos

We constantly think of articles and videos that might spark Git people interest / skill us up or help building a stellar career

Building a Voice-Enabled AI Assistant With Javascript

JSNation 2023

21 min

Building a Voice-Enabled AI Assistant With Javascript

Top Content

Tejas Kumar

Author of the "Fluent React" bestselling book, software engineer with 23 years of experience, and host of the developer-loved ConTejas Code podcast.

This Talk discusses building a voice-activated AI assistant using web APIs and JavaScript. It covers using the Web Speech API for speech recognition and the speech synthesis API for text to speech. The speaker demonstrates how to communicate with the Open AI API and handle the response. The Talk also explores enabling speech recognition and addressing the user. The speaker concludes by mentioning the possibility of creating a product out of the project and using Tauri for native desktop-like experiences.

case study artificial intelligence

The Ai-Assisted Developer Workflow: Build Faster and Smarter Today

JSNation US 2024

31 min

The Ai-Assisted Developer Workflow: Build Faster and Smarter Today

Top Content

Addy Osmani

Engineering Leader Working on Google Chrome

AI is transforming software engineering by using agents to help with coding. Agents can autonomously complete tasks and make decisions based on data. Collaborative AI and automation are opening new possibilities in code generation. Bolt is a powerful tool for troubleshooting, bug fixing, and authentication. Code generation tools like Copilot and Cursor provide support for selecting models and codebase awareness. Cline is a useful extension for website inspection and testing. Guidelines for coding with agents include defining requirements, choosing the right model, and frequent testing. Clear and concise instructions are crucial in AI-generated code. Experienced engineers are still necessary in understanding architecture and problem-solving. Energy consumption insights and sustainability are discussed in the Talk.

artificial intelligence

AI and Web Development: Hype or Reality

JSNation 2023

24 min

AI and Web Development: Hype or Reality

Top Content

Wes Bos

Full Stack Developer, Speaker & Teacher, Co-host of Syntax.fm podcast.

This talk explores the use of AI in web development, including tools like GitHub Copilot and Fig for CLI commands. AI can generate boilerplate code, provide context-aware solutions, and generate dummy data. It can also assist with CSS selectors and regexes, and be integrated into applications. AI is used to enhance the podcast experience by transcribing episodes and providing JSON data. The talk also discusses formatting AI output, crafting requests, and analyzing embeddings for similarity.

productivity artificial intelligence

The Rise of the AI Engineer

React Summit US 2023

30 min

The Rise of the AI Engineer

Top Content

Watch video: The Rise of the AI Engineer

Shawn Swyx Wang

Shawn Swyx Wang

Latent.Space Editor & Smol.ai Founder

The rise of AI engineers is driven by the demand for AI and the emergence of ML research and engineering organizations. Start-ups are leveraging AI through APIs, resulting in a time-to-market advantage. The future of AI engineering holds promising results, with a focus on AI UX and the role of AI agents. Equity in AI and the central problems of AI engineering require collective efforts to address. The day-to-day life of an AI engineer involves working on products or infrastructure and dealing with specialties and tools specific to the field.

web development artificial intelligence builders and founders future of development

TensorFlow.js 101: ML in the Browser and Beyond

ML conf EU 2020

41 min

TensorFlow.js 101: ML in the Browser and Beyond

Jason Mayes

Web AI Lead at Google.

TensorFlow.js enables machine learning in the browser and beyond, with features like face mesh, body segmentation, and pose estimation. It offers JavaScript prototyping and transfer learning capabilities, as well as the ability to recognize custom objects using the Image Project feature. TensorFlow.js can be used with Cloud AutoML for training custom vision models and provides performance benefits in both JavaScript and Python development. It offers interactivity, reach, scale, and performance, and encourages community engagement and collaboration between the JavaScript and machine learning communities.

tensorflow machine learning innovation tensorflow react

Web Apps of the Future With Web AI

JSNation 2024

32 min

Web Apps of the Future With Web AI

Jason Mayes

Web AI Lead at Google.

Web AI in JavaScript allows for running machine learning models client-side in a web browser, offering advantages such as privacy, offline capabilities, low latency, and cost savings. Various AI models can be used for tasks like background blur, text toxicity detection, 3D data extraction, face mesh recognition, hand tracking, pose detection, and body segmentation. JavaScript libraries like MediaPipe LLM inference API and Visual Blocks facilitate the use of AI models. Web AI is in its early stages but has the potential to revolutionize web experiences and improve accessibility.

artificial intelligence

Workshops on related topic

AI on Demand: Serverless AI

DevOps.js Conf 2024

163 min

AI on Demand: Serverless AI

Top Content

Featured WorkshopFree

Nathan Disidore

Nathan Disidore

In this workshop, we discuss the merits of serverless architecture and how it can be applied to the AI space. We'll explore options around building serverless RAG applications for a more lambda-esque approach to AI. Next, we'll get hands on and build a sample CRUD app that allows you to store information and query it using an LLM with Workers AI, Vectorize, D1, and Cloudflare Workers.

serverless architecture artificial intelligence

AI for React Developers

React Advanced 2024

142 min

AI for React Developers

Top Content

Featured Workshop

Eve Porcello

Knowledge of AI tooling is critical for future-proofing the careers of React developers, and the Vercel suite of AI tools is an approachable on-ramp. In this course, we’ll take a closer look at the Vercel AI SDK and how this can help React developers build streaming interfaces with JavaScript and Next.js. We’ll also incorporate additional 3rd party APIs to build and deploy a music visualization app.
Topics:- Creating a React Project with Next.js- Choosing a LLM- Customizing Streaming Interfaces- Building Routes- Creating and Generating Components - Using Hooks (useChat, useCompletion, useActions, etc)

react next.js artificial intelligence

Vibe coding with Cline

JSNation 2025

64 min

Vibe coding with Cline

Featured Workshop

Nik Pash

The way we write code is fundamentally changing. Instead of getting stuck in nested loops and implementation details, imagine focusing purely on architecture and creative problem-solving while your AI pair programmer handles the execution. In this hands-on workshop, I'll show you how to leverage Cline (an autonomous coding agent that recently hit 1M VS Code downloads) to dramatically accelerate your development workflow through a practice we call "vibe coding" - where humans focus on high-level thinking and AI handles the implementation.You'll discover:The fundamental principles of "vibe coding" and how it differs from traditional developmentHow to architect solutions at a high level and have AI implement them accuratelyLive demo: Building a production-grade caching system in Go that saved us $500/weekTechniques for using AI to understand complex codebases in minutes instead of hoursBest practices for prompting AI agents to get exactly the code you wantCommon pitfalls to avoid when working with AI coding assistantsStrategies for using AI to accelerate learning and reduce dependency on senior engineersHow to effectively combine human creativity with AI implementation capabilitiesWhether you're a junior developer looking to accelerate your learning or a senior engineer wanting to optimize your workflow, you'll leave this workshop with practical experience in AI-assisted development that you can immediately apply to your projects. Through live coding demos and hands-on exercises, you'll learn how to leverage Cline to write better code faster while focusing on what matters - solving real problems.

artificial intelligence

Building Full Stack Apps With Cursor

JSNation 2025

46 min

Building Full Stack Apps With Cursor

Featured Workshop

Mike Mikula

In this workshop I’ll cover a repeatable process on how to spin up full stack apps in Cursor. Expect to understand techniques such as using GPT to create product requirements, database schemas, roadmaps and using those in notes to generate checklists to guide app development. We will dive further in on how to fix hallucinations/ errors that occur, useful prompts to make your app look and feel modern, approaches to get every layer wired up and more! By the end expect to be able to run your own AI generated full stack app on your machine!
Please, find the FAQ here

artificial intelligence

Free webinar: Building Full Stack Apps With Cursor

Productivity Conf for Devs and Tech Leaders

71 min

Free webinar: Building Full Stack Apps With Cursor

Top Content

WorkshopFree

Mike Mikula

In this webinar I’ll cover a repeatable process on how to spin up full stack apps in Cursor. Expect to understand techniques such as using GPT to create product requirements, database schemas, roadmaps and using those in notes to generate checklists to guide app development. We will dive further in on how to fix hallucinations/ errors that occur, useful prompts to make your app look and feel modern, approaches to get every layer wired up and more! By the end expect to be able to run your own ai generated full stack app on your machine!

fullstack artificial intelligence

Working With OpenAI and Prompt Engineering for React Developers

React Advanced 2023

98 min

Working With OpenAI and Prompt Engineering for React Developers

Top Content

Workshop

Richard Moss

In this workshop we'll take a tour of applied AI from the perspective of front end developers, zooming in on the emerging best practices when it comes to working with LLMs to build great products. This workshop is based on learnings from working with the OpenAI API from its debut last November to build out a working MVP which became PowerModeAI (A customer facing ideation and slide creation tool).
In the workshop they'll be a mix of presentation and hands on exercises to cover topics including:
- GPT fundamentals- Pitfalls of LLMs- Prompt engineering best practices and techniques- Using the playground effectively- Installing and configuring the OpenAI SDK- Approaches to working with the API and prompt management- Implementing the API to build an AI powered customer facing application- Fine tuning and embeddings- Emerging best practice on LLMOps

artificial intelligence openai react and ai

Follow us

Upcoming events

Subscribe to the top JS conferences

and grow in-depth as engineer with insights from library authors and core teams

JSNation US 2025

New York, Nov 17 - 20, 2025

Want to sponsor our events?

React Summit US 2025

New York, Nov 18 - 21, 2025

TechLead Conference 2025: AI in Orgs

Sep 18 - 19, 2025

AI Coding Summit

October, 2025

React Advanced 2025

London, Nov 27 - Dec 1, 2025

TechLead Conf London 2025: Adopting AI in Orgs Edition

London, Nov 28, 2025

React Advanced Canada 2026

Toronto, Mar 24 - 26, 2026

Node Congress 2026

April, 2026

JSNation 2026

Amsterdam, Jun 11 - 15, 2026

React Summit 2026

Amsterdam, Jun 11 - 15, 2026