AI Engineer End-to-End Workflow: Productivity Challenges and Their Solutions

This ad is not shown to multipass and full ticket holders
React Summit
React Summit 2025
June 13 - 17, 2025
Amsterdam & Online
The biggest React conference worldwide
Learn More
In partnership with Focus Reactive
Upcoming event
React Summit 2025
React Summit 2025
June 13 - 17, 2025. Amsterdam & Online
Learn more
Bookmark
Rate this content

Building generative AI applications requires a GenAIOps mindset that takes you on a journey from model selection to prompt ideation, prototype evaluation, and app deployment. In this talk, we’ll explore the GenAIOps journey from an AI Engineer perspective and identify productivity challenges, and the tooling solutions that can mitigate these issues and help streamline their developer experience.

This talk has been presented at Productivity Conf for Devs and Tech Leaders, check out the latest edition of this Tech Conference.

FAQ

Nithya Narasimhan is an AI advocate at Microsoft.

The focus of the talk is on productivity challenges and solutions for AI engineers, particularly in the journey from catalog to code to cloud.

An AI engineer is someone who can pick a model, customize it through various techniques, evaluate it for quality, and deploy it for real-world applications.

'Exposure to AI' refers to a task being exposed to AI if AI-powered software can reduce the time needed to complete the task by at least 50%.

Contoso Chat is a generic retail application used to demonstrate the integration of a chatbot into a retail website to improve customer interaction and sales.

The desired features are conversational interaction, grounding responses in catalog data, context awareness, and safety from malicious activities.

The three stages are ideation, augmentation, and operationalization.

Infrastructure as code allows defining infrastructure needs through code, ensuring consistency across team members and enabling use of tools like Azure Developer CLI for provisioning.

The Azure AI Inference API provides a unified abstraction to interact with various models, allowing developers to swap models with minimal effort and enhance productivity during the ideation phase.

AI-assisted evaluation involves using another AI to grade the responses of the first AI, often referred to as 'LLM as a judge,' to scale evaluation against large datasets.

Nitya Narasimhan
Nitya Narasimhan
22 min
27 Mar, 2025

Comments

Sign in or register to post your comment.
  • Va Da
    Va Da
    P4
    AI is taking over the world.
Video Summary and Transcription
Today's Talk explores productivity challenges and solutions for AI engineers, focusing on an application called Contoso Chat. The end-to-end workflow is examined from a productivity perspective, introducing the concept of Exposure to AI. Building an AI application involves ideation, augmentation, and operationalization stages. Provisioning and setup are crucial steps, with infrastructure as code being a productivity tool. GitHub Codespaces and development containers provide consistent development environments. Azure AI Inference API allows easy iteration and prototyping. AI-assisted evaluation involves training AI models and using evaluators for grading responses. Custom evaluators can be created. Overall, the Talk emphasizes the importance of productivity throughout the AI engineer's journey.

1. Introduction

Short description:

Today, I want to talk about productivity challenges and solutions for the AI engineers journey. I will distinguish between apps and ops and explore the use of AI-based efficiencies. We will focus on an application called Contoso Chat to discover how AI can improve productivity throughout the workflow. An AI engineer has the skills to pick and customize a large-language model, evaluate its performance, and deploy it for real-world applications.

Hello, my name is Nithya Narasimhan and I am an AI advocate at Microsoft. Today, I want to talk to you about productivity challenges and solutions for the AI engineers journey from catalog to code to cloud. But to set the stage, I actually want to start by asking, what does productivity mean and who is this AI engineer? And I want to think of it in two steps.

First, I want to distinguish between apps and ops. And second, I want to use this really nice question, is my task exposed to AI, as a way to ask, is there room for AI-based efficiencies in this step of my workflow? For the bulk of this talk, we'll really focus on setting the stage with an application called Contoso Chat that I've been working on for a year or more and use that to kind of walk through this end-to-end workflow and say, in every stage of this, is there room for efficiencies and productivity with AI?

So to set the stage, we all, when we think about AI and productivity, we're really thinking of ourselves as app developers. And we're using AI for efficiency in our coding workflows. That means we might be scaffolding out a new app. We might be writing tests. We might be writing documentation, debugging, getting explainers, et cetera, using tools like Copilot. But when I think about an AI engineer, an AI engineer is somebody who has the skills to go all the way from the very first step of picking a large-language model and then customizing it through prompt engineering or fine-tuning or retrieval-augmented generation, and then evaluating it to make sure that it's providing good quality responses for a lot of diverse inputs, and then deploying it so it can actually be integrated with real-world applications to deliver that service.

2. Productivity Perspective

Short description:

Let's look at the end-to-end workflow from a productivity perspective. Chip Huyan's AI Engineering book introduces the concept of Exposure to AI, where tasks can be considered exposed to AI if it reduces the completion time by 50% or more.

Now, that's a lot. But how can I now look at that end-to-end workflow from a productivity perspective? And I want to recommend all of you read this book if you haven't. This is Chip Huyan's AI Engineering. And within the first chapter or so, she actually has this term that caught my attention called Exposure to AI. And it comes from this paper called Elundo et al. But what I really liked is it kind of says exposure to AI. It defines a task as being exposed to AI if AI and AI-powered software can reduce the time needed to complete that task by at least 50%. And I thought this is something I can measure.

Check out more articles and videos

We constantly think of articles and videos that might spark Git people interest / skill us up or help building a stellar career

Building a Voice-Enabled AI Assistant With Javascript
JSNation 2023JSNation 2023
21 min
Building a Voice-Enabled AI Assistant With Javascript
Top Content
This Talk discusses building a voice-activated AI assistant using web APIs and JavaScript. It covers using the Web Speech API for speech recognition and the speech synthesis API for text to speech. The speaker demonstrates how to communicate with the Open AI API and handle the response. The Talk also explores enabling speech recognition and addressing the user. The speaker concludes by mentioning the possibility of creating a product out of the project and using Tauri for native desktop-like experiences.
AI and Web Development: Hype or Reality
JSNation 2023JSNation 2023
24 min
AI and Web Development: Hype or Reality
Top Content
This talk explores the use of AI in web development, including tools like GitHub Copilot and Fig for CLI commands. AI can generate boilerplate code, provide context-aware solutions, and generate dummy data. It can also assist with CSS selectors and regexes, and be integrated into applications. AI is used to enhance the podcast experience by transcribing episodes and providing JSON data. The talk also discusses formatting AI output, crafting requests, and analyzing embeddings for similarity.
The Rise of the AI Engineer
React Summit US 2023React Summit US 2023
30 min
The Rise of the AI Engineer
Top Content
Watch video: The Rise of the AI Engineer
The rise of AI engineers is driven by the demand for AI and the emergence of ML research and engineering organizations. Start-ups are leveraging AI through APIs, resulting in a time-to-market advantage. The future of AI engineering holds promising results, with a focus on AI UX and the role of AI agents. Equity in AI and the central problems of AI engineering require collective efforts to address. The day-to-day life of an AI engineer involves working on products or infrastructure and dealing with specialties and tools specific to the field.
The Ai-Assisted Developer Workflow: Build Faster and Smarter Today
JSNation US 2024JSNation US 2024
31 min
The Ai-Assisted Developer Workflow: Build Faster and Smarter Today
AI is transforming software engineering by using agents to help with coding. Agents can autonomously complete tasks and make decisions based on data. Collaborative AI and automation are opening new possibilities in code generation. Bolt is a powerful tool for troubleshooting, bug fixing, and authentication. Code generation tools like Copilot and Cursor provide support for selecting models and codebase awareness. Cline is a useful extension for website inspection and testing. Guidelines for coding with agents include defining requirements, choosing the right model, and frequent testing. Clear and concise instructions are crucial in AI-generated code. Experienced engineers are still necessary in understanding architecture and problem-solving. Energy consumption insights and sustainability are discussed in the Talk.
Web Apps of the Future With Web AI
JSNation 2024JSNation 2024
32 min
Web Apps of the Future With Web AI
Web AI in JavaScript allows for running machine learning models client-side in a web browser, offering advantages such as privacy, offline capabilities, low latency, and cost savings. Various AI models can be used for tasks like background blur, text toxicity detection, 3D data extraction, face mesh recognition, hand tracking, pose detection, and body segmentation. JavaScript libraries like MediaPipe LLM inference API and Visual Blocks facilitate the use of AI models. Web AI is in its early stages but has the potential to revolutionize web experiences and improve accessibility.
Code coverage with AI
TestJS Summit 2023TestJS Summit 2023
8 min
Code coverage with AI
Codium is a generative AI assistant for software development that offers code explanation, test generation, and collaboration features. It can generate tests for a GraphQL API in VS Code, improve code coverage, and even document tests. Codium allows analyzing specific code lines, generating tests based on existing ones, and answering code-related questions. It can also provide suggestions for code improvement, help with code refactoring, and assist with writing commit messages.

Workshops on related topic

AI on Demand: Serverless AI
DevOps.js Conf 2024DevOps.js Conf 2024
163 min
AI on Demand: Serverless AI
Top Content
Featured WorkshopFree
Nathan Disidore
Nathan Disidore
In this workshop, we discuss the merits of serverless architecture and how it can be applied to the AI space. We'll explore options around building serverless RAG applications for a more lambda-esque approach to AI. Next, we'll get hands on and build a sample CRUD app that allows you to store information and query it using an LLM with Workers AI, Vectorize, D1, and Cloudflare Workers.
AI for React Developers
React Advanced 2024React Advanced 2024
142 min
AI for React Developers
Top Content
Featured Workshop
Eve Porcello
Eve Porcello
Knowledge of AI tooling is critical for future-proofing the careers of React developers, and the Vercel suite of AI tools is an approachable on-ramp. In this course, we’ll take a closer look at the Vercel AI SDK and how this can help React developers build streaming interfaces with JavaScript and Next.js. We’ll also incorporate additional 3rd party APIs to build and deploy a music visualization app.
Topics:- Creating a React Project with Next.js- Choosing a LLM- Customizing Streaming Interfaces- Building Routes- Creating and Generating Components - Using Hooks (useChat, useCompletion, useActions, etc)
Working With OpenAI and Prompt Engineering for React Developers
React Advanced 2023React Advanced 2023
98 min
Working With OpenAI and Prompt Engineering for React Developers
Top Content
Workshop
Richard Moss
Richard Moss
In this workshop we'll take a tour of applied AI from the perspective of front end developers, zooming in on the emerging best practices when it comes to working with LLMs to build great products. This workshop is based on learnings from working with the OpenAI API from its debut last November to build out a working MVP which became PowerModeAI (A customer facing ideation and slide creation tool).
In the workshop they'll be a mix of presentation and hands on exercises to cover topics including:
- GPT fundamentals- Pitfalls of LLMs- Prompt engineering best practices and techniques- Using the playground effectively- Installing and configuring the OpenAI SDK- Approaches to working with the API and prompt management- Implementing the API to build an AI powered customer facing application- Fine tuning and embeddings- Emerging best practice on LLMOps
Free webinar: Building Full Stack Apps With Cursor
Productivity Conf for Devs and Tech LeadersProductivity Conf for Devs and Tech Leaders
71 min
Free webinar: Building Full Stack Apps With Cursor
Top Content
WorkshopFree
Mike Mikula
Mike Mikula
In this webinar I’ll cover a repeatable process on how to spin up full stack apps in Cursor.  Expect to understand techniques such as using GPT to create product requirements, database schemas, roadmaps and using those in notes to generate checklists to guide app development.  We will dive further in on how to fix hallucinations/ errors that occur, useful prompts to make your app look and feel modern, approaches to get every layer wired up and more!  By the end expect to be able to run your own ai generated full stack app on your machine!
Building AI Applications for the Web
React Day Berlin 2023React Day Berlin 2023
98 min
Building AI Applications for the Web
Workshop
Roy Derks
Roy Derks
Today every developer is using LLMs in different forms and shapes. Lots of products have introduced embedded AI capabilities, and in this workshop you’ll learn how to build your own AI application. No experience in building LLMs or machine learning is needed. Instead, we’ll use web technologies such as JavaScript, React and GraphQL which you already know and love.
Building Your Generative AI Application
React Summit 2024React Summit 2024
82 min
Building Your Generative AI Application
WorkshopFree
Dieter Flick
Dieter Flick
Generative AI is exciting tech enthusiasts and businesses with its vast potential. In this session, we will introduce Retrieval Augmented Generation (RAG), a framework that provides context to Large Language Models (LLMs) without retraining them. We will guide you step-by-step in building your own RAG app, culminating in a fully functional chatbot.
Key Concepts: Generative AI, Retrieval Augmented Generation
Technologies: OpenAI, LangChain, AstraDB Vector Store, Streamlit, Langflow