TechLead Conf London 2025: Adopting AI in Orgs Edition

English versionEN

Shipping AI Under Constraints: Build, Buy, or Kill

At Capital on Tap, the AI team began outside Engineering, and has had a contentious time integrating the team into our existing Engineering structure - with an unclear mandate, pressure to turn prototypes into production systems. Most organisations have prototypes; far fewer have safe, reliable customer-facing AI products. This talk is a practitioner’s playbook for AI in organisations and measuring AI impact and ROI in a regulated UK fintech context—focused on operating model, gates, product-readiness, and post-launch measurement tied to operational outcomes.

This talk has been presented at TechLead Conf London 2025: Adopting AI in Orgs Edition, check out the latest edition of this Tech Conference.

FAQ

Capital Untap is a medium-sized fintech company that provides credit and savings services for small and medium businesses in the UK and US.

Dave's talk focused on the challenges and lessons learned from attempting to implement AI projects under constraints, specifically discussing the failures and successes at Capital Untap.

The main issue with the AI chatbot project was its complexity and the lack of proper integration with engineering standards, which led to inaccurate responses and the need for significant resources to make it production-ready.

Capital Untap decided to pause the complex AI chatbot project and opted to partner with third-party vendors for chatbot solutions, integrating AI engineers into existing product teams rather than maintaining a separate AI team.

Capital Untap successfully implemented an AI project called Blaze, which transcribes and summarizes customer calls, reducing wrap-up time for operations agents by a minute per call.

Capital Untap learned that AI projects should be treated as standard engineering projects, integrated into product teams, and not handled by a separate AI team, as they are not an AI-focused company.

Capital Untap now integrates AI engineers into existing product teams, using a standard intake process with clear success metrics and stage gates, similar to other engineering projects.

Capital Untap decided to use third-party vendors for their chatbot due to the complexity and resource demands of developing a sophisticated AI chatbot in-house, which was beyond their core competency.

The Blaze AI project saved Capital Untap about 500 hours per year in wrap-up time for operations agents, improving efficiency and operational metrics.

Yes, Capital Untap is hiring for various roles, and interested individuals can connect with Dave on LinkedIn for more information.

deep dive

Dave Glencross

8 min

28 Nov, 2025

Comments

Video Summary and Transcription

Dave, Engineering Manager at Capital Untap, shares a story of AI failure and lessons learned from moving too fast in AI development within the company. The AI team focused on developing a chatbot for customer service, which became a flagship initiative under engineering governance. Issues with accuracy and complexity led to pausing the chatbot project and exploring third-party vendors. Capital OnTap reset AI processes by integrating AI engineers into product teams, emphasizing standard engineering over specialized AI teams.

Available in Español: Envío de IA Bajo Restricciones: Construir, Comprar o Eliminar

1. AI Development Challenges at Capital Untap

Short description:

Dave, Engineering Manager at Capital Untap, shares a story of AI failure and lessons learned from moving too fast in AI development within the company.

I'm Dave. I'm an Engineering Manager at Capital Untap, and thank you for coming to my talk on Shipping AI Under Constraints. When I pitched the talk, I did not have the subtitle of how to waste 200k on AI. So as you can tell, it's been a fun few months for me.

Capital Untap is a medium-sized fintech. We do credit and savings for small and medium businesses, UK and US, and we only have about 100 or so developers, quite small for the amount of revenue that we generate. Our senior execs are very into AI, very enthusiastic. When we saw the email from the Shopify CEO earlier, it made me realize where my CEO got a lot of his ideas. And we have a culture of pilots and experimentation within the company.

This is just a brief story of failure, and how we wasted a load of money, and what we're doing differently now. The key lesson that I took from it was that we gave AI special treatment, and we moved too fast. What does it mean to move too fast? Well, the AI team actually grew out of the business. They did not start in engineering. Head of AI originally came from the business. And if you think that's a baffling decision for a company to make, I'm not here to argue with you on that. The engineering department treated them kind of as an internal security threat, and tried to limit the amount of customer data going to them.

2. Challenges in Deploying the AI Chatbot

Short description:

The AI team initially focused on developing an AI chatbot for customer service, with the project becoming the flagship initiative after moving under engineering governance.

Which I think is actually quite sensible. But here we are. Not the strongest start. If you do any kind of customer service, it has probably occurred to you that you could have an AI chat bot. It's a very obvious use case. And it's the first thing that the AI team set about doing. Trying to build something that would chat to customers, answer simple questions, and so on.

So, as I say, the team started outside engineering, outside engineering governance and standards. And after eight months of hard work on this chatbot, amongst other projects, engineering won the argument, and the team was moved to work for me. I started off with the assessment of projects, taking on a new team, assess what they're doing. There were more than just the chatbot, but it was the big one. It was their flagship project. It was the big thing that we were working on.

And as it came to me, they were telling me, we're weeks away from go live. We're very close. Let's go. So this is where we come in with the evaluation using stage gates. I mean, the step one is just evals. I'm sure everyone in the audience knows what that is, but LLM integration tests, essentially, get to a certain level, move to shadow mode for our chatbot. So it would comment on customer queries and production. And it would just say, this is what I would have said if I was talking to the customer.

3. Challenges in Evaluating AI Project Performance

Short description:

The chatbot faced accuracy and complexity issues, leading to a decision to pause the project and explore third-party vendors. Capital OnTap shifted focus to simpler AI projects within product teams and decided to reset AI processes by integrating AI engineers into existing teams.

And that gave an opportunity for operations agents to compare what they were saying with the chatbot and give us any feedback. And of course, we need to have a high level of confidence with that stage gate before we go to customer contact. This is where the problem started to emerge. If you ask the chatbot a question that's not in its knowledge base, most of the time it will escalate to a human, but a few times it will just make something up, which I'm sure comes as no surprise to anyone, his work with LLMs. It was pretty good, but 90 to 95% accuracy was not good enough for something that's going to talk to customers thousands of times a week. And here was where the problem started to emerge, right? Complexity. We had a lot of this papering over the cracks of a small LLM problem. It's lying to you, okay, how do we fix that? We put in some validation after it makes its answer to just check, is this factually correct? And maybe that will work pretty well, but ultimately we have to build the validation and test it before we know. So not only have you got complexity of kind of workaround upon workaround, but it's very hard to estimate because we just don't really know how well it's going to work.

To give a counter example, we have built other AI projects within Capital OnTap. We have one called Blaze, which is essentially AI transcription and summarization of customer calls. It's a bit more complex than that, but it's pretty straightforward. And that was built by the operations engineering team. So it started off in the correct team. It started off with a very product focus, and we started off with tracking operational metrics. So after a phone call, operations agent does the wrap up, and we've reduced that time from three minutes to two, which is a massive impact for 600 calls a day. The decision came, the decision time came for Merlin, our chatbot. We had to decide what to do with it. We'd spent at least 200K in people's salaries, as those involved, and LLM costs. And we had to say to the business, look, this thing that you thought was weeks away is actually going to be paused and months away now. But the CEO understands sunk cost fallacy, so we'd killed it and we decided to look at third-party vendors.

Ultimately, we decided this was not our core competency for such a complex project, and we should stick to the fintech side of things and simpler AI projects that we can own within product teams, but go with third parties for something as complicated as this. So we had two vendors that we liked. We essentially just found other businesses that had integrated with them and went to chat with them just to see how they performed. One was much faster, and one of them lied to us about balance amount, which was very interesting to see in the live real world. And as we were killing the main AI project, we decided to reset how we do AI at CapitalTap, so we no longer have a separate AI team. We have AI engineers who will embed in an existing product team, help them deliver their project, and then step away.

4. Reassessing AI Project Ownership and Approach

Short description:

CapitalTap shifted focus to integrated AI engineers into product teams, emphasizing standard engineering over specialized AI teams. The reset involved reevaluating the ownership structure and approach to AI projects, aligning them with the business's core competencies.

One was much faster, and one of them lied to us about balance amount, which was very interesting to see in the live real world. And as we were killing the main AI project, we decided to reset how we do AI at CapitalTap, so we no longer have a separate AI team. We have AI engineers who will embed in an existing product team, help them deliver their project, and then step away.

Ultimately, we decided that a team built around a technology can't own domains and products not in the same way that an actual product team can. And then, again, back to the comparison between the two. Blaze almost started accidentally in the correct team, correct ownership. And if you read through these just very briefly, what you see is not really AI problems. It's just standard engineering problems, and that's really the lesson that we took from this.

Again, outside our core competency to deliver a very complicated AI project, so we didn't. And we used the same intake process for AI as any other project, any other feature. So we have a named product owner, clear success metrics, stage gates that I've shown you. Each gate has a killer criteria, so you can just back off a project if it's not good enough. And this is kind of what it boils down to for me, can a product team own it? Do you need permanent AI specialists to own it? For us, again, that's a red flag. We're not an AI company. We're trying to adopt it, but we are ultimately not an AI company.

Available in other languages:

Check out more articles and videos

We constantly think of articles and videos that might spark Git people interest / skill us up or help building a stellar career

A Guide to React Rendering Behavior

React Advanced 2022

25 min

A Guide to React Rendering Behavior

Top Content

Mark Erikson

Replay.io

This transcription provides a brief guide to React rendering behavior. It explains the process of rendering, comparing new and old elements, and the importance of pure rendering without side effects. It also covers topics such as batching and double rendering, optimizing rendering and using context and Redux in React. Overall, it offers valuable insights for developers looking to understand and optimize React rendering.

react performance deep dive react rendering

React Concurrency, Explained

React Summit 2023

23 min

React Concurrency, Explained

Top Content

Watch video: React Concurrency, Explained

Ivan Akulov

Google Developer Expert, Web Performance Consultant, Netherlands

React 18's concurrent rendering, specifically the useTransition hook, optimizes app performance by allowing non-urgent updates to be processed without freezing the UI. However, there are drawbacks such as longer processing time for non-urgent updates and increased CPU usage. The useTransition hook works similarly to throttling or bouncing, making it useful for addressing performance issues caused by multiple small components. Libraries like React Query may require the use of alternative APIs to handle urgent and non-urgent updates effectively.

react performance best practices react 18 deep dive react concurrent mode

If You Were a React Compiler

React Summit US 2024

26 min

If You Were a React Compiler

Top Content

Tony Alicea

Tony is a bestselling Udemy and Pluralsight author, known for his online courses such as "JavaScript: Understanding the Weird Parts"

In this talk, the speaker aims to build an accurate understanding of how the new React compiler works, focusing on minimizing re-renders and improving performance. They discuss the concept of memoization and how it can be used to optimize React applications by storing the results of function calls. The React compiler automates this process by analyzing code, checking dependencies, and transpiling JSX. The speaker emphasizes the importance of being aware of memory concerns when using memoization and explains how the React compiler detects changes in function closure values. They also mention the Fibre Tree, which drives the reconciliation process and helps optimize performance in React. Additionally, the speaker touches on JSX transpilation, compiler caching, and the generation of code. They encourage developers to understand the code generated by the compiler to optimize specific sections as needed.

react deep dive react 19

Making Magic: Building a TypeScript-First Framework

TypeScript Congress 2023

31 min

Making Magic: Building a TypeScript-First Framework

Top Content

Daniel Roe

Leads the Nuxt core team

Daniel Rowe discusses building a TypeScript-first framework at TypeScript Congress and shares his involvement in various projects. Nuxt is a progressive framework built on Vue.js, aiming to reduce friction and distraction for developers. It leverages TypeScript for inference and aims to be the source of truth for projects. Nuxt provides type safety and extensibility through integration with TypeScript. Migrating to TypeScript offers long-term maintenance benefits and can uncover hidden bugs. Nuxt focuses on improving existing tools and finds inspiration in frameworks like TRPC.

typescript frameworks vue nuxt.js deep dive builders and founders

Inside Fiber: the in-depth overview you wanted a TLDR for

React Summit 2022

27 min

Inside Fiber: the in-depth overview you wanted a TLDR for

Top Content

Matheus Albuquerque

Staff Software Engineer @Medallia • Google Developer Expert • International Speaker • Technical Reviewer

This Talk explores the internals of React Fiber and its implications. It covers topics such as fibres and units of work, inspecting elements and parent matching, pattern matching and coroutines, and the influence of coroutines on concurrent React. The Talk also discusses effect handlers in React, handling side effects in components, and the history of effect handlers in React. It concludes by emphasizing the importance of understanding React internals and provides learning resources for further exploration.

react react 18 deep dive

Deep Diving on Concurrent React

React Advanced 2022

29 min

Deep Diving on Concurrent React

Matheus Albuquerque

Staff Software Engineer @Medallia • Google Developer Expert • International Speaker • Technical Reviewer

The Talk discussed Concurrent React and its impact on app performance, particularly in relation to long tasks on the main thread. It explored parallelism with workers and the challenges of WebAssembly for UI tasks. The concepts of concurrency, scheduling, and rendering were covered, along with techniques for optimizing performance and tackling wasted renders. The Talk also highlighted the benefits of hydration improvements and the new profiler in Concurrent React, and mentioned future enhancements such as React fetch and native scheduling primitives. The importance of understanding React internals and correlating performance metrics with business metrics was emphasized.

react concurrent rendering react 18 deep dive

Workshops on related topic

React Hooks Tips Only the Pros Know

React Summit Remote Edition 2021

177 min

React Hooks Tips Only the Pros Know

Top Content

Featured Workshop

Maurice de Beijer

The addition of the hooks API to React was quite a major change. Before hooks most components had to be class based. Now, with hooks, these are often much simpler functional components. Hooks can be really simple to use. Almost deceptively simple. Because there are still plenty of ways you can mess up with hooks. And it often turns out there are many ways where you can improve your components a better understanding of how each React hook can be used.You will learn all about the pros and cons of the various hooks. You will learn when to use useState() versus useReducer(). We will look at using useContext() efficiently. You will see when to use useLayoutEffect() and when useEffect() is better.

react best practices react hooks deep dive react 18 hooks react profiler

Designing Effective Tests With React Testing Library

React Summit 2023

151 min

Designing Effective Tests With React Testing Library

Top Content

Featured Workshop

Josh Justice

React Testing Library is a great framework for React component tests because there are a lot of questions it answers for you, so you don’t need to worry about those questions. But that doesn’t mean testing is easy. There are still a lot of questions you have to figure out for yourself: How many component tests should you write vs end-to-end tests or lower-level unit tests? How can you test a certain line of code that is tricky to test? And what in the world are you supposed to do about that persistent act() warning?
In this three-hour workshop we’ll introduce React Testing Library along with a mental model for how to think about designing your component tests. This mental model will help you see how to test each bit of logic, whether or not to mock dependencies, and will help improve the design of your components. You’ll walk away with the tools, techniques, and principles you need to implement low-cost, high-value component tests.
Table of contents- The different kinds of React application tests, and where component tests fit in- A mental model for thinking about the inputs and outputs of the components you test- Options for selecting DOM elements to verify and interact with them- The value of mocks and why they shouldn’t be avoided- The challenges with asynchrony in RTL tests and how to handle them
Prerequisites- Familiarity with building applications with React- Basic experience writing automated tests with Jest or another unit testing framework- You do not need any experience with React Testing Library- Machine setup: Node LTS, Yarn

react testing best practices deep dive react testing react testing library test driven development react

Advanced TypeScript types for fun and reliability

TypeScript Congress 2022

116 min

Advanced TypeScript types for fun and reliability

Workshop

Maurice de Beijer

If you're looking to get the most out of TypeScript, this workshop is for you! In this interactive workshop, we will explore the use of advanced types to improve the safety and predictability of your TypeScript code. You will learn when to use types like unknown or never. We will explore the use of type predicates, guards and exhaustive checking to make your TypeScript code more reliable both at compile and run-time. You will learn about the built-in mapped types as well as how to create your own new type map utilities. And we will start programming in the TypeScript type system using conditional types and type inferring.
Are you familiar with the basics of TypeScript and want to dive deeper? Then please join me with your laptop in this advanced and interactive workshop to learn all these topics and more.
You can find the slides, with links, here: http://theproblemsolver.nl/docs/ts-advanced-workshop.pdf
And the repository we will be using is here: https://github.com/mauricedb/ts-advanced

best practices typescript deep dive