Back to Blog
ragai-architecturehealthtechreliabilitymulti-agent

RAG Retrieval Reliability: Why I Stopped Trusting It (Simply Explained)

A plain-language guide to rag retrieval reliability. No jargon, no tech speak, just what it means for your business.

By Mike Hodgen

Want the full technical deep dive? Read the detailed version

The Day My AI Lied to Me With a Straight Face

I built a private health AI for a family member. Think of it like a small medical team, except the team members are AI assistants. Each one has a specialty: one watches vital signs, one tracks medications, one knows the medical history. You ask a question, the right specialist answers, and a head assistant pulls it all together.

It worked great. Until it didn't, in a way that changed how I build these systems.

Here was the problem. I'd ask a general question like "how am I doing overall?" and one of the AI assistants would answer, with total confidence, "I don't see a BMI in your chart."

The BMI was right there. Current. Sitting in the exact file the AI had access to.

This is the dangerous part. The AI wasn't wrong about the answer. It was wrong about whether it even had the answer. And those are two very different failures.

The second one is worse, because it sounds honest. "I don't see that" feels careful. It feels trustworthy. It sounds like the AI checked and came up empty.

In a health setting, that's a wrong decision waiting to happen. A weight problem reported as missing. A medication that doesn't show up when it should. The whole reason you put AI on top of your data is so it uses all of it, especially the parts that matter most.

So here's the real question for anyone thinking about this: can you actually trust AI to use every important fact in your data? Or will it confidently skip the ones that matter and swear they were never there?

Why AI Skips the Most Important Stuff

To fix it, I had to understand why it happens. And it's not a glitch. It's just how these systems are built to search.

Here's the simple version. When you ask the AI a question, it doesn't read your entire file. It grabs the chunks of information that sound most similar to your question. Then it answers based only on what it grabbed.

That works fine when your question and the fact use similar words. Ask "what medications am I on?" and the medication info lights up. Easy.

The trouble is that "sounds similar" is not the same as "actually important."

Now ask a broad question like "how am I doing?" Nothing in those words sounds like a specific BMI number or a drug allergy. So the search skips right past them. The single most important fact, the one a doctor would mention first, never gets pulled in. The AI never sees it. So it honestly tells you it's missing.

Here's the pattern that makes this scary: the more general your question, the more likely the system misses the critical stuff. The exact questions where you most want the important facts to surface are the questions where the search is least likely to find them. And it fails quietly. No alarm. No error message.

Some Facts Are Too Important to Leave to Chance

Once I understood the cause, the fix was obvious. There's a small group of facts where "usually found" is nowhere near good enough.

In health, that group is short and clear: current weight and height, BMI, blood pressure, heart rate, active medications, known allergies. These are the facts that matter to almost any question, and the cost of the AI not seeing one is severe.

You don't want a coin flip deciding whether the AI sees a drug allergy. You want it there, every single time, no exceptions.

The mistake most systems make is treating every piece of information as equally findable. They dump everything into one big searchable pile and hope the right stuff comes up. That's fine for the random background details. It's reckless for the handful of facts that should never go missing.

This isn't just a health thing. Every business has its own version:

  • An account balance in a financial assistant
  • A contract expiration date in a legal assistant
  • An order status in a customer service bot
  • An allergy, anywhere it shows up

These are facts where being wrong about whether you even have them leads to a wrong decision. They can't depend on whether the search happened to find them.

My Fix: Stop Searching, Start Guaranteeing

So I stopped trying to make the search reliable for these facts. I took the search out of the equation entirely.

Instead, I built a simple rule: before any AI assistant answers anything, the critical facts get placed at the very top of what it reads. Every question. Every time. No matter what the search turned up.

The current weight, BMI, blood pressure, heart rate, medications, allergies. All of it sits right in front of the AI before it thinks about anything else.

The "I don't see a BMI" failure became impossible. The BMI is always there.

I could have spent weeks fine-tuning the search to be smarter. But here's the thing: you can improve a search forever and still hit an oddly worded question that slips through. With searching, you're always managing a failure rate. For facts where a single miss could cause a wrong health decision, the only acceptable failure rate is zero. And you can't get to zero with a system that's basically guessing.

Putting the facts in front of the AI by default costs me a tiny bit more each time, since I'm including them whether the question needs them or not. We're talking pennies. For facts where one miss could matter, that's not a cost. It's insurance.

I added two more safety nets on top. First, a rule baked into every assistant: you are not allowed to say "I don't see this" without first double-checking the actual data. Second, that head assistant I mentioned reviews all the specialists before anything reaches the person. If one says "no BMI on file" while the data clearly has one, the head catches the mistake and fixes it. A false answer has to slip past two separate checks to ever reach you.

When Searching Is Still the Right Tool

I want to be clear: I'm not against searching. I use it constantly, including in this same system.

For the giant pile of background information, searching is exactly right. Old medical notes, past conversations, big stacks of documents, anything where you genuinely can't know in advance what'll matter. You can't shove all of that in front of the AI every time. It wouldn't fit, and most of it would be noise.

So here's the rule I follow, and the test I give every client:

If a fact being missed would cause a wrong decision, put it in front of the AI by default. If it's just nice-to-have background, let the search find it.

A drug allergy being missed causes a wrong decision. Guarantee it. A note from a routine checkup a year and a half ago is helpful if it's relevant, harmless if it's not. Let the search handle it.

That one question sorts almost everything. And it works everywhere. In a finance assistant, the account balance is guaranteed and the transaction history is searched. In a legal one, the expiration date is guaranteed and the supporting details are searched.

What This Means for Trusting AI With Your Data

So back to the question I opened with. Can you trust AI to use all your data?

Yes. But only if someone built the system to guarantee it, not just hope for it.

Out of the box, these systems will confidently miss things. They'll tell you a fact isn't there when it is, in a tone that sounds careful and trustworthy. Most vendors won't mention this, partly because their demo uses clean, easy questions where the search happens to work, and partly because the failure is invisible until it costs you.

That's the difference between a slick demo and a system you can actually rely on. Knowing which facts can't be left to chance. Building the backstops. Catching the mistakes before they reach you. None of that shows up in a flashy demo. All of it shows up the first time the easy default quietly fails.

That's the work I do. I find the places where AI breaks in ways nobody noticed, and I engineer around them before they cost you a wrong decision.

If you're putting AI against data where being wrong about what you have is not an option, that's the conversation worth having.

Want to explore what AI could do for your business?

Book a free 30-minute strategy call. No pitch deck, no sales team, just a real conversation about your operations and where AI fits.

Book a Discovery Call

Get AI insights for business leaders

Practical AI strategy from someone who built the systems — not just studied them. No spam, no fluff.

Ready to automate your growth?

Book a free 30-minute strategy call with Hodgen.AI.

Book a Strategy Call