Insight · complete medical records

Complete Medical Records: The Missing Input for Healthcare AI

Patient portals show a fraction of the record. Why healthcare AI needs the complete record — every note, image, and bill — and what completeness takes.

Featuring Aleida Lanza on The Signal Room

Each patient is tasked with recalling their entire medical history whenever they step foot into an exam room. This includes their medical histories, surgical procedures, and medication histories, including the time and date. Unfortunately, most individuals struggle with this task because the full medical history is not easily accessible to the patient. It is, however, locked away in multiple locations and exists in multiple, secure, and controlled versions.

Aleida Lanza talks about this in a recorded Signal Room conversation at the Put Data First conference. Aleida spent 35 years as a medical malpractice paralegal before founding CaseDok. Her thoughts on this subject are direct and unsettling. Truthfully, when we say we are giving patients their data, and when we harness AI to interpret those data, we are giving them a very small piece of the whole. The only data patients have access to is their patient portal, and the entire dataset of their medical care is largely locked away and inaccessible to them. At Hutchins Data Strategy Consultants, this is what people are most frustrated with when it comes to healthcare AI: the framework was never given the data it needed to be right.

Not Everything Is Recorded in the Patient Portal

Lanza identified a line that is intentionally glossed over in most patient access conversations. She noted that a patient portal only contains a small amount of what a provider has. Therefore, if you are not viewing what your doctor is seeing, then you are viewing an excerpt of your record, not your actual record.

Her definition of completeness encompassed more than clinical notes. When she speaks of the medical record, she is referring to everything, the entirety of records, images, and detailed bills. These categories in particular are the ones that typically disappear from the patient-facing view. A scan exists as a report, and a procedure exists as an item that the patient never sees. For someone who has dedicated her career to piecing together the fragmented story of what happened to a patient, that missing material is often the part of the story that is most important.

She made an additional, more focused observation about the term interoperability. Lanza asserted that when the term is used in the context of the industry, it often refers to the interoperability of only the core clinical records, the portions of the record that are structured, coded, and easily transferable between systems. The remainder of the record does not integrate in that manner, and presenting the clinical record as the entire record allows everyone to consider a widely unresolved issue to be resolved.

Antecedents Over Episodes

What a record is for is the line that stays with you. Healthcare, she said, seems to look at what happened at the hospital and misses how the patient got there, the antecedent, the run-up, the history leading to the event. That earlier story is the most often missing part, and it is frequently the part that would change a decision.

This ties into something Chris Hutchins brought up in the episode: medicine evaluates the performance of a healthcare provider using an average patient, which is an abstraction that collapses the moment the focus is on the actual patient in front of you. A record created around the discrete episode, rather than the trajectory that led to it, carries the same flattening. If you point an AI system at that kind of record, the model will learn the episode, not the patient.

Fragmentation Has a True Cost

The fragmentation that Lanza discussed has a price, and it is most often patients who pay this price multiple times. We, in her explanation, pay for the same records repeatedly, every new appointment starts with a blank history form and a patient struggling to remember the history an earlier provider documented. Of the approximately fifteen minutes a visit buys, she said, a large part can be consumed just trying to explain a history that should already be put together.

She framed it using a specific economic lens: it is possible healthcare is the only sector in which you have to pay for a service in order to receive the data related to that service. If an insurance company covers a large expense for a scan, that scan is essentially a data set that the patient usually does not receive. She was arguing that patients should have the right to not only own that data set, but to also download it, store it, and carry the associated images and other line items. Her position was reinforced by real experience: she explained that she left her legal career after witnessing a family member being the victim of a malpractice incident, which she recognized as it was occurring and was powerless to intervene.

Why Completeness Has to Be Prioritized Before the Model

The most important part of Lanza's lecture for anyone designing a complex healthcare AI was Lanza's requirement that the record be assembled before AI is considered, and handled with extreme care. It is also relevant that the record does not have a consistent shape to it. If a source sends data in HL7, it will be HL7, and if it is sent as a PDF, it will remain a PDF. A rural practice may still be relying on paper records. Her methodology was to take the source at its face value and in its native format, rather than forcing everything into a single shape up front.

She was just as adamant regarding the restrictions. You cannot just throw a medical record into an AI system because much of it is protected. The work she envisioned was a pipeline that would enable an AI system to safely access the full medical history and allow the AI to address the antecedent history without further fragmentation or hallucination. She mentioned an unreliability finding of records in emergency settings that she remembered being published in 2025, and estimated it to be on the order of one third. She cited it loosely, and the number should be treated as an alarm and not as a statistic. The underlying point was that when an unreliable history is given to a clinician in a case whereby the patient is unable to provide any information, a complete and curated history is what closes the gap.

Hutchins identified the deeper concern pertaining to the episode. Missing information is the most dangerous type of bias because there is nothing to identify it with. A wrong value can be identified, but a value can be missing and there is no trace. A model can lack completeness and still be confident, and no fairness audit of the model will reveal a fact that was never part of the information.

For context, CaseDok launched a day before the recording. She explained that it currently serves UnitedHealthcare, Aetna, and Florida Blue enrolled members and is conditionally approved on the Medicare.gov Connected Apps Registry. Once the government shutdown ends, CaseDok will be able to extend the services to Medicare members. This is a first rollout, and the diagnosis is what is important. AI is being asked to reason over healthcare records that were never complete to begin with.

How Hutchins Approaches Record Completeness

Our work tends to start where the model conversation has already moved on — at whether the record a use case depends on is actually whole. Completeness is the first lens we apply in any data readiness assessment: not whether an organization has a lot of data, but whether it holds the right data, in usable form, for the decision a model is meant to support. That bleeds into data quality and the governance that decides who can assemble which records, in what format, under which privacy obligations. A pipeline that brings a complete history together also has to answer for every piece of privileged material it touches. These themes run through The Signal Room podcast, where practitioners describe what it takes to give healthcare AI something solid to reason over.

Authoritative sources

Have a data or AI challenge like this?

A 30-minute call is enough to tell whether we're the right fit.

FAQ

Frequently asked questions

What is a complete medical record, and why does it matter for AI?

It is the full corpus of a patient's history — clinical notes, images, and itemized bills across every provider — not the slice exposed in a single patient portal. AI reasoning over a partial record inherits the gaps in that record, and some of those gaps are impossible to see from inside any one system.

Don't patient portals already give patients their records?

Most portals surface only a small fraction of what exists. On the episode, Aleida Lanza argued that a patient who reads a portal is not seeing what the doctor sees, and that the rest of the record — including images and billing detail — usually has to be acquired separately and at cost.

Why can't you just feed the whole record into an AI model?

Records arrive in different native formats, from HL7 messages to PDFs to paper, and much of the content is privileged. Lanza's position was that the corpus has to be assembled and handled safely before AI is brought anywhere near it — privileged material cannot simply be poured into a model.

How does record incompleteness create bias?

When information is missing rather than wrong, there is no error signal to catch. A model can look confident while reasoning over a record that omits the one fact that would have changed the plan, which makes missing-data bias harder to detect than most other failure modes.