Probabilistic Masking, Explained Without the Jargon
The simplest way I can describe the core idea behind our patent is this: we taught the system where to look before we asked it what it saw.
A 3D medical scan is enormous. It is not a single picture, it is a stack of hundreds of slices that together form a volume you can move through. Most of that volume is perfectly healthy tissue. Disease, when it is there, usually lives in a small part of it. If you hand all of that raw data to a model and ask it to find a problem, you are making it search an entire building room by room when you already have a decent idea of which floor matters.
Looking where it counts
Probabilistic masking is our answer to that. Before the heavy analysis happens, the system narrows its attention to the regions most likely to be medically meaningful, based on what similar, expertly annotated scans have shown in the past.
Think about how an experienced radiologist works. They do not stare at every pixel with equal weight. Years of training have taught their eye where trouble tends to hide for a given kind of scan. They look there first. Probabilistic masking is an attempt to give the system a version of that learned instinct, expressed as math rather than intuition.
The "probabilistic" part matters. The system is not throwing away the rest of the image. It is assigning higher priority to the areas that prior evidence says are worth examining closely, and it can be wrong and self-correct. It is a way of focusing, not a way of ignoring.
Why focusing changes everything downstream
Once you have isolated the regions that matter, two good things happen.
First, the analysis gets sharper. The model is spending its effort on the part of the image that actually carries the signal, instead of diluting itself across a huge volume of healthy tissue. Focus improves accuracy in machines for the same reason it does in people.
Second, you can represent those important regions far more efficiently. Instead of carrying around the entire raw scan, the system converts the regions of interest into compact mathematical summaries, which we call embeddings, and works with those. That makes everything faster and, as it happens, easier to protect, which is a thread I pull on in another piece.
The honest limits
I want to be careful here, because medical AI attracts a lot of overpromising. Teaching a system where to look is a meaningful advance, but it is not a doctor, and it is not a guarantee. The point of the approach is to make a capable model more focused, more efficient, and easier to trust, with a clinician still in the loop. It is an assistant to expert judgment, not a replacement for it.
That framing is deliberate. The best medical technology I have seen makes a skilled human faster and sharper. It does not try to push the human out of the room.
Why I find it elegant
What I like about probabilistic masking is that it borrows a very human idea and makes it useful to a machine. We do not solve hard problems by paying equal attention to everything. We solve them by knowing where to look first. Building that instinct into the system, in a way you can measure and improve, is the part of this work I find genuinely satisfying.
FAQ
What is probabilistic masking?
It is a method that narrows an AI system's attention to the regions of a 3D medical scan most likely to be medically meaningful, based on prior distributions from annotated scans, so analysis focuses where disease is most likely to be.
Part of the Patents series →
Privacy-Preserving Diagnostics: AI on Medical Images Without Exposing the Patient
The hardest problem in medical AI is not the AI. It is the data. This is the part of our patent that comes straight out of my security career.
What "Allowed" Actually Means
People hear "patent" and assume there are two states: you have one or you do not. There is an in-between, and it is the moment that actually matters.
Why a Human Stays in the Loop
The most important design decision in our system is not a clever algorithm. It is the choice to keep a person in charge and to let the machine learn from them.