The Monitoring Infrastructure Human Radiology Built

Artificial Intelligence Industry Investigation

The Monitoring Infrastructure Human Radiology Built — and the Veterinary Void It Exposes

In May 2026, the American College of Radiology launched Assess-AI, which it describes as the world’s first AI quality registry for medical imaging, alongside a formal ACR-SIIM Practice Parameter for Imaging AI. The registry monitors the real-world performance of clinical imaging AI after deployment — detecting the performance drift and divergence-from-marketing that the human-side profession has formally acknowledged is routine. The United Kingdom’s Royal College of Radiologists has built parallel monitoring infrastructure. The human-medicine professional bodies recognized the problem and built the cure. The veterinary profession’s own specialty bodies issued a near-identical diagnosis in their 2025 position statement on veterinary AI — and built nothing.

VeterinaryTeleradiology.com Editorial Staff  ·  May 2026  ·  Estimated read: 24 minutes

On May 5, 2026, at its annual meeting in Washington, DC, the American College of Radiology Council approved the ACR-SIIM Practice Parameter for Imaging Artificial Intelligence and announced that the ACR Data Science Institute had published, in the Journal of the American College of Radiology, the technical framework for Assess-AI — a national quality registry that the ACR describes as the world’s first AI quality registry for medical imaging. The registry had first launched operationally on November 18, 2024; the May 2026 announcement and the peer-reviewed publication formalized and documented an infrastructure that had by then been running for eighteen months.

The significance of Assess-AI for veterinary medicine is not that it monitors veterinary AI. It does not. Assess-AI is a human-medicine system, built by the human-medicine radiology profession, monitoring human-medicine imaging AI. The word “veterinary” does not appear in its technical framework. Its use cases are human conditions — intracranial hemorrhage, pulmonary embolism, pneumothorax, large vessel occlusion. Its regulatory context is the U.S. Food and Drug Administration’s authorization framework for AI-enabled medical devices, a framework that does not apply to veterinary AI at all.

The significance is the contrast. Assess-AI is documentary proof of what a radiology profession does when it takes seriously the proposition that clinical AI must be monitored after deployment. It built a national registry. It built a data-ingestion platform. It built analytics dashboards and benchmarking infrastructure and local case-review tools. It published the technical framework in a peer-reviewed journal. It approved a formal practice parameter governing how AI should be selected, evaluated before deployment, and monitored after. It created a recognized-facility designation for institutions that comply. The human-medicine radiology profession, having concluded that deployed AI requires structured oversight, built the structures.

The veterinary radiology profession reached a materially similar conclusion. In 2025, the American College of Veterinary Radiology and the European College of Veterinary Diagnostic Imaging jointly issued a position statement on artificial intelligence in veterinary diagnostic imaging and radiation oncology, published in the Journal of the American Veterinary Medical Association. That statement concluded that commercially available veterinary AI products do not currently meet the transparency and validation standards the profession requires, and it called for rigorous peer-reviewed research, unbiased third-party evaluation, and board-certified veterinary radiologists remaining in the diagnostic loop. The veterinary profession, through its two principal radiology specialty bodies, formally diagnosed the same problem the human side diagnosed: deployed clinical AI cannot be presumed to perform as marketed, and requires rigorous, independent oversight.

Then the veterinary profession built nothing. There is no veterinary Assess-AI. There is no veterinary national AI registry. There is no veterinary equivalent of the ACR-SIIM Practice Parameter, no veterinary recognized-facility designation, no veterinary post-deployment monitoring infrastructure of any kind. The diagnosis was issued and the prescription was written. The prescription was never filled.

This article documents that contrast in detail: what the ACR built and why, what the parallel infrastructure in the United Kingdom looks like, what the veterinary profession’s own position statement said, and what the structural consequences are of the veterinary profession having reached the human side’s conclusion without taking the human side’s action. It is a reference investigation, sourced to the ACR’s peer-reviewed technical framework, the ACR’s own announcement of the practice parameter, the Royal College of Radiologists’ published monitoring guidance, and the ACVR/ECVDI position statement. It names commercial veterinary AI products as examples where this publication’s prior coverage has already documented them, and it cross-references that prior coverage throughout.

Assess-AI: A National Registry for Detecting What Happens to Clinical AI After It Is Deployed

The Problem Assess-AI Was Built to Solve

The ACR’s published technical framework is direct about why the registry exists. The framework states that, although the number of FDA-authorized AI products has grown rapidly and imaging AI is now integrated into routine clinical workflows at a substantial proportion of healthcare facilities, “many institutions report that real-world AI performance often falls short of the metrics demonstrated during the authorization process,” and that this shortfall limits clinician trust and increases downstream clinical workload.

The framework identifies the specific mechanisms. AI models are, in the framework’s words, “notoriously fragile and prone to issues like performance degradation when local data is different from training data, including variations in scanner manufacturers or patient demographics.” Off-label use — applying a model outside its authorized indications — can produce worse-than-expected performance. And FDA-authorized models are generally locked: they cannot be retrained or fine-tuned without additional regulatory authorization, except within the scope of an authorized Predetermined Change Control Plan, which means a model that begins to drift cannot simply be quietly corrected. The framework concludes that detecting and addressing performance drift “is critical to maintaining clinical reliability,” and that “without mechanisms for robust monitoring, the risk of sub-optimal performance by these models remains a pressing concern.”

This is a remarkable set of admissions for a profession’s flagship body to publish in its own journal. It is the human-medicine radiology establishment stating, in peer-reviewed print, that FDA authorization is not a guarantee of real-world performance; that AI models degrade when they encounter data different from their training distribution; that the same model performs differently at different facilities and on different patient populations; and that none of this can be managed without infrastructure built specifically to detect it. Every one of those statements has a veterinary analog, and the veterinary analogs are in several respects more severe — a point this article returns to in detail below.

How the Registry Works

Assess-AI operates within the ACR’s National Radiology Data Registry, the established multi-registry infrastructure through which thousands of U.S. imaging facilities already submit quality data. Any U.S. imaging facility with an imaging AI model deployed in a real-world clinical setting is eligible to participate, and the ACR has noted that the more than 5,000 facilities already participating in at least one National Radiology Data Registry program are eligible to participate in Assess-AI under their existing agreements.

Participating facilities submit three categories of de-identified data through ACR Connect, the ACR’s locally-installed software platform that connects to a facility’s radiology information system, picture archiving and communication system, and electronic health record. The three categories are: the imaging AI outputs produced in the clinical workflow; the de-identified text of the radiology report for the exam; and the imaging study metadata drawn from DICOM headers, including patient age and sex. The framework notes that the registry does not currently collect source image pixel data or patient-specific health information — the de-identification is handled locally, before transmission, and only the study date of service is retained as protected health information.

What the registry does with that data is the analytically important part. It computes concordance — agreement — between the AI model’s output and a surrogate label extracted from the radiology report. Because re-reviewing every study against a fresh expert-annotated reference standard would not scale, the registry instead uses large language model prompting pipelines to extract, from the text of the report the interpreting radiologist already wrote, a structured label indicating whether the target finding was present or absent. The AI output and the report-derived surrogate label are then compared. When they agree, the case is concordant; when they disagree, it is discordant. Monthly concordance rates are computed and plotted over time, with confidence intervals, against registry benchmarks.

The ACR is careful — and this care is itself instructive — about what concordance does and does not mean. The framework states explicitly that the report-derived labeling method produces concordance that “represents alignment with the radiologist’s report (not equivalent to clinical accuracy).” It is not a pathology-confirmed ground truth. It is a measure of whether the AI agrees with the radiologist, not a measure of whether either the AI or the radiologist is correct. The framework acknowledges the limitations of this approach directly and at length: radiology reports contain hedging and diagnostic uncertainty, so surrogate labels extracted from them may themselves be uncertain; the large language models used for extraction are subject to drift and require ongoing validation; and incorporation bias is a concern, because if the radiologist viewed the AI output before writing the report, the report is no longer independent of the AI result.

This combination — a pragmatic, scalable surrogate paired with explicit, published acknowledgment of exactly what the surrogate cannot establish — is a model worth naming clearly. The ACR did not wait for a perfect monitoring methodology before building monitoring infrastructure. It built a good-enough system, documented its limitations honestly, and committed to improving it. The alternative — building nothing until a perfect system is available — is the de facto position of the commercial veterinary AI sector, and the ACR’s example demonstrates that the alternative is a choice, not a necessity.

What the Registry Found — and Why It Matters

The ACR’s published framework includes early results that, although presented as an infrastructure overview rather than a definitive performance evaluation, demonstrate exactly why post-deployment monitoring is necessary. At the time of the analysis, the registry held data from 16 participating facilities, representing 48,371 patients and 62,835 imaging studies.

For the intracranial hemorrhage use case, the ACR fit an exploratory mixed-effects logistic regression to examine what factors were associated with concordance. The result: patient sex, patient age, and scanner manufacturer were all significantly associated with AI concordance, each at a p-value below 0.0001. Increasing patient age was associated with lower concordance. Female patients showed modestly higher concordance than male patients. And the scanner manufacturer — the brand of the equipment producing the image — was significantly associated with whether the AI agreed with the radiologist. Beyond all of those measured factors, the framework reports that “the variability of the facility random intercept indicated meaningful between-facility differences in baseline performance not explained by patient age, sex, or scanner manufacturer.”

Read that finding plainly: the same category of AI model, monitored across real facilities, performed measurably differently depending on the patient’s age and sex, depending on which manufacturer’s scanner produced the image, and depending on the facility itself in ways that could not be fully explained even after accounting for those variables. This is precisely the performance variation the framework warned about — fragility to local data variation, divergence across deployment settings — and it was detectable only because the monitoring infrastructure existed to detect it. Without Assess-AI, every one of those facilities would have continued using its AI model with no way of knowing that performance varied this way.

The same category of AI model performed measurably differently depending on patient age, patient sex, scanner manufacturer, and the facility itself. That variation was detectable only because the infrastructure existed to detect it.

The Governance Loop

Detecting a performance signal is only useful if something can be done about it. The Assess-AI framework describes what the ACR calls a complete governance loop. When a facility’s concordance for a given AI model drops — falling below the registry benchmark, or below a confidence interval, or below the facility’s own historical performance — the facility’s dashboard surfaces the signal. A local administrator can then use the Forensics App, a component of ACR Connect that runs locally at the facility, to securely re-identify the discordant cases and review them: the imaging study, the radiology report, the AI output, and the surrogate label, all together, so a reviewer can determine why the case was discordant. Structured feedback from that local review is returned to the registry.

The framework notes that the corrective actions a facility might take after such a review include updating scanner protocols, modifying data pipelines, re-training personnel, or contacting the AI model’s manufacturer. The loop is closed: monitoring detects the signal, local review investigates the cause, and corrective action addresses it — with the registry capturing the feedback so that the monitoring improves over time.

The ACR-SIIM Practice Parameter for Imaging AI, approved at the same May 2026 meeting, formalizes the surrounding governance expectations. Per the ACR’s announcement, the practice parameter applies to physicians, technologists, medical physicists, informatics teams, data scientists, and administrators who deploy or use AI in imaging workflows, and it covers AI tool selection, pre-deployment evaluation, ongoing performance monitoring, and patient privacy protection. Facilities that implement AI in accordance with the parameter can earn the ACR Recognized Center for Healthcare-AI designation. The registry is the measurement instrument; the practice parameter is the governance standard; the recognized-facility designation is the incentive. Together they constitute an integrated system.

The United Kingdom Built It Too: Post-Deployment AI Monitoring Is Becoming the Human-Medicine Standard

The Royal College of Radiologists’ Parallel Infrastructure

The ACR is not alone, and that fact matters for understanding whether post-deployment AI monitoring is a fringe idea or an emerging standard. In the United Kingdom, the Royal College of Radiologists has published guidance titled “Post-deployment monitoring and safety reporting of AI medical imaging devices in clinical practice,” and has developed an RCR AI registry.

The RCR guidance draws a careful three-part distinction that is worth reproducing because it clarifies the conceptual landscape. Evaluation, in the RCR’s framework, is the assessment of an AI medical device before or at the point of deployment, to confirm expected performance. Auditing is the structured, periodic review of outputs against reference standards, to verify ongoing accuracy and compliance. Monitoring is the continuous or routine observation of operational performance and safety indicators, to detect change over time. The RCR treats all three as complementary but distinct activities that together form a complete oversight practice. The guidance recommends tracking AI-human disagreement rates, liaising with the vendor when AI performance changes so the model can be recalibrated or retrained, and sharing audit findings across National Health Service networks so that performance trends are spotted early across institutions rather than discovered in isolation.

The convergence is the point. Two of the most established radiology professional bodies in the world — one in the United States, one in the United Kingdom — independently reached the same conclusion: that deployed clinical AI requires structured, ongoing, post-deployment oversight, and that the profession itself must build the infrastructure to deliver it. They did not wait for regulators to mandate it. They did not wait for vendors to volunteer it. The professional bodies built it themselves, because the professional bodies concluded it was part of the responsible practice of radiology.

This transatlantic convergence forecloses one possible defense of the veterinary profession’s inaction. It is not possible to argue that post-deployment AI monitoring is an unproven, experimental, or premature idea that the veterinary profession is reasonably waiting to mature. It has matured. It is being implemented at national scale on two continents by the human-medicine radiology establishment. For veterinary medicine, post-deployment AI monitoring is not a frontier that has not yet been reached. It is a gap that has not been filled.

The 2025 ACVR/ECVDI Position Statement: The Veterinary Profession Identified the Problem in Writing

What the Veterinary Radiology Bodies Said in 2025

The veterinary radiology profession cannot claim it was unaware of the problem. In 2025, the American College of Veterinary Radiology and the European College of Veterinary Diagnostic Imaging — the two principal veterinary radiology specialty bodies, the direct counterparts of the ACR and the RCR — jointly issued a position statement on artificial intelligence in veterinary diagnostic imaging and radiation oncology, published in the Journal of the American Veterinary Medical Association.

The position statement’s conclusions were not equivocal. It concluded that commercially available veterinary AI products do not currently meet the transparency, validation, and evidence standards that the profession requires in order to rely on them. It called for rigorous peer-reviewed research validating veterinary AI products. It called for unbiased third-party evaluation — evaluation independent of the vendors selling the products, rather than vendor-funded or vendor-controlled validation. And it called for board-certified veterinary radiologists remaining in the diagnostic loop, rather than being displaced by AI systems operating without specialist oversight.

That is, in substance, the veterinary radiology profession’s two flagship specialty bodies stating formally that the commercial veterinary AI products currently being marketed to and adopted by veterinary practices have not been validated to the standard the profession’s own experts consider necessary — and calling for exactly the kind of rigorous, independent, ongoing evaluation that, on the human side, produced Assess-AI and the RCR registry.

This publication has documented the commercial veterinary AI landscape that the ACVR/ECVDI statement was responding to. The Phantom Radiologists series examined the foundational training-data claims of commercial veterinary AI vendors including SignalPET, Vetology, and Antech RapidRead, and calculated that the “trained on millions of specialist-reviewed cases” marketing claims cannot be reconciled with the documented size of the veterinary radiology specialist workforce. The analysis of AI primary reads and the veterinary regulatory gap documented that veterinary AI operates outside the FDA authorization framework, the state practice act frameworks, and the reimbursement-gatekeeping mechanisms that constrain AI on the human side. The examination of institutional inaction by the ACVR, AVMA, and AAHA documented the pattern of professional bodies issuing statements without building enforcement or oversight mechanisms. The ACVR/ECVDI position statement is the veterinary radiology profession’s formal acknowledgment of the problem. Assess-AI is what the acknowledgment looks like when it is followed by construction.

The Diagnosis Without the Cure

Place the two professions side by side. Both have flagship radiology specialty bodies. Both of those bodies concluded, in formal published statements, that deployed clinical AI cannot be presumed to perform as marketed and requires rigorous, independent, ongoing evaluation. On the human side, that conclusion was followed by the construction of Assess-AI, the ACR Connect data platform, the DART analytics platform, the Forensics App, the ACR-SIIM Practice Parameter, and the ARCH-AI recognized-facility designation — an integrated, national, peer-reviewed-and-documented oversight infrastructure. On the veterinary side, the conclusion was followed by nothing of the kind.

The veterinary profession issued the diagnosis and did not build the cure. And the veterinary diagnosis, properly understood, describes a more dangerous condition than the human one — because the veterinary AI products the ACVR/ECVDI statement was evaluating were never subject to the FDA pre-market authorization process in the first place. On the human side, the monitoring infrastructure is a second layer of assurance on top of a regulatory floor. On the veterinary side, there is no regulatory floor, and now there is no monitoring layer either. The veterinary profession identified an unguarded gap and then left it unguarded.

Every Structural Condition That Justified Assess-AI Is Present in Veterinary AI — Several of Them More Severely

The Conditions Transfer, and They Intensify

The ACR’s framework justified Assess-AI by reference to a specific set of structural conditions. Each of those conditions has a veterinary analog, and in several cases the veterinary version is materially worse. The point of this section is not abstract argument; it is to walk through the ACR’s own stated rationale, condition by condition, and show that the rationale applies to veterinary medicine with at least equal and often greater force.

Condition One: Real-World Performance Diverges From Pre-Market Claims

The ACR built Assess-AI partly because real-world AI performance often falls short of the metrics demonstrated during FDA authorization. On the human side, there is at least a pre-market metric to fall short of — the FDA authorization process produces a documented performance baseline against which real-world divergence can be measured. Veterinary AI has no such process. As this publication documented in its coverage of the veterinary AI regulatory gap, commercial veterinary AI products are not subject to FDA pre-market authorization. There is no required, standardized, independently reviewed pre-market performance baseline for a veterinary AI radiology product. The marketing claim is, very often, the only performance figure that exists. On the human side, monitoring detects divergence from a regulated baseline. On the veterinary side, there is no baseline — which means monitoring is not a second layer of assurance, it is the only possible layer.

Condition Two: Models Are Fragile to Local Data Variation

The ACR cited AI fragility to variation in scanner manufacturers and patient demographics. The veterinary version of this condition is more severe for a reason this publication has documented at length in the Phantom Radiologists infrastructure analysis: breed-specific anatomic variation. Human chest radiograph AI does not have to contend with the difference between a Dachshund’s thorax and a Great Dane’s thorax. Veterinary AI does. The canine species alone spans a range of body conformations more anatomically diverse than the entire human population. A veterinary AI model trained predominantly on one set of breeds and deployed on a practice population skewed toward different breeds is encountering exactly the local-data-variation fragility the ACR warned about — amplified by a degree of anatomic heterogeneity that has no human-medicine equivalent. The ACR’s own data showed that scanner manufacturer alone significantly affected concordance; veterinary AI faces scanner variation plus breed variation plus the species variation between canine and feline patients.

Condition Three: Off-Label Use Produces Worse-Than-Expected Performance

The ACR cited off-label use — applying a model outside its authorized indications — as a performance risk. On the human side, there are authorized indications: the FDA authorization specifies what the model is cleared to do, which makes off-label use at least definable. Veterinary AI products are not authorized for specific indications by any regulator. The concept of “off-label” presupposes a label. Where the boundaries of a veterinary AI product’s validated performance lie — which species, which breeds, which body regions, which presentations, which equipment — is, in the absence of regulatory definition and in the absence of the rigorous third-party evaluation the ACVR/ECVDI statement called for, frequently undefined. A veterinary practice cannot keep a model within its validated boundaries when those boundaries have not been established.

Condition Four: Locked Models Cannot Adapt to Drift

The ACR noted that FDA-authorized models are generally locked and cannot be retrained without further authorization, which makes drift detection critical because drift cannot simply be silently corrected. The veterinary situation is different in a way that cuts both directions and is, on balance, worse. Veterinary AI models are not subject to the FDA’s lock; a vendor can in principle update a veterinary model at any time. But that is not reassurance — it is the opposite. On the human side, a locked model at least cannot change without a documented, authorized process, so the facility knows the model is stable between authorized updates. A veterinary AI model can change underneath the practices using it, with no regulatory documentation, no version control visible to the customer, and — critically — no monitoring registry that would detect the resulting performance change. The human side has locked models plus drift monitoring. The veterinary side has unlocked models and no drift monitoring. The combination of mutability and invisibility is the worst of the available configurations.

Condition Five: The Same Model Performs Differently Across Facilities

The ACR’s exploratory analysis found meaningful between-facility performance differences not explained by patient age, sex, or scanner manufacturer. There is no structural reason to expect veterinary AI would be exempt from comparable between-facility variation — veterinary practices differ in equipment, in technique, in patient population, in case mix, in image quality. But on the veterinary side, that variation is entirely invisible. A veterinary practice using a commercial AI radiology product has no registry benchmark, no peer comparison, no facility-level concordance trend, and no mechanism by which it would ever learn that the product performs differently in its practice than the vendor’s marketing implied. The between-facility variation the ACR can now see and investigate is, in veterinary medicine, simply happening in the dark.

The Core Asymmetry

On the human side, post-deployment AI monitoring is a second layer of assurance built on top of an FDA pre-market regulatory floor. On the veterinary side, there is no regulatory floor — and now no monitoring layer either. Every structural condition the ACR cited to justify building Assess-AI is present in veterinary AI, and the conditions that involve regulatory baselines, defined indications, and model stability are all worse in veterinary medicine because the regulatory scaffolding the human-side monitoring complements does not exist at all.

The Comparison, Summarized

Oversight ElementHuman RadiologyVeterinary Radiology
Pre-market regulatory authorization of AI productsYes — FDA AI/ML device authorizationNo — no FDA pathway applies
National post-deployment AI monitoring registryYes — ACR Assess-AI (and RCR registry in the UK)No — none announced or published
Formal professional-society AI practice parameterYes — ACR-SIIM Practice Parameter for Imaging AINo equivalent practice parameter
Recognized-facility designation for AI governanceYes — ACR ARCH-AI designationNo equivalent designation
Profession formally identified the AI validation problemYesYes — 2025 ACVR/ECVDI position statement
Profession built infrastructure addressing the problemYesNo
Facility-level concordance benchmarking against peersYes — via Assess-AI dashboardsNo mechanism exists
Structured local discordant-case review workflowYes — ACR Forensics AppNo equivalent
Peer-reviewed publication of the monitoring frameworkYes — JACR, 2026No framework to publish

The table’s penultimate row is the one that should give the veterinary profession pause. On the question of whether the profession identified the AI validation problem, both columns read “yes” — the veterinary profession, through the ACVR/ECVDI position statement, did identify it. It is the final row, whether the profession built infrastructure addressing the problem, where the columns diverge. The veterinary profession’s failure is not a failure of awareness. It is a failure of action after awareness.

A Veterinary Assess-AI Is Technically Buildable — The Barrier Is Institutional Will

Nothing About the Architecture Is Human-Medicine-Specific

It would be a mistake to conclude from this article that a veterinary monitoring registry is impossible because veterinary medicine lacks some essential ingredient the human side possesses. The core architecture of Assess-AI is not human-medicine-specific. Consider its components in turn. De-identified data ingestion from practice imaging systems: veterinary practices run picture archiving and communication systems and practice management systems from which de-identified AI outputs, report text, and DICOM metadata could be drawn, exactly as on the human side. Normalization of AI outputs across vendors: the veterinary AI market has a finite, enumerable set of commercial vendors whose outputs could be mapped to standard values. Surrogate label extraction from report text using large language models: veterinary radiology reports are text documents from which a large language model can extract present/absent labels using precisely the prompting methodology the ACR describes. Central concordance computation, benchmarking dashboards, local case-review tools: none of these is biologically specific to humans.

The veterinary profession also has the institutional bodies that could host such a registry. The ACVR is an established specialty organization. The veterinary informatics community has technical capacity. The DICOM standard already accommodates veterinary terminology through a dedicated working group. The data infrastructure question for a veterinary monitoring registry is a question of assembly and resourcing, not of invention.

There are genuine veterinary-specific complications, and an honest account names them. The breed and species heterogeneity that makes veterinary AI fragile also makes veterinary benchmarking harder — a concordance benchmark may need to be stratified by species and potentially by breed group to be meaningful. The veterinary radiology specialist workforce that would adjudicate discordant cases is, as this publication has documented extensively, small relative to clinical demand. Veterinary practices are more fragmented and less likely to participate in existing national quality registries than the 5,000-plus facilities already in the ACR’s National Radiology Data Registry. These are real design challenges. But they are design challenges of the kind the ACR itself navigated — the ACR’s framework is candid that its own surrogate-labeling approach is imperfect and that its benchmarking does not yet adjust for case mix. The ACR did not let imperfection prevent construction. It built the imperfect-but-useful system and committed to improving it.

The barrier, then, is not technical feasibility. It is institutional will and resource commitment. The ACR chose to commit the resources. The Royal College of Radiologists chose to commit the resources. The veterinary professional bodies have not made the equivalent choice — and the 2025 ACVR/ECVDI position statement proves the choice has not been made for lack of awareness, because the awareness is on the record.

The Bottom Line

In 2026 the American College of Radiology launched Assess-AI, which it describes as the world’s first AI quality registry for medical imaging, and approved a formal ACR-SIIM Practice Parameter for Imaging AI. The registry monitors the real-world, post-deployment performance of clinical imaging AI — detecting the performance drift, the divergence from pre-market claims, and the facility-to-facility variation that the ACR’s own peer-reviewed framework acknowledges is routine. The United Kingdom’s Royal College of Radiologists has built parallel monitoring infrastructure. Post-deployment AI monitoring is now an emerging standard of responsible practice in human radiology, implemented at national scale on two continents.

The veterinary radiology profession, through the 2025 ACVR/ECVDI position statement on artificial intelligence in veterinary diagnostic imaging, reached a materially similar conclusion: that commercially available veterinary AI products do not meet the profession’s required transparency and validation standards, and that rigorous, independent, ongoing evaluation is needed. The veterinary profession issued that diagnosis and built no corresponding infrastructure. There is no veterinary Assess-AI, no veterinary AI registry, no veterinary AI practice parameter, no veterinary recognized-facility designation, no veterinary post-deployment monitoring of any kind.

Every structural condition the ACR cited to justify Assess-AI is present in veterinary AI, and the conditions involving regulatory baselines, defined indications, and model stability are worse in veterinary medicine, because veterinary AI is not subject to the FDA pre-market authorization the human-side monitoring complements. The veterinary version of the problem is more severe, and the veterinary response to it is less. The architecture of a veterinary monitoring registry is technically buildable; the components are not human-medicine-specific; the hosting institutions exist. What is missing is institutional will. The human-medicine professional bodies built the monitoring infrastructure their AI position demanded. The veterinary professional bodies, having issued a comparable position, did not — and until they do, every commercial veterinary AI radiology product deployed in practice is operating exactly the way the human-side profession decided was no longer acceptable: unmonitored, unbenchmarked, and presumed to perform as marketed.


Frequently Asked Questions

What is the ACR Assess-AI registry?

Assess-AI is a national quality registry created by the American College of Radiology to monitor the real-world, post-deployment performance of clinical imaging artificial intelligence models. It operates within the ACR National Radiology Data Registry. Participating facilities submit three categories of de-identified data through the ACR Connect platform: imaging AI outputs produced in the clinical workflow, de-identified radiology report text, and DICOM study metadata. The ACR computes concordance between the AI outputs and surrogate labels extracted from the radiology reports using large language model prompting pipelines, and delivers results to facilities through interactive dashboards. Facilities can compare their local AI model performance against registry benchmarks and against peer institutions using the same vendor and model, and can investigate individual discordant cases locally using the Forensics App. The registry first launched on November 18, 2024, and was formally presented alongside the ACR-SIIM Practice Parameter for Imaging AI at the ACR 2026 annual meeting in May 2026, where the ACR described it as the world’s first AI quality registry for medical imaging. The technical framework was published in the Journal of the American College of Radiology in 2026. Assess-AI currently monitors models for intracranial hemorrhage, pulmonary embolism, pneumothorax, large vessel occlusion, bone age, cervical spine fracture, breast density, pneumoperitoneum, tube malposition, pleural effusion, brain mass effect, and obstructive hydrocephalus.

Why did the American College of Radiology build a post-deployment AI monitoring registry?

The ACR built Assess-AI because real-world AI performance routinely diverges from the performance demonstrated during pre-market regulatory authorization, and because AI models drift over time. The published technical framework states directly that many institutions report real-world AI performance often falls short of the metrics demonstrated during the FDA authorization process, that AI models are fragile and prone to performance degradation when local data differs from training data, and that off-label use of models outside their authorized indications can produce worse-than-expected performance. FDA-authorized models are generally locked and cannot be retrained without additional regulatory authorization, which limits the ability to adapt to performance drift. The ACR concluded that without mechanisms for robust post-deployment monitoring, the risk of suboptimal AI performance in clinical settings remains a pressing concern. The registry exists to detect that divergence — to give facilities the ability to see when a deployed AI model is underperforming relative to benchmark, relative to peers, or relative to its own past performance, and to investigate why. The same structural conditions the ACR identified as justifying the registry — performance divergence from pre-market claims, model drift, fragility to local data variation, off-label use risk — are present in veterinary AI as well, and arguably more acute, because veterinary AI is not subject to the FDA pre-market authorization process that produces the baseline performance metrics in the first place.

Does veterinary medicine have anything equivalent to Assess-AI?

No. As of this article’s publication, no veterinary professional body — not the American College of Veterinary Radiology, not the European College of Veterinary Diagnostic Imaging, not the American Veterinary Medical Association, not the American Animal Hospital Association — has announced or published any post-deployment AI performance monitoring registry comparable to Assess-AI. There is no veterinary national registry that collects deployed AI outputs, computes concordance against radiologist interpretation, benchmarks facilities against peers, or provides facilities with the infrastructure to detect AI performance drift in clinical use. This absence is structurally significant because the veterinary profession’s own professional bodies have formally identified the problem that monitoring infrastructure addresses. The 2025 ACVR/ECVDI position statement on artificial intelligence in veterinary diagnostic imaging concluded that no commercially available veterinary AI product meets the required transparency and validation standards, and called for rigorous peer-reviewed research, unbiased third-party evaluation, and board-certified radiologist involvement. The position statement issued the diagnosis. No corresponding infrastructure was built. The human-side professional bodies, having reached the same diagnosis, built Assess-AI and the ACR-SIIM Practice Parameter. The veterinary profession issued the statement and stopped.

What did the 2025 ACVR/ECVDI position statement on veterinary AI actually say?

The 2025 position statement on artificial intelligence in veterinary diagnostic imaging and radiation oncology, issued jointly by the American College of Veterinary Radiology and the European College of Veterinary Diagnostic Imaging and published in the Journal of the American Veterinary Medical Association, concluded that commercially available veterinary AI products do not currently meet the transparency, validation, and evidence standards required for the profession to rely on them. The statement called for rigorous peer-reviewed research validating veterinary AI products, unbiased third-party evaluation rather than vendor-funded or vendor-controlled validation, and board-certified veterinary radiologists remaining in the diagnostic loop rather than being replaced by AI. The position statement is the veterinary profession’s formal acknowledgment, through its two principal radiology specialty bodies, that the commercial veterinary AI products currently being marketed to and adopted by veterinary practices have not been validated to the standard the profession’s own experts consider necessary. The significance of the statement is not only what it says about veterinary AI products; it is what the profession did and did not do after issuing it. The statement identified the need for rigorous validation and monitoring. The infrastructure that would deliver rigorous validation and monitoring was not built. This article documents the contrast with the human-medicine professional bodies, which reached the same conclusion about the need for monitoring infrastructure and then built it.

How does the ACR registry handle the problem of not having a perfect reference standard?

Assess-AI uses what it calls a report-derived labeling method. Rather than re-reviewing every imaging study against an expert-annotated reference standard, the registry extracts surrogate labels from the text of the radiology report that the interpreting radiologist already produced, using large language model prompting pipelines. The registry then computes concordance — agreement — between the AI model’s output and the report-derived surrogate label. The ACR is explicit, in its published framework, that this concordance represents alignment with the radiologist’s report and is not equivalent to clinical accuracy or to a pathology-confirmed ground truth. The published framework acknowledges the limitations directly: surrogate labels extracted from report text may reflect interpretations that are inherently uncertain, because radiologists appropriately hedge in reports; the large language models used for extraction are themselves subject to drift and require ongoing validation; and incorporation bias is a concern, because if the interpreting radiologist saw the AI output before writing the report, the report is no longer independent of the AI result. The ACR registry is, in other words, a pragmatic system that trades the unattainable ideal of universal expert re-review for a scalable surrogate that is good enough to detect meaningful performance signals — while being transparent about exactly what the surrogate can and cannot establish. This pragmatic-but-transparent design is itself a model. It demonstrates that the absence of a perfect reference standard is not a valid reason to build no monitoring infrastructure at all, which is the de facto position of commercial veterinary AI.

Is the United States the only country building AI monitoring infrastructure for radiology?

No. The pattern is transatlantic. In the United Kingdom, the Royal College of Radiologists has published guidance on post-deployment monitoring and safety reporting of AI medical imaging devices in clinical practice, and has developed an RCR AI registry. The RCR guidance distinguishes among evaluation (assessment of an AI device before or at the point of deployment), auditing (structured periodic review of outputs against reference standards), and monitoring (continuous observation of operational performance and safety indicators to detect change over time), and treats all three as necessary complementary activities. The RCR guidance recommends tracking AI-human disagreement rates, liaising with vendors when AI performance changes, and sharing audit findings across the National Health Service networks to spot trends early. The convergence is the point: two of the most established radiology professional bodies in the world, on two continents, independently reached the conclusion that deployed clinical AI requires structured post-deployment monitoring, and independently built or are building the infrastructure to deliver it. This is not a fringe position or an experimental idea. It is becoming the standard of responsible practice in human radiology. The veterinary profession’s continued operation without any equivalent infrastructure is therefore not explained by the idea being new or unproven. It is a gap, not a frontier.

Why does the absence of veterinary AI monitoring matter for pet owners and veterinarians?

It matters because veterinary AI radiology products are being adopted into clinical workflows now, and without monitoring infrastructure, performance drift and underperformance are occurring undetected. The ACR’s published framework documents that real-world AI performance varies meaningfully by facility, by scanner manufacturer, and by patient demographics — variation that is detectable only when the monitoring infrastructure exists to detect it. In the ACR registry’s exploratory analysis of its intracranial hemorrhage use case, patient sex, patient age, and scanner manufacturer were all significantly associated with AI concordance, and meaningful between-facility performance differences remained even after accounting for those factors. If comparable variation exists in veterinary AI — and there is no structural reason to expect it would not, given that veterinary practices use diverse imaging equipment across diverse patient populations spanning dozens of breeds with substantial anatomic variation — that variation is currently invisible. A veterinary practice using a commercial AI radiology product has no benchmark to compare its results against, no peer comparison, no drift detection, and no structured pathway to investigate discordant cases. The veterinarian relying on the AI output, and the pet owner paying for the diagnostic interpretation, have no assurance that the product is performing in their practice the way the vendor’s marketing claims it performs. The human-side infrastructure exists precisely because that assurance cannot be presumed. The veterinary side operates on the presumption anyway.

Could the veterinary profession build something like Assess-AI?

There is no fundamental technical barrier. The core components of the ACR’s Assess-AI architecture are not human-medicine-specific. De-identified data ingestion from practice systems, normalization of AI outputs across vendors, surrogate label extraction from report text using large language models, central concordance computation, benchmarking dashboards, and local case-review tools are all technically transferable to veterinary radiology. The veterinary profession has the institutional bodies that could host such a registry — the ACVR operates within an existing organizational structure, and the broader veterinary informatics community has the technical capacity. What the human-side example demonstrates is that building the infrastructure is a question of institutional will and resource commitment, not technical impossibility. The ACR chose to invest in building Assess-AI, the ACR Connect platform, the DART analytics platform, and the surrounding governance framework. The Royal College of Radiologists chose to invest in its monitoring guidance and registry. The veterinary professional bodies have not made the equivalent choice. The 2025 ACVR/ECVDI position statement demonstrates that the veterinary radiology specialty bodies are aware of the problem and have articulated it formally. The gap between articulating the problem and building the infrastructure to address it is a gap of institutional commitment. This article documents that gap because the first step toward closing it is naming it clearly: the human-medicine professional bodies built the monitoring infrastructure their AI position demanded, and the veterinary professional bodies, having issued a comparable position, did not.


Artificial Intelligence Investigation Industry Post-Deployment Monitoring ACR Assess-AI AI Quality Registry Clinical Imaging AI AI Performance Drift ACR-SIIM Practice Parameter ARCH-AI Designation ACR Data Science Institute National Radiology Data Registry Royal College of Radiologists RCR AI Registry ACVR ECVDI ACVR ECVDI Position Statement Veterinary AI Veterinary Radiology AI Veterinary AI Validation Veterinary AI Regulatory Gap FDA AI Authorization Concordance Monitoring Surrogate Label Extraction SignalPET Vetology Antech RapidRead AI Governance Professional Society Accountability Breed-Specific Anatomic Variation Veterinary Teleradiology Pet Owner Consumer Protection

Editorial & Sourcing Note. VeterinaryTeleradiology.com is an independent industry publication. This article is a reference investigation comparing the post-deployment artificial intelligence monitoring infrastructure built by the human-medicine radiology profession with the absence of equivalent infrastructure in veterinary medicine. The article’s account of the ACR Assess-AI registry is sourced to the peer-reviewed technical framework published in the Journal of the American College of Radiology (Coombs et al., 2026, DOI 10.1016/j.jacr.2026.04.024) and to the American College of Radiology’s own May 5, 2026 announcement of the ACR-SIIM Practice Parameter for Imaging AI. At the time of writing, the JACR article is available as an accepted, in-press journal pre-proof; readers should consult the journal for the final Version of Record, which may contain copyediting changes. The article’s account of the UK Royal College of Radiologists’ monitoring guidance is sourced to the RCR’s published guidance on post-deployment monitoring and safety reporting of AI medical imaging devices. The article’s account of the veterinary radiology profession’s position is sourced to the 2025 ACVR/ECVDI position statement on artificial intelligence in veterinary diagnostic imaging and radiation oncology, published in the Journal of the American Veterinary Medical Association.

The statement that no veterinary equivalent of the Assess-AI registry exists reflects the absence of any such registry announced or published by a veterinary professional body as of this article’s publication date. If a veterinary professional body has established or announced post-deployment AI monitoring infrastructure not reflected here, this publication will update the article upon documentation. Commercial veterinary AI products named in this article are named as examples consistent with this publication’s prior documented coverage, cross-referenced in the related coverage section; the article makes no new factual claims about specific named products beyond what that prior coverage documents.

This publication is not a regulatory body and does not provide regulatory or legal advice. The article’s structural comparison of human-medicine and veterinary-medicine AI oversight is intended to inform reader and professional consideration of the infrastructure gap it documents. The American College of Veterinary Radiology, the European College of Veterinary Diagnostic Imaging, the American Veterinary Medical Association, the American Animal Hospital Association, and any commercial veterinary AI vendor are invited to publish institutional response addressing the article’s documented observations. Any such response supported by documentary evidence will be published in full by this publication.

Scroll to Top