How Do We Know if an AI System Is Safe, Fair, Reliable, and Trustworthy?

One of the most consequential questions in technology governance today is also one of the most underexamined: how do we establish, and sustain, meaningful trust in automated decision-making systems, particularly when those systems affect the most fundamental aspects of people’s lives? In other words, how do we know if an AI System Is safe, fair, reliable, and trustworthy? Trustworthiness, in this context, must be demonstrated through the structures, processes, and accountability mechanisms that govern how a system operates in practice.

Australia’s Robodebt programme offers one of the most instructive public case studies of what happens when those structures are absent. Between 2016 and 2019, the Australian Government’s Department of Human Services used an automated income-averaging algorithm to identify alleged overpayments in welfare benefits and issue debt notices to recipients. The system compared welfare payment records against averaged annual income data held by the Australian Taxation Office. Where a discrepancy was identified, it automatically generated a debt, without the manual verification processes that had previously been required under human-led assessments.

The consequences were severe. Approximately 381,000 individuals received debt notices, many of which were incorrect. The system had, in effect, shifted the burden of proof from the government to the individual: recipients were required to disprove debts the algorithm had generated, often without access to the records or explanations that would allow them to do so. AUD $746 million was wrongfully recovered before the programme was halted. Subsequent legal proceedings resulted in a class action settlement, and the programme became the subject of a formal Royal Commission, which reported in 2023.

The Royal Commission’s findings are worth examining carefully, because they go beyond the question of algorithmic error. The Commission found that the programme was unlawful from its inception, income averaging, as used, had no valid legal basis for establishing a debt. It also found that senior officials were aware of legal concerns and that these concerns were not adequately escalated or acted upon. Critically, the programme lacked effective mechanisms for affected individuals to understand how decisions had been reached, or to access genuine human review of their cases. Contestability was, for many, procedurally available but practically inaccessible.

These findings point to something important for anyone working in AI governance, digital public services, or institutional accountability. The failure was not primarily a technical one. The algorithm functioned as designed. The failure was governance: the absence of adequate human oversight, the suppression of legal scrutiny, the weak validation of assumptions, and the lack of meaningful recourse for affected individuals. When automated systems operate at scale, processing hundreds of thousands of cases in the time it would take human caseworkers to review a fraction of them, governance gaps do not remain contained. They propagate at the same speed and volume as the system itself.

This is the governance lesson that applies directly to AI deployment today. A system may be technically sound and operationally efficient while still producing outcomes that are unjust, unlawful, or harmful, particularly if the conditions under which it operates have not been rigorously validated, and if the people it affects have no effective means of challenge. Efficiency and accountability are not in tension; they are both necessary conditions for a system that can be legitimately trusted.

For organisations and governments deploying automated or AI-powered decision-making systems, the practical implications are clear. Deployment should be accompanied by continuous monitoring of outcomes across affected populations, not simply performance monitoring of the system itself. Assumptions embedded in system design, including the legal and evidential basis for decisions, require ongoing scrutiny, not just initial approval. Individuals affected by automated decisions should have access to meaningful explanations and to genuine human review, not procedural pathways that exist on paper but are difficult to navigate in practice. And internal governance must create the conditions in which concerns, including legal and ethical concerns, can be raised and properly considered, rather than set aside.

Robodebt was a systemic failure that emerged from the interaction of institutional pressures, inadequate oversight, and the pace at which automated systems can generate consequential decisions at scale. They are the central challenge of responsible deployment. Trustworthy AI is built incrementally, through the quality of the governance structures that surround it. That is the standard against which institutions should be measured, and against which they should hold themselves.

#AIGovernance #ResponsibleAI #DigitalGovernance #PublicPolicy #Accountability #AIPolicy

Institute for AI Policy & Governance