OPEN SIGNAL
The Long View ·

The Long View: The Trust Problem at the Heart of Artificial Intelligence

AI's biggest obstacle isn't capability — it's the fact that we have no reliable way to know when to believe what it tells us, and building that infrastructure matters more than building better models.

The Machine That Sounds Like It Knows

Ask a large language model a factual question and you will receive an answer delivered with the syntax of authority. The grammar is precise. The tone is measured. The structure follows the conventions of expert communication — qualifications where appropriate, nuance where expected, confidence where the training data suggests confidence is warranted.

None of this tells you whether the answer is correct.

This is the trust problem at the heart of artificial intelligence, and it is more consequential than any question about model capability, training efficiency, or deployment cost. We have built systems that produce outputs indistinguishable in form from expert human communication, but we have not built the infrastructure — technical, institutional, or cognitive — to help people calibrate their trust in those outputs appropriately.

The result is a technology that is simultaneously undertrusted and overtrusted, often by the same person in the same session. A lawyer uses an AI assistant to draft a brief and fails to catch fabricated case citations. A patient dismisses an AI-generated health summary that happens to be accurate because they distrust the source. A software engineer accepts buggy AI-generated code without review, while the engineer in the next seat refuses to use AI tools at all.

This is not a problem that better models will solve. GPT-5 or Claude 4 or Gemini Ultra may hallucinate less frequently, but the fundamental epistemic challenge remains: how do you know when to believe a system whose reliability varies unpredictably across domains, question types, and phrasings?

Getting this right is arguably more important than any advance in model capability. A moderately capable AI system that people trust appropriately — knowing when to rely on it and when to verify — is more valuable than a highly capable system that people either trust blindly or refuse to use. The history of transformative technologies suggests that trust infrastructure, not raw capability, determines whether a technology achieves its potential or stalls in a cycle of hype and backlash.

Why AI Trust Is Epistemically Different

Every tool humans have ever built has required some form of trust. You trust that your car’s brakes will work. You trust that the bridge will hold your weight. You trust that the medication your doctor prescribes has been tested for safety and efficacy.

But AI systems present an epistemic challenge that is categorically different from previous technologies, for reasons worth examining carefully.

The Opacity Problem

Most technologies that require trust are at least in principle inspectable. A mechanical engineer can examine a bridge’s structural design. A pharmacologist can analyze a drug’s molecular mechanism. Even complex systems like aircraft can be decomposed into subsystems that are individually testable and verifiable.

Modern AI models resist this kind of inspection. A large language model contains billions of parameters whose collective behavior produces outputs through processes that are not fully understood even by the researchers who built the system. Mechanistic interpretability research — the effort to understand what individual neurons and circuits in neural networks are doing — has made progress, but we remain far from being able to explain why a model produces a specific output in a specific context.

This means that trust in AI outputs cannot be grounded in understanding how those outputs were produced. It must be grounded in something else — empirical track records, statistical reliability measures, institutional validation, or some combination of these. Building that grounding is a challenge we have barely begun to address.

The Confidence-Competence Mismatch

Humans calibrate trust partly through confidence signals. When a person says “I’m not sure about this,” we adjust our confidence in their claim. When they speak with certainty, we grant more weight to their statement — particularly if they have demonstrated expertise in the past.

AI systems, as currently designed, do not produce well-calibrated confidence signals. A language model generating a hallucinated citation and a language model stating a well-established fact may use identical syntax and tone. The model does not experience uncertainty in the way humans do, and its outputs do not reliably encode the degree to which the underlying information is well-supported.

Some models now provide probability estimates or hedging language, but these remain crude instruments. A model might say “I believe” or “I’m not certain” as linguistic patterns learned from training data rather than as genuine reflections of epistemic state. The user has no reliable way to distinguish between the model hedging because the information is genuinely uncertain and the model hedging because hedging is a common pattern in its training data for that type of question.

The Domain Variance Problem

A human expert’s reliability is relatively consistent within their domain of expertise. A cardiologist’s medical advice is generally trustworthy for cardiac issues. You know the boundaries — you would not ask them to review your tax return.

AI systems have unpredictable reliability boundaries. A language model might be highly accurate on questions about well-documented historical events and wildly unreliable on questions about recent developments in a niche scientific field — but both answers will be presented with the same surface-level confidence. The user cannot easily determine where the model’s reliability drops off, because those boundaries are not stable or well-mapped.

This domain variance makes the trust problem fundamentally harder than trusting a human expert. With a human, you can assess credentials, check track records in specific domains, and calibrate based on their own expressed uncertainty. With a current AI system, the reliability landscape is complex, shifting, and largely invisible to the end user.

How Trust Was Built Before

The trust problem in AI is unprecedented in its specific form, but it is not the first time society has faced the challenge of building trust in a powerful, opaque, and potentially dangerous technology. The historical record offers instructive parallels.

Aviation: Trust Through Systematic Transparency

In the early decades of powered flight, public fear of aviation was widespread and justified. Planes crashed frequently. The causes were often mysterious. The technology was new enough that no one had a strong prior for how reliable it should be.

The aviation industry’s response, developed over decades, became the gold standard for building institutional trust in a dangerous technology. The framework rested on several pillars.

First, mandatory incident reporting. Every accident and serious incident was investigated by an independent body — in the United States, what is now the National Transportation Safety Board. Findings were made public. The culture of the industry shifted toward radical transparency about failure, driven by the recognition that hiding failures made everyone less safe.

Second, standardized certification. Aircraft designs, components, maintenance procedures, and pilot qualifications were all subject to detailed, public standards. These standards were developed through a process that combined engineering analysis, empirical testing, and operational experience. Compliance was verified through independent inspection.

Third, continuous monitoring. The introduction of the flight data recorder and cockpit voice recorder — the “black boxes” — in the 1960s created a feedback loop that allowed the industry to learn from both accidents and near-misses. This data-driven approach to safety improvement compounded over decades, producing the extraordinarily safe air travel system that exists today.

The lesson for AI is not that we need exact analogs of these mechanisms, but that trust in aviation was not built by making planes safer and then waiting for people to notice. It was built by creating institutions and processes that made safety visible, measurable, and accountable. The infrastructure of trust was deliberately designed and systematically implemented.

Medicine: Trust Through Controlled Validation

The history of pharmaceutical regulation offers another instructive parallel. Before the modern regulatory framework, the market for medicines was essentially unregulated. Vendors sold products with unverified claims about efficacy. Some worked. Many did not. Some were actively harmful.

The trust infrastructure that emerged — culminating in the FDA’s modern drug approval process in the United States and similar agencies worldwide — was built on the principle of controlled validation before deployment. Clinical trials, with their double-blind protocols, control groups, and statistical significance requirements, provided a mechanism for separating claims from evidence.

This process is slow, expensive, and imperfect. It sometimes excludes effective treatments and occasionally approves harmful ones. But it solved the core trust problem: it gave patients, doctors, and institutions a rational basis for believing that an approved medication was more likely to help than to harm, within quantified bounds of confidence.

The pharmaceutical model is particularly relevant to AI because it addresses a similar challenge: how to build appropriate trust in a system whose internal mechanisms are not fully understood by the people who use it. Most patients do not understand pharmacology. They trust their medications because institutions — regulatory agencies, medical schools, professional boards — have created a validation infrastructure that sits between the technology and the user.

The Internet: Trust Through Gradual Social Proof

The early commercial internet faced intense skepticism. The idea of entering a credit card number into a website seemed absurd to most people in the mid-1990s. Online shopping was widely predicted to remain a niche activity because consumers would never trust remote merchants they could not see or visit.

Trust in e-commerce was built through a combination of technical infrastructure (SSL encryption, later TLS), institutional guarantees (credit card chargeback protections, platform buyer-protection policies), and accumulated social proof (millions of successful transactions building a baseline of confidence). The process took roughly a decade from the first commercial browsers to the point where online shopping became routine.

Notably, trust in the internet was not built uniformly. People learned to trust certain types of transactions (buying books from Amazon) before others (sending money to a stranger on a peer-to-peer platform). Trust was domain-specific, context-dependent, and built incrementally through repeated positive experiences.

This graduated, uneven pattern of trust formation is likely the most realistic template for how trust in AI will develop.

The Calibration Challenge

The deepest aspect of the AI trust problem is not whether people trust AI too much or too little — it is that most people have no framework for calibrating their trust appropriately.

Calibration, in the epistemological sense, means having beliefs whose confidence levels match the actual reliability of the underlying evidence. A well-calibrated person who says they are 80 percent confident in a claim is correct about 80 percent of the time when they express that level of confidence.

Humans are notoriously poorly calibrated in general — we tend toward overconfidence in many domains — but we have developed heuristics and institutions that improve calibration in structured settings. Doctors learn, through training and experience, which symptoms are diagnostic and which are misleading. Weather forecasters, who receive immediate and unambiguous feedback on their predictions, are among the best-calibrated experts studied by psychologists.

AI systems create a calibration crisis because they strip away the cues that normally help people calibrate. When reading a human expert’s assessment, you can evaluate their credentials, consider their track record, assess their reasoning, note their expressed uncertainty, and check their claims against other experts. When reading an AI-generated response, most of these calibration mechanisms are absent or unreliable.

The result is a bimodal distribution of trust that serves no one well. Some users — particularly those who interact with AI frequently and have experienced its failures — develop a healthy skepticism but often over-correct, dismissing useful AI outputs along with unreliable ones. Other users — often those less experienced with the technology or those using it in domains where they lack independent expertise — accept AI outputs uncritically, sometimes with serious consequences.

Neither extreme represents appropriate calibration. The goal is not to trust AI more or less, but to trust it correctly — to have a mental model of its reliability that is accurate enough to guide decision-making.

Institutional Trust vs. Individual Trust

The trust challenge in AI has two distinct dimensions that are often conflated: individual trust (a person’s decision about whether to rely on a specific AI output) and institutional trust (society’s collective framework for governing AI deployment and validating AI performance).

The Limits of Individual Trust

Placing the burden of trust calibration on individual users is a strategy with well-understood limitations. Expecting every person who interacts with an AI system to independently assess the reliability of its outputs — effectively to serve as their own AI auditor — is unrealistic for the same reasons that we do not expect patients to independently validate their medications or airline passengers to inspect their aircraft.

Individual trust calibration requires domain expertise (knowing enough about the subject to evaluate the answer), technical literacy (understanding how the AI system works and where its failure modes lie), and continuous attention (checking each output rather than developing routine reliance). This combination is rare even among sophisticated users, and it is an impossible standard for the general population.

The aviation and medical parallels both suggest that trust in complex, consequential technologies ultimately depends on institutional structures — not because individuals cannot make good judgments, but because the cognitive load of constant verification is too high for daily use.

What Institutional Trust Infrastructure Looks Like

A functioning trust infrastructure for AI would likely need several components, none of which exist in mature form today.

Reliability benchmarking by domain. Just as structural engineering has safety factors and pharmaceuticals have efficacy standards, AI systems need domain-specific reliability metrics that are independently measured and publicly reported. Not a single benchmark score, but a detailed reliability profile: this system is highly reliable for standard medical coding queries, moderately reliable for differential diagnosis support, and not validated for treatment planning.

Audit and accountability mechanisms. When an AI system produces a harmful output — a misdiagnosis, a fabricated legal citation, a flawed financial analysis — there needs to be a clear process for investigating the failure, understanding its cause, and implementing systemic corrections. The aviation model of independent investigation and public findings is instructive here.

Graduated deployment standards. Not all AI applications carry the same risk. A chatbot that recommends restaurants and one that assists with clinical decisions should not face the same validation requirements. A risk-tiered regulatory framework — with minimal requirements for low-stakes applications and rigorous validation for high-stakes ones — would balance innovation with safety.

Transparency requirements scaled to stakes. For high-consequence AI applications, users and regulators need access to information about training data, known failure modes, performance metrics by domain, and the limitations of the system. This does not require full model transparency (which may not be technically feasible), but it does require meaningful disclosure beyond marketing materials and cherry-picked benchmarks.

Provenance and attribution infrastructure. As AI-generated content becomes ubiquitous, mechanisms for tracking the origin and reliability of information become critical. This is as much a social challenge as a technical one — it requires norms, standards, and tools for distinguishing AI-generated content from human-generated content, and for attributing AI outputs to specific systems with known reliability profiles.

The Speed Problem

There is a fundamental tension between the pace of AI development and the pace at which trust infrastructure can be built.

New AI models are released on a timeline measured in months. They are deployed to millions of users within weeks. Their capabilities change with each iteration, which means that any reliability assessment has a short shelf life — the system you tested is not the system currently in production.

Trust infrastructure, by contrast, takes years to develop. Regulatory frameworks require legislative action, public comment, and institutional capacity-building. Professional standards require consensus among practitioners. Audit methodologies require development, testing, and validation. Cultural norms around appropriate use require accumulated experience across millions of users.

This speed mismatch creates a persistent gap between AI capability and trust infrastructure. The technology advances faster than our ability to evaluate it, which means that for any given AI system, we are almost always in a period of insufficient institutional guidance about how much to trust it.

The gap is not unique to AI — it is a common feature of rapidly evolving technologies. Automobiles existed for decades before traffic laws, driver licensing, and vehicle safety standards caught up. Social media spread globally before content moderation norms and regulatory frameworks developed. But the speed of AI development compresses the timeline, making the gap between capability and governance wider and more consequential than previous technology transitions.

Why This Matters More Than Capability

The technology industry’s attention and investment are overwhelmingly focused on making AI systems more capable. Billions of dollars flow into training larger models, improving reasoning ability, expanding context windows, and adding new modalities. These are worthy engineering goals, and the progress has been remarkable.

But capability without calibrated trust produces a technology that society cannot use effectively. If people cannot determine when to rely on an AI system, they will either fail to use it when it would help them (the undertrust problem) or rely on it when it will fail them (the overtrust problem). Both outcomes represent a failure to capture the value that AI capability creates.

The history of technology adoption suggests that the binding constraint on a transformative technology’s impact is rarely raw capability. It is the social, institutional, and cognitive infrastructure that allows people to integrate the technology into their lives and work in ways that are reliable and beneficial.

The automobile’s impact on society was not determined primarily by engine horsepower. It was shaped by the roads built to accommodate cars, the traffic laws that made driving predictable, the insurance systems that distributed risk, the licensing requirements that ensured minimum competence, and the cultural norms that evolved around appropriate use. Without these, a more powerful engine would simply have produced more dangerous chaos.

AI is approaching a similar inflection point. The models are impressively capable. But the roads, traffic laws, insurance systems, and licensing requirements — the trust infrastructure — are in their earliest stages.

What Appropriate Trust Looks Like

The goal is not universal trust in AI. It is not universal skepticism. It is a state of calibrated, context-dependent trust that allows individuals and institutions to use AI systems effectively while managing the risks of error.

This state has several characteristics.

Users understand reliability boundaries. A person using an AI coding assistant knows that it is highly reliable for standard implementation patterns, moderately reliable for architectural suggestions, and unreliable for security-critical logic. They verify accordingly, spending their limited attention on the areas where the model is most likely to be wrong.

Institutions provide validation. For high-stakes applications — healthcare, legal, financial — institutional bodies have evaluated AI systems and provided guidance on appropriate use. A doctor using a clinical AI system knows that it has been validated for specific diagnostic categories, with published performance metrics, by an independent body whose methodology they can assess.

Failure is visible and instructive. When AI systems fail, the failures are documented, analyzed, and used to improve both the systems and the trust infrastructure around them. Users and developers have access to failure databases analogous to aviation incident reports, enabling continuous improvement in both technology and governance.

Trust is earned incrementally. New AI capabilities are deployed in low-risk contexts first and gradually extended to higher-stakes applications as evidence of reliability accumulates. This graduated approach allows trust to develop at a pace commensurate with demonstrated performance rather than marketing claims.

The system is honest about what it does not know. AI systems communicate their uncertainty in ways that are meaningful to users — not through vague hedging language, but through structured reliability indicators tied to empirical performance data. When a model is operating at the edge of its validated capability, the user knows it.

The Path Forward

Building trust infrastructure for AI is not a single project or a single policy. It is an ongoing, multi-decade effort that will involve technology companies, governments, academic institutions, professional organizations, and civil society. The work will be slow, contested, and imperfect — much like the development of trust infrastructure for every previous transformative technology.

But several concrete steps are both feasible in the near term and foundational for the long-term effort.

The AI industry needs to move from marketing-oriented benchmarks to reliability engineering. This means publishing detailed performance profiles rather than single aggregate scores, testing systematically for failure modes rather than showcasing best-case performance, and maintaining transparency about known limitations rather than minimizing them.

Governments need to develop regulatory frameworks that are risk-proportionate and technically informed. The EU AI Act represents an early attempt at this, with its risk-tiered approach, but the framework will need significant iteration as the technology evolves. Other jurisdictions will develop their own approaches, and international coordination — while difficult — will be essential.

Professional communities — in medicine, law, engineering, education, and other fields — need to develop domain-specific standards for AI use that reflect the actual reliability of AI systems in their practice areas. These standards should be evidence-based, regularly updated, and developed through the same professional consensus processes that govern other aspects of practice.

And individual users need better tools and education for calibrating their trust in AI outputs. This is the weakest link in the current system, and it will remain so until institutional infrastructure matures. In the interim, the most effective approach is likely to focus on developing “AI literacy” as a core competency — not the ability to build AI systems, but the ability to evaluate and appropriately trust their outputs.

Conclusion

The trust problem in AI is not a bug to be fixed or a PR challenge to be managed. It is a fundamental structural challenge that will shape the trajectory of AI’s impact on society as much as any technical advance.

We have built systems that can produce remarkably useful outputs but that come with no reliable indicators of when those outputs should be believed. We have deployed these systems to hundreds of millions of users without the institutional infrastructure that every previous transformative technology has needed to function safely at scale.

This is not a reason for despair. It is a description of work that needs to be done — work that is as important and as difficult as the engineering work of making AI systems more capable.

The technologies that have most profoundly shaped modern life — aviation, medicine, telecommunications, the internet — did not succeed because they were powerful. They succeeded because societies built the institutional frameworks that allowed their power to be channeled productively. Trust was not an afterthought or a constraint on innovation. It was the foundation on which innovation’s benefits were realized.

AI will be no different. The question is not whether we will build trust infrastructure for artificial intelligence. The question is whether we will build it deliberately and well, or whether we will build it reactively, in response to accumulating failures and crises, at a cost far higher than necessary.

The models are getting better every quarter. The trust infrastructure needs to start catching up.

Get the signal in your inbox

Free. Sourced. AI-written. The AI buildout, daily.

No spam. Unsubscribe anytime.