How Large Language Models Actually Work: A Business Leader's Technical Primer
A concise technical explanation of how large language models function — from training data and transformer architecture to why they produce the outputs they do.
A precise explanation of what generative AI systems actually produce — probability distributions over tokens — and why understanding this changes how leaders should deploy and trust AI outputs.
Generative AI does not retrieve answers — it generates them, token by token, by sampling from a probability distribution. Every output is the model's statistically best guess at what text should follow the input, conditioned on patterns learned from training data. This is not a subtle technical distinction: it is the foundational fact about generative AI that changes how it should be deployed, trusted and evaluated in any organisation serious about using it responsibly.
When a generative AI model produces text, it is executing a sequential prediction process. Given the current context — every token that has appeared so far, including the system prompt, retrieved documents, conversation history and the current query — the model assigns a probability to each token in its vocabulary. It samples from that distribution, appends the selected token to the context, and repeats until a stop condition is reached.
The output is not looked up. It is not retrieved from a database of verified answers. It is constructed, token by token, from statistical patterns the model learned during training. Those patterns encode a great deal of genuine knowledge — but they also encode noise, errors, biases, and the structural tendency to produce fluent, plausible-sounding sequences regardless of factual grounding.
The word "generative" in generative AI is literal: the system generates new text. It is not retrieval with a natural-language interface. The distinction matters enormously for how outputs should be used.
The probability-based generation mechanism is the root cause of the most consequential risk in enterprise AI deployments: hallucination. Because the model is optimising for probable text rather than accurate text, it will produce false information with the same fluency and confidence as true information, whenever false information is statistically more probable given the input.
This risk is not confined to obscure edge cases. It manifests in common business scenarios: an AI summarising a contract may omit a liability clause because similar clauses do not commonly appear in its training distribution. An AI answering policy questions may describe a rule that existed in an earlier version of the regulation. An AI generating a competitive analysis may attribute a capability to a vendor that no longer offers it.
PwC's 2025 Global CEO Survey found that only 30% of CEOs report increased revenue from AI in the last 12 months, and only 12% report both revenue increase and cost reduction. The gap between AI potential and realised value is substantially explained by deployments that did not account for what generative AI actually produces.
The generation mechanism in full:
The critical point: at no stage in this process is there a truth-verification step. The model does not check whether the token it selects corresponds to a fact in the world. It selects the token that is statistically most probable given the context. Truth and probability are correlated but not identical — and that gap is where hallucination lives.
Accepting that generative AI produces probable text rather than verified facts has direct architectural implications for every production deployment:
Grounding reduces the gap between probability and truth: When the model is provided with verified source documents in the context window and instructed to base its answer on those documents, the probability distribution is conditioned on accurate content. Retrieval-augmented generation is grounding at the architecture level.
Output validation creates a truth check outside the model: For structured outputs — classifications, numerical fields, yes/no determinations — post-processing validation can check model outputs against known constraints and business rules, catching errors the model would not self-correct.
Human review closes the remaining gap: For high-stakes outputs, a knowledgeable human reviewer provides the epistemically grounded check the model cannot provide for itself.
Edison AI's AI training programmes teach both technical and business teams to reason about generative AI from first principles — understanding the probability mechanism, designing appropriate verification architectures, and communicating clearly to stakeholders about what AI can and cannot be trusted to produce autonomously.
Edison AI runs practical AI training that turns this understanding into day-to-day team capability.
Generative AI generates probability distributions over possible next tokens and samples from them to produce text. The output is the sequence of tokens the model judged most probable given the input, not a retrieval of verified facts or a lookup from a database. This is why the same prompt can produce different answers, and why outputs can be fluent but factually wrong.
Generative AI models do not have a reliable internal signal indicating when they lack knowledge versus when they have it. The probability mechanism operates over patterns, not epistemically verified knowledge. A model can produce a high-confidence-seeming output based on weak training signal just as easily as based on strong training signal. Some alignment techniques train models to express uncertainty, but this is imperfect and should not be relied upon as a comprehensive safeguard.
It means trust must be calibrated to the use case and the verification architecture around the model. Generative AI is highly useful for drafting, summarising, classifying and reasoning tasks where outputs are reviewed by a knowledgeable human or validated against source documents. It is not appropriate as an autonomous source of factual authority on high-stakes questions without grounding and verification.
Edison AI helps Australian businesses move from AI curiosity to practical implementation, with workflow design, team training and measurable outcomes. Tell us about your setup and we'll come back with a sequenced plan grounded in the same thinking you just read.
Article: What \"Generative AI\" Actually Generates: Probabilities, Not Facts