Back to Blog
AI SEOAEOAEO

Apple Built an Answer Engine. Here Is the Two-Layer Benchmark Before iOS 27.

Hayden BondHayden Bond··9 min read
Apple Built an Answer Engine. Here Is the Two-Layer Benchmark Before iOS 27.
On Monday June 8, 2026 Apple introduced Siri AI, a rebuilt assistant that goes out to the web, draws on broad world knowledge, and returns an answer (Apple Newsroom, June 8 2026). The term is Apple's own. Bloomberg reported last September that the project, codenamed World Knowledge Answers, was described inside Apple as an answer engine (Bloomberg, September 2025). What shipped matches the description.
Apple announced it today. It is not live yet. A developer preview went out that day, a public beta follows this summer, and the full release lands this fall with iOS 27. When it does, it becomes the default answer layer across Apple's device base, the largest in consumer tech.
That base is a new surface where brand visibility gets decided, and it reaches more people by default than any standalone AI app has reached by choice. So we are setting a benchmark now, before the rollout, and we re-pull it when iOS 27 ships. This page is the baseline.
We read every AI answer surface through two layers, because the two have different inputs, different timelines, and different fixes. The parametric layer is what a model already believes from training, fixed in its weights between training runs. The retrieval layer is what the model finds at query time, rented daily against whatever else is published. Siri AI runs on both. Brand visibility inside it divides the same way. Hold that split. The rest of this benchmark sits on it.

The market going into the rollout

Distribution is the reason this matters, so start there. These are the baseline figures as of June 2026. We re-pull each one at the iOS 27 release and publish the delta.
Two of those figures use different units, so read them apart. ChatGPT reports weekly actives, Gemini reports monthly actives, and the distance between weekly and monthly is wide enough that the near-identical 900 million should never be read as a tie. The self-reported company numbers (Apple, OpenAI, Google) and the panel estimate (Counterpoint) also sit in separate buckets, and we keep them separate.
The read across the set: the standalone apps are large and still reach a fraction of the people holding an iPhone who never opened an AI app on purpose. Willingness is broad, around seven in ten adults. The habit is not yet universal. A default-placed answer engine is the thing that closes that gap, because the user never has to choose it. And the 20 billion dollar line is the market price of the default slot Apple just built into its own surface and paid no one to hold.

The two-layer read on Siri AI

Siri AI is the same structure we diagnose on every other surface: a model that answers partly from what it carries and partly from what it fetches. That maps onto the two layers cleanly.
The parametric layer is what the Gemini-based model already believes about a brand from training. If the trained-in picture is missing, outdated, or miscategorized, Siri inherits that error before it retrieves a single page, and a freshly published page may not override it. Parametric signals persist by default, because the weights are fixed between training runs, and a confident wrong belief can outlast the correction. This is the layer most brands never check and the one no dashboard measures.
The retrieval layer is what Siri pulls at query time. This is the addressable surface for content. The answer is assembled from passages a model can lift and have stand on their own, so the unit that gets retrieved is the self-contained passagerather than the whole page. Section-level semantic density and alignment with the sub-queries a model fans a question into decide what comes back. Retrieval visibility is rented, not owned. It holds only as long as your passage stays the best available answer against everything else published that day.
The practitioner takeaway is the part that should change how the work is scoped. The strategy for Siri visibility is the two-layer strategy we already run. Apple did not add a new discipline. It attached its distribution to the one that already exists. Siri is one more answer-engine surface in the set we already work, alongside ChatGPT, Perplexity, and Google's AI answers, and the same retrieval and parametric mechanics decide visibility on each.

Siri visibility is mostly downstream of Gemini

One inference, stated as an inference. Apple built its Foundation Models with Gemini technology, confirmed at the keynote (MacRumors keynote coverage, June 8 2026). If Siri's web answers are produced by Gemini-derived models, then what surfaces in a Siri answer is likely governed by retrieval and ranking close to what governs Google's own AI answers, rather than by a net-new Apple index built from scratch. Apple may layer its own orchestration on top, so hold this loosely. But the direction is firm enough to plan against. The entity signals, structured data, and Knowledge Graph presence that earn visibility in Google's AI answers are already doing part of the work for Siri. The surface is new. The retrieval mechanics underneath are largely ones we already work.

Where the surface is addressable, and where it is not

The new Siri does more than answer from the web. It reads personal context, takes actions across apps, and works from what is on screen. Most of that is answered from a user's own data and cannot be touched by content or entity work. Three channels are addressable, and they belong to different parts of the practice.
The web-answer path is the retrieval channel. When Siri answers a question from the open web, your content can be the source it builds on. This is where passage-level semantic density and sub-query alignment earn the citation, the same work that earns it on any retrieval surface.
The local and action path is its own channel, and it does not run through your website. When Siri brokers a local or commercial choice, the candidate comes from Apple's own data, so presence in Apple Maps and Apple Business Connect becomes a Siri-visibility lever. For hospitality, local service, and tourism clients, that channel decides whether Siri ever says the name, and it is ownable now.
The parametric channel sits under both. It is the slower work of correcting what the model already believes, through the corroborated off-site entity signals that feed training, rather than through anything you publish on your own domain this week.

What we measure when it ships

The market figures above are measurable today. Brand presence inside Siri's answers is not, because Siri is not live. That measurement is the re-pull, and we run it with three terms.
Parametric Presence: whether the model already carries the brand, and carries it correctly, before any retrieval runs. LLM Influence Score: whether our work moves what the answer says. Unprompted Recommendation Rate: whether Siri names or recommends the brand when the user did not ask for it by name.
The methodology note we were holding open is now resolved by the developer beta. Siri's web-answer cards cite their sources and link out to them. The retrieval channel carries a referral, so we score it like any other citation surface, a place to earn both the citation and the visit. The two-layer caution still holds underneath. Being retrieved is not the same as being cited, because most of what a model pulls never makes the answer, and a shown link rarely becomes a guaranteed visit. The recommendation still sits upstream of the click. The retrieval channel is now a traffic channel worth scoring, and the work is to be the source Siri cites and then the brand it already recommends.

The re-pull

This benchmark is dated June 2026, set from the announcement and the developer preview. We re-pull it at the iOS 27 release this fall: the market figures refreshed, the citation behavior measured at scale now that the developer beta confirms the cards link their sources, and the first read on the three brand-presence terms once there is a shipping Siri to measure them against. The point of publishing the baseline now is that the comparison only exists if the before was written down before the after.

The takeaway

A new default answer surface on Apple's base means brand visibility there is decided by the two layers we already work. The recommendation Siri gives, or withholds, sits upstream of every dashboard number, because the model names a brand before the user ever confirms it in a branded search. You earn that the same way on every surface: be what the model already believes at the parametric layer, and be the best retrievable answer at the retrieval layer. Apple just attached the largest distribution in consumer tech to that problem. The baseline is set. We re-pull at iOS 27.
Baseline set June 2026, from Apple's announcement and the developer preview, which shows Siri's web-answer cards citing and linking their sources. Market figures and the three brand-presence terms re-pulled at the iOS 27 release this fall.
Share this article

Ready to appear in AI search?

We work with businesses across every industry. If you have questions about where you stand in modern search, we are easy to reach.

Get in touch
Hayden Bond

Hayden Bond

Hayden Bond has been doing SEO since 2004. He founded Plate Lunch Collective in Aiea, helping brands get cited by AI platforms rather than just ranked by Google.