All Articles
    Data & Research
    Updated June 4, 202612 min read

    Ahrefs Just Published 10 Findings on AI Search Optimization. Here's What Each Means for Brand Teams.

    Ahrefs analysed 1 billion data points across 14 studies and published 10 findings on how AI search actually works in 2026. Several reframe the standard AI visibility playbook. Here are the headline numbers, what each one means in practice, and the two findings that change how brand teams should prioritise.

    Matiss Katanenko

    Matiss Katanenko

    Co-founder, Honeyb

    Ahrefs, the SEO platform that runs one of the largest web indices in the market, spent six months analysing how AI search engines source, cite, and rank content. The output covered roughly 1 billion data points across 14 separate studies. Tim Soulo, the company's CMO, published the 10 headline findings on X in late May, with the underlying research available on the Ahrefs blog and a separate ChatGPT vs Google citation overlap study.

    The findings are unusually rich because the sample size is large enough to draw real conclusions, the methodology is publicly documented, and the authors stand behind the data publicly. Several of the findings reinforce what brand teams running AI visibility programmes have been seeing anecdotally. Two of them are genuinely surprising and change how the work should be prioritised. One contradicts conventional wisdom (including some of what's been written on this blog), and the right response is to update the playbook rather than defend the older take. AI search is moving fast enough that any single study is a snapshot, not a verdict, which is the broader point underneath the schema finding below.

    The 10 findings at a glance

    #FindingHeadline statWhat it changes for brand teams
    1'Best X' listicles dominate citations43.8% of ChatGPT-cited pagesEditor outreach to category roundups is the highest-ROI placement
    2Two thirds of citations are off-limits to marketers67% from Wikipedia, homepages, app storesConcentrate effort on the 32.3% influenceable surface
    3Most AI-cited pages don't rank on Google28.3% have zero Google trafficAI visibility must be measured directly, not inferred from rankings
    4Retrieved is not the same as citedChatGPT cites ~50% of what it fetchesBackground-context influence matters as much as visible citations
    5Broad schema markup didn't move citations-4.6% to +2.4% across surfacesSchema still matters for specific surfaces; the broad lift claim is overstated
    6YouTube has the strongest correlation with AI brand visibility0.737 correlationAdd a YouTube creator outreach track to your AI visibility programme
    7AI Overviews collapse clicks on the #1 result-58% clicks (up from -34.5% in 10 months)Optimise for citation inside AI Overviews, not just ranking above them
    8AI Overviews concentrate on informational queries99.9% informational, 3.2% shoppingRun citation strategy for top-of-funnel, click strategy for bottom-of-funnel
    9AI Mode and AI Overviews cite different sources86% same conclusion, 13.7% citation overlapMeasure and optimise both surfaces separately
    10AI Overviews are highly volatileChanges every 2.15 days, 70% content driftDaily monitoring is the minimum cadence for stable trend data

    Finding 1: 'Best X' listicles are the single most cited content format

    The stat: 'Best X' blog listicles make up 43.8% of all page types cited by ChatGPT specifically.

    What it means: when ChatGPT generates a recommendation answer for a buyer-stage query, the source it pulls from is overwhelmingly a 'best of' listicle. This is consistent with what brand teams have been observing for two years now. Inclusion in category-defining roundups (G2's best-of lists, Forbes Advisor roundups, niche publisher 'best 10' articles) is the single highest-leverage placement for getting cited.

    For brand teams: outreach to the editors writing 'best of' roundups in your category is one of the highest-ROI activities in AI visibility work. Building your own 'best X' content (your own roundups, your own comparison pages) also captures citations directly.

    Finding 2: 67% of ChatGPT's top citations are off-limits to marketers

    The stat: 67% of ChatGPT's top 1,000 citations come from sources marketers can't influence: Wikipedia (29.7%), homepages (23.8%), app stores (6.6%). Only 32.3% are influenceable content like educational pages, reviews, news, and blog posts.

    What it means: the meaningful surface for AI visibility work is a third of what shows up. The other two thirds is structural infrastructure (Wikipedia, official homepages, app stores) that brand teams either already have or genuinely can't move via marketing tactics.

    For brand teams: this is actually freeing rather than discouraging. The actionable surface is smaller, which means the work should be concentrated. Inside the 32.3% influenceable share, the priorities order cleanly: review platforms (G2, Trustpilot, Capterra) first, editorial roundups second, news coverage third, owned blog content fourth. Trying to optimise across all four simultaneously is less effective than running them in sequence with concentrated effort on whichever one your brand is weakest on first.

    Finding 3: 28.3% of ChatGPT's most-cited pages have zero Google organic visibility

    The stat: 28.3% of the pages ChatGPT cites most frequently have zero traditional organic traffic from Google. They get cited repeatedly by ChatGPT despite not ranking in Google search at all.

    What it means: AI search is structurally a separate discovery layer from Google search. The pages winning AI citations are partly the same pages winning Google rankings, but not entirely. Roughly three in ten cited pages are invisible to Google ranking metrics and yet do significant AI-visibility work for their brands.

    For brand teams: this validates the case that AI visibility needs to be measured directly, not inferred from Google rankings. A brand that's invisible on Google for a category-defining keyword can still be cited heavily by ChatGPT for the same query if their content fits the AI engine's citation preference. Measurement tooling that only tracks Google rankings (Ahrefs Rank Tracker, SEMrush Position Tracking, Sistrix Visibility Index) misses about a third of what's actually happening on the AI-citation surface. AI-specific monitoring is the only way to see it.

    Finding 4: ChatGPT cites only about 50% of the URLs it retrieves

    The stat: ChatGPT fetches dozens of pages per query but only cites about half of them. The other half are used as background context without attribution.

    What it means: being retrieved and being cited are different things. A page can be read and used by the model to shape its answer without ever appearing in the visible citation list. This has two implications. First, the visible citation list understates the actual influence of the broader source pool. Second, a brand's content can be influencing AI answers without showing up in any analytics dashboard that only tracks visible citations.

    For brand teams: don't optimise solely for the visible citation surface. The work to make your content extractable, citable, and factually clean pays off even when the model uses it as background context rather than a named citation, because background context shapes how the model describes your category and your brand.

    Finding 5: Schema markup had zero meaningful impact on AI citations

    The stat: across the Ahrefs experiments, AI Overviews dipped 4.6%, AI Mode rose 2.4%, and ChatGPT rose 2.2% after schema markup was added. All three movements are indistinguishable from zero against the baseline noise.

    What it means: the simple causal claim 'add schema markup, get more AI citations' does not hold up to controlled testing. Adding schema didn't move the citation needle in either direction.

    Why this is a good example of needing continuous measurement, not one-time setup: an earlier post on this blog argued that schema markup helps with AI visibility. The Ahrefs run shows the broad-citation claim doesn't hold across their sample. Both takes can be defensible at different points in time, and both will shift again as the engines retrain. Anyone making a 2026 call from a study run in early 2025 is already working from stale data. The targeted impacts still hold: Product, Offer, and AggregateRating schema is what feeds ChatGPT Shopping product cards correctly, FAQ schema is still used for direct-answer extraction, and Organization schema shapes Knowledge Panel content (which influences how AI describes your brand at the entity level). The broad 'schema = more AI citations' claim is the part that doesn't survive the Ahrefs data, and the schema markup post should be read with that correction in mind.

    The general principle: when a finding contradicts something you previously believed about AI search, the answer is to update the belief and keep monitoring, not to pick a side. The engines change their citation behaviour faster than any one study can capture.

    For brand teams: prioritise schema for the specific surface it serves (Product schema for shopping, FAQ for question queries, Organization for brand entity work). Don't expect a broad citation lift from adding it across the site.

    Finding 6: YouTube mentions have the highest correlation (0.737) with AI brand visibility

    The stat: YouTube brand mentions had a 0.737 correlation with AI brand visibility, higher than backlinks, page count, Domain Rating, or any other conventional SEO metric Ahrefs tested. The pattern held across both Google-owned and OpenAI products.

    What it means: this is the most actionable single finding in the entire study. YouTube is dramatically underweighted in most brand teams' AI visibility work, but it produces the strongest correlated signal across the engines. The reason is structural: YouTube transcripts are heavily ingested into AI training corpora, YouTube comments serve as a form of multi-voice corroboration that AI models trust, and YouTube creators talking about a brand on video produce a citable signal at the entity level rather than a single-page level.

    For brand teams: if your AI visibility programme doesn't have a YouTube creator outreach component, this is probably the highest-leverage addition you can make in 2026. Identify the YouTube creators in your category, build relationships, get your product into their hands, and aim for substantive on-screen mentions. The video doesn't have to be a paid placement; an organic mention in a review or category overview video tends to outperform sponsored content for AI citation impact.

    Finding 7: AI Overviews reduce clicks to the #1 result by 58%

    Want to see this in action?

    Check how AI models talk about your brand — free, instant, no signup required.

    Free AI Check

    The stat: AI Overviews reduce clicks to the #1 organic result by 58%. This is up from 34.5% just ten months earlier. The trend is accelerating.

    What it means: the click loss from AI Overviews has nearly doubled in under a year. The traffic that used to flow from Google's top organic position to brand sites is increasingly captured at the SERP itself by the AI summary. This is the single clearest piece of evidence that the click-driven SEO model is structurally compressing.

    For brand teams: the click loss is permanent for the queries that trigger AI Overviews. The work shifts from 'get the click' to 'be cited inside the AI Overview itself'. A brand that ranks #1 organically and is not cited in the AI Overview now captures 42% of what they captured eighteen months ago for the same query. Measuring AI Overview citation share is now as consequential as measuring traditional ranking.

    Finding 8: 99.9% of AI Overviews appear on informational queries

    The stat: 99.9% of AI Overviews appear on informational intent queries. Transactional, navigational, and local searches are almost entirely AI-Overview-free. Shopping triggers AI Overviews just 3.2% of the time.

    What it means: the click loss from finding 7 is concentrated on informational queries. Transactional traffic ('buy X', '[brand] login', 'pricing'), navigational traffic (brand name searches), and local traffic (near-me queries) are still flowing through to brand sites at roughly the pre-AI rates.

    For brand teams: this changes prioritisation. Informational top-of-funnel content has lost most of its click value but retained its citation value (you want to be cited inside the AI Overview, not click through to your page). Transactional and branded content has retained its click value and should still be optimised the old way. The hybrid posture (citation optimisation for top-of-funnel, click optimisation for bottom-of-funnel) is now the right default for most B2B and consumer brands.

    Finding 9: AI Mode and AI Overviews reach the same conclusion 86% of the time but cite different sources

    The stat: for a given search query, Google's AI Mode and AI Overviews reach the same conclusions 86% of the time but cite almost entirely different sources. Only 13.7% citation overlap.

    What it means: even within Google's own AI surfaces, the citation layer is largely independent. The 86% conclusion-level agreement reflects a shared underlying judgement about what's true; the 13.7% citation overlap reflects two largely separate decisions about which sources to credit for that judgement. Both decisions affect brand visibility.

    For brand teams: appearing in AI Overviews is not the same as appearing in AI Mode, even on the same query. The two surfaces need to be measured separately and the citation work needs to address both. Honeyb tracks AI Mode and AI Overviews as distinct engines for exactly this reason.

    Finding 10: AI Overviews change every 2.15 days on average

    The stat: AI Overviews change every 2.15 days on average, with 70% of content differing between consecutive observations. Semantic similarity stays at 0.95, meaning the conclusions barely move even as the words, sources, and entities constantly shuffle.

    What it means: the meaning is stable, the visible composition is volatile. A brand that appears in an AI Overview today may not appear in the same AI Overview tomorrow, even though the AI is reaching essentially the same conclusion about the query. This is the strongest empirical evidence for the case that one-off AI visibility checks are unreliable.

    For brand teams: monitoring AI Overview presence on a weekly cadence misses meaningful share-of-voice movement that happens every two days. Daily monitoring is the minimum cadence that produces stable trend data. The case for continuous monitoring is now empirical rather than just intuitive.

    The two findings that change how to prioritise

    Most of the ten findings reinforce the current AI visibility playbook. Two of them genuinely change how brand teams should be prioritising their work.

    The first is finding 6 (YouTube has the highest correlation with AI brand visibility). This is the single largest underweighted opportunity in most brand teams' programmes. The work is unfamiliar to most SEO and content teams (creator outreach is closer to influencer marketing than to traditional SEO) but the data says it produces the strongest correlated signal. Brands that move first on YouTube creator relationships in their category will be visibly ahead on AI visibility metrics within two quarters.

    The second is finding 3 (28.3% of ChatGPT's most-cited pages have zero Google traffic). This is the strongest empirical case yet that AI search is a structurally separate discovery layer. A brand whose AI visibility measurement infrastructure only tracks Google rankings (most SEO platforms in the market) is blind to roughly a third of the AI citation surface. The fix is not adding another SEO tool; it's adding a dedicated AI visibility platform that measures the AI surface directly.

    How this maps to Honeyb's positioning

    Honeyb is the dedicated AI visibility platform Ahrefs' research implies brands should be running alongside their SEO stack. Across the ten findings, the platform's existing capabilities map cleanly: daily monitoring across all eight major AI engines (which addresses finding 10's 2.15-day volatility), per-engine tracking that treats AI Mode and AI Overviews as distinct surfaces (finding 9), citation tracking across the full surface visible plus inferred (related to finding 4), and share-of-voice measurement for buyer-stage queries that are usually informational (finding 8).

    Two things Honeyb's product roadmap addresses that the findings raise directly: YouTube mention monitoring (finding 6) and AI Overview citation share tracking (finding 7) are both surfaces we're adding visibility for in coming releases. Run a free check to see your current baseline across the eight major engines.

    A note on Ahrefs as both data source and market competitor: Ahrefs launched their own AI visibility checker in early 2026. The research findings stand on their own credibility; the tool comparison is a separate decision. The advantage of Ahrefs' tool is integration with the rest of their SEO platform. The advantage of dedicated AI visibility platforms (Honeyb, Profound, and others) is depth-of-feature on the AI surface specifically, with monitoring across more engines and sentiment classification as a first-class metric rather than an add-on.

    What we're watching change next

    Every finding in the Ahrefs study is true at the moment of measurement and provisional after that. The list below tracks the signals where we expect the strongest movement over the next two to three quarters, and where brand teams should be cautious about treating today's number as permanent.

    SignalWhere it stands nowWhy it's likely to move
    Schema markup impact on broad citationsIndistinguishable from zero (Ahrefs, 2026)Engines are actively re-weighting structured data; surface-specific lift (Product, FAQ, Organization) likely to grow
    Citation share by engineChatGPT dominant, Gemini regaining groundAI Mode rollout and Copilot integration changing the share split monthly
    YouTube correlation strength0.737, highest single signalOther UGC platforms (Reddit, podcast transcripts) are being ingested at higher rates; the relative ranking may compress
    Citation overlap between AI Mode and AI Overviews13.7%Google is signalling unification of the two surfaces; overlap will likely climb
    AI Overview volatility2.15-day average refreshStability typically rises as engines mature; expect the refresh interval to lengthen
    Sentiment as a leading indicatorEmerging discipline, under-instrumentedWill become standard the same way ranking became standard for SEO once the metric was operationalised

    The reason continuous monitoring beats periodic audits is that the underlying numbers move fast enough that a quarterly snapshot is already stale by the time it lands in front of the board.

    Closing

    The Ahrefs research is the most rigorous public dataset on AI search behaviour available in 2026. Most of the findings reinforce work brand teams should already be doing. Two of them (YouTube creator outreach and the AI-vs-Google discovery layer split) are large enough to genuinely reorder priorities for the next two quarters. For the underlying research, the Ahrefs blog post on most-cited ChatGPT pages is the right starting point. For ongoing measurement of your brand's AI citation surface, run a free AI visibility check to baseline your current position across the major engines, and see our board template for how to translate the data into executive conversations.

    Matiss Katanenko

    About the author

    Matiss Katanenko

    Co-founder, Honeyb

    My name is Matiss Katanenko and I co-founded Honeyb, the AI visibility platform that tracks how ChatGPT, Gemini, Claude, Perplexity and the other major AI engines talk about brands. I'm based in Riga, Latvia. Before Honeyb I spent years on the agency side running SEO and content programs for fast-growing brands across the US and Europe. That work is where I watched AI search start to compress the entire discovery channel into a four-brand short list, and decided to build the tool I wished agencies had. In my free time I'm in the sauna, on a padel court, or behind a drum kit.

    Connect on LinkedIn
    Honeyb

    Free, instant, no signup

    See your brand through every major AI model.

    Run a free check in 30 seconds. The picture is usually different than you'd expect.

    ChatGPTChatGPT
    ClaudeClaude
    GeminiGemini
    PerplexityPerplexity