Ask ChatGPT for the best quiet vacuum for a small apartment, or tell Google's AI Mode to find a sofa that ships before the holidays, and you get something that did not exist a year ago: a short, confident list of specific products, often with prices, ratings, and a button to buy. The agent did not scroll your homepage or admire your photography. It pulled structured data, ranked it against narrow sub-queries, checked it against what other sources say, and named a handful of winners. As of 2026 these agents also increasingly transact, so being on that list is no longer a vanity metric. Understanding how AI shopping agents work, and what actually produces AI product recommendations, is now the difference between being in the answer and being invisible. This piece walks through the mechanism, the signals that decide which products get picked, and the concrete moves that raise a brand's odds.
How AI shopping agents actually work The most useful thing to internalise is that an AI shopping agent does not experience your site as a shopper does. It retrieves structured records, ranks them, and corroborates them. The clearest evidence for where those records come from arrived in a March 2026 analysis of more than 43,000 ChatGPT carousel products, reported by [Search Engine Land](https://searchengineland.com/new-finding-chatgpt-sources-83-of-its-carousel-products-from-google-shopping-via-shopping-query-fan-outs-470723), which found that 83% of ChatGPT's product recommendations match Google Shopping's top 40 organic listings, with around 60% drawn from the top 10. ChatGPT reaches them through what the study calls shopping query fan-outs: it decomposes a request into several short, intent-specific sub-queries, averaging about seven words each, and pulls candidate products from Google Shopping's organic index. The practical takeaway is blunt. For ChatGPT, getting picked starts with your Google Shopping feed, not your homepage copy. The same retrieve-rank-corroborate pattern holds across engines, even where the underlying index differs.
How AI agents choose products: the five signals Once candidates are retrieved, a handful of signals decide which ones survive into the answer. The first is semantic query relevance. The agent is matching intent, so a product described only as a vacuum loses to one whose copy names its noise level and dimensions when the query is quiet vacuum for a small apartment. The second is structured product data: Product, Offer, and AggregateRating schema on the page, plus complete feed attributes such as GTIN, MPN, brand, color, size, material, and the correct Google product taxonomy. The third is price and availability accuracy, where current pricing and an in-stock status rank higher and any feed-versus-landing-page mismatch erodes trust. The fourth, and the one brands most underrate, is third-party authority, the roundups, buyer guides, and review platforms an agent uses to verify a claim. The fifth is contextual weighting: budget-led queries are scored more on price, quality-led queries more on ratings. These factors echo the broader pattern in [how AI models choose which brands to recommend](/blog/how-ai-models-choose-which-brands-to-recommend), now applied at the level of individual SKUs.
Structured data and feeds: the new storefront If retrieval runs on structured records, your feed is your storefront in a way your designed pages no longer are. OpenAI's own commerce specification makes the shift explicit. Merchants submit product feeds in CSV, TSV, XML, or JSON to a secure OpenAI endpoint rather than waiting to be crawled, and the spec supports refreshes as often as every 15 minutes so price and stock stay close to real time, as documented in the [OpenAI commerce file-upload spec](https://developers.openai.com/commerce/specs/file-upload/products). Google's side of the equation is larger still: its Shopping Graph now holds more than 50 billion product listings, and Merchant Center has added conversational-era attributes such as answers to common questions and lists of compatible accessories, described in [Google's agentic shopping announcement](https://blog.google/products-and-platforms/products/products/shopping/agentic-checkout-holiday-ai-shopping/). A thin or stale feed does not just rank lower, it gets skipped or misrepresented. This is the same discipline that underpins [AI visibility for e-commerce brands](/blog/ai-visibility-for-ecommerce) and the case for [schema markup as an AI visibility signal](/blog/schema-markup-ai-visibility): give the machine clean, complete, current attributes or accept that it will fill the gaps with someone else's product.
Why third-party signals outweigh your own site Here is the part that frustrates marketing teams. You can ship a flawless feed and still lose the slot, because agents corroborate before they recommend, and most of that corroboration happens off your domain. Vendor analysis from [Alhena](https://alhena.ai/blog/chatgpt-shopping-product-recommendations/) reports that roughly 91% of AI citations trace to off-site sources, the buyer guides, forum threads, and review platforms an engine trusts more than your marketing copy. That figure comes from a vendor blog rather than primary research, so treat it as directional, but the direction is well established elsewhere. Brands with active profiles on the major review platforms are markedly more likely to be cited, a pattern we examined in [how Trustpilot and G2 shape AI recommendations](/blog/review-sites-ai-recommendations), and community discussion carries similar weight, which is [why AI models cite Reddit](/blog/why-ai-models-cite-reddit) more often than the brand's own pages. The lesson for product teams is that off-site presence is not a nice-to-have layered on top of a good feed. It is the corroboration step the agent runs before it will name you at all.
Want to see this in action?
See how every major AI model talks about your brand. Free to start.





