How Large Language Models Source Brand Reputation Across Languages and Markets
Pith reviewed 2026-06-25 19:03 UTC · model grok-4.3
The pith
Large language models cite third-party sites for 85.7 percent of brand answers rather than company-owned pages.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
When large language models answer brand questions they attribute 85.7 percent of citations to third-party domains and only 14.3 percent to domains the brand itself controls. The domain distribution is long-tailed and follows a Zipf law with alpha equal to 0.86. Wikipedia ranks as the single most-cited domain in 11 of the 12 languages studied. In the Polish market YouTube leads and four HR and career portals together supply more than twice as many citations as Polish Wikipedia.
What carries the argument
Classification of each citation URL as owned by the brand or third-party, followed by aggregation by language and market to reveal concentration and dominance.
If this is right
- Brand reputation inside AI answers is shaped mainly by how external sites describe the company.
- Wikipedia supplies the largest single share of AI brand information in almost every language.
- Because citations follow a Zipf distribution, a modest shift in the top 18 percent of domains affects the majority of outputs.
- Market-specific source preferences, such as YouTube dominance for Polish brands, produce different citation mixes even for national companies.
Where Pith is reading between the lines
- Companies could gain more AI visibility by ensuring accurate information appears on Wikipedia and other top-cited domains than by focusing only on their own sites.
- Small edits or content changes on the leading third-party domains could alter AI descriptions of many brands at once.
- The observed patterns may differ for non-brand queries or for models trained after the datasets were collected.
Load-bearing premise
The Rankfor.AI datasets and the owned-versus-third-party labels accurately capture the actual grounding behavior of the models without bias from query wording or model choice.
What would settle it
A new run of the same brand queries on additional models or with rephrased prompts that produces a third-party citation share below 70 percent would falsify the central proportion.
read the original abstract
When a large language model (LLM) answers a question about a company, it grounds the answer in retrieved web sources, and those sources decide what the model says. Most analysis of AI brand visibility looks at the answer text. This study looks one step earlier, at the citations. We merge three Rankfor.AI datasets covering 128 brands across 12 home markets and 13 languages, and analyse 167,551 URL-grounded citations (189,974 total attribution rows). We classify each citation by domain and source type and measure where AI gets its brand information, by language and by market. Four patterns hold. First, AI grounds brand answers overwhelmingly in third-party sources: 85.7% of citations point to sites the brand does not own, against 14.3% owned. Second, the source base is concentrated and long-tailed: 80% of citations come from about 18% of domains, fitting a Zipf law (alpha = 0.86, R^2 = 0.983). Third, one reference site dominates almost everywhere: Wikipedia is the most-cited domain in 11 of 12 languages, the exception being Lithuanian, where the business daily vz.lt edges it (4.38%). Fourth, the source mix is market-specific at the margin: for 46 Polish national brands the most-cited domain is YouTube, and four HR and careers portals supply 637 citations against 297 for Polish Wikipedia, about twice as many.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript merges three Rankfor.AI datasets covering 128 brands across 12 markets and 13 languages and examines 167551 URL-grounded citations (189974 attribution rows). It reports that 85.7% of citations come from third-party domains versus 14.3% from brand-owned domains, that the domain distribution is long-tailed and fits a Zipf law (alpha=0.86, R^2=0.983), that Wikipedia is the most-cited domain in 11 of 12 languages, and that market-specific patterns exist (e.g., YouTube leads for Polish brands while HR portals outrank Wikipedia).
Significance. If the domain classifications are reliable, the work supplies a large-scale descriptive baseline on LLM grounding behavior for brands. The explicit counts, concentration statistic, and cross-language/market comparisons are concrete and could inform studies of AI-mediated brand visibility. The sample size and reported R^2 value are strengths of the descriptive analysis.
major comments (2)
- [Abstract] Abstract: the central 85.7% third-party claim requires every one of the 167551 citations to be correctly labeled owned versus third-party. The text states only that citations were classified by domain and source type; no decision rules, brand-domain lists, handling of subsidiaries/subdomains, or validation (inter-rater reliability, spot-checks) are described. Systematic misclassification of even a few high-frequency domains would move the reported split by several points.
- [Abstract] Abstract: no justification is offered for the representativeness of the 12 markets and 13 languages, and no analysis addresses possible biases arising from query phrasing or the particular models underlying the Rankfor.AI datasets.
minor comments (2)
- The reported percentages and Zipf parameters are given without error bars, confidence intervals, or other uncertainty measures.
- The exact breakdown of brands per market/language and any filtering steps applied to the 189974 attribution rows are not stated.
Simulated Author's Rebuttal
Thank you for the constructive feedback. We address each major comment below and will revise the manuscript to improve transparency and limitations discussion.
read point-by-point responses
-
Referee: [Abstract] Abstract: the central 85.7% third-party claim requires every one of the 167551 citations to be correctly labeled owned versus third-party. The text states only that citations were classified by domain and source type; no decision rules, brand-domain lists, handling of subsidiaries/subdomains, or validation (inter-rater reliability, spot-checks) are described. Systematic misclassification of even a few high-frequency domains would move the reported split by several points.
Authors: We agree that the classification methodology must be described in detail to support the 85.7% figure. In the revised manuscript we will add a dedicated subsection in Methods that specifies the decision rules for owned vs. third-party domains, the brand-domain lists employed, handling of subsidiaries and subdomains, and any validation performed (including spot-checks). revision: yes
-
Referee: [Abstract] Abstract: no justification is offered for the representativeness of the 12 markets and 13 languages, and no analysis addresses possible biases arising from query phrasing or the particular models underlying the Rankfor.AI datasets.
Authors: The 12 markets and 13 languages are those present in the merged Rankfor.AI datasets; we will add a short justification paragraph in Methods or Limitations explaining the data-driven selection and its diversity. We will also add an explicit limitations statement acknowledging possible biases from query phrasing and model-specific retrieval behavior, while clarifying that the work is descriptive rather than a generalizability study. A full bias analysis would require new experiments outside the current scope. revision: partial
Circularity Check
No circularity: purely descriptive empirical counts and one data fit
full rationale
The paper merges existing citation datasets, classifies URLs by domain ownership, reports raw percentages (85.7% third-party), identifies the most-cited domain per language/market, and fits a Zipf exponent to the observed frequency distribution. None of these steps derive a new quantity from prior outputs of the same analysis; the Zipf fit is a post-hoc statistical summary of the collected data rather than a prediction that reduces to the fit by construction. No self-citations, uniqueness theorems, or ansatzes are invoked to support the central claims. The work is self-contained against external citation benchmarks.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
In: Advances in Neural Information Processing Systems 33 (NeurIPS 2020)
Lewis P, Perez E, Piktus A, Petroni F, Karpukhin V, Goyal N, et al (2020) Retrieval-augmented generation for knowledge-intensive NLP tasks. In: Advances in Neural Information Processing Systems 33 (NeurIPS 2020). arXiv:2005.11401
Pith/arXiv arXiv 2020
-
[2]
Computational Linguistics 49(4):777–840
Rashkin H, Nikolaev V, Lamm M, Aroyo L, Collins M, Das D, Petrov S, Singh Tomar G, Turc I, Reitter D (2023) Measuring attribution in natural language generation models. Computational Linguistics 49(4):777–840. arXiv:2112.12870
arXiv 2023
-
[3]
In: Findings of the Association for Computational Linguistics: EMNLP
Liu NF, Zhang T, Liang P (2023) Evaluating verifiability in generative search engines. In: Findings of the Association for Computational Linguistics: EMNLP
2023
-
[4]
In: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD ’24)
Aggarwal P, Murahari V, Rajpurohit T, Kalyan A, Narasimhan K, Deshpande A (2024) GEO: generative engine optimization. In: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD ’24). arXiv:2311.09735
arXiv 2024
-
[5]
Chen M, Wang X, Chen K, Koudas N (2025) Generative engine optimization: how to dominate AI search. arXiv:2509.08919
arXiv 2025
-
[6]
Kumar P (2026) Generative engine optimization at scale: measuring brand visibility across AI search engines. arXiv:2606.20065
Pith/arXiv arXiv 2026
-
[7]
Yang KC (2025) News source citing patterns in AI search systems. arXiv:2507.05301
arXiv 2025
-
[8]
Aral S, Li H, Zuo R (2026) The rise of AI search: implications for information markets and human judgement at scale. arXiv:2602.13415 11
arXiv 2026
-
[9]
Kirsten E, Grosse Perdekamp J, Wu Q, Upadhyay M, Gummadi KP, Zafar MB (2025) Characterizing web search in the age of generative AI. arXiv:2510.11560
Pith/arXiv arXiv 2025
-
[10]
Proceedings of the Association for Information Science and Technology 61(1):205–217
Li A, Sinnamon L (2024) Generative AI search engines as arbiters of pub- lic knowledge: an audit of bias and authority. Proceedings of the Association for Information Science and Technology 61(1):205–217. https://doi.org/10.1002/ pra2.1021 (also arXiv:2405.14034)
arXiv 2024
-
[11]
Journal of Brand Management 7(4):241–255
Fombrun CJ, Gardberg NA, Sever JM (2000) The Reputation Quotient: a multi- stakeholder measure of corporate reputation. Journal of Brand Management 7(4):241–255. https://doi.org/10.1057/bm.2000.10
-
[12]
Strategic Management Journal 23(12):1077–1093
Roberts PW, Dowling GR (2002) Corporate reputation and sustained superior financial performance. Strategic Management Journal 23(12):1077–1093. https: //doi.org/10.1002/smj.274
-
[13]
˙Zatuchin D (2026) The language blind spot: how query language and brand recognition tier shape AI-constructed brand reputation across twelve European languages. arXiv:2606.23165
arXiv 2026
-
[14]
˙Zatuchin D (2026) Who owns the AI recommendation? A multi-industry empirical map of brand category ownership across large language models. arXiv:2606.23057
Pith/arXiv arXiv 2026
-
[15]
Peec AI, 19 June 2026
Peec AI (2026) Top domains cited by AI search: analysis based on 30M sources. Peec AI, 19 June 2026. https://peec.ai/blog/ top-domains-cited-by-ai-search-analysis-based-on-30m-sources
2026
-
[16]
Omniscient Digital, 8 January 2026
Dombrowski C (2026) How LLMs source brand information: an analysis of 23,000+ AI citations. Omniscient Digital, 8 January 2026. https://beomniscient. com/blog/how-llms-source-brand-information/ 12
2026
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.