How LLMs Form Opinions About Businesses
During training, large language models process text from across the internet: news articles, forum discussions, business directories, academic papers, social media, review platforms, and much more. They learn to associate certain entities — businesses, people, brands — with certain topics, qualities, and trust signals. When prompted to make a recommendation, the model surfaces the entities with the strongest relevant associations for that specific query context.
Think of it as a weighted reputation score embedded inside the model's parameters. You cannot directly influence those weights once the model is trained, but you can influence the training data the model learned from — and for search-enabled AI like Perplexity and ChatGPT with web browsing, you can influence what it retrieves right now.
This means AEO is fundamentally a long-term evidence-building strategy, not a quick technical fix. The businesses appearing most often in AI recommendations today aren't gaming a system — they built genuine authority signals over months and years that the model learned to trust.
The 6 Ranking Factors AI Engines Actually Favor
Research from practitioners at iPullRank and Growth Memo, combined with systematic testing across AI platforms, points to six consistent patterns. These aren't official signals published by OpenAI or Google — they're empirically observed patterns in what the models consistently reward.
Factor 1: Entity Recognition and Distinctness
Before an AI can recommend your business, it needs to recognize your business as a distinct, real-world entity. This means your brand name must appear consistently across enough authoritative sources that the model can confidently associate it with a specific type of business, a specific location, and a specific set of services.
Entity recognition is binary in a way the other factors aren't. Either the model knows you exist as a distinct entity, or it doesn't. A business that has never been mentioned outside its own website may as well not exist to the model. The fix is to establish what practitioners call an entity anchor: a homepage with correct Organization schema, a fully-optimized Google Business Profile, and presence on at least five authoritative third-party platforms (LinkedIn, Crunchbase, industry directories), all using the same name and description.
For a full breakdown of how to build entity signals, the entity-first framework covers the five-layer approach Magna uses with every client.
Factor 2: Topical Authority and Depth
LLMs associate brands with topics based on the volume and depth of content connecting the brand to that topic. A law firm with 40 detailed articles about employment law will be more strongly associated with "employment lawyer" queries than a firm with one generic practice area page — even if the latter has higher domain authority by traditional SEO measures.
This is the mechanism behind topical authority clusters. When an AI model encounters dozens of interconnected, authoritative documents from the same source all discussing variations of the same topic, it builds strong associative patterns between that source (your brand) and that topic. Isolated individual pages barely move the needle; a coherent content cluster shifts it significantly.
The practical implication: 20 deep, interconnected articles about your core topic will outperform 100 thin, disconnected pages on various topics. Depth and coherence beat breadth.
Factor 3: Third-Party Corroboration
AI models treat information from independent sources as more reliable than information from the source itself. A business that describes itself as "the leading X in industry Y" gets less credit from the model than a business that has been described that way by three respected external publications.
This factor is why traditional PR — press coverage, industry award listings, expert roundups, analyst mentions — has a direct and measurable effect on AI visibility. The model needs external corroboration to feel confident enough to make a recommendation. Research from Profound consistently shows that a single mention in a Tier 1 industry publication can shift AI recommendation frequency more than dozens of lower-authority citations.
Factor 4: Structured and Extractable Content
AI engines — especially search-enabled ones like Perplexity — strongly prefer content that is structured for extraction. This means headers that directly answer questions, clear factual statements that can be quoted verbatim, tables that summarize comparisons, and FAQ sections with explicit question-answer pairs.
This factor is distinct from traditional on-page SEO. It's not about keyword density or meta descriptions — it's about making your content easy for the model to parse and cite. A 2,000-word article with no structure, no headers, and no extractable facts will be passed over in favor of a 600-word page with clear H2s, a comparison table, and a bulleted takeaway list.
Schema markup reinforces this. Organization, Article, FAQPage, and HowTo schema all make the content more parseable for AI crawlers and training pipelines.
Factor 5: Sentiment and Reputation Signals
AI systems trained on review platforms (Google Reviews, Trustpilot, G2) and forum discussions (Reddit, Quora) incorporate sentiment into their recommendation patterns. Brands with consistently positive reviews, no major public controversies, and constructive community engagement are recommended more frequently than those with mixed or negative sentiment profiles, even when the technical authority signals are similar.
This factor is slow to build and slow to change, which makes it both the most durable competitive advantage and the most damaging liability. Businesses with strong review sentiment across multiple platforms carry it through every model retraining cycle.
Factor 6: Recency and Freshness (Search-Enabled AI)
For AI systems with real-time web search — ChatGPT Search, Perplexity, Google AI Mode — recent content matters in a way it doesn't for purely training-data-based recommendations. A business that published a relevant, authoritative piece last week may surface ahead of a competitor with a two-year-old post, even if the competitor has historically higher authority.
This factor only applies to search-enabled AI, not to knowledge-cutoff models responding from training data alone. But given that Perplexity, ChatGPT with search, and Google AI Overviews all use real-time retrieval, recency is increasingly relevant for the majority of AI-assisted queries. A consistent publishing cadence — even one article per week — creates a freshness signal that pure training-data models can't compete with on live queries.
| Factor | Applies To | Timeline to Build | Primary Lever |
|---|---|---|---|
| Entity recognition | All AI platforms | 2–6 weeks | Schema + directory presence |
| Topical authority | All AI platforms | 8–16 weeks | Content clusters |
| Third-party corroboration | All AI platforms | 8–24 weeks | PR + digital citations |
| Structured content | All AI platforms | 1–2 weeks | Content structure + schema |
| Reputation/sentiment | All AI platforms | 12–52 weeks | Reviews + community presence |
| Recency / freshness | Search-enabled AI only | Immediate | Publishing cadence |
What Changes When AI Has Web Search
The distinction between AI models operating from training data and AI models with real-time web search is critical for strategy. It determines which tactics move the needle now versus which ones compound over months.
Training-data AI (ChatGPT without browsing, Claude without retrieval) reflects the web as it existed during the model's training cutoff. Improving your visibility there requires building signals that get picked up in the next training run — a process that takes months. These improvements are durable: once you're in the training data, you benefit from every conversation the model has on that topic.
Search-enabled AI (Perplexity, ChatGPT Search, Google AI Mode) retrieves live web content before responding. Your freshness signals, structured content, and recent press coverage all matter immediately. A new piece of high-authority content can shift your Perplexity visibility within days. This is the fast lane of AI search optimization.
The most effective AEO strategy works both channels simultaneously: build entity signals and topical authority for the training-data pipeline, while maintaining a publishing cadence and PR program that feeds the real-time retrieval layer.
The 3 Misconceptions That Waste AEO Budgets
Misconception 1: "More Content = More Visibility"
Publishing 50 thin, unfocused articles about unrelated topics will not build AI visibility. LLMs reward topical coherence. Ten deeply interconnected articles on a single core topic will outperform 50 scattered pieces every time. Before you publish anything, ask whether it deepens your topical authority in one area or dilutes it across many.
Misconception 2: "Schema Markup Is the Main Signal"
Schema markup matters — it's the most direct communication channel between your website and AI parsers — but it's a foundation, not a destination. A perfectly structured Organization schema on a website with no third-party mentions and no topical depth will still produce minimal AI visibility. Schema accelerates the other signals; it doesn't replace them.
Misconception 3: "AEO Is Separate from Traditional SEO"
Most of the activities that build AI visibility — authoritative content, review generation, digital PR, structured data — also improve traditional search rankings. AEO and SEO share the same foundation. The difference is in emphasis: SEO prioritizes keyword targeting and backlink acquisition; AEO prioritizes entity clarity, topical depth, and third-party corroboration. Running both programs simultaneously creates compound returns that neither achieves alone.
Building Your Evidence Base: A 90-Day Plan
The following plan assumes you are starting from minimal AI visibility. Adjust based on your AI visibility audit findings.
Days 1–15: Entity foundation
Implement Organization schema with sameAs links on your homepage. Claim and standardize all directory profiles (Google Business, LinkedIn, Crunchbase, three industry directories). Ensure your business name and description are identical across every platform. This is the prerequisite for everything else.
Days 16–45: Topical depth
Identify your core expertise topic and publish 6–8 interconnected articles that each answer a different question within that topic. Use internal linking to connect them into a coherent cluster. Each article should be structured with clear H2s, an answer-first introduction, and at least one extractable table or list.
Days 46–60: Review velocity
Launch a systematic review request program targeting your best recent customers. Aim for 20+ new reviews across Google and one industry-specific platform (G2, Trustpilot, or equivalent). Volume matters here — a steady stream of recent reviews outperforms a historical pile that hasn't grown in six months.
Days 61–90: Citation acquisition
Execute at least three PR placements in trade publications or industry roundups. Focus on editorial mentions where your business is cited by name in connection with your core expertise topic. Podcast appearances and conference speaking engagements also generate citable mentions. Track visibility at 90 days using the same query set you tested on Day 1.
Tracking Progress: How to Know It's Working
Unlike traditional SEO where rank trackers update daily, AI visibility measurement is manual and imprecise. The most reliable approach: run a fixed set of 20–30 queries across ChatGPT, Perplexity, Gemini, and Claude on the same day each month. Track whether your business is mentioned, in what context, and how prominently. Over time, the trend line tells you whether your evidence-building is working.
Perplexity will show improvements first because it retrieves live web content. ChatGPT will lag because it depends on training data updates. If Perplexity improves but ChatGPT doesn't, you're building current web presence but not yet durable training-data authority — typically a sign you need more third-party corroboration and longer-form content.
For a detailed methodology, the ChatGPT visibility testing framework covers a 30-prompt testing protocol with a standardized scoring rubric. Running it quarterly gives you a meaningful performance baseline that correlates with real-world lead flow from AI referrals.