What Is Schema Markup?
Schema markup is structured code added to a page using the Schema.org vocabulary. It describes the content in a way search engines and AI systems can parse reliably.
Schema markup is structured code added to a webpage using the Schema.org vocabulary to describe its content in a way search engines and AI systems can parse without ambiguity. Where structured data is the broader concept, Schema markup specifically refers to the dominant vocabulary backed by Google, Microsoft, Yahoo and Yandex.
Why schema markup matters
HTML tells a browser how to render text. Schema tells a machine what the text is. A page can present a recipe, a course, a software product or an event in visually similar ways — Schema markup makes the difference explicit. For AI systems, that explicitness is the difference between being confidently cited and being skipped.
JSON-LD vs. Microdata vs. RDFa
| Format | Where it lives | Recommendation |
|---|---|---|
| JSON-LD | A <script type="application/ld+json"> block, usually in <head> | Preferred by Google; cleanest to maintain |
| Microdata | Attributes on existing HTML tags | Legacy; harder to maintain |
| RDFa | Attributes on existing HTML tags | Niche; mostly publishing-platform use |
JSON-LD is the right default for almost every modern site.
The most useful schema types for AI search
Article/BlogPosting— author, publish date, headline, image. Foundational for editorial content.FAQPage— explicit question/answer pairs that AI systems can lift wholesale. Powerful for AEO.HowTo— stepwise instructions. Useful for tutorial and support content.Product+Offer— pricing, availability, brand. Critical for ecommerce and SaaS pricing pages.SoftwareApplication— for SaaS landing pages and tool directories.Organization— your canonical brand entity, with logo, social profiles and contact info.BreadcrumbList— site hierarchy for hub pages and category pages.Person— author bios with credentials, strengthening E-E-A-T signals.
Common implementation mistakes
- Marking up content that isn't visible to users. Google penalises this as cloaking.
- Inflated review counts. Fake
AggregateRatingvalues trigger manual actions. - Stale data. A pricing schema that disagrees with the visible price erodes trust signals.
- Multiple conflicting Organization blocks. Consolidate to one canonical block per domain.
How to validate your schema
Use Google's Rich Results Test and the Schema.org validator on every template you publish. Both flag missing required fields, type mismatches and syntax errors. Run validation as part of your deploy pipeline for any page that ships JSON-LD.
Frequently asked questions
Does schema markup directly improve rankings?
Schema is not a direct ranking factor in classical SEO, but it improves how your pages are represented in rich results, AI Overviews and Knowledge Graph entries — which indirectly drives clicks and citations.
Can I have multiple schema blocks on one page?
Yes. A typical article page might include Article, BreadcrumbList, Organization and Person blocks. Keep them in separate script tags or combine them into a single @graph.
Is schema markup useful for LLMs that don't render JavaScript?
Yes — JSON-LD is plain text in the HTML response, so any crawler that fetches the raw HTML sees it without executing JavaScript.
Related Terms
Google AI Overviews (AIO)
Google AI Overviews are AI-generated summaries that sit above the blue links, stitched together from multiple sources directly inside Google Search.
llms.txt
llms.txt is a proposed plain-text file at the root of a site. It gives large language models a curated, machine-readable map of the pages that matter most.
What Is Structured Data?
Structured data is information marked up in a defined format so machines can read it without guessing. On the web, that usually means Schema.org JSON-LD.
Entity Recognition
Entity recognition is how AI systems pick out people, brands, products, and places in a piece of text and link them to a known identity.
What Is a Knowledge Graph?
A knowledge graph stores facts as entities and relationships, so machines can reason about people, places, brands, and how they connect.
Answer Engine Optimization (AEO)
Answer Engine Optimization is the work of becoming the cited source inside AI answers from ChatGPT, Gemini, Claude, and Perplexity, not just a blue link on Google.
