XML Sitemaps

Learn what XML sitemaps are, why they matter for SEO, how to create them, and best practices for submitting them to Google Search Console.

Beginner7 min readUpdated 04 Mar 2026Bukhosi Moyo

An XML sitemap is a file that lists all the important pages on your website, helping search engines discover, crawl, and index them efficiently. Think of it as a table of contents for Google — it tells the crawler exactly which pages exist and which ones matter most.

Quick Answer
  • An XML sitemap is a machine-readable file listing URLs you want search engines to crawl and index.
  • It helps Google discover pages faster, especially new pages and pages with few internal links.
  • Sitemaps are not a ranking factor — they do not boost rankings, but they ensure pages are found and indexed.
  • Submit your sitemap through Google Search Console for monitoring and error detection.
  • Most CMSs and frameworks generate sitemaps automatically — but you should verify and optimise them.

If you want the full breakdown, continue below.

What Is an XML Sitemap?

An XML sitemap is an XML file (usually at yourdomain.com/sitemap.xml) that lists URLs along with optional metadata:

<?xml version="1.0" encoding="UTF-8"?>
<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
  <url>
    <loc>https://symaxx.co.za/web-design/pretoria</loc>
    <lastmod>2026-03-01</lastmod>
    <changefreq>monthly</changefreq>
    <priority>0.8</priority>
  </url>
</urlset>

XML Sitemap Fields

Field Required Purpose
<loc> Yes The full URL of the page
<lastmod> Recommended Date the page was last modified
<changefreq> Optional How often the page changes (Google largely ignores this)
<priority> Optional Relative importance within your site (Google largely ignores this)

In practice, <loc> and <lastmod> are the only fields that matter. Google has confirmed it ignores <changefreq> and <priority>.

When Sitemaps Matter Most

Sitemaps are most valuable for:

  • Large websites (1,000+ pages) — ensures all pages are discovered
  • New websites — helps Google find pages before you have backlinks
  • Sites with poor internal linking — provides an alternative discovery path
  • Sites with deep page hierarchies — ensures deeply nested pages are found
  • News websites — Google News sitemaps enable rapid indexation
  • Sites undergoing migration — helps Google discover the new URL structure

For small, well-linked websites (under 100 pages), Google can usually discover all pages through internal links and external backlinks. The sitemap is still a good practice but less critical.

XML Sitemap Best Practices

Include Only Indexable Pages

Your sitemap should only contain pages you want indexed:

Include:

  • All public, indexable pages
  • Pages you want Google to crawl regularly

Exclude:

  • Pages with noindex tags
  • Paginated pages (unless they are truly unique content)
  • Login/admin pages
  • Thank you / confirmation pages
  • Parameter-based duplicate URLs
  • Pages returning 4xx or 5xx errors

Keep the Sitemap Updated

An outdated sitemap with URLs that return 404 errors or redirect wastes Google's time and sends negative signals:

  • Automatically regenerate sitemaps when content changes
  • Remove deleted pages and redirected URLs
  • Update <lastmod> dates when content is genuinely modified (not artificially)

Use Sitemap Index Files for Large Sites

Sites with more than 50,000 URLs (the maximum per sitemap) need a sitemap index:

<?xml version="1.0" encoding="UTF-8"?>
<sitemapindex xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
  <sitemap>
    <loc>https://symaxx.co.za/sitemap-pages.xml</loc>
    <lastmod>2026-03-01</lastmod>
  </sitemap>
  <sitemap>
    <loc>https://symaxx.co.za/sitemap-blog.xml</loc>
    <lastmod>2026-03-04</lastmod>
  </sitemap>
</sitemapindex>

Even for smaller sites, splitting sitemaps by content type (pages, blog posts, documentation) makes monitoring easier.

Reference the Sitemap in Robots.txt

Add a sitemap directive to your robots.txt file:

Sitemap: https://symaxx.co.za/sitemap.xml

This ensures all search engines can find your sitemap, even before you manually submit it.

Submitting Your Sitemap

Google Search Console

  1. Navigate to Google Search Console → Sitemaps
  2. Enter your sitemap URL
  3. Click Submit
  4. Monitor the status — Google will report any errors

Bing Webmaster Tools

  1. Navigate to Bing Webmaster Tools → Sitemaps
  2. Submit your sitemap URL

Automatic Discovery

Search engines can also discover sitemaps through:

  • robots.txt reference (as shown above)
  • Ping endpoints (automatic submission via URL)

Common Sitemap Mistakes

Including noindex pages. Sending Google to pages you have told it not to index sends mixed signals.

Stale sitemaps. Sitemaps with old URLs, 404s, or redirects waste crawl budget.

Fake lastmod dates. Setting today's date on all URLs when content has not changed. Google may stop trusting your lastmod data entirely.

Missing sitemap entirely. While not strictly required, there is no reason not to have one.

Non-canonical URLs. Only include the canonical version of each page. Do not include parameter variants or non-www/www duplicates.

Key Takeaways

  • XML sitemaps help Google discover, crawl, and index your pages efficiently.
  • They are not a ranking factor but prevent indexation issues.
  • Include only indexable, canonical URLs — exclude noindex pages, error pages, and redirects.
  • Submit through Google Search Console for monitoring and error detection.
  • Keep sitemaps updated automatically as content changes.

Quick Sitemap Checklist

  • Sitemap exists at /sitemap.xml
  • Referenced in robots.txt
  • Submitted to Google Search Console
  • Contains only indexable, canonical URLs
  • No 404 or redirect URLs included
  • lastmod dates reflect genuine content changes
  • Sitemap automatically regenerated when content changes
  • Sitemap index used if over 50,000 URLs
  • Monitored quarterly for errors in Search Console

Tools & Resources (Coming Soon)

  • Sitemap Validator (Coming soon)
  • SEO Audit Tool (Coming soon)
  • Crawlability Checker (Coming soon)

Related SEO Documentation

Was this helpful?