Technical SEO
How to fix canonical and sitemap host mismatch
Host mismatch happens when your sitemap, canonical tags, and live URLs disagree about the preferred domain.
Problem
Search engines may crawl duplicates, distrust sitemap URLs, or split signals across hosts.
Symptoms
- Sitemap URLs use www while canonical URLs do not.
- HTTP and HTTPS variants appear together.
- Pages are crawled but canonicalized elsewhere.
How to diagnose
- Compare homepage final URL, sitemap host, and canonical host.
- Sample sitemap URLs and inspect their canonical tags.
- Check redirects from alternate host variants.
How to fix
- Pick one canonical host.
- Update sitemap generation to use that host.
- Redirect all alternate host variants to the canonical URL.
How Search Lighthouse helps
Search Lighthouse checks homepage redirects, sitemap discovery, canonical host consistency, and sampled sitemap page metadata.
Related guides
Why Google crawled but did not index your pages
Crawled but not indexed usually means discovery worked, but page quality, duplication, or signals did not justify indexing.
How to check robots.txt, sitemap, and canonical tags
Robots, sitemap, and canonical tags tell search engines what they can crawl and which URLs matter.
How to improve indexability for AI-built websites
AI-built websites often ship fast, but search engines still need stable templates, links, and unique page value.