What it does
This algorithm checks URLs in a sitemap XML or a sitemap directory that you submit to it. You can check for up to 100 URLs for free. A sitemap can normally be found at
http://your.site/sitemap.xml path, but they can also be stored under custom locations and have custom extensions.
How it works
Insert your sitemap's url and you'll be on your way. After process completes you will be able to view the data or download it.
Best practice is to have your sitemap URLs respond with 200 status code, avoiding redirects or broken links and duplication. This scanner works with most sitemaps, including:
- Sitemap directories
However it does not work with
- Video sitemaps
- Image sitemaps
These kinds of sitemaps belong to a different standard and require a different approach. Leave a comment if you are looking for such an algorithm.
Why use it?
Sitemaps are a great tool when it comes to helping with speeding up your site's indexation in different search engines. However, webmasters often plainly skip the creation of sitemaps, or, what happens even more often, they set automatic scripts to create a sitemap and then rarely check on it. This lack of attention towards sitemaps often results in annoying errors which are happening in the background and prevent the normal indexation processes. Here are some of the problems that you can avoid by checking your sitemaps regularly:
- Unindexed pages
- Dead pages getting indexed
- Sitemap is clogged with irrelevant links
- Duplicate pages getting indexed