How to find and fix problems in the website's structure and optimize a sitemap both for Googlebot and for users
About the e-commerce website:
- E-commerce website with 22+ years of experience
- 1 mln pages
- 1,3 mln monthly visits
The main challenge:
To find and fix problems in the website's structure and optimize a sitemap both for Googlebot and for users.
What was done?
- The website was crawled with JetOctopus to find all technical bugs in the website’s structure.
- Log lines analyzer was used to understand how Googlebot crawls the sitemap.
What problems were detected?
- 1 mln pages which aren't in the website's structure, but are regularly visited by Googlebot. Crawling budget is wasted on unknown content.
- There are 320K pages which aren’t indexed by Googlebot, but these pages are valuable for the website. Only 180K pages are effectively crawled by Googlebot.
- There are around 320K links to not in-stock products in the sitemap.
- Incorrect logic of adding new products (new products used to be indexed within 2 weeks).
What recommendations we gave:
- Сhange the logic of sitemap generation. For e-commerce websites it’s crucial to show new products in search results as soon as they are updated on the website. Googlebot regularly analyses website’s sitemap to index fresh content. That’s why it’s needed to generate the actual sitemap constantly (we recommend to doing it every week).
- Resubmit new sitemaps through Google Search Console. Decide which pages on the website should be crawled by Google, and determine the canonical version of each page. You can create your sitemap manually or choose from a number of third-party tools to generate your sitemap for you.
- Get rid of non in-stock products in the sitemap.
Software engineer Matt Cutts said that e-commerce sites with hundreds of thousands of pages should set the date the page will expire using the Unavailable_after META tag. This way, when the product is added, you can immediately set when that product page will expire based on an auction date or a go-stale date.
- Add a new block with the most popular and prior products in each product category. Proper internal linking makes it easy for Googlebot to crawl your webpages.
JetOctopus team recommends reviewing GSC guide, explaining how to submit a new sitemap for crawling. Also, there is useful information on how to solve common problems with sitemaps.
- For e-commerce sites the correct sitemap generation is one of the most crucial things (new products to be added immediately, not-in-stock products to be deleted from actual sitemap). It is a true opportunity to increase sales.
- Accurate work with canonical and non-canonical tags is your silver bullet.
- Interlinking structure is your rocket tools. Don’t underestimate it.
Get more useful info:2 Different Realities: Your Site Structure & How Google Perceives It
About the author
Serge Bezborodov is a CTO of JetOctopus. He is a professional programmer with 9+ years of experience. He has worked with aggregators for 5 years - vacancies, real estate, cars. Also, he has experience in design DB architecture and query optimization. Serge has crawled more than 160 mln pages, 40 TB of data with JetOctopus.