The frequency of crawling of different subdomains by search engines may differ. This may depend on the indexing settings, the robots.txt file (for each subdomain there must be a...
Ajax URLs may not be displayed correctly in search results because they are an asynchronously processed part of the page that depends on user action. Although the Ajax URLs have...
5xx response codes indicate a broken web server. These status codes are usually temporary. However, a large number of 5xx and regular repeats can affect the crawling of your...
You may have noticed that the number of crawled URLs in the Google Search Console “Crawl stats” report may differ from the number of URLs visited by search engines that you see...
Using JetOctopus, you can find pages containing mixed types of URLs – with HTTP and HTTPS protocol. Google and other search engines recommend using the HTTPS protocol because it...
Custom extraction is needed to find certain elements in the HTML code of the page. Our web scraper will search selected elements while crawling your website. With custom...
Pages with HTML full duplication are URLs that contain identical content, including full duplicated headings, titles, and metadata, HTML-elements. If these pages are open for...
In the work of SEO, it is important to track the pages that just appeared in SERP. These can be both newly created pages and already existing pages that have dropped out of the...
Merging data from different tables is much more convenient if you use the “Join Dataset” tool. You can use this option in all data tables: logs, Google Search Console and crawl....
If your website is perfectly on-page optimized, but the search engines get a lot of non-200 response codes, this can decrease the crawling budget. Search engines will crawl your...