You may have noticed that the number of crawled URLs in the Google Search Console “Crawl stats” report may differ from the number of URLs visited by search engines that you see...
Using JetOctopus, you can find pages containing mixed types of URLs – with HTTP and HTTPS protocol. Google and other search engines recommend using the HTTPS protocol because it...
Custom extraction is needed to find certain elements in the HTML code of the page. Our web scraper will search selected elements while crawling your website. With custom...
Pages with HTML full duplication are URLs that contain identical content, including full duplicated headings, titles, and metadata, HTML-elements. If these pages are open for...
In the work of SEO, it is important to track the pages that just appeared in SERP. These can be both newly created pages and already existing pages that have dropped out of the...
Merging data from different tables is much more convenient if you use the “Join Dataset” tool. You can use this option in all data tables: logs, Google Search Console and crawl....
If your website is perfectly on-page optimized, but the search engines get a lot of non-200 response codes, this can decrease the crawling budget. Search engines will crawl your...
We offer several options if you want to share a JetOctopus report with your colleagues. Choose the most convenient way depending on what your needs are.
1. Share the PDF file...
If search engines receive a lot of 404 response codes, they can reduce the scanning frequency of your website. Also, 404 in the logs can indicate the following problems...
You may meet a situation when you check a page that returned a 4xx or 5xx response code to the search engine (you found this data in the logs). And when checking manually, the...