How to find URLs blocked by robots.txt
We want to have clean and accessible code on our website. Also, we want search engines to scan only pages needed in search results because we worry about our crawling budget. But sometimes there are situations when URLs in the code are blocked by the robots.txt file. It means that search engines cannot access the […]
How to find and export all images without alt attributes
The alt attribute (or alternative text) contains a description of the image. Alt will be displayed on the screen instead of the image if the user has a laggy internet connection. Of course, search engines also “read” alt attributes. Thanks to this HTML attribute, search engines understand what is depicted, and show images in search […]
How to bulk export all image alt attributes
Alt attribute (alternative text) is a description of the appearance, function, or/and content of images on a web page. Alt attributes are in the HTML code. Users usually don’t see alt text if they have a normal internet connection. But if the Internet is laggy or images are blocked by the browser, then users will […]
How to set up image crawling
If you need to crawl all the images on your website, you can use JetOctopus. By default, we do not crawl images, so you need to configure the crawl specifically for further image analysis. Image crawl settings Go to the desired project and click the “New crawl” button. In the basic settings, enter the domain […]
How to export all inlinks to 5xx, 4xx and 3xx
Using JetOctopus you can bulk export all inlinks to 5xx, 4xx, and 3xx. Why you should pay special attention to those types of response code? Relevant links with 200 response codes are important for your customers. 404, 502, 503 (and others) are not very appealing to users. They just want to get the information they […]
How to calculate logs volume?
JetOctopus has switched logs quota calculation from pages to usual log lines.What does it mean? Now we are calculating user quota limits by the number of requests made from verified Googlebot, Bing, and Baidu bots to regular HTML pages only.Bot requests to images, CSS/JS objects, and other media files are not counted as usage of […]
How to check pages with low content words
Search engines declare that content is one of the most important ranking factors. Pages with high-quality and complete content always have better chances to be in the top positions in SERP. Using JetOctopus, you can easily find pages that don’t have a lot of content. Below you will find a step-by-step guide to checking pages […]
How to find and check 404 Errors?
In JetOctopus crawl you can find the pages with a 404 response code. This response code means that the page was not found and the content is not available to your users. To find all 404 pages on your website, use JetOctopus crawler. Among other things, you can use the free 7-days trial after demo […]
How to analyze external domains in your website code
External links can be both a plus and a minus for optimizing your website. If you link to other domains on your website, this is a signal to search engines that you trust these external resources. Therefore, locate on your website only up-to-date, trusted links to sites with high authority. Analyzing external domains en masse […]