webzone.it
robots.txt

Robots Exclusion Standard data for webzone.it

Resource Scan

Scan Details

Site Domain webzone.it
Base Domain webzone.it
Scan Status Ok
Last Scan2024-10-30T06:13:35+00:00
Next Scan 2024-11-29T06:13:35+00:00

Last Scan

Scanned2024-10-30T06:13:35+00:00
URL https://webzone.it/robots.txt
Domain IPs 75.119.146.111
Response IP 75.119.146.111
Found Yes
Hash 5265b9481d521d4e9bb3d4173e35cfed19df4ea00deec43ea57b7198444ed065
SimHash 19504d306b81

Groups

*

Rule Path
Allow /wp-content/uploads/
Allow /wp-admin/admin-ajax.php
Disallow /wp-admin/
Disallow /readme.html
Disallow /refer/
Disallow /*.pdf$

ioncrawl
ccbot
semrushbot
semrushbot-sa
semrushbot-ba
semrushbot-si
semrushbot-swa
semrushbot-ct
semrushbot-bm
semrushbot-seoab
semrushbot/7~bl
siteauditbot
splitsignalbot
semrushbot-coub
mj12bot
seznambot
blexbot
zoominfobot
aranhabot
yandex
yandexbot
yandexbot/3.0
baidu
baiduspider
baiduspider/2.0
baiduspider+
baiduspider-video
baiduspider-image
twengabot
twengabot-2.0
dotbot
mauibot (crawler.feedback+dc@gmail.com)
ahrefsbot
serpstatbot
zoombot
coccocbot-web
dataforseobot
seokicks
barkrowler
adsbot
mail.ru_bot
ltx71 - (http://ltx71.com/)
barkrowler
dnbcrawler-analytics
velenpublicwebcrawler
orbbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.webzone.it/sitemap_index.xml
sitemap https://www.webzone.it/post-sitemap.xml
sitemap https://www.webzone.it/page-sitemap.xml
sitemap https://www.webzone.it/category-sitemap.xml
sitemap https://www.webzone.it/post_tag-sitemap.xml