webzone.it
robots.txt
Robots Exclusion Standard data for webzone.it
Resource Scan
Scan Details
Site Domain | webzone.it |
Base Domain | webzone.it |
Scan Status | Ok |
Last Scan | 2024-10-30T06:13:35+00:00 |
Next Scan | 2024-11-29T06:13:35+00:00 |
Last Scan
Scanned | 2024-10-30T06:13:35+00:00 |
URL | https://webzone.it/robots.txt |
Domain IPs | 75.119.146.111 |
Response IP | 75.119.146.111 |
Found | Yes |
Hash | 5265b9481d521d4e9bb3d4173e35cfed19df4ea00deec43ea57b7198444ed065 |
SimHash | 19504d306b81 |
Groups
*
Rule | Path |
---|---|
Allow | /wp-content/uploads/ |
Allow | /wp-admin/admin-ajax.php |
Disallow | /wp-admin/ |
Disallow | /readme.html |
Disallow | /refer/ |
Disallow | /*.pdf$ |
ioncrawl
ccbot
semrushbot
semrushbot-sa
semrushbot-ba
semrushbot-si
semrushbot-swa
semrushbot-ct
semrushbot-bm
semrushbot-seoab
semrushbot/7~bl
siteauditbot
splitsignalbot
semrushbot-coub
mj12bot
seznambot
blexbot
zoominfobot
aranhabot
yandex
yandexbot
yandexbot/3.0
baidu
baiduspider
baiduspider/2.0
baiduspider+
baiduspider-video
baiduspider-image
twengabot
twengabot-2.0
dotbot
mauibot (crawler.feedback+dc@gmail.com)
ahrefsbot
serpstatbot
zoombot
coccocbot-web
dataforseobot
seokicks
barkrowler
adsbot
mail.ru_bot
ltx71 - (http://ltx71.com/)
barkrowler
dnbcrawler-analytics
velenpublicwebcrawler
orbbot
Rule | Path |
---|---|
Disallow | / |
Other Records
Field | Value |
---|---|
sitemap | https://www.webzone.it/sitemap_index.xml |
sitemap | https://www.webzone.it/post-sitemap.xml |
sitemap | https://www.webzone.it/page-sitemap.xml |
sitemap | https://www.webzone.it/category-sitemap.xml |
sitemap | https://www.webzone.it/post_tag-sitemap.xml |