icei.it
robots.txt

Robots Exclusion Standard data for icei.it

Resource Scan

Scan Details

Site Domain icei.it
Base Domain icei.it
Scan Status Ok
Last Scan2025-11-22T16:56:36+00:00
Next Scan 2025-12-22T16:56:36+00:00

Last Scan

Scanned2025-11-22T16:56:36+00:00
URL https://icei.it/robots.txt
Domain IPs 35.214.208.13
Response IP 35.214.208.13
Found Yes
Hash a5cf5fef69c96b1c94b4c5fc1cc556103e546a38541b839488b9dab17ddf3de4
SimHash 05711a1346d1

Groups

*

Rule Path
Allow /
Disallow /cgi-bin
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content
Disallow /e/
Disallow /show-error-*
Disallow /xmlrpc.php
Disallow /trackback/
Disallow /comment-page-
Allow /wp-content/uploads/

mediapartners-google

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

googlebot

Rule Path
Allow /

Other Records

Field Value
sitemap https://icei.it/sitemap_index.xml