cartellimax.it
robots.txt

Robots Exclusion Standard data for cartellimax.it

Resource Scan

Scan Details

Site Domain cartellimax.it
Base Domain cartellimax.it
Scan Status Ok
Last Scan2024-06-12T10:01:32+00:00
Next Scan 2024-07-12T10:01:32+00:00

Last Scan

Scanned2024-06-12T10:01:32+00:00
URL https://cartellimax.it/robots.txt
Redirect https://www.cartellimax.it:443/robots.txt
Redirect Domain www.cartellimax.it
Redirect Base cartellimax.it
Domain IPs 3.122.154.234, 3.76.50.210, 52.29.96.232
Redirect IPs 13.33.30.122, 13.33.30.53, 13.33.30.83, 13.33.30.98, 2600:9000:229f:200:d:20c6:d000:93a1, 2600:9000:229f:5a00:d:20c6:d000:93a1, 2600:9000:229f:6000:d:20c6:d000:93a1, 2600:9000:229f:8e00:d:20c6:d000:93a1, 2600:9000:229f:9a00:d:20c6:d000:93a1, 2600:9000:229f:b400:d:20c6:d000:93a1, 2600:9000:229f:de00:d:20c6:d000:93a1, 2600:9000:229f:e800:d:20c6:d000:93a1
Response IP 13.33.30.122
Found Yes
Hash 9f7af8cb757c16319b49b40e9d205480e2f819fb9ff3c20957e1302a2f64941f
SimHash 254d9b4975d0

Groups

adsbot-google

Rule Path
Disallow /image
Disallow /shared_signs
Disallow /s3

*

Rule Path
Disallow /image
Disallow /shared_signs
Disallow /s3

addsearchbot

Rule Path
Disallow /generatore-di-targhette

Other Records

Field Value
sitemap https://www.cartellimax.it/sitemap.xml

Comments

  • Google adsbot ignores robots.txt unless specifically named!