caravansonnet.com
robots.txt

Robots Exclusion Standard data for caravansonnet.com

Resource Scan

Scan Details

Site Domain caravansonnet.com
Base Domain caravansonnet.com
Scan Status Ok
Last Scan2025-10-21T02:17:54+00:00
Next Scan 2025-10-28T02:17:54+00:00

Last Scan

Scanned2025-10-21T02:17:54+00:00
URL http://caravansonnet.com/robots.txt
Redirect http://www.caravansonnet.com/robots.txt
Redirect Domain www.caravansonnet.com
Redirect Base caravansonnet.com
Domain IPs 216.239.32.21, 216.239.34.21, 216.239.36.21, 216.239.38.21
Redirect IPs 2404:6800:4003:c00::79, 74.125.200.121
Response IP 74.125.24.121
Found Yes
Hash cb8cf5ac246f730df46d26bde10d0b6c2314a457f368c702d722c25da1001a08
SimHash 4d4498504f53

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /search
Disallow /share-widget
Allow /

Other Records

Field Value
sitemap http://www.caravansonnet.com/sitemap.xml