indian-ocean.com
robots.txt

Robots Exclusion Standard data for indian-ocean.com

Resource Scan

Scan Details

Site Domain indian-ocean.com
Base Domain indian-ocean.com
Scan Status Ok
Last Scan2026-01-04T22:13:45+00:00
Next Scan 2026-02-03T22:13:45+00:00

Last Scan

Scanned2026-01-04T22:13:45+00:00
URL https://indian-ocean.com/robots.txt
Redirect https://www.indian-ocean.com/robots.txt
Redirect Domain www.indian-ocean.com
Redirect Base indian-ocean.com
Domain IPs 104.21.44.15, 172.67.192.206, 2606:4700:3030::ac43:c0ce, 2606:4700:3031::6815:2c0f
Redirect IPs 104.21.44.15, 172.67.192.206, 2606:4700:3030::ac43:c0ce, 2606:4700:3031::6815:2c0f
Response IP 104.21.44.15
Found Yes
Hash af243ef85bf4d3a8d626872f34b7a172c8eec93c5cc2d43496b0f5a87e08d856
SimHash 6849d8d0c311

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /httpdocs/drupal
Disallow /httpdocs/old
Disallow /wp-content/plugins/
Disallow /wp-admin/
Disallow /readme.html
Disallow /refer/
Disallow /

Other Records

Field Value
crawl-delay 86400

googlebot

Rule Path
Allow /

slurp

Rule Path
Allow /