protectedplanet.net
robots.txt

Robots Exclusion Standard data for protectedplanet.net

Resource Scan

Scan Details

Site Domain protectedplanet.net
Base Domain protectedplanet.net
Scan Status Ok
Last Scan2025-09-03T16:27:04+00:00
Next Scan 2025-10-03T16:27:04+00:00

Last Scan

Scanned2025-09-03T16:27:04+00:00
URL https://protectedplanet.net/robots.txt
Redirect https://www.protectedplanet.net/robots.txt
Redirect Domain www.protectedplanet.net
Redirect Base protectedplanet.net
Domain IPs 178.79.132.24
Redirect IPs 104.26.10.60, 104.26.11.60, 172.67.74.96, 2606:4700:20::681a:a3c, 2606:4700:20::681a:b3c, 2606:4700:20::ac43:4a60
Response IP 104.26.10.60
Found Yes
Hash 4a99db68b6a376f84bcce47dec2493b5277f80a0520628b4236c52c22658114f
SimHash 6ac5cd857150

Groups

*

Rule Path
Disallow /search

ahrefsbot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

googlebot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 30

*

Rule Path
Disallow /*.pdf$

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:
  • User-agent: *
  • Disallow: /