pca.st
robots.txt

Robots Exclusion Standard data for pca.st

Resource Scan

Scan Details

Site Domain pca.st
Base Domain pca.st
Scan Status Ok
Last Scan2024-05-08T16:46:47+00:00
Next Scan 2024-05-22T16:46:47+00:00

Last Scan

Scanned2024-05-08T16:46:47+00:00
URL https://pca.st/robots.txt
Domain IPs 13.35.18.33, 13.35.18.51, 13.35.18.82, 13.35.18.91, 2600:9000:20c7:1000:2:7f27:3800:93a1, 2600:9000:20c7:2800:2:7f27:3800:93a1, 2600:9000:20c7:4c00:2:7f27:3800:93a1, 2600:9000:20c7:6a00:2:7f27:3800:93a1, 2600:9000:20c7:ac00:2:7f27:3800:93a1, 2600:9000:20c7:b200:2:7f27:3800:93a1, 2600:9000:20c7:b800:2:7f27:3800:93a1, 2600:9000:20c7:dc00:2:7f27:3800:93a1
Response IP 13.35.18.33
Found Yes
Hash 8fde77b163f392adc8b596c7499e5354ecf6292051b6e656f378b43db90652c1
SimHash b28d6d8d65f0

Groups

*

Rule Path
Disallow /itunes/
Disallow /private/
Disallow /polling/
Disallow /feed/

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines: