naszademokracja.pl
robots.txt

Robots Exclusion Standard data for naszademokracja.pl

Resource Scan

Scan Details

Site Domain naszademokracja.pl
Base Domain naszademokracja.pl
Scan Status Ok
Last Scan2025-05-20T01:39:18+00:00
Next Scan 2025-06-19T01:39:18+00:00

Last Scan

Scanned2025-05-20T01:39:18+00:00
URL https://naszademokracja.pl/robots.txt
Redirect https://www.naszademokracja.pl/robots.txt
Redirect Domain www.naszademokracja.pl
Redirect Base naszademokracja.pl
Domain IPs 104.21.85.3, 172.67.200.106, 2606:4700:3031::ac43:c86a, 2606:4700:3037::6815:5503
Redirect IPs 104.22.38.97, 104.22.39.97, 172.67.29.53, 2606:4700:10::6816:2661, 2606:4700:10::6816:2761, 2606:4700:10::ac43:1d35
Response IP 172.67.29.53
Found Yes
Hash 1d1b1b1b7fd9d4156723a618b38c512f42a6c0ddbacd3fe74e4ace769f2eb9d2
SimHash 26491d09d566

Groups

yahoo! slurp

Rule Path
Disallow /petitions/*/comments

Comments

  • See https://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file
  • Tell Yahoo! Slurp to stop trying to call the AJAX endpoint for the next page of comments
  • Other crawlers seem to be smart enough to not need this.