action.womensmarch.com
robots.txt

Robots Exclusion Standard data for action.womensmarch.com

Resource Scan

Scan Details

Site Domain action.womensmarch.com
Base Domain womensmarch.com
Scan Status Ok
Last Scan2026-01-13T16:39:49+00:00
Next Scan 2026-02-12T16:39:49+00:00

Last Scan

Scanned2026-01-13T16:39:49+00:00
URL https://action.womensmarch.com/robots.txt
Domain IPs 104.20.33.57, 172.66.162.202, 2606:4700:10::6814:2139, 2606:4700:10::ac42:a2ca
Response IP 104.20.33.57
Found Yes
Hash 1d1b1b1b7fd9d4156723a618b38c512f42a6c0ddbacd3fe74e4ace769f2eb9d2
SimHash 26491d09d566

Groups

yahoo! slurp

Rule Path
Disallow /petitions/*/comments

Comments

  • See https://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file
  • Tell Yahoo! Slurp to stop trying to call the AJAX endpoint for the next page of comments
  • Other crawlers seem to be smart enough to not need this.