whatdotheyknow.com
robots.txt

Robots Exclusion Standard data for whatdotheyknow.com

Resource Scan

Scan Details

Site Domain whatdotheyknow.com
Base Domain whatdotheyknow.com
Scan Status Ok
Last Scan2025-10-22T11:03:38+00:00
Next Scan 2025-11-21T11:03:38+00:00

Last Scan

Scanned2025-10-22T11:03:38+00:00
URL https://whatdotheyknow.com/robots.txt
Redirect https://www.whatdotheyknow.com/robots.txt
Redirect Domain www.whatdotheyknow.com
Redirect Base whatdotheyknow.com
Domain IPs 104.20.23.150, 172.66.151.93, 2606:4700:10::6814:1796, 2606:4700:10::ac42:975d
Redirect IPs 104.20.23.150, 172.66.151.93, 2606:4700:10::6814:1796, 2606:4700:10::ac42:975d
Response IP 104.20.23.150
Found Yes
Hash b756b9491a95cf63b1a6b0f49b99128c24d612ce86f0d52e33ee21622ce842ea
SimHash 839027ae3f61

Groups

*

Rule Path
Disallow */annotate/*
Disallow */new/*
Disallow */search/*
Disallow */similar/*
Disallow */track/*
Disallow */upload/*
Disallow */user/contact/*
Disallow */feed/*
Disallow */profile/*
Disallow */signin*
Disallow */tor*
Allow */request/*/response/*/attach/*
Disallow */request/*/response/*
Disallow */request/*/download*
Disallow */change_request/*
Disallow */outgoing_messages/*/mail_server_logs*
Disallow */outgoing_messages/*/delivery_status*
Disallow *?*update_status=1*

Comments

  • See https://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file