greenpeace.community
robots.txt

Robots Exclusion Standard data for greenpeace.community

Resource Scan

Scan Details

Site Domain greenpeace.community
Base Domain greenpeace.community
Scan Status Ok
Last Scan2025-09-20T22:37:14+00:00
Next Scan 2025-10-04T22:37:14+00:00

Last Scan

Scanned2025-09-20T22:37:14+00:00
URL https://greenpeace.community/robots.txt
Domain IPs 104.21.21.121, 172.67.198.154, 2606:4700:3031::ac43:c69a, 2606:4700:3037::6815:1579
Response IP 104.21.21.121
Found Yes
Hash 9c7407414d27302aa1934ef85df96d3b3f5fbc393599c6b782c7fc4bd17ebc09
SimHash 0d44980169d3

Groups

*

Rule Path
Disallow /pick-institution
Disallow /terms
Disallow /privacy-policy
Disallow /legal
Disallow /backoffice
Disallow /networks/*/recruiter/jobs

mj12bot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

ut-dorkbot

Rule Path
Disallow /

ut-dorkbot/1.0

Rule Path
Disallow /

Other Records

Field Value
sitemap https://greenpeace.community/sitemap.xml