greendiary.com
robots.txt
Robots Exclusion Standard data for greendiary.com
Resource Scan
Scan Details
| Site Domain | greendiary.com |
| Base Domain | greendiary.com |
| Scan Status | Ok |
| Last Scan | 2025-11-12T19:23:39+00:00 |
| Next Scan | 2025-11-19T19:23:39+00:00 |
Last Scan
| Scanned | 2025-11-12T19:23:39+00:00 |
| URL | https://greendiary.com/robots.txt |
| Domain IPs | 104.21.39.16, 172.67.142.31, 2606:4700:3031::6815:2710, 2606:4700:3037::ac43:8e1f |
| Response IP | 104.21.39.16 |
| Found | Yes |
| Hash | f2569474020e9b34ece15f4aa1bac3340a186f4e1cfd0756f679deecc74b0116 |
| SimHash | 4119c4c2e7b5 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /calendar/action* |
| Disallow | /events/action* |
| Allow | /*.css |
| Allow | /*.js |
Other Records
| Field | Value |
|---|---|
| crawl-delay | 3 |