cambridgeblog.org
robots.txt
Robots Exclusion Standard data for cambridgeblog.org
Resource Scan
Scan Details
| Site Domain | cambridgeblog.org |
| Base Domain | cambridgeblog.org |
| Scan Status | Ok |
| Last Scan | 2025-11-14T20:00:08+00:00 |
| Next Scan | 2025-12-14T20:00:08+00:00 |
Last Scan
| Scanned | 2025-11-14T20:00:08+00:00 |
| URL | https://cambridgeblog.org/robots.txt |
| Domain IPs | 104.21.22.252, 172.67.208.31, 2606:4700:3032::ac43:d01f, 2606:4700:3036::6815:16fc |
| Response IP | 104.21.22.252 |
| Found | Yes |
| Hash | 74ebeea8c549cb0db57c972a9eb6604c8c5bcac93292d9df2154dd69a20378a1 |
| SimHash | 6b20dc628bb2 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /wp-admin/ |
| Allow | /wp-admin/admin-ajax.php |
Other Records
| Field | Value |
|---|---|
| crawl-delay | 10 |
Other Records
| Field | Value |
|---|---|
| sitemap | https://cambridgeblog.org/sitemap.xml |
| sitemap | https://cambridgeblog.org/sitemap.rss |
| sitemap | https://cambridgeblog.org/sitemap.xml |