thepilcrowpub.com
robots.txt
Robots Exclusion Standard data for thepilcrowpub.com
Resource Scan
Scan Details
| Site Domain | thepilcrowpub.com |
| Base Domain | thepilcrowpub.com |
| Scan Status | Ok |
| Last Scan | 2025-12-13T19:58:31+00:00 |
| Next Scan | 2026-01-12T19:58:31+00:00 |
Last Scan
| Scanned | 2025-12-13T19:58:31+00:00 |
| URL | https://thepilcrowpub.com/robots.txt |
| Redirect | https://planetbasedfoods.com/robots.txt |
| Redirect Domain | planetbasedfoods.com |
| Redirect Base | planetbasedfoods.com |
| Domain IPs | 104.21.92.141, 172.67.194.140, 2606:4700:3032::ac43:c28c, 2606:4700:3035::6815:5c8d |
| Redirect IPs | 104.18.34.87, 172.64.153.169, 2606:4700:4400::6812:2257, 2a06:98c1:3102::ac40:99a9 |
| Response IP | 104.18.34.87 |
| Found | Yes |
| Hash | 4a3b0f84e7a60e3d71408b4b510a9420fba31d1526152a933b98ab0c9989d69f |
| SimHash | 6928d8008b82 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /wp-admin/ |
| Allow | /wp-admin/admin-ajax.php |
| Disallow | /author/ |
| Disallow | /*/trackback |
| Disallow | /img/ |
| Disallow | /tag/ |
| Disallow | /feed |
| Disallow | /*/feed |
| Disallow | /comments/feed |
| Disallow | /?s=* |
| Disallow | /attachment/ |
| Disallow | /*?utm_source |
| Disallow | /*%26utm_source |
Other Records
| Field | Value |
|---|---|
| sitemap | https://planetbasedfoods.com/sitemap.xml |
Comments