mattcrummy.com
robots.txt
Robots Exclusion Standard data for mattcrummy.com
Resource Scan
Scan Details
| Site Domain | mattcrummy.com |
| Base Domain | mattcrummy.com |
| Scan Status | Ok |
| Last Scan | 2025-11-11T07:23:11+00:00 |
| Next Scan | 2025-11-12T07:23:11+00:00 |
Last Scan
| Scanned | 2025-11-11T07:23:11+00:00 |
| URL | https://www.mattcrummy.com/robots.txt |
| Domain IPs | 151.101.131.7, 151.101.195.7, 151.101.3.7, 151.101.67.7, 2a04:4e42:200::775, 2a04:4e42:400::775, 2a04:4e42:600::775, 2a04:4e42::775 |
| Response IP | 146.75.47.7 |
| Found | Yes |
| Hash | 720035107bfb8cfe5ed6920a8a7d7d0df408456911d49c4294ea9452977f8ae7 |
| SimHash | e0145504fd53 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /ghost/ |
| Disallow | /email/ |
| Disallow | /members/api/comments/counts/ |
| Disallow | /r/ |
| Disallow | /webmentions/receive/ |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.mattcrummy.com/sitemap.xml |