ericsimmerman.com
robots.txt
Robots Exclusion Standard data for ericsimmerman.com
Resource Scan
Scan Details
Site Domain | ericsimmerman.com |
Base Domain | ericsimmerman.com |
Scan Status | Ok |
Last Scan | 2025-09-25T19:22:09+00:00 |
Next Scan | 2025-10-25T19:22:09+00:00 |
Last Scan
Scanned | 2025-09-25T19:22:09+00:00 |
URL | https://ericsimmerman.com/robots.txt |
Domain IPs | 104.21.5.241, 172.67.134.9, 2606:4700:3033::ac43:8609, 2606:4700:3035::6815:5f1 |
Response IP | 104.21.5.241 |
Found | Yes |
Hash | 57fa1d3f51137807aed0d5c9b8d936e3cf9ea5e99825f3f243bc7ad105d99d98 |
SimHash | d0489c402113 |
Groups
*
Rule | Path |
---|---|
Disallow | /images/ |
Disallow | /js/ |
Disallow | /css/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.ericsimmerman.com/sitemap.xml |
Warnings
- 4 invalid lines.
Comments