marshu.com
robots.txt
Robots Exclusion Standard data for marshu.com
Resource Scan
Scan Details
Site Domain | marshu.com |
Base Domain | marshu.com |
Scan Status | Ok |
Last Scan | 2025-05-05T17:29:56+00:00 |
Next Scan | 2025-05-12T17:29:56+00:00 |
Last Scan
Scanned | 2025-05-05T17:29:56+00:00 |
URL | https://marshu.com/robots.txt |
Domain IPs | 160.153.42.201 |
Response IP | 160.153.42.201 |
Found | Yes |
Hash | 882eee2b39fb21ef3902092287d2c7bdd4e4abbf5032d0ae14a215c3c2034f25 |
SimHash | ab046d2cc713 |
Groups
*
Rule | Path |
---|---|
Disallow | /_db_backups/ |
Disallow | /cgi/ |
Disallow | /css/ |
Disallow | /gallery/ |
Disallow | /img/ |
Disallow | /images/ |
Disallow | /hostroot-findcalculator/ |
Disallow | /hostroot-njsunsets/ |
Disallow | /hostroot-shoepass/ |
Disallow | /mobile/ |
Disallow | /old_stuff/ |
Disallow | /reworked-old-files/ |
Disallow | /securimage/ |
Disallow | /stats/ |
Disallow | /suspects/ |
Disallow | /statshistory/ |
Disallow | /picture_library/ |
Disallow | /photos/ |
Disallow | /test/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.marshu.com/sitemap.txt |