marshu.com
robots.txt

Robots Exclusion Standard data for marshu.com

Resource Scan

Scan Details

Site Domain marshu.com
Base Domain marshu.com
Scan Status Ok
Last Scan2025-05-05T17:29:56+00:00
Next Scan 2025-05-12T17:29:56+00:00

Last Scan

Scanned2025-05-05T17:29:56+00:00
URL https://marshu.com/robots.txt
Domain IPs 160.153.42.201
Response IP 160.153.42.201
Found Yes
Hash 882eee2b39fb21ef3902092287d2c7bdd4e4abbf5032d0ae14a215c3c2034f25
SimHash ab046d2cc713

Groups

*

Rule Path
Disallow /_db_backups/
Disallow /cgi/
Disallow /css/
Disallow /gallery/
Disallow /img/
Disallow /images/
Disallow /hostroot-findcalculator/
Disallow /hostroot-njsunsets/
Disallow /hostroot-shoepass/
Disallow /mobile/
Disallow /old_stuff/
Disallow /reworked-old-files/
Disallow /securimage/
Disallow /stats/
Disallow /suspects/
Disallow /statshistory/
Disallow /picture_library/
Disallow /photos/
Disallow /test/

Other Records

Field Value
sitemap https://www.marshu.com/sitemap.txt