mishawaka.claz.org
robots.txt

Robots Exclusion Standard data for mishawaka.claz.org

Resource Scan

Scan Details

Site Domain mishawaka.claz.org
Base Domain claz.org
Scan Status Ok
Last Scan2024-09-20T08:24:24+00:00
Next Scan 2024-09-27T08:24:24+00:00

Last Scan

Scanned2024-09-20T08:24:24+00:00
URL https://mishawaka.claz.org/robots.txt
Domain IPs 69.162.68.146, 69.162.83.22, 74.63.201.106
Response IP 69.162.68.146
Found Yes
Hash 771412222a76e189ebdbcd8676ffcdc3e1b9bfc5d775de1a627e4bf59721941a
SimHash 7f004804e893

Groups

*

Rule Path
Disallow /user/
Disallow /guest/
Disallow /go/
Disallow /partner/
Disallow /*?*save=search
Disallow /*/flag$
Disallow /classifieds/*/analytics.svg
Disallow /classifieds/*/contact

Other Records

Field Value
sitemap https://mishawaka.claz.org/sitemap.xml