claz.org
robots.txt

Robots Exclusion Standard data for claz.org

Resource Scan

Scan Details

Site Domain claz.org
Base Domain claz.org
Scan Status Ok
Last Scan2024-11-10T21:33:20+00:00
Next Scan 2024-11-17T21:33:20+00:00

Last Scan

Scanned2024-11-10T21:33:20+00:00
URL https://claz.org/robots.txt
Domain IPs 69.162.68.146, 69.162.83.22, 74.63.201.106
Response IP 69.162.83.22
Found Yes
Hash dea4d853ec4aeb36faa71f54351e3254882e7ca97bbe3082aefdd760a0e2cb19
SimHash 7d00d104cd93

Groups

*

Rule Path
Disallow /user/
Disallow /guest/
Disallow /go/
Disallow /partner/
Disallow /*?*save=search
Disallow /*/flag$
Disallow /classifieds/*/analytics.svg
Disallow /classifieds/*/contact

Other Records

Field Value
sitemap https://claz.org/sitemap.xml
sitemap https://claz.org/locations.xml
sitemap https://claz.org/listings1.xml
sitemap https://claz.org/listings2.xml
sitemap https://claz.org/listings3.xml
sitemap https://claz.org/listings4.xml