claz.org
robots.txt

Robots Exclusion Standard data for claz.org

Resource Scan

Scan Details

Site Domain claz.org
Base Domain claz.org
Scan Status Ok
Last Scan2024-09-14T14:11:52+00:00
Next Scan 2024-09-21T14:11:52+00:00

Last Scan

Scanned2024-09-14T14:11:52+00:00
URL https://claz.org/robots.txt
Domain IPs 69.162.68.146, 69.162.83.22, 74.63.201.106
Response IP 74.63.201.106
Found Yes
Hash 55238db50c29d438747a46086a17b667406df64d5ad270a5e1e979f9fc52286f
SimHash 6f008104ed93

Groups

*

Rule Path
Disallow /user/
Disallow /guest/
Disallow /go/
Disallow /partner/
Disallow /*?*save=search
Disallow /*/flag$
Disallow /classifieds/*/analytics.svg
Disallow /classifieds/*/contact

Other Records

Field Value
sitemap https://claz.org/sitemap.xml
sitemap https://claz.org/locations.xml
sitemap https://claz.org/listings1.xml
sitemap https://claz.org/listings2.xml
sitemap https://claz.org/listings3.xml
sitemap https://claz.org/listings4.xml
sitemap https://claz.org/listings5.xml