galesburg.claz.org
robots.txt

Robots Exclusion Standard data for galesburg.claz.org

Resource Scan

Scan Details

Site Domain galesburg.claz.org
Base Domain claz.org
Scan Status Ok
Last Scan2024-06-27T02:14:36+00:00
Next Scan 2024-07-04T02:14:36+00:00

Last Scan

Scanned2024-06-27T02:14:36+00:00
URL https://galesburg.claz.org/robots.txt
Domain IPs 69.162.68.146, 69.162.83.22, 74.63.201.106
Response IP 74.63.201.106
Found Yes
Hash 27c0de1b8b120c0b6b3f3cefa06803a9713575091c4d8848793f55dd1bfe17a9
SimHash 7f005904c893

Groups

*

Rule Path
Disallow /user/
Disallow /guest/
Disallow /go/
Disallow /partner/
Disallow /*?*save=search
Disallow /*/flag$
Disallow /classifieds/*/analytics.svg
Disallow /classifieds/*/contact

Other Records

Field Value
sitemap https://galesburg.claz.org/sitemap.xml