cliti.com
robots.txt

Robots Exclusion Standard data for cliti.com

Resource Scan

Scan Details

Site Domain cliti.com
Base Domain cliti.com
Scan Status Ok
Last Scan2025-12-15T20:25:01+00:00
Next Scan 2026-01-14T20:25:01+00:00

Last Scan

Scanned2025-12-15T20:25:01+00:00
URL https://cliti.com/robots.txt
Redirect https://www.cliti.com/robots.txt
Redirect Domain www.cliti.com
Redirect Base cliti.com
Domain IPs 104.18.6.73, 104.18.7.73, 2606:4700::6812:649, 2606:4700::6812:749
Redirect IPs 104.18.6.73, 104.18.7.73, 2606:4700::6812:649, 2606:4700::6812:749
Response IP 104.18.7.73
Found Yes
Hash 148a9d9fb6516d1aee0f8da384b63bf5dee744fade63fb2cad83167dd17fa4c9
SimHash 831440228631

Groups

*

Rule Path
Allow *arch/a/*
Disallow /out/
Disallow /*?page=**
Disallow */search*
Disallow /*queryString%3D*
Disallow /*orientation%3D*
Disallow /*all%3D*
Disallow */set-locale*
Disallow /*?filter*
Disallow /*?pricing=*
Disallow /thumb/
Disallow /3thumbs/
Disallow *US_CENSUS_NAME*

ia_archiver

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.cliti.com/sitemap.xml

Comments

  • www.robotstxt.org/
  • www.google.com/support/webmasters/bin/answer.py?hl=en&answer=156449