clclt.com
robots.txt

Robots Exclusion Standard data for clclt.com

Resource Scan

Scan Details

Site Domain clclt.com
Base Domain clclt.com
Scan Status Ok
Last Scan2024-11-12T21:58:03+00:00
Next Scan 2024-11-19T21:58:03+00:00

Last Scan

Scanned2024-11-12T21:58:03+00:00
URL https://clclt.com/robots.txt
Domain IPs 209.104.5.201
Response IP 209.104.5.201
Found Yes
Hash a40c8829c39038f212326c559aed8e3772a89773ee0b9d823f41da25931c3e31
SimHash a96c32842ada

Groups

*

Rule Path
Disallow /gyrobase/ArticleArchives
Disallow /gyrobase/EventSearch
Disallow /gyrobase/FilmSearch
Disallow /gyrobase/LocationSearch
Disallow /gyrobase/MovieTimes
Disallow /gyrobase/Search
Disallow /charlotte/ArticleArchives
Disallow /charlotte/EventSearch
Disallow /charlotte/FilmSearch
Disallow /charlotte/LocationSearch
Disallow /charlotte/MovieTimes
Disallow /charlotte/Search

Other Records

Field Value
sitemap https://clclt.com/charlotte/Sitemap.xml