thencata.org
robots.txt
Robots Exclusion Standard data for thencata.org
Resource Scan
Scan Details
Site Domain | thencata.org |
Base Domain | thencata.org |
Scan Status | Ok |
Last Scan | 2024-11-08T14:04:00+00:00 |
Next Scan | 2024-11-15T14:04:00+00:00 |
Last Scan
Scanned | 2024-11-08T14:04:00+00:00 |
URL | https://thencata.org/robots.txt |
Domain IPs | 72.32.79.251 |
Response IP | 72.32.79.251 |
Found | Yes |
Hash | 324d10aae733e69c82046e3aa7a518083d54e34806699a2709272c8c2127f4f4 |
SimHash | 6a14dae2c2b1 |
Groups
rogerbot
Rule | Path |
---|---|
Disallow | /admin/ |
Disallow | /services/ |
Disallow | /*.axd |
Allow | / |
Other Records
Field | Value |
---|---|
crawl-delay | 2 |
mozilla/5.0 (compatible; butterfly/1.0; +http://labs.topsy.com/butterfly/) gecko/2009032608 firefox/3.0.8
Rule | Path |
---|---|
Disallow | / |
siteimprove.com
Rule | Path |
---|---|
Disallow | /admin/ |
Disallow | /services/ |
Disallow | /*.axd |
Disallow | /*print%3Dtrue* |
Allow | / |
powermapper
Rule | Path |
---|---|
Disallow | /admin/ |
Disallow | /services/ |
Disallow | /*.axd |
Disallow | /*print%3Dtrue* |
Allow | / |
googlebot
Rule | Path |
---|---|
Disallow | /admin/ |
Disallow | /services/ |
Disallow | /*.axd |
Disallow | /*print%3Dtrue* |
Allow | /services/podcast_rss.ashx |
Allow | / |
bingbot
Rule | Path |
---|---|
Disallow | /admin/ |
Disallow | /images/ |
Disallow | /admin/ |
Disallow | /common/ |
Disallow | /editor/ |
Disallow | /services/ |
Disallow | /site/ |
Disallow | /*.js$ |
Disallow | /*.css$ |
Disallow | /*.jpg$ |
Disallow | /*.gif$ |
Disallow | /*.axd |
Allow | /documents/ |
Allow | / |
Other Records
Field | Value |
---|---|
crawl-delay | 30 |
msnbot
Rule | Path |
---|---|
Disallow | /admin/ |
Disallow | /images/ |
Disallow | /admin/ |
Disallow | /common/ |
Disallow | /editor/ |
Disallow | /services/ |
Disallow | /site/ |
Disallow | /*.js$ |
Disallow | /*.css$ |
Disallow | /*.jpg$ |
Disallow | /*.gif$ |
Disallow | /*.axd |
Allow | /documents/ |
Allow | / |
Other Records
Field | Value |
---|---|
crawl-delay | 30 |
*
Rule | Path |
---|---|
Disallow | /images/ |
Disallow | /documents/ |
Disallow | /admin/ |
Disallow | /services/ |
Disallow | /site/ |
Disallow | /*.js$ |
Disallow | /*.css$ |
Disallow | /*.jpg$ |
Disallow | /*.gif$ |
Disallow | /*.axd |
Disallow | /*print%3Dtrue* |
Allow | / |
Other Records
Field | Value |
---|---|
crawl-delay | 5 |
heritrix
Rule | Path |
---|---|
Disallow | /admin/ |
Disallow | /images/ |
Disallow | /documents/ |
Disallow | /admin/ |
Disallow | /common/ |
Disallow | /editor/ |
Disallow | /services/ |
Disallow | /site/ |
Disallow | /*.axd |
Allow | / |
Other Records
Field | Value |
---|---|
crawl-delay | 5 |
mozilla/4.0+(compatible;+t-h-u-n-d-e-r-s-t-o-n-e)
Rule | Path |
---|---|
Disallow | /admin/ |
Disallow | /services/ |
Disallow | /site/ |
Disallow | /*.axd |
Allow | / |
Other Records
Field | Value |
---|---|
crawl-delay | 2 |
Warnings
- 2 invalid lines.
- `visit-time` is not a known field.