ccsd.org.uk
robots.txt
Robots Exclusion Standard data for ccsd.org.uk
Resource Scan
Scan Details
Site Domain | ccsd.org.uk |
Base Domain | ccsd.org.uk |
Scan Status | Ok |
Last Scan | 2024-10-25T13:22:26+00:00 |
Next Scan | 2024-11-24T13:22:26+00:00 |
Last Scan
Scanned | 2024-10-25T13:22:26+00:00 |
URL | https://ccsd.org.uk/robots.txt |
Redirect | https://www.ccsd.org.uk/robots.txt |
Redirect Domain | www.ccsd.org.uk |
Redirect Base | ccsd.org.uk |
Domain IPs | 52.213.121.163 |
Redirect IPs | 52.213.121.163 |
Response IP | 52.213.121.163 |
Found | Yes |
Hash | 4c8213a552c27c57848658af31a7c3a8f36a0f33e6c23281d8430a09f552d7e5 |
SimHash | 02f80400a07a |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /account/* |
Disallow | /gsearch.ashx |
Disallow | /emailvcard.asp |
Disallow | /news/news-archive/*/amp/ |
Disallow | /members-area/* |
Disallow | /assets/*.pdf$ |
Disallow | /report/report.asp* |
Disallow | /report.asp* |
Disallow | /banner* |
*
No rules defined. All paths allowed.
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
Warnings
- 2 invalid lines.
Comments