cla-net.org
robots.txt
Robots Exclusion Standard data for cla-net.org
Resource Scan
Scan Details
Site Domain | cla-net.org |
Base Domain | cla-net.org |
Scan Status | Ok |
Last Scan | 2025-07-07T07:32:14+00:00 |
Next Scan | 2025-08-06T07:32:14+00:00 |
Last Scan
Scanned | 2025-07-07T07:32:14+00:00 |
URL | https://www.cla-net.org/robots.txt |
Domain IPs | 35.169.50.49, 35.173.82.140, 35.174.132.21 |
Response IP | 35.174.132.21 |
Found | Yes |
Hash | ecfeebd8adabb0fb89c136eddfcb5c52c9b18ffd18cf0fdc88a64a67f4b5409d |
SimHash | ec949d42c3d9 |
Groups
*
Rule | Path |
---|---|
Disallow | /global_inc/ |
Allow | /global_inc/*.css |
Allow | /global_inc/*.js |
*
Rule | Path |
---|---|
Disallow | /global_engine/ajax/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.cla-net.org/autositemapindex.xml |
Warnings
- 18 invalid lines.
Comments