pcaaca.org
robots.txt
Robots Exclusion Standard data for pcaaca.org
Resource Scan
Scan Details
Site Domain | pcaaca.org |
Base Domain | pcaaca.org |
Scan Status | Ok |
Last Scan | 2025-07-11T13:05:12+00:00 |
Next Scan | 2025-08-10T13:05:12+00:00 |
Last Scan
Scanned | 2025-07-11T13:05:12+00:00 |
URL | https://pcaaca.org/robots.txt |
Domain IPs | 35.169.50.49, 35.173.82.140 |
Response IP | 35.169.50.49 |
Found | Yes |
Hash | 0eea9d25f543a0b82b0e4505cdaefdb82d589f328f35b5f577f3c002006268ea |
SimHash | ec949d42c3d8 |
Groups
*
Rule | Path |
---|---|
Disallow | /global_inc/ |
Allow | /global_inc/*.css |
Allow | /global_inc/*.js |
*
Rule | Path |
---|---|
Disallow | /global_engine/ajax/ |
Other Records
Field | Value |
---|---|
sitemap | https://pcaaca.org/autositemapindex.xml |
Warnings
- 18 invalid lines.
Comments