ctgal.com
robots.txt
Robots Exclusion Standard data for ctgal.com
Resource Scan
Scan Details
Site Domain | ctgal.com |
Base Domain | ctgal.com |
Scan Status | Ok |
Last Scan | 2024-05-25T07:00:28+00:00 |
Next Scan | 2024-06-24T07:00:28+00:00 |
Last Scan
Scanned | 2024-05-25T07:00:28+00:00 |
URL | https://ctgal.com/robots.txt |
Domain IPs | 104.21.33.242, 172.67.193.251, 2606:4700:3033::6815:21f2, 2606:4700:3037::ac43:c1fb |
Response IP | 104.21.33.242 |
Found | Yes |
Hash | c79c7c6dfec15cd0466f02ec9f3d942f877e08d7b377717a3b91f70fb205e1f7 |
SimHash | 6b9f63f7c8b3 |
Groups
*
Rule | Path |
---|---|
Disallow | /cgi-bin/ |
Disallow | /admin/ |
Disallow | /search/ |
Disallow | /cache/ |
Disallow | /report.php?pid= |
Other Records
Field | Value |
---|---|
sitemap | https://ctgal.com/sitemap.xml |
Warnings
- 2 invalid lines.