cgc.edu.in
robots.txt
Robots Exclusion Standard data for cgc.edu.in
Resource Scan
Scan Details
Site Domain | cgc.edu.in |
Base Domain | cgc.edu.in |
Scan Status | Ok |
Last Scan | 2025-04-21T05:50:28+00:00 |
Next Scan | 2025-05-21T05:50:28+00:00 |
Last Scan
Scanned | 2025-04-21T05:50:28+00:00 |
URL | https://www.cgc.edu.in/robots.txt |
Domain IPs | 192.124.249.54 |
Response IP | 192.124.249.54 |
Found | Yes |
Hash | febdb76a5dee5fb4cfba241bceb3e8d6fb1ecc143be7251b5c5f0c5d5b16455f |
SimHash | 691494d6d395 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /admin/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.cgc.edu.in/sitemap.xml |
Warnings
- 2 invalid lines.