glcc.ac.in
robots.txt

Robots Exclusion Standard data for glcc.ac.in

Resource Scan

Scan Details

Site Domain glcc.ac.in
Base Domain glcc.ac.in
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-10-30T16:20:17+00:00
Next Scan 2025-11-06T16:20:17+00:00

Last Successful Scan

Scanned2025-09-29T06:35:38+00:00
URL https://glcc.ac.in/robots.txt
Domain IPs 45.64.105.11
Response IP 45.64.105.11
Found Yes
Hash ea23f30c613beb442cff00f3f09887b420a77670a3c11f7200d2bd72c292e56f
SimHash 6100cc004f93

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://glcc.ac.in/wp-sitemap.xml