gcghumarwin.ac.in
robots.txt

Robots Exclusion Standard data for gcghumarwin.ac.in

Resource Scan

Scan Details

Site Domain gcghumarwin.ac.in
Base Domain gcghumarwin.ac.in
Scan Status Ok
Last Scan2025-09-10T08:21:44+00:00
Next Scan 2025-10-10T08:21:44+00:00

Last Scan

Scanned2025-09-10T08:21:44+00:00
URL https://gcghumarwin.ac.in/robots.txt
Domain IPs 108.178.4.234
Response IP 108.178.4.234
Found Yes
Hash 7b8da0096dd96abb88e79fc848e3941360f9cfd435b6ae8bcb9edc87518b1231
SimHash 49159d504792

Groups

mediapartners-google

Rule Path
Disallow

googlebot

Rule Path
Allow /
Disallow /search

*

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://gcghumarwin.ac.in/sitemap.xml