www.eicta.iitk.ac.in
robots.txt

Robots Exclusion Standard data for www.eicta.iitk.ac.in

Resource Scan

Scan Details

Site Domain www.eicta.iitk.ac.in
Base Domain iitk.ac.in
Scan Status Ok
Last Scan2025-12-08T23:26:00+00:00
Next Scan 2026-01-07T23:26:00+00:00

Last Scan

Scanned2025-12-08T23:26:00+00:00
URL https://www.eicta.iitk.ac.in/robots.txt
Domain IPs 3.165.75.117, 3.165.75.57, 3.165.75.85, 3.165.75.97
Response IP 3.165.75.97
Found Yes
Hash 575012d08e223642dd4e651b1b62001f9ec4104b0c0eac900792072f31b9bf97
SimHash 390c8862c4b1

Groups

*

Rule Path
Allow /llms.txt
Disallow /404/
Disallow /private/
Disallow /tmp/

ccbot

Rule Path
Allow /

gptbot

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

claudebot

Rule Path
Allow /

google-extended

Rule Path
Allow /

linkedinbot

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.eicta.iitk.ac.in/sitemap.xml