in.edugain.com
robots.txt

Robots Exclusion Standard data for in.edugain.com

Resource Scan

Scan Details

Site Domain in.edugain.com
Base Domain edugain.com
Scan Status Ok
Last Scan2025-09-08T03:30:56+00:00
Next Scan 2025-09-22T03:30:56+00:00

Last Scan

Scanned2025-09-08T03:30:56+00:00
URL https://in.edugain.com/robots.txt
Domain IPs 13.35.202.4, 13.35.202.75, 13.35.202.80, 13.35.202.90, 2600:9000:2078:1000:c:2754:3740:93a1, 2600:9000:2078:3200:c:2754:3740:93a1, 2600:9000:2078:800:c:2754:3740:93a1, 2600:9000:2078:a000:c:2754:3740:93a1, 2600:9000:2078:a00:c:2754:3740:93a1, 2600:9000:2078:c200:c:2754:3740:93a1, 2600:9000:2078:ee00:c:2754:3740:93a1, 2600:9000:2078:fa00:c:2754:3740:93a1
Response IP 13.35.202.80
Found Yes
Hash f83335a2e5e680e0492f45e35dc396d40ea7b359955172d4fbd220effd6d84ba
SimHash 483564416590

Groups

*

Rule Path
Disallow /generate_paper/

ia_archiver

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

*

Rule Path
Disallow *replay*

*

Rule Path
Disallow *qdata*

*

Rule Path
Disallow *contest*

*

Rule Path
Disallow *sub%3D0*

*

Rule Path
Disallow *level%3D0*

*

Rule Path
Disallow *grade%3D0*

*

Rule Path
Disallow *paper/Grade*

*

Rule Path
Disallow *paper/Math*

*

Rule Path
Disallow *.php*

Other Records

Field Value
sitemap https://www.edugain.com/sitemap_index.xml