siit.org.in
robots.txt

Robots Exclusion Standard data for siit.org.in

Resource Scan

Scan Details

Site Domain siit.org.in
Base Domain siit.org.in
Scan Status Ok
Last Scan2026-01-22T00:59:40+00:00
Next Scan 2026-01-29T00:59:40+00:00

Last Scan

Scanned2026-01-22T00:59:40+00:00
URL https://siit.org.in/robots.txt
Domain IPs 142.132.213.119
Response IP 142.132.213.119
Found Yes
Hash ec37eb558fc85ebc593ccb73a1475038bebadc0b6a0f8a4c51a7ed53475a18f9
SimHash 2d4dfe02ccc3

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /tmp/
Disallow /admin/
Disallow /hidden/
Disallow /old/
Disallow /*.zip$
Disallow /*.pdf$
Disallow /*.doc$
Disallow /*.xls$
Disallow /*.xlsx$
Disallow /*?date=
Disallow /*?

Other Records

Field Value
sitemap https://siit.org.in/sitemap.xml