siit.org.in
robots.txt
Robots Exclusion Standard data for siit.org.in
Resource Scan
Scan Details
| Site Domain | siit.org.in |
| Base Domain | siit.org.in |
| Scan Status | Ok |
| Last Scan | 2026-01-22T00:59:40+00:00 |
| Next Scan | 2026-01-29T00:59:40+00:00 |
Last Scan
| Scanned | 2026-01-22T00:59:40+00:00 |
| URL | https://siit.org.in/robots.txt |
| Domain IPs | 142.132.213.119 |
| Response IP | 142.132.213.119 |
| Found | Yes |
| Hash | ec37eb558fc85ebc593ccb73a1475038bebadc0b6a0f8a4c51a7ed53475a18f9 |
| SimHash | 2d4dfe02ccc3 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /cgi-bin/ |
| Disallow | /tmp/ |
| Disallow | /admin/ |
| Disallow | /hidden/ |
| Disallow | /old/ |
| Disallow | /*.zip$ |
| Disallow | /*.pdf$ |
| Disallow | /*.doc$ |
| Disallow | /*.xls$ |
| Disallow | /*.xlsx$ |
| Disallow | /*?date= |
| Disallow | /*? |
Other Records
| Field | Value |
|---|---|
| sitemap | https://siit.org.in/sitemap.xml |