ism.edu
robots.txt
Robots Exclusion Standard data for ism.edu
Resource Scan
Scan Details
| Site Domain | ism.edu |
| Base Domain | ism.edu |
| Scan Status | Ok |
| Last Scan | 2025-11-12T21:36:31+00:00 |
| Next Scan | 2025-12-12T21:36:31+00:00 |
Last Scan
| Scanned | 2025-11-12T21:36:31+00:00 |
| URL | https://ism.edu/robots.txt |
| Domain IPs | 104.21.51.6, 172.67.216.38, 2606:4700:3031::6815:3306, 2606:4700:3033::ac43:d826 |
| Response IP | 172.67.216.38 |
| Found | Yes |
| Hash | 625c9d3eef3d0ca48ef1b335b5fc65343e0d2bede6f798fb61d008bd89306664 |
| SimHash | e31c155bc3f5 |
Groups
*
| Rule | Path |
|---|---|
| Allow | /*.js* |
| Allow | /*.css* |
| Allow | /*.png* |
| Allow | /*.jpg* |
| Allow | /*.gif* |
Other Records
| Field | Value |
|---|---|
| crawl-delay | 5 |
*
| Rule | Path |
|---|---|
| Disallow | /administrator/ |
| Disallow | /bin/ |
| Disallow | /cache/ |
| Disallow | /cli/ |
| Disallow | /includes/ |
| Disallow | /installation/ |
| Disallow | /language/ |
| Disallow | /layouts/ |
| Disallow | /libraries/ |
| Disallow | /logs/ |
| Disallow | /tmp/ |
| Disallow | /archive/ |
| Disallow | /search/ |
| Disallow | /*?pop=* |
| Disallow | /*?msi=* |
| Disallow | /*?sid=* |
| Disallow | /*?componentId=* |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.ism.edu/sitemap.xml |
Comments