insercorp.net
robots.txt
Robots Exclusion Standard data for insercorp.net
Resource Scan
Scan Details
Site Domain | insercorp.net |
Base Domain | insercorp.net |
Scan Status | Ok |
Last Scan | 2025-09-26T13:41:48+00:00 |
Next Scan | 2025-10-26T13:41:48+00:00 |
Last Scan
Scanned | 2025-09-26T13:41:48+00:00 |
URL | https://insercorp.net/robots.txt |
Redirect | https://www.insercorp.com/robots.txt |
Redirect Domain | www.insercorp.com |
Redirect Base | insercorp.com |
Domain IPs | 107.180.77.84 |
Redirect IPs | 104.21.38.94, 172.67.221.124, 2606:4700:3033::ac43:dd7c, 2606:4700:3036::6815:265e |
Response IP | 172.67.221.124 |
Found | Yes |
Hash | be0de47a93ca7eb78507abbdf238157f096fa325e9d0b845db5e44942e51da23 |
SimHash | 901508a26d13 |
Groups
*
Rule | Path |
---|---|
Disallow | /admin |
Disallow | /content/print/ |
Disallow | /index/print/ |
Disallow | /news/index/print/ |
Disallow | /events/index/print/ |
Disallow | /blog/index/print/ |
Disallow | /staff/index/print/ |
Disallow | /departments/index/print/ |
Disallow | /events/index/add-to-cal/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.insercorp.com/sitemap/xml |