corpnet.com
robots.txt

Robots Exclusion Standard data for corpnet.com

Resource Scan

Scan Details

Site Domain corpnet.com
Base Domain corpnet.com
Scan Status Ok
Last Scan2024-04-24T01:52:51+00:00
Next Scan 2024-05-24T01:52:51+00:00

Last Scan

Scanned2024-04-24T01:52:51+00:00
URL https://corpnet.com/robots.txt
Redirect https://www.corpnet.com/robots.txt
Redirect Domain www.corpnet.com
Redirect Base corpnet.com
Domain IPs 35.84.116.10
Redirect IPs 13.225.4.106, 13.225.4.23, 13.225.4.57, 13.225.4.91, 2600:9000:21b4:2a00:14:8209:d1c0:93a1, 2600:9000:21b4:4800:14:8209:d1c0:93a1, 2600:9000:21b4:4a00:14:8209:d1c0:93a1, 2600:9000:21b4:6a00:14:8209:d1c0:93a1, 2600:9000:21b4:6e00:14:8209:d1c0:93a1, 2600:9000:21b4:800:14:8209:d1c0:93a1, 2600:9000:21b4:c800:14:8209:d1c0:93a1, 2600:9000:21b4:e400:14:8209:d1c0:93a1
Response IP 13.225.4.23
Found Yes
Hash f23bca2aa3c9e6dc25312ef200e0112ff99c79652822d42989e14ec8c3d37dd5
SimHash 79208c408db0

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /*?states=
Disallow /*?STATES=
Disallow /*?pid=
Disallow /*?PID=
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://www.corpnet.com/sitemap_index.xml