tarantula.net
robots.txt
Robots Exclusion Standard data for tarantula.net
Resource Scan
Scan Details
Site Domain | tarantula.net |
Base Domain | tarantula.net |
Scan Status | Ok |
Last Scan | 2024-06-26T02:18:19+00:00 |
Next Scan | 2024-07-26T02:18:19+00:00 |
Last Scan
Scanned | 2024-06-26T02:18:19+00:00 |
URL | https://tarantula.net/robots.txt |
Redirect | https://www.tarantula.net/robots.txt |
Redirect Domain | www.tarantula.net |
Redirect Base | tarantula.net |
Domain IPs | 199.60.103.100 |
Redirect IPs | 199.60.103.228, 199.60.103.28, 2606:2c40::c73c:671c, 2606:2c40::c73c:67e4 |
Response IP | 199.60.103.28 |
Found | Yes |
Hash | 20a33a8d55a561fe8748e9c975f618fadf8a9d5f6669855fc1693836a8d45212 |
SimHash | 7e44c6388cb3 |
Groups
*
Rule | Path |
---|---|
Disallow | /sample-* |
Disallow | /blog/sample-* |
Disallow | /driving-growth-for-infraco |
Disallow | /_hcms/preview/ |
Disallow | /hs/manage-preferences/ |
Disallow | /hs/preferences-center/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.tarantula.net/sitemap.xml |