tarantula.net
robots.txt

Robots Exclusion Standard data for tarantula.net

Resource Scan

Scan Details

Site Domain tarantula.net
Base Domain tarantula.net
Scan Status Ok
Last Scan2024-06-26T02:18:19+00:00
Next Scan 2024-07-26T02:18:19+00:00

Last Scan

Scanned2024-06-26T02:18:19+00:00
URL https://tarantula.net/robots.txt
Redirect https://www.tarantula.net/robots.txt
Redirect Domain www.tarantula.net
Redirect Base tarantula.net
Domain IPs 199.60.103.100
Redirect IPs 199.60.103.228, 199.60.103.28, 2606:2c40::c73c:671c, 2606:2c40::c73c:67e4
Response IP 199.60.103.28
Found Yes
Hash 20a33a8d55a561fe8748e9c975f618fadf8a9d5f6669855fc1693836a8d45212
SimHash 7e44c6388cb3

Groups

*

Rule Path
Disallow /sample-*
Disallow /blog/sample-*
Disallow /driving-growth-for-infraco
Disallow /_hcms/preview/
Disallow /hs/manage-preferences/
Disallow /hs/preferences-center/

Other Records

Field Value
sitemap https://www.tarantula.net/sitemap.xml