shl.com
robots.txt

Robots Exclusion Standard data for shl.com

Resource Scan

Scan Details

Site Domain shl.com
Base Domain shl.com
Scan Status Ok
Last Scan2024-05-13T18:48:15+00:00
Next Scan 2024-06-12T18:48:15+00:00

Last Scan

Scanned2024-05-13T18:48:15+00:00
URL https://shl.com/robots.txt
Redirect https://www.shl.com:443/robots.txt
Redirect Domain www.shl.com
Redirect Base shl.com
Domain IPs 18.159.144.0, 18.184.141.195
Redirect IPs 18.155.68.53, 18.155.68.61, 18.155.68.73, 18.155.68.86, 2600:9000:23d2:7e00:18:efed:aa40:93a1, 2600:9000:23d2:9000:18:efed:aa40:93a1, 2600:9000:23d2:9400:18:efed:aa40:93a1, 2600:9000:23d2:c200:18:efed:aa40:93a1, 2600:9000:23d2:ce00:18:efed:aa40:93a1, 2600:9000:23d2:e400:18:efed:aa40:93a1, 2600:9000:23d2:f400:18:efed:aa40:93a1, 2600:9000:23d2:fe00:18:efed:aa40:93a1
Response IP 18.155.68.73
Found Yes
Hash 4991d74d3a4817c5235058f8ede32f7d1ae6595b3326dc1e166f533c599ba5fd
SimHash 0440da436133

Groups

*

Rule Path
Allow /
Disallow /admin
Disallow /dev

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap https://www.shl.com/sitemap.xml

Comments

  • Robots.txt
  • Enables robots.txt rules for all crawlers
  • Rate limit Yahoo!, Bing and Yandex (Google ignores this)
  • SilverStripe sitemap: