worley.com
robots.txt

Robots Exclusion Standard data for worley.com

Resource Scan

Scan Details

Site Domain worley.com
Base Domain worley.com
Scan Status Ok
Last Scan2025-12-22T03:39:44+00:00
Next Scan 2026-01-21T03:39:44+00:00

Last Scan

Scanned2025-12-22T03:39:44+00:00
URL https://worley.com/robots.txt
Redirect https://www.worley.com/robots.txt
Redirect Domain www.worley.com
Redirect Base worley.com
Domain IPs 40.119.12.70
Redirect IPs 104.18.37.45, 172.64.150.211, 2606:4700:440a::ac40:96d3, 2606:4700:440d::6812:252d
Response IP 104.18.37.45
Found Yes
Hash 92a4e9852339d5db261cdf3deef29c1b6415c6949a81d73df073ef5e2a67d185
SimHash b9407a671f50

Groups

*

Rule Path
Disallow /data/
Disallow /App_Config/
Disallow /App_Data/
Disallow /assets/
Disallow /bin/
Disallow /global/
Disallow /indexes/
Disallow /layouts/
Disallow /pdf/
Disallow /media/files/worley/
Disallow /sitecore/
Disallow /temp/
Disallow /upload/
Disallow /xsl/
Disallow /Global.asax/
Disallow /App_Config/
Disallow /Version.aspx/
Disallow /Web.config/

Warnings

  • 1 invalid line.