careers.woodplc.com
robots.txt
Robots Exclusion Standard data for careers.woodplc.com
Resource Scan
Scan Details
Site Domain | careers.woodplc.com |
Base Domain | woodplc.com |
Scan Status | Ok |
Last Scan | 2024-11-03T10:14:27+00:00 |
Next Scan | 2024-12-03T10:14:27+00:00 |
Last Scan
Scanned | 2024-11-03T10:14:27+00:00 |
URL | https://careers.woodplc.com/robots.txt |
Domain IPs | 43.245.41.174 |
Response IP | 43.245.41.174 |
Found | Yes |
Hash | 3378e6b83d51ec6ec79801f0e47b305f04e5365a2343f98e0d68ef19386a1c81 |
SimHash | 6804c894beb2 |
Groups
*
Rule | Path |
---|---|
Disallow | /_designs/ |
Disallow | /*?sq_content_src= |
Disallow | /*_recache |
Disallow | /*_edit |
Disallow | /*_admin |
Disallow | /*_login |
Disallow | /*_performance |
Disallow | /*_design |
Disallow | /*_web_services |
Disallow | /*?result_184856_result_page=* |
Disallow | /_resources/ |
Disallow | /resources/ |
Disallow | /sandbox/ |
Disallow | /redirects/ |
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
Comments