woodplc.com
robots.txt

Robots Exclusion Standard data for woodplc.com

Resource Scan

Scan Details

Site Domain woodplc.com
Base Domain woodplc.com
Scan Status Ok
Last Scan2024-10-31T00:14:59+00:00
Next Scan 2024-11-30T00:14:59+00:00

Last Scan

Scanned2024-10-31T00:14:59+00:00
URL https://woodplc.com/robots.txt
Redirect https://www.woodplc.com/robots.txt
Redirect Domain www.woodplc.com
Redirect Base woodplc.com
Domain IPs 43.245.41.174
Redirect IPs 43.245.41.174
Response IP 43.245.41.174
Found Yes
Hash c430ba166359e752720274a76ee27dba15c4a09cd4f17eb90814e64fd32726ea
SimHash 7846a8549f21

Groups

orbbot
zoominfobot
mj12bot
semrushbot
semrushbot-bm
ahrefsbot
dotbot

Rule Path
Disallow /

*

Rule Path
Disallow /_designs/
Disallow /*?sq_content_src=
Disallow /*_recache
Disallow /*_edit
Disallow /*_admin
Disallow /*_login
Disallow /*_performance
Disallow /*_design
Disallow /*_web_services
Disallow /*?result_184856_result_page=*
Disallow /_resources/
Disallow /resources/
Disallow /sandbox/
Disallow /key-account-info-hubs/

*
googlebot
bingbot

Rule Path
Allow /vdn
Allow /fired-heaters
Allow /nexus
Allow /noise
Allow /gotech
Allow /ece
Disallow /company/where-we-operate/global-locations/
Disallow /__data/

funnelback

Rule Path
Disallow /resources/profiles/

Other Records

Field Value
crawl-delay 10

Comments

  • Block harmful bots
  • Disallow some matrix defaults
  • Disallow: /redirects/
  • Override above blocks for redirects we want to be indexed to pass on link equity
  • Disallow some data paths
  • Try to prevent Funnelback indexing profile information