motorheads.com
robots.txt

Robots Exclusion Standard data for motorheads.com

Resource Scan

Scan Details

Site Domain motorheads.com
Base Domain motorheads.com
Scan Status Ok
Last Scan2024-11-04T11:36:09+00:00
Next Scan 2024-11-11T11:36:09+00:00

Last Scan

Scanned2024-11-04T11:36:09+00:00
URL https://motorheads.com/robots.txt
Redirect https://www.motorheads.com/robots.txt
Redirect Domain www.motorheads.com
Redirect Base motorheads.com
Domain IPs 104.26.2.16, 104.26.3.16, 172.67.75.40, 2606:4700:20::681a:210, 2606:4700:20::681a:310, 2606:4700:20::ac43:4b28
Redirect IPs 151.101.1.91, 151.101.129.91, 151.101.193.91, 151.101.65.91
Response IP 199.232.45.91
Found Yes
Hash 2f49fa8feb6448d80a2db63be54b13150c45cd661f19a4934d41c8cb5c28637c
SimHash 5eaceec2e48b

Groups

*

Rule Path
Allow /sitemap.xml

*

Rule Path
Disallow /cgi-bin/
Disallow /login.html
Disallow /page.html
Disallow /register.html
Disallow /company/contact-us
Disallow /company/privacy-policy
Disallow /company/terms-conditions
Disallow /company/team
Disallow /search/
Disallow /searchtags.html
Disallow /*/searchtags-*.html
Disallow /preview/*.html
Disallow /*.php
Disallow /*.inc
Disallow /*.txt
Allow /ads.txt
Disallow /*.pdf
Disallow /admin/
Disallow /wp-admin/
Disallow /trackback/
Disallow /wp-content/plugins/
Disallow */1006504/
Disallow /by/authors/$
Disallow /tellafriend
Disallow /community
Disallow /company/myaccount
Disallow /*?action=
Disallow /*%
Disallow /*//

Comments

  • robots.txt cdn
  • EXCEPTIONS
  • ADSENSE
  • User-agent: Mediapartners-Google
  • Disallow:
  • ALL AGENTS
  • disallow all files in these directories
  • disallow specific pages
  • disallow multi-kw systems
  • disallow preview pages
  • disallow all files ending with these extensions
  • Disallow: /*.js
  • Disallow: /*.css
  • Disallow: /*.xml
  • disallow admin
  • disallow specific
  • Disallow: /*?var_
  • Disallow: /*&var_
  • Disallow: /*?id_
  • Disallow: /*&id_
  • Disallow: /*?
  • Disallow: /*&