marzanoresources.com
robots.txt

Robots Exclusion Standard data for marzanoresources.com

Resource Scan

Scan Details

Site Domain marzanoresources.com
Base Domain marzanoresources.com
Scan Status Ok
Last Scan2024-11-10T17:49:43+00:00
Next Scan 2024-12-10T17:49:43+00:00

Last Scan

Scanned2024-11-10T17:49:43+00:00
URL https://marzanoresources.com/robots.txt
Domain IPs 104.26.14.166, 104.26.15.166, 172.67.72.33, 2606:4700:20::681a:ea6, 2606:4700:20::681a:fa6, 2606:4700:20::ac43:4821
Response IP 172.67.72.33
Found Yes
Hash f4d0ef29a00037f629bbe0f5ac040f43be19764e6d1e132351cbdf0aab1962e3
SimHash fb26f82bc3b1

Groups

*

Rule Path
Disallow /lib/
Disallow /*.php$
Disallow /pkginfo/
Disallow /report/
Disallow /var/
Disallow /customer/
Disallow /sendfriend/
Disallow /review/
Disallow /*SID%3D
Disallow /blog/wp-login.php
Disallow /blog/wp-admin
Disallow /custom/
Disallow /misc/
Disallow /plc-navigator/
Disallow /scripts/
Disallow /catalog/product_compare/
Disallow /catalog/category/view/
Disallow /catalog/product/view/
Disallow /catalogsearch/
Disallow /checkout/
Disallow /control/
Disallow /contacts/
Disallow /customer/
Disallow /customize/
Disallow /newsletter/
Disallow /poll/
Disallow /review/
Disallow /sendfriend/
Disallow /tag/
Disallow /wishlist/
Disallow /search/
Disallow /search/?*
Disallow /customer/
Disallow /search/
Disallow /searchresults/

deepcrawl

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /
Disallow /
Disallow /
Disallow /

Comments

  • Default Instructions
  • Directories
  • Paths (clean URLs)
  • disabling indexing
  • disabling indexing
  • disable deepcrawl....
  • disable ahrefs bot
  • disable petalbot
  • disable AI bot traffic

Warnings

  • `‍user-agent` is not a known field.