proteinhouseeg.com
robots.txt

Robots Exclusion Standard data for proteinhouseeg.com

Resource Scan

Scan Details

Site Domain proteinhouseeg.com
Base Domain proteinhouseeg.com
Scan Status Ok
Last Scan2026-02-07T00:05:48+00:00
Next Scan 2026-03-09T00:05:48+00:00

Last Scan

Scanned2026-02-07T00:05:48+00:00
URL https://proteinhouseeg.com/robots.txt
Domain IPs 104.21.78.103, 172.67.220.56, 2606:4700:3031::ac43:dc38, 2606:4700:3033::6815:4e67
Response IP 104.21.78.103
Found Yes
Hash e28045a1ee3ed53d1d01145f90057982bd31a7ff9bb5c81de869baa79ecafa6f
SimHash e900a822efb2

Groups

*

Rule Path
Disallow /wp-content/uploads/wc-logs/
Disallow /wp-content/uploads/woocommerce_transient_files/
Disallow /wp-content/uploads/woocommerce_uploads/
Disallow /*?add-to-cart=
Disallow /*?*add-to-cart=
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://proteinhouseeg.com/wp-sitemap.xml