mattadlard.com
robots.txt

Robots Exclusion Standard data for mattadlard.com

Resource Scan

Scan Details

Site Domain mattadlard.com
Base Domain mattadlard.com
Scan Status Ok
Last Scan2026-03-13T17:48:36+00:00
Next Scan 2026-03-20T17:48:36+00:00

Last Scan

Scanned2026-03-13T17:48:36+00:00
URL https://mattadlard.com/robots.txt
Domain IPs 104.18.37.69, 172.64.150.187, 2606:4700:4408::ac40:96bb, 2a06:98c1:3100::6812:2545
Response IP 104.18.37.69
Found Yes
Hash 81196598d9b7d1ad97689538207f47aa4ebd39402cea36cf9e06178bd4e58859
SimHash 254f0a00c410

Groups

*

Rule Path
Disallow /cdn-cgi/
Disallow /*add-to-cart%3D*

*

Rule Path
Disallow /cdn-cgi/
Disallow /*add-to-cart%3D*

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /bakeitbetter/mp-files/

Other Records

Field Value
sitemap https://mattadlard.com/wp-sitemap.xml

Comments

  • Prevent Crawling Unnecessary Endpoints - Dynamically added by BigScoots
  • Prevent Crawling Unnecessary Endpoints - Dynamically added by BigScoots
  • Custom rule to block this folder from being indexed