peterattiamd.com
robots.txt

Robots Exclusion Standard data for peterattiamd.com

Resource Scan

Scan Details

Site Domain peterattiamd.com
Base Domain peterattiamd.com
Scan Status Ok
Last Scan2025-09-20T20:15:02+00:00
Next Scan 2025-10-20T20:15:02+00:00

Last Scan

Scanned2025-09-20T20:15:02+00:00
URL https://peterattiamd.com/robots.txt
Domain IPs 104.18.37.69, 172.64.150.187, 2606:4700:4408::ac40:96bb, 2a06:98c1:3100::6812:2545
Response IP 104.18.37.69
Found Yes
Hash ece2349bda8d80307e20474aca43c5dd87527f2b0e7a8f5b6c7c439a5adc286d
SimHash a0e08a20ee10

Groups

*

Rule Path
Disallow /cdn-cgi/
Disallow /*add-to-cart%3D*

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow *.mp3
Disallow /category/podcast/qualys/feed$
Disallow /category/podcast/feed$
Disallow /category/podcast/qualys/feed/$
Disallow /category/podcast/feed/$

Comments

  • Prevent Crawling Unnecessary Endpoints - Dynamically added by BigScoots