theplainsman.com
robots.txt

Robots Exclusion Standard data for theplainsman.com

Resource Scan

Scan Details

Site Domain theplainsman.com
Base Domain theplainsman.com
Scan Status Ok
Last Scan2025-12-13T23:54:30+00:00
Next Scan 2026-01-12T23:54:30+00:00

Last Scan

Scanned2025-12-13T23:54:30+00:00
URL https://www.theplainsman.com/robots.txt
Domain IPs 34.193.93.95, 35.168.142.214
Response IP 35.168.142.214
Found Yes
Hash ed9ad42189db75801556c5c5792982a94c0373b8e68b3d1d7f69df0d94e82111
SimHash 4804dc304711

Groups

*

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

semrushbot

Rule Path
Disallow /