stsheet.com
robots.txt

Robots Exclusion Standard data for stsheet.com

Resource Scan

Scan Details

Site Domain stsheet.com
Base Domain stsheet.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-06-04T10:06:55+00:00
Next Scan 2025-09-02T10:06:55+00:00

Last Successful Scan

Scanned2024-07-17T10:05:29+00:00
URL https://www.stsheet.com/robots.txt
Domain IPs 104.21.44.30, 172.67.194.85, 2606:4700:3030::6815:2c1e, 2606:4700:3030::ac43:c255
Response IP 104.21.44.30
Found Yes
Hash 5f91784519d5f679fdbe76f1ed6e844e3f1e8ac2246baf1000b33d5dd9e5b6b7
SimHash 6b1ddcf0eb10

Groups

amazonbot

Rule Path
Disallow /

googlebot

Rule Path
Disallow /nogooglebot/

*

Rule Path
Disallow /login

adsbot-google

Rule Path
Disallow /login

nutch

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /login

Other Records

Field Value
crawl-delay 10

ahrefssiteaudit

Rule Path
Disallow /login

Other Records

Field Value
crawl-delay 10

mj12bot

Rule Path
Disallow /login

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.stsheet.com/sitemap.xml