sands555c.com
robots.txt

Robots Exclusion Standard data for sands555c.com

Resource Scan

Scan Details

Site Domain sands555c.com
Base Domain sands555c.com
Scan Status Ok
Last Scan2025-11-14T11:36:23+00:00
Next Scan 2025-12-14T11:36:23+00:00

Last Scan

Scanned2025-11-14T11:36:23+00:00
URL https://sands555c.com/robots.txt
Redirect https://www.sands555c.com/robots.txt
Redirect Domain www.sands555c.com
Redirect Base sands555c.com
Domain IPs 104.21.58.100, 172.67.158.253, 2606:4700:3030::6815:3a64, 2606:4700:3037::ac43:9efd
Redirect IPs 104.21.58.100, 172.67.158.253, 2606:4700:3030::6815:3a64, 2606:4700:3037::ac43:9efd
Response IP 172.67.158.253
Found Yes
Hash 537b6ce5857c6387799cfb45b7f9936199c2ea3f27aa45a8507e5813deb32bc4
SimHash 741cd950e2a0

Groups

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

awariorssbot

Rule Path
Disallow /

awariosmartbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

peer39_crawler

Rule Path
Disallow /

peer39_crawler/1.0

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

youbot

Rule Path
Disallow /

*

Rule Path
Disallow