spreadshirtmedia.net
robots.txt

Robots Exclusion Standard data for spreadshirtmedia.net

Resource Scan

Scan Details

Site Domain spreadshirtmedia.net
Base Domain spreadshirtmedia.net
Scan Status Ok
Last Scan2024-09-15T08:04:05+00:00
Next Scan 2024-10-15T08:04:05+00:00

Last Scan

Scanned2024-09-15T08:04:05+00:00
URL https://spreadshirtmedia.net/robots.txt
Domain IPs 151.101.130.137, 151.101.194.137, 151.101.2.137, 151.101.66.137, 2a04:4e42:200::649, 2a04:4e42:400::649, 2a04:4e42:600::649, 2a04:4e42::649
Response IP 151.101.66.137
Found Yes
Hash 90b1179d4c9b85c6dd91687690ee7e71f1bf67cb22589fa8a3becf3810cd2789
SimHash c75ccbf0861b

Groups

ahrefsbot

Rule Path
Disallow /

coccocbot-web

Rule Path
Disallow /

coccocbot-image

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

sogouspider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider-video

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

aspiegelbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

geedobot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

*

Rule Path
Disallow /bims/