marnesggz.nl
robots.txt
Robots Exclusion Standard data for marnesggz.nl
Resource Scan
Scan Details
Site Domain | marnesggz.nl |
Base Domain | marnesggz.nl |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Couldn't connect to server. |
Last Scan | 2024-06-18T08:33:00+00:00 |
Next Scan | 2024-07-02T08:33:00+00:00 |
Last Successful Scan
Scanned | 2024-05-11T07:34:22+00:00 |
URL | https://marnesggz.nl/robots.txt |
Domain IPs | 77.241.85.221 |
Response IP | 77.241.85.221 |
Found | Yes |
Hash | 1a2460e447777016df4f8d24605ae1a42b97370380d047908e01b69c106ff618 |
SimHash | 5a5749f5c400 |
Groups
*
Rule | Path |
---|---|
Disallow | /*blackhole |
Disallow | /?blackhole |
yandex
ahrefsbot
aspiegelbot
blexbot
mail.ru_bot
megaindex.ru
mj12bot
openlinkprofiler.org
velenpublicwebcrawler
linguee
obot
dotbot
rogerbot
mauibot
buck
ccbot
borneobot
komodiabot
gofeed
seznambot
elisabot
spiderling
sogou web spider
seekport
baiduspider
grapeshotcrawler
mojeekbot
olbicobot
ltx71
proximic
vagabondo
seokicks
bananabot
woorankreview
cispa
adsbot
seekport
trendictionbot
semrushbot
scrapy
lcc
sidetrade
foregenix
nuclei
crawlson
netestate
serpstatbot
xpymep.exe
mtrobot
amazonbot
colly
turnitin
barkrowler
intelx.io_bot
smtbot
dataforseobot
twingly
Rule | Path |
---|---|
Disallow | / |