apolloduck.be
robots.txt

Robots Exclusion Standard data for apolloduck.be

Resource Scan

Scan Details

Site Domain apolloduck.be
Base Domain apolloduck.be
Scan Status Ok
Last Scan2024-10-06T07:46:31+00:00
Next Scan 2024-10-13T07:46:31+00:00

Last Scan

Scanned2024-10-06T07:46:31+00:00
URL https://apolloduck.be/robots.txt
Redirect https://www.apolloduck.be/robots.txt
Redirect Domain www.apolloduck.be
Redirect Base apolloduck.be
Domain IPs 149.11.45.181
Redirect IPs 149.11.45.181
Response IP 149.11.45.181
Found Yes
Hash 43b54da74617fae78ac4413664e3d7c3c31ba42a4a771a0c303ec993d0d6808c
SimHash 2305f462a481

Groups

mediapartners-google

Rule Path
Disallow

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 30

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 30

*

Rule Path
Disallow /graphics/
Disallow /logos/
Disallow /video_bin/
Disallow /legal/
Disallow /enquire.phtml
Disallow /new/contact.phtml
Disallow /js/

ia_archiver

Rule Path
Disallow /

lcc

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

yandex

Rule Path
Disallow /