arcadeoligist.com
robots.txt

Robots Exclusion Standard data for arcadeoligist.com

Resource Scan

Scan Details

Site Domain arcadeoligist.com
Base Domain arcadeoligist.com
Scan Status Ok
Last Scan2025-08-30T03:54:00+00:00
Next Scan 2025-09-06T03:54:00+00:00

Last Scan

Scanned2025-08-30T03:54:00+00:00
URL http://arcadeoligist.com/robots.txt
Domain IPs 198.55.101.16
Response IP 198.55.101.16
Found Yes
Hash 9cce839888ad0988f791796099998461c8999489d19e7067ba3d3ab857d22836
SimHash 0440d3934411

Groups

architextspider

Rule Path
Disallow

baiduspider

Rule Path
Disallow

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

googlebot-mobile

Rule Path
Disallow

mediapartners-google

Rule Path
Disallow

msnbot

Rule Path
Disallow

msnbot-media

Rule Path
Disallow

msnbot-news

Rule Path
Disallow

msnbot-products

Rule Path
Disallow

msnptc

Rule Path
Disallow

naverbot

Rule Path
Disallow

robozilla

Rule Path
Disallow

scooter

Rule Path
Disallow

slurp

Rule Path
Disallow

teoma

Rule Path
Disallow

turnitinbot

Rule Path
Disallow

yandex

Rule Path
Disallow

yahoo-mmcrawler

Rule Path
Disallow

yahooysmcm

Rule Path
Disallow

ia_archiver

Rule Path
Disallow /

*

Rule Path
Disallow /

Comments

  • robots.txt for arcadeoligist.com !