akinia.com.cy
robots.txt

Robots Exclusion Standard data for akinia.com.cy

Resource Scan

Scan Details

Site Domain akinia.com.cy
Base Domain akinia.com.cy
Scan Status Ok
Last Scan2024-10-11T09:36:00+00:00
Next Scan 2024-10-18T09:36:00+00:00

Last Scan

Scanned2024-10-11T09:36:00+00:00
URL https://akinia.com.cy/robots.txt
Domain IPs 104.21.96.111, 172.67.177.25
Response IP 104.21.96.111
Found Yes
Hash 4c2a31f513c9438d5b21752be1830089a5821791daf07f2705a2c93148677075
SimHash 597849116e40

Groups

*

Rule Path
Disallow /index.php
Disallow /go/
Disallow /plugins/
Disallow /libs/
Disallow /includes/
Disallow /print*
Disallow /*?sort_by=
Disallow /*%26sort_by%3D
Disallow /*?sort_type=
Disallow /*%26sort_type%3D

screaming frog seo spider

Rule Path
Allow
Disallow /libs/kcaptcha/getImage.php
Disallow /files/
Disallow /*%26cf-
Disallow /index.php?page=print

spbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

proximic

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

openstat.ru/bot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

bubing

Rule Path
Disallow /

synthesio crawler release monalisa

Rule Path
Disallow /

sogou

Rule Path
Disallow /

linkpadbot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

slurp

Rule Path
Disallow /

haosouspider

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

domaincrawler

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

deusu

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

getintent crawler

Rule Path
Disallow /

proximic

Rule Path
Disallow /

adbeat_bot

Rule Path
Disallow /

addthis.com

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

siteexplorer

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

extlinksbot

Rule Path
Disallow /

getintent crawler

Rule Path
Disallow /

ltx71 - (http://ltx71.com/)

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

grouphigh

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

criteobot/0.1

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

Comments

  • http://www.trendiction.de/de/publisher/bot

Warnings

  • 2 invalid lines.