webagentsolutions.com
robots.txt

Robots Exclusion Standard data for webagentsolutions.com

Resource Scan

Scan Details

Site Domain webagentsolutions.com
Base Domain webagentsolutions.com
Scan Status Ok
Last Scan2024-10-10T13:43:53+00:00
Next Scan 2024-11-09T13:43:53+00:00

Last Scan

Scanned2024-10-10T13:43:53+00:00
URL https://webagentsolutions.com/robots.txt
Domain IPs 52.170.197.133
Response IP 52.170.197.133
Found Yes
Hash 489ce20b304253d7ffc08d29277cddd599c9660e91d97f3ea2ec39ff293da485
SimHash c87e55516780

Groups

*

Rule Path
Disallow /account/*
Disallow /default/ajax/*
Disallow /resource/*
Disallow /style/*
Disallow /tracker/*
Disallow /widget/*

Other Records

Field Value
crawl-delay 10

serpstatbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

geedobot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

linguee

Rule Path
Disallow /

ltx71 - (http://ltx71.com/)

Rule Path
Disallow /

daum

Rule Path
Disallow /

mojeekbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

spbot

Rule Path
Disallow /

memorybot

Rule Path
Disallow /

xovibot

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

nerdybot

Rule Path
Disallow /

smtbot

Rule Path
Disallow /

plukkie

Rule Path
Disallow /

alphaseobot

Rule Path
Disallow /

alphaseobot-sa

Rule Path
Disallow /

extlinksbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

Warnings

  • 2 invalid lines.