alva.org.uk
robots.txt

Robots Exclusion Standard data for alva.org.uk

Resource Scan

Scan Details

Site Domain alva.org.uk
Base Domain alva.org.uk
Scan Status Ok
Last Scan2024-10-30T08:31:18+00:00
Next Scan 2024-11-29T08:31:18+00:00

Last Scan

Scanned2024-10-30T08:31:18+00:00
URL https://alva.org.uk/robots.txt
Redirect https://www.alva.org.uk/robots.txt
Redirect Domain www.alva.org.uk
Redirect Base alva.org.uk
Domain IPs 104.21.41.116, 172.67.146.230, 2606:4700:3031::6815:2974, 2606:4700:3031::ac43:92e6
Redirect IPs 104.21.41.116, 172.67.146.230, 2606:4700:3031::6815:2974, 2606:4700:3031::ac43:92e6
Response IP 172.67.146.230
Found Yes
Hash 9a5476c7c86809314973216c33fd761d1eb6ee5b8c59131e2349449c53d59764
SimHash 521ec472e8b3

Groups

ahrefsbot

Rule Path
Disallow /

blp_bbot/0.1

Rule Path
Disallow /

flamingo_searchengine

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

cyberalert

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

webcrawler

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

cityreview

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

shopwiki

Rule Path
Disallow /

yandex

Rule Path
Disallow /

discobot

Rule Path
Disallow /

birubot

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /
Disallow /

twitterbot

Rule Path
Disallow /

gosospider

Rule Path
Disallow /

steeler

Rule Path
Disallow /

summify

Rule Path
Disallow /

accelobot

Rule Path
Disallow /

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Warnings

  • 2 invalid lines.
  • `user agent` is not a known field.