leisuremedia.com
robots.txt

Robots Exclusion Standard data for leisuremedia.com

Resource Scan

Scan Details

Site Domain leisuremedia.com
Base Domain leisuremedia.com
Scan Status Ok
Last Scan2024-10-08T03:44:29+00:00
Next Scan 2024-11-07T03:44:29+00:00

Last Scan

Scanned2024-10-08T03:44:29+00:00
URL https://www.leisuremedia.com/robots.txt
Domain IPs 104.21.40.77, 172.67.181.58, 2606:4700:3035::ac43:b53a, 2606:4700:3037::6815:284d
Response IP 104.21.40.77
Found Yes
Hash 20b8ea7b46bb6d35fa03080c982a616640b51e09f55678ce60f9ed3915e826bc
SimHash 521ec472e8b3

Groups

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

blp_bbot/0.1

Rule Path
Disallow /

flamingo_searchengine

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

cyberalert

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

webcrawler

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

cityreview

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

shopwiki

Rule Path
Disallow /

yandex

Rule Path
Disallow /

discobot

Rule Path
Disallow /

birubot

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /
Disallow /

twitterbot

Rule Path
Disallow /

gosospider

Rule Path
Disallow /

steeler

Rule Path
Disallow /

summify

Rule Path
Disallow /

accelobot

Rule Path
Disallow /

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Warnings

  • 2 invalid lines.
  • `user agent` is not a known field.