diffordsguide.com
robots.txt

Robots Exclusion Standard data for diffordsguide.com

Resource Scan

Scan Details

Site Domain diffordsguide.com
Base Domain diffordsguide.com
Scan Status Ok
Last Scan2024-06-09T12:07:47+00:00
Next Scan 2024-07-09T12:07:47+00:00

Last Scan

Scanned2024-06-09T12:07:47+00:00
URL https://diffordsguide.com/robots.txt
Redirect https://www.diffordsguide.com/robots.txt
Redirect Domain www.diffordsguide.com
Redirect Base diffordsguide.com
Domain IPs 18.135.36.150, 18.171.107.198, 2a05:d01c:f44:a801:f25e:4215:2819:c1da, 2a05:d01c:f44:a804:1de8:58e0:68cc:afc, 2a05:d01c:f44:a806:f7e2:d9:ccf7:ef07, 3.10.93.244
Redirect IPs 18.135.36.150, 18.171.107.198, 2a05:d01c:f44:a801:f25e:4215:2819:c1da, 2a05:d01c:f44:a804:1de8:58e0:68cc:afc, 2a05:d01c:f44:a806:f7e2:d9:ccf7:ef07, 3.10.93.244
Response IP 18.135.36.150
Found Yes
Hash e47acd721392a52836405fb8fba9978345b3bf8a7214647cb70249d16eba3230
SimHash 71488160f2d0

Groups

*

Rule Path
Disallow /.well-known/*
Disallow /test/*
Disallow /bug
Disallow /encyclopedia/sample/*
Disallow /competition/preview/*
Disallow /event/preview/*
Disallow /announcement/preview/*
Disallow /g/preview/*
Disallow /pay/*
Disallow /profile/cocktail-finder
Disallow /city-guide/preview/*
Disallow /producer/preview/*
Disallow /enter/*
Allow /

semrushbot

Rule Path
Disallow /

titan

Rule Path
Disallow /

companybook-crawler

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

linguee

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

seekbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /login
Disallow /profile
Disallow /account
Disallow /your-account
Disallow /el-gr
Disallow /pt-br
Disallow /en-au

Other Records

Field Value
sitemap https://www.diffordsguide.com/sitemap/gb.xml
sitemap https://www.diffordsguide.com/sitemap/gr.xml
sitemap https://www.diffordsguide.com/sitemap/au.xml
sitemap https://www.diffordsguide.com/sitemap/br.xml
sitemap https://www.diffordsguide.com/sitemap/bar.xml

Comments

  • sitemaps
  • bot/crawler block
  • limit amazonbot to just main content,