sandwiche.me
robots.txt

Robots Exclusion Standard data for sandwiche.me

Resource Scan

Scan Details

Site Domain sandwiche.me
Base Domain sandwiche.me
Scan Status Ok
Last Scan2024-05-21T22:52:50+00:00
Next Scan 2024-06-20T22:52:50+00:00

Last Scan

Scanned2024-05-21T22:52:50+00:00
URL https://sandwiche.me/robots.txt
Domain IPs 13.227.254.124, 13.227.254.30, 13.227.254.40, 13.227.254.50
Response IP 13.227.254.40
Found Yes
Hash 21e7b49d69da4096d5f3da05dbb9be20fec718b52dd6c39133c9c57b9f5ec9e0
SimHash a0555301c5b1

Groups

applebot

Rule Path
Disallow /admin

baiduspider

Rule Path
Disallow /admin

bingbot

Rule Path
Disallow /admin

discordbot

Rule Path
Disallow /admin

facebookexternalhit

Rule Path
Disallow /admin

googlebot

Rule Path
Disallow /admin

googlebot-image

Rule Path
Disallow /admin

google-inspectiontool

Rule Path
Disallow /admin

ia_archiver

Rule Path
Disallow /admin

linkedinbot

Rule Path
Disallow /admin

msnbot

Rule Path
Disallow /admin

naverbot

Rule Path
Disallow /admin

pinterestbot

Rule Path
Disallow /admin

screaming frog seo spider

Rule Path
Disallow /admin

seznambot

Rule Path
Disallow /admin

slurp

Rule Path
Disallow /admin

teoma

Rule Path
Disallow /admin

telegrambot

Rule Path
Disallow /admin

twitterbot

Rule Path
Disallow /admin

yandex

Rule Path
Disallow /admin

yeti

Rule Path
Disallow /admin

snapchatadsbot

Rule Path
Disallow /admin

semrushbot

Rule Path
Disallow /admin

pinterestbot

Rule Path
Disallow /

*

Rule Path
Disallow /

Other Records

Field Value
sitemap https://sandwiche.me/sitemap.xml
sitemap https://sandwiche.me/sitemap-links-afiliados.xml

Comments

  • We allow everything from the root
  • and then set noindex on our backends for paths that we want to Disallow
  • to prevent this list from being too complex and long