menshealth.pt
robots.txt

Robots Exclusion Standard data for menshealth.pt

Resource Scan

Scan Details

Site Domain menshealth.pt
Base Domain menshealth.pt
Scan Status Ok
Last Scan2024-06-24T23:38:38+00:00
Next Scan 2024-07-01T23:38:38+00:00

Last Scan

Scanned2024-06-24T23:38:38+00:00
URL https://menshealth.pt/robots.txt
Domain IPs 104.26.8.220, 104.26.9.220, 172.67.73.154, 2606:4700:20::681a:8dc, 2606:4700:20::681a:9dc, 2606:4700:20::ac43:499a
Response IP 104.26.9.220
Found Yes
Hash b0e260a177751bcf26cc225fa14bb55768b1a1fba67c5c47351aca7d85299cb5
SimHash acd948a2c6a1

Groups

*

Rule Path
Disallow

googlebot
googlebot-video
bingbot
baiduspider
baiduspider-mobile
baiduspider-video
baiduspider-image
naverbot
yeti
yandex
yandexbot
yandexmobilebot
yandexvideo
yandexwebmaster
yandexsitelinks
seznambot

Rule Path
Allow /

adsbot-google
twitterbot
adidxbot

Rule Path
Allow /

yahoo pipes 1.0
facebot
externalfacebookhit
semrushbot
semrushbot-sa
mj12bot
ahrefsbot

Rule Path
Disallow /
Disallow /*?*
Disallow /topico/*
Disallow /t/*
Disallow /categoria/*
Disallow /author/*
Disallow /blog/page/*
Disallow /page/*

ia_archiver

Rule Path
Allow /$
Disallow /*

ia_archiver-web.archive.org

Rule Path
Allow /$
Disallow /*

meltawer
digimind
knowings
sindup
talkwater
turnitinbot
converacrawler
jetbot
newsnow
kbcrawl
amisoftware
newzbin
ask n read
qwam content intelligence
zite
flipboard
youmag
synthesio
trendybuzz
spotter
scoop.it
linkfluence
augure
corporama
grub-client
k2spider
libwww
wget
adequat
adequat-systems
auramundi
coexel
ellisphere
leadbox
mention
moreover
mytwip
newsnow
newzbin
opinion-tracker
proxem
score3
trendeo
vecteurplus
verticalsearch
vsw
winello
fetch
infoseek
msiecrawler
offline explorer
sitecheck.internetseer.com
teleport
teleportpro
webcopier
webstripper
zealbot
asknread.com
ellisphere
spotter
omgilibot
omgili

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Comments

  • É proibido o uso de web crawlers ou outros métodos automáticos de navegação neste site.
  • Proibimos o rastreamento de nosso site usando um agente que não corresponda à sua identidade conforme número 2 alínea w) do artigo 75o do Decreto Lei n.o 63/85, de 14 de Março.
  • Convidamos-vos a entrar em contato connosco para subscrever uma licença de utilizador. Apenas os nossos parceiros têm o direito de utilizar o nosso conteúdo para uma finalidade que não seja estritamente individual.
  • Robots excluidos .
  • Disable ChatGPT crawler
  • Disable CommonCrawl
  • Disable BARD and Vortex AI crawler

Warnings

  • 2 invalid lines.