menshealth.pt
robots.txt
Robots Exclusion Standard data for menshealth.pt
Resource Scan
Scan Details
Site Domain | menshealth.pt |
Base Domain | menshealth.pt |
Scan Status | Ok |
Last Scan | 2024-06-24T23:38:38+00:00 |
Next Scan | 2024-07-01T23:38:38+00:00 |
Last Scan
Scanned | 2024-06-24T23:38:38+00:00 |
URL | https://menshealth.pt/robots.txt |
Domain IPs | 104.26.8.220, 104.26.9.220, 172.67.73.154, 2606:4700:20::681a:8dc, 2606:4700:20::681a:9dc, 2606:4700:20::ac43:499a |
Response IP | 104.26.9.220 |
Found | Yes |
Hash | b0e260a177751bcf26cc225fa14bb55768b1a1fba67c5c47351aca7d85299cb5 |
SimHash | acd948a2c6a1 |
Groups
*
Rule | Path |
---|---|
Disallow |
googlebot
googlebot-video
bingbot
baiduspider
baiduspider-mobile
baiduspider-video
baiduspider-image
naverbot
yeti
yandex
yandexbot
yandexmobilebot
yandexvideo
yandexwebmaster
yandexsitelinks
seznambot
Rule | Path |
---|---|
Allow | / |
yahoo pipes 1.0
facebot
externalfacebookhit
semrushbot
semrushbot-sa
mj12bot
ahrefsbot
Rule | Path |
---|---|
Disallow | / |
Disallow | /*?* |
Disallow | /topico/* |
Disallow | /t/* |
Disallow | /categoria/* |
Disallow | /author/* |
Disallow | /blog/page/* |
Disallow | /page/* |
meltawer
digimind
knowings
sindup
talkwater
turnitinbot
converacrawler
jetbot
newsnow
kbcrawl
amisoftware
newzbin
ask n read
qwam content intelligence
zite
flipboard
youmag
synthesio
trendybuzz
spotter
scoop.it
linkfluence
augure
corporama
grub-client
k2spider
libwww
wget
adequat
adequat-systems
auramundi
coexel
ellisphere
leadbox
mention
moreover
mytwip
newsnow
newzbin
opinion-tracker
proxem
score3
trendeo
vecteurplus
verticalsearch
vsw
winello
fetch
infoseek
msiecrawler
offline explorer
sitecheck.internetseer.com
teleport
teleportpro
webcopier
webstripper
zealbot
asknread.com
ellisphere
spotter
omgilibot
omgili
Rule | Path |
---|---|
Disallow | / |
Warnings
- 2 invalid lines.
Comments