jn.pt
robots.txt
Robots Exclusion Standard data for jn.pt
Resource Scan
Scan Details
Site Domain | jn.pt |
Base Domain | jn.pt |
Scan Status | Ok |
Last Scan | 2024-11-13T18:22:25+00:00 |
Next Scan | 2024-11-20T18:22:25+00:00 |
Last Scan
Scanned | 2024-11-13T18:22:25+00:00 |
URL | https://jn.pt/robots.txt |
Redirect | https://www.jn.pt/robots.txt |
Redirect Domain | www.jn.pt |
Redirect Base | jn.pt |
Domain IPs | 104.26.2.211, 104.26.3.211, 172.67.71.226, 2606:4700:20::681a:2d3, 2606:4700:20::681a:3d3, 2606:4700:20::ac43:47e2 |
Redirect IPs | 104.26.2.211, 104.26.3.211, 172.67.71.226, 2606:4700:20::681a:2d3, 2606:4700:20::681a:3d3, 2606:4700:20::ac43:47e2 |
Response IP | 104.26.3.211 |
Found | Yes |
Hash | 6f23915a0f1574010fe65790dc74372278626f7d70fe672c3552ebc38187f93a |
SimHash | acd908b2c6a1 |
Groups
*
Rule | Path |
---|---|
Disallow |
googlebot
googlebot-video
bingbot
baiduspider
baiduspider-mobile
baiduspider-video
baiduspider-image
naverbot
yeti
yandex
yandexbot
yandexmobilebot
yandexvideo
yandexwebmaster
yandexsitelinks
seznambot
Rule | Path |
---|---|
Allow | / |
yahoo pipes 1.0
facebot
externalfacebookhit
semrushbot
semrushbot-sa
mj12bot
ahrefsbot
Rule | Path |
---|---|
Disallow | / |
Disallow | /*?* |
Disallow | /newsgen/* |
Disallow | /page/* |
meltawer
digimind
knowings
sindup
talkwater
turnitinbot
converacrawler
jetbot
newsnow
kbcrawl
amisoftware
newzbin
ask n read
qwam content intelligence
zite
flipboard
youmag
synthesio
trendybuzz
spotter
scoop.it
linkfluence
augure
corporama
grub-client
k2spider
libwww
wget
adequat
adequat-systems
auramundi
coexel
ellisphere
leadbox
mention
moreover
mytwip
newsnow
newzbin
opinion-tracker
proxem
score3
trendeo
vecteurplus
verticalsearch
vsw
winello
fetch
infoseek
msiecrawler
offline explorer
sitecheck.internetseer.com
teleport
teleportpro
webcopier
webstripper
zealbot
asknread.com
ellisphere
spotter
omgilibot
omgili
Rule | Path |
---|---|
Disallow | / |
Warnings
- 2 invalid lines.
Comments