marianne.net
robots.txt
Robots Exclusion Standard data for marianne.net
Resource Scan
Scan Details
Site Domain | marianne.net |
Base Domain | marianne.net |
Scan Status | Ok |
Last Scan | 2024-11-16T08:32:24+00:00 |
Next Scan | 2024-11-23T08:32:24+00:00 |
Last Scan
Scanned | 2024-11-16T08:32:24+00:00 |
URL | https://marianne.net/robots.txt |
Redirect | https://www.marianne.net/robots.txt |
Redirect Domain | www.marianne.net |
Redirect Base | marianne.net |
Domain IPs | 3.164.85.14, 3.164.85.20, 3.164.85.30, 3.164.85.97 |
Redirect IPs | 18.161.111.102, 18.161.111.72, 18.161.111.77, 18.161.111.97 |
Response IP | 65.9.112.51 |
Found | Yes |
Hash | 1e9332c8980915c6173d8ec9b8aeb80aa15a4c9c4d5aea354e7aef76fb73801b |
SimHash | 284dc8314d25 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | *?ref=* |
*
Rule | Path |
---|---|
Disallow | / |
meltawer
digimind
knowings
sindup
cision
talkwater
turnitinbot
converacrawler
jetbot
newsnow
kbcrawl
amisoftware
newzbin
ask n read
qwam content intelligence
zite
youmag
synthesio
trendybuzz
spotter
scoop.it
linkfluence
augure
corporama
grub-client
k2spider
libwww
wget
adequat
adequat-systems
auramundi
coexel
ellisphere
leadbox
mention
moreover
mytwip
opinion-tracker
proxem
score3
trendeo
vecteurplus
verticalsearch
vsw
winello
fetch
infoseek
msiecrawler
offline explorer
sitecheck.internetseer.com
teleport
teleportpro
webcopier
webstripper
zealbot
asknread.com
omgilibot
omgili
xenu link sleuth/1.3.8
chatgpt-user
ccbot
gptbot
google-extended
Rule | Path |
---|---|
Disallow | / |
Other Records
Field | Value |
---|---|
sitemap | https://www.marianne.net/sitemap_news.xml |
Warnings
- 2 invalid lines.
- `host` is not a known field.
Comments