marianne.net
robots.txt

Robots Exclusion Standard data for marianne.net

Resource Scan

Scan Details

Site Domain marianne.net
Base Domain marianne.net
Scan Status Ok
Last Scan2024-11-16T08:32:24+00:00
Next Scan 2024-11-23T08:32:24+00:00

Last Scan

Scanned2024-11-16T08:32:24+00:00
URL https://marianne.net/robots.txt
Redirect https://www.marianne.net/robots.txt
Redirect Domain www.marianne.net
Redirect Base marianne.net
Domain IPs 3.164.85.14, 3.164.85.20, 3.164.85.30, 3.164.85.97
Redirect IPs 18.161.111.102, 18.161.111.72, 18.161.111.77, 18.161.111.97
Response IP 65.9.112.51
Found Yes
Hash 1e9332c8980915c6173d8ec9b8aeb80aa15a4c9c4d5aea354e7aef76fb73801b
SimHash 284dc8314d25

Groups

*

Rule Path
Allow /
Disallow *?ref=*

*

Rule Path
Disallow /

meltawer
digimind
knowings
sindup
cision
talkwater
turnitinbot
converacrawler
jetbot
newsnow
kbcrawl
amisoftware
newzbin
ask n read
qwam content intelligence
zite
youmag
synthesio
trendybuzz
spotter
scoop.it
linkfluence
augure
corporama
grub-client
k2spider
libwww
wget
adequat
adequat-systems
auramundi
coexel
ellisphere
leadbox
mention
moreover
mytwip
opinion-tracker
proxem
score3
trendeo
vecteurplus
verticalsearch
vsw
winello
fetch
infoseek
msiecrawler
offline explorer
sitecheck.internetseer.com
teleport
teleportpro
webcopier
webstripper
zealbot
asknread.com
omgilibot
omgili
xenu link sleuth/1.3.8
chatgpt-user
ccbot
gptbot
google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.marianne.net/sitemap_news.xml

Comments

  • Sitemaps
  • Robots exclus

Warnings

  • 2 invalid lines.
  • `host` is not a known field.