amp.public.fr
robots.txt

Robots Exclusion Standard data for amp.public.fr

Resource Scan

Scan Details

Site Domain amp.public.fr
Base Domain public.fr
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-05-17T01:55:45+00:00
Next Scan 2024-06-16T01:55:45+00:00

Last Successful Scan

Scanned2024-04-18T00:44:13+00:00
URL https://amp.public.fr/robots.txt
Domain IPs 52.222.169.57, 52.222.169.74, 52.222.169.84, 52.222.169.96
Response IP 52.222.169.74
Found Yes
Hash 9cf8896245c6c21049eb22e353e5fb711dc580513ab58b77490ba9964ffca243
SimHash a05f48904c25

Groups

*

Rule Path
Disallow /var/cache
Disallow /esi
Disallow /recherche/simple

meltawer
digimind
knowings
sindup
cision
talkwater
turnitinbot
converacrawler
jetbot
newsnow
kbcrawl
amisoftware
newzbin
ask n read
qwam content intelligence
zite
youmag
synthesio
trendybuzz
spotter
scoop.it
linkfluence
augure
corporama
grub-client
k2spider
libwww
wget
adequat
adequat-systems
auramundi
coexel
ellisphere
leadbox
mention
moreover
mytwip
opinion-tracker
proxem
score3
trendeo
vecteurplus
verticalsearch
vsw
winello
fetch
infoseek
msiecrawler
offline explorer
sitecheck.internetseer.com
teleport
teleportpro
webcopier
webstripper
zealbot
asknread.com
omgilibot
omgili
xenu link sleuth/1.3.8
chatgpt-user
ccbot
google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.public.fr/var/storage/exports/sitemaps/public/news/sitemap-news-1.xml
sitemap https://www.public.fr/var/storage/exports/sitemaps/public/images/sitemap-images-index.xml

Comments

  • Robots exclus

Warnings

  • 2 invalid lines.