amp.public.fr
robots.txt
Robots Exclusion Standard data for amp.public.fr
Resource Scan
Scan Details
Site Domain | amp.public.fr |
Base Domain | public.fr |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-05-17T01:55:45+00:00 |
Next Scan | 2024-06-16T01:55:45+00:00 |
Last Successful Scan
Scanned | 2024-04-18T00:44:13+00:00 |
URL | https://amp.public.fr/robots.txt |
Domain IPs | 52.222.169.57, 52.222.169.74, 52.222.169.84, 52.222.169.96 |
Response IP | 52.222.169.74 |
Found | Yes |
Hash | 9cf8896245c6c21049eb22e353e5fb711dc580513ab58b77490ba9964ffca243 |
SimHash | a05f48904c25 |
Groups
*
Rule | Path |
---|---|
Disallow | /var/cache |
Disallow | /esi |
Disallow | /recherche/simple |
meltawer
digimind
knowings
sindup
cision
talkwater
turnitinbot
converacrawler
jetbot
newsnow
kbcrawl
amisoftware
newzbin
ask n read
qwam content intelligence
zite
youmag
synthesio
trendybuzz
spotter
scoop.it
linkfluence
augure
corporama
grub-client
k2spider
libwww
wget
adequat
adequat-systems
auramundi
coexel
ellisphere
leadbox
mention
moreover
mytwip
opinion-tracker
proxem
score3
trendeo
vecteurplus
verticalsearch
vsw
winello
fetch
infoseek
msiecrawler
offline explorer
sitecheck.internetseer.com
teleport
teleportpro
webcopier
webstripper
zealbot
asknread.com
omgilibot
omgili
xenu link sleuth/1.3.8
chatgpt-user
ccbot
google-extended
Rule | Path |
---|---|
Disallow | / |
Other Records
Field | Value |
---|---|
sitemap | https://www.public.fr/var/storage/exports/sitemaps/public/news/sitemap-news-1.xml |
sitemap | https://www.public.fr/var/storage/exports/sitemaps/public/images/sitemap-images-index.xml |
Warnings
- 2 invalid lines.
Comments