newho.prod.sudouest.fr
robots.txt
Robots Exclusion Standard data for newho.prod.sudouest.fr
Resource Scan
Scan Details
Site Domain | newho.prod.sudouest.fr |
Base Domain | sudouest.fr |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-11-16T19:50:41+00:00 |
Next Scan | 2024-11-30T19:50:41+00:00 |
Last Successful Scan
Scanned | 2024-10-25T16:15:29+00:00 |
URL | https://newho.prod.sudouest.fr/robots.txt |
Domain IPs | 45.223.107.231 |
Response IP | 45.223.107.231 |
Found | Yes |
Hash | 1787486e8b115a641b73c8367f614ab204d41319d6890a16ce6cbd2a3e3e74c0 |
SimHash | 24bf53f1c435 |
Groups
adsbot-google
adsbot-google-mobile
apis-google
applebot
bingbot
bnf.fr_bot
exabot
googlebot
googlebot-image
googlebot-mobile
googlebot-news
googlebot-video
googlebot_nauxeo
ia_archiver
mediapartners-google
proximic
slurp
storebot-google
twitterbot
upday
sistrix
beopbot
weborama-fetcher
flipboard
flipboardproxy
Rule | Path |
---|---|
Disallow | /idalgo/ |
Disallow | /carte/ |
Disallow | /campub/ |
Disallow | /recherche/ |
Disallow | /legacy/ |
Disallow | /_profiler/ |
Disallow | /sar_topics/ |
Disallow | /sar_als/ |
Disallow | /reagir/ |
Disallow | /blocks/article/ |
Disallow | /block/article/ |
Disallow | /styles-2018/ |
Disallow | /www/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.sudouest.fr/sitemap-news.xml |
sitemap | https://www.sudouest.fr/sitemap.xml |
Warnings
- 4 invalid lines.
Comments