lentreprise.fr
robots.txt

Robots Exclusion Standard data for lentreprise.fr

Resource Scan

Scan Details

Site Domain lentreprise.fr
Base Domain lentreprise.fr
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-09-29T04:49:10+00:00
Next Scan 2024-12-28T04:49:10+00:00

Last Successful Scan

Scanned2023-02-14T20:21:46+00:00
URL http://lentreprise.fr/robots.txt
Redirect https://www.lexpress.fr/robots.txt
Redirect Domain www.lexpress.fr
Redirect Base lexpress.fr
Domain IPs 95.131.136.80
Redirect IPs 23.59.168.43, 23.59.168.65, 2600:1413:b000:13::b857:c195, 2600:1413:b000:13::b857:c19c
Response IP 42.99.140.155
Found Yes
Hash 23bea9a4ef0d54d8b72be992ba3039d422dcff45140ace0a0886b5aa3f84e9bf
SimHash ca4dda040366

Groups

*

Rule Path
Disallow /tiny/
Disallow /includes
Disallow /imgs
Disallow /imgstat

tunitinbot

Rule Path
Disallow /

converacrawler

Rule Path
Disallow /

quepasacreep

Rule Path
Disallow /

jetbot

Rule Path
Disallow /

meltwater

Rule Path
Disallow /

newsnow

Rule Path
Disallow /

digimind

Rule Path
Disallow /

knowings d

Rule Path
Disallow /

kbcrawl

Rule Path
Disallow /

wget

Rule Path
Disallow /

amisoftware

Rule Path
Disallow /

newzbin

Rule Path
Disallow /

ask n read

Rule Path
Disallow /

qwam content intelligence

Rule Path
Disallow /

zite

Rule Path
Disallow /

flipboard

Rule Path
Disallow /

flipboardproxy

Rule Path
Disallow /

youmag

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.lexpress.fr/arc/outboundfeeds/sitemap-news.xml
sitemap https://www.lexpress.fr/arc/outboundfeeds/sitemap.xml