lentreprise.com
robots.txt

Robots Exclusion Standard data for lentreprise.com

Resource Scan

Scan Details

Site Domain lentreprise.com
Base Domain lentreprise.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-03-31T22:51:31+00:00
Next Scan 2024-06-29T22:51:31+00:00

Last Successful Scan

Scanned2023-02-13T15:35:10+00:00
URL http://lentreprise.com/robots.txt
Redirect https://www.lexpress.fr/robots.txt
Redirect Domain www.lexpress.fr
Redirect Base lexpress.fr
Domain IPs 95.131.136.80
Redirect IPs 184.87.193.70, 184.87.193.71, 2600:1413:b000:13::b857:c195, 2600:1413:b000:13::b857:c19c
Response IP 42.99.140.219
Found Yes
Hash 23bea9a4ef0d54d8b72be992ba3039d422dcff45140ace0a0886b5aa3f84e9bf
SimHash ca4dda040366

Groups

*

Rule Path
Disallow /tiny/
Disallow /includes
Disallow /imgs
Disallow /imgstat

tunitinbot

Rule Path
Disallow /

converacrawler

Rule Path
Disallow /

quepasacreep

Rule Path
Disallow /

jetbot

Rule Path
Disallow /

meltwater

Rule Path
Disallow /

newsnow

Rule Path
Disallow /

digimind

Rule Path
Disallow /

knowings d

Rule Path
Disallow /

kbcrawl

Rule Path
Disallow /

wget

Rule Path
Disallow /

amisoftware

Rule Path
Disallow /

newzbin

Rule Path
Disallow /

ask n read

Rule Path
Disallow /

qwam content intelligence

Rule Path
Disallow /

zite

Rule Path
Disallow /

flipboard

Rule Path
Disallow /

flipboardproxy

Rule Path
Disallow /

youmag

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.lexpress.fr/arc/outboundfeeds/sitemap-news.xml
sitemap https://www.lexpress.fr/arc/outboundfeeds/sitemap.xml