larousse.com
robots.txt

Robots Exclusion Standard data for larousse.com

Resource Scan

Scan Details

Site Domain larousse.com
Base Domain larousse.com
Scan Status Ok
Last Scan2024-05-29T16:54:11+00:00
Next Scan 2024-06-05T16:54:11+00:00

Last Scan

Scanned2024-05-29T16:54:11+00:00
URL https://larousse.com/robots.txt
Redirect https://www.larousse.com/robots.txt
Redirect Domain www.larousse.com
Redirect Base larousse.com
Domain IPs 51.144.190.143
Redirect IPs 51.144.190.143
Response IP 51.144.190.143
Found Yes
Hash 4f4d64b0bf5d550f5d602c80c5744b67e37b81d0d6ab21411f09842517f1b65c
SimHash 69144d45e511

Groups

*

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://www.larousse.fr/crawler/sitemapIndex_bilingue_larousse_com.xml