lejournaldevalergues.com
robots.txt

Robots Exclusion Standard data for lejournaldevalergues.com

Resource Scan

Scan Details

Site Domain lejournaldevalergues.com
Base Domain lejournaldevalergues.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-09-23T12:22:17+00:00
Next Scan 2024-11-22T12:22:17+00:00

Last Successful Scan

Scanned2024-07-26T12:21:03+00:00
URL https://lejournaldevalergues.com/robots.txt
Domain IPs 185.128.239.52
Response IP 185.128.239.52
Found Yes
Hash 4dbb4c85b9a6eafe5488c078d7d4ca3f4b68032528f557fd80d3ab2d19542f17
SimHash 6a08d0574731

Groups

*

Rule Path
Allow /
Disallow /contact
Disallow /mail/subscribe
Disallow /mail/valid-*
Disallow /api/*

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

spbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://lejournaldevalergues.com/sitemap-news.xml
sitemap https://lejournaldevalergues.com/sitemap.xml