gregoirethorel.com
robots.txt

Robots Exclusion Standard data for gregoirethorel.com

Resource Scan

Scan Details

Site Domain gregoirethorel.com
Base Domain gregoirethorel.com
Scan Status Ok
Last Scan2025-09-09T15:27:18+00:00
Next Scan 2025-10-09T15:27:18+00:00

Last Scan

Scanned2025-09-09T15:27:18+00:00
URL https://gregoirethorel.com/robots.txt
Redirect https://www.gregoirethorel.com/robots.txt
Redirect Domain www.gregoirethorel.com
Redirect Base gregoirethorel.com
Domain IPs 104.16.185.173
Redirect IPs 104.16.185.173, 104.16.186.173, 104.16.187.173, 104.16.188.173, 104.16.189.173, 2606:4700::6810:b9ad, 2606:4700::6810:baad, 2606:4700::6810:bbad, 2606:4700::6810:bcad, 2606:4700::6810:bdad
Response IP 104.16.187.173
Found Yes
Hash 5eb4eca32df41c26ca4b389232ef02f4885c81925d5df2686de0185ca7b365de
SimHash 6a9c9403fe90

Groups

ahrefsbot
bingbot
blexbot
bubing
dotbot
msnbot
mj12bot
petalbot
semrushbot
semrushbot-ba
semrushbot-bm
semrushbot-sa
semrushbot-si
siteauditbot
smtbot
yandex

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://www.gregoirethorel.com/sitemap.xml