planetecourrier.com
robots.txt

Robots Exclusion Standard data for planetecourrier.com

Resource Scan

Scan Details

Site Domain planetecourrier.com
Base Domain planetecourrier.com
Scan Status Ok
Last Scan2025-09-19T17:55:50+00:00
Next Scan 2025-10-03T17:55:50+00:00

Last Scan

Scanned2025-09-19T17:55:50+00:00
URL https://planetecourrier.com/robots.txt
Redirect https://www.planetecourrier.com/robots.txt
Redirect Domain www.planetecourrier.com
Redirect Base planetecourrier.com
Domain IPs 104.21.1.213, 172.67.152.88, 2606:4700:3035::6815:1d5, 2606:4700:3035::ac43:9858
Redirect IPs 104.21.1.213, 172.67.152.88, 2606:4700:3035::6815:1d5, 2606:4700:3035::ac43:9858
Response IP 172.67.152.88
Found Yes
Hash 14db9df224d32dc7b8d9a510d4eb4723d58c24c519fff37aa7689055ea82cc92
SimHash 6d09f851ef93

Groups

*

Rule Path
Disallow /auto/
Disallow /*?r=*
Disallow /*?l=*&p=*&version=*&r=*&pr=*
Disallow /*.pdf

gptbot

Rule Path
Allow /

chatgpt-user

Rule Path
Allow /

claude-web

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

youbot

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.planetecourrier.com/sitemap.cfm

Comments

  • robots.txt pour le site
  • Sitemap

Warnings

  • `https` is not a known field.