planetealpha.com
robots.txt

Robots Exclusion Standard data for planetealpha.com

Resource Scan

Scan Details

Site Domain planetealpha.com
Base Domain planetealpha.com
Scan Status Ok
Last Scan2025-09-30T11:26:23+00:00
Next Scan 2025-10-07T11:26:23+00:00

Last Scan

Scanned2025-09-30T11:26:23+00:00
URL https://planetealpha.com/robots.txt
Domain IPs 46.4.24.98
Response IP 46.4.24.98
Found Yes
Hash 813ad70f6cc1f035f209aee8470d01aad7198e98d2481371e0ab6f6ffba6d2bb
SimHash 29150a70c6e1

Groups

*

Rule Path
Allow /
Disallow /cgi-bin/
Disallow /tmp/
Disallow /admin/
Disallow /*?
Disallow /*%26
Disallow /*.js$
Disallow /*.css$

Other Records

Field Value
crawl-delay 5

googlebot

Rule Path
Allow /
Allow /*.css$
Allow /*.js$

Other Records

Field Value
crawl-delay 2

googlebot-image

Rule Path
Allow /images/

googlebot-image

Rule Path
Allow /images/
Disallow /Logos/

bingbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap https://planetealpha.com/sitemap.xml

Comments

  • Dossiers interdits aux robots
  • Optimisation de l'exploration - empêcher l'exploration de doublons
  • Crawl-delay pour limiter la charge sur le serveur
  • Instructions spécifiques pour Googlebot
  • Instructions pour Google Images
  • Instructions pour Googlebot-Image pour optimiser l'indexation des images
  • Instructions pour Bingbot