pressesante.com
robots.txt

Robots Exclusion Standard data for pressesante.com

Resource Scan

Scan Details

Site Domain pressesante.com
Base Domain pressesante.com
Scan Status Ok
Last Scan2024-11-02T09:15:06+00:00
Next Scan 2024-12-02T09:15:06+00:00

Last Scan

Scanned2024-11-02T09:15:06+00:00
URL https://pressesante.com/robots.txt
Redirect https://www.pressesante.com/robots.txt
Redirect Domain www.pressesante.com
Redirect Base pressesante.com
Domain IPs 104.21.58.67, 172.67.201.130, 2606:4700:3033::6815:3a43, 2606:4700:3035::ac43:c982
Redirect IPs 104.21.58.67, 172.67.201.130, 2606:4700:3033::6815:3a43, 2606:4700:3035::ac43:c982
Response IP 104.21.58.67
Found Yes
Hash 011eace6508845ef707336b8e100d3e3666dc41a11d06a49de8fc8d0d0d4efd1
SimHash 492dd44074f1

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /trucs-et-astuces/
Disallow /cdn-cgi/
Allow /*.js
Allow /wp-admin/admin-ajax.php
Allow /wp-admin/*.js
Allow /wp-admin/admin-ajax.php
Disallow /wp-includes/

grapeshot

Rule Path
Disallow
Allow /*css?*
Allow /*js?*
Allow /*?utm*
Allow /css/?

adsbot-google-mobile-apps
feedfetcher-google
apis-google
googlebot
googlebot-news
googlebot-image
googlebot-video

Rule Path
Allow /*

Other Records

Field Value
sitemap https://www.pressesante.com/sitemap_index.xml
sitemap https://www.pressesante.com/news-sitemap.xml

Comments

  • URLs autorisées CSS JS Analytics pour les Bots
  • Autoriser Google Image