lepetitgrassois.com
robots.txt

Robots Exclusion Standard data for lepetitgrassois.com

Resource Scan

Scan Details

Site Domain lepetitgrassois.com
Base Domain lepetitgrassois.com
Scan Status Ok
Last Scan2026-01-18T23:47:48+00:00
Next Scan 2026-02-17T23:47:48+00:00

Last Scan

Scanned2026-01-18T23:47:48+00:00
URL https://lepetitgrassois.com/robots.txt
Domain IPs 104.26.2.184, 104.26.3.184, 172.67.75.16, 2606:4700:20::681a:2b8, 2606:4700:20::681a:3b8, 2606:4700:20::ac43:4b10
Response IP 104.26.2.184
Found Yes
Hash 53e13cab8c88340b5142e4b3d646197b53aaae50334428ada2a416529d165091
SimHash c84174414fb0

Groups

*

Rule Path
Disallow
Disallow */feed
Disallow *.pdf
Disallow *.docx
Disallow */?s=*
Disallow */?add*

Other Records

Field Value
sitemap https://lepetitgrassois.com/sitemap_index.xml

Comments

  • Ne pas indexer les flux RSS
  • Ne pas indexer les fichiers pdf
  • Ne pas indexer les fichiers pdf