marsouin.org
robots.txt

Robots Exclusion Standard data for marsouin.org

Resource Scan

Scan Details

Site Domain marsouin.org
Base Domain marsouin.org
Scan Status Ok
Last Scan2025-09-12T16:36:19+00:00
Next Scan 2025-10-12T16:36:19+00:00

Last Scan

Scanned2025-09-12T16:36:19+00:00
URL https://marsouin.org/robots.txt
Redirect https://www.marsouin.org/robots.txt
Redirect Domain www.marsouin.org
Redirect Base marsouin.org
Domain IPs 213.186.33.40
Redirect IPs 213.186.33.40
Response IP 213.186.33.40
Found Yes
Hash a7f4078d079788e69f12cfaa0c042d2650fc9477dacf9a3107368082bc812324
SimHash 6b07dc759f31

Groups

*

Rule Path
Allow /local/cache-css/
Allow /local/cache-js/
Disallow /ecrire/
Disallow /lib/
Disallow /prive/
Disallow /spip.php?action=*
Disallow /spip.php?page=login*
Disallow /*.api/

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://www.marsouin.org/sitemap.xml

Comments

  • robots.txt
  • @url: https://www.marsouin.org
  • @generator: SPIP 4.3.9
  • @template: squelettes-dist/robots.txt.html

Warnings

  • `noindex` is not a known field.