licra.org
robots.txt

Robots Exclusion Standard data for licra.org

Resource Scan

Scan Details

Site Domain licra.org
Base Domain licra.org
Scan Status Ok
Last Scan2025-05-19T11:41:32+00:00
Next Scan 2025-06-18T11:41:32+00:00

Last Scan

Scanned2025-05-19T11:41:32+00:00
URL https://licra.org/robots.txt
Domain IPs 213.186.33.18
Response IP 213.186.33.18
Found Yes
Hash da485a15b4380b7892e581d833dc53d472c4d8d163272ab023b246b39ba934dd
SimHash 484cda126023

Groups

scrapy

Rule Path
Allow /

scrapy

Rule Path
Allow /
Disallow /wp-includes/
Disallow /wp-content/plugins/
Disallow /organigramme-equipe-siege
Disallow /wp-content/uploads/_pda/ORGANIGRAMME-EQUIPE-SIEGE.xlsx
Disallow /*.pdf$
Disallow /*.xlsx$
Disallow /*.xls$
Disallow /*.docx$
Disallow /*.doc$

*

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.licra.org/sitemap_index.xml

Comments

  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK