licra.org
robots.txt

Robots Exclusion Standard data for licra.org

Archived Snapshots

Resource Scan

Scan Details

Site Domain	licra.org
Base Domain	licra.org
Scan Status	Ok
Last Scan	2025-05-19T11:41:32+00:00
Next Scan	2025-06-18T11:41:32+00:00

Last Scan

Scanned	2025-05-19T11:41:32+00:00
URL	https://licra.org/robots.txt
Domain IPs	213.186.33.18
Response IP	213.186.33.18
Found	Yes
Hash	da485a15b4380b7892e581d833dc53d472c4d8d163272ab023b246b39ba934dd
SimHash	484cda126023

Groups

scrapy

Rule	Path
Allow	/

Rule

Path

Allow

/

scrapy

Rule	Path
Allow	/
Disallow	/wp-includes/
Disallow	/wp-content/plugins/
Disallow	/organigramme-equipe-siege
Disallow	/wp-content/uploads/_pda/ORGANIGRAMME-EQUIPE-SIEGE.xlsx
Disallow	/*.pdf$
Disallow	/*.xlsx$
Disallow	/*.xls$
Disallow	/*.docx$
Disallow	/*.doc$

Rule

Path

Allow

/

Disallow

/wp-includes/

Disallow

/wp-content/plugins/

Disallow

/organigramme-equipe-siege

Disallow

/wp-content/uploads/_pda/ORGANIGRAMME-EQUIPE-SIEGE.xlsx

Disallow

/*.pdf$

Disallow

/*.xlsx$

Disallow

/*.xls$

Disallow

/*.docx$

Disallow

/*.doc$

*

Rule	Path
Disallow

Rule

Path

Disallow

Back to top

Other Records

Field	Value
sitemap	https://www.licra.org/sitemap_index.xml

Field

Value

sitemap

https://www.licra.org/sitemap_index.xml

Back to top

Comments

START YOAST BLOCK
---------------------------
---------------------------
END YOAST BLOCK

Back to top

licra.orgrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

scrapy

scrapy

*

Other Records

Comments

licra.org
robots.txt