anil.org
robots.txt

Robots Exclusion Standard data for anil.org

Resource Scan

Scan Details

Site Domain anil.org
Base Domain anil.org
Scan Status Ok
Last Scan2024-08-30T14:00:49+00:00
Next Scan 2024-09-29T14:00:49+00:00

Last Scan

Scanned2024-08-30T14:00:49+00:00
URL https://anil.org/robots.txt
Redirect https://www.anil.org/robots.txt
Redirect Domain www.anil.org
Redirect Base anil.org
Domain IPs 95.128.74.61
Redirect IPs 95.128.74.61
Response IP 95.128.74.61
Found Yes
Hash 0e71a773d57a22e9f55b843b29b0aa42251af0903e3698d82ccf49e3d5750bb5
SimHash 4b08c631d770

Groups

twitterbot

Rule Path
Allow /
Disallow

*

Rule Path
Allow /
Disallow /t3lib/
Disallow /typo3/
Disallow /recherche/
Disallow /*cHash%3D*
Allow /fileadmin/user_upload/logo-anil-social.png
Allow /fileadmin/user_upload/logo-anil-vertical.png
Allow /fileadmin/ANIL/ANIL_VERTICAL_bis.png
Allow /fileadmin/ANIL/favicon.png
Disallow /fileadmin*
Disallow /*?tx*
Disallow /*?type=
Disallow /*.pdf
Disallow /?id=
Disallow /cookies
Disallow /erreur-404
Disallow /fileadmin/ANIL/Espace_partenaires/
Disallow /fileadmin/ANIL/RDV_dunkerque_sept22/

tinytestbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.anil.org/index.php?eID=dd_googlesitemap

Warnings

  • `noindex` is not a known field.