amentsoc.org
robots.txt

Robots Exclusion Standard data for amentsoc.org

Resource Scan

Scan Details

Site Domain amentsoc.org
Base Domain amentsoc.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-08-06T20:36:39+00:00
Next Scan 2025-10-05T20:36:39+00:00

Last Successful Scan

Scanned2025-05-16T20:35:37+00:00
URL https://amentsoc.org/robots.txt
Redirect https://www.amentsoc.org/robots.txt
Redirect Domain www.amentsoc.org
Redirect Base amentsoc.org
Domain IPs 185.181.117.71
Redirect IPs 185.181.117.71
Response IP 185.181.117.71
Found Yes
Hash 28a6c6b7b20f4995e20828172ab36b64f3a06bc0c26664ae113de68b86a26e8a
SimHash 34857c66ec9b

Groups

*

Rule Path
Disallow /members/restrict/
Disallow /help/tell-a-friend.html
Disallow /help/tellfriend.pl
Disallow /about/feedback.pl
Disallow /search/search.pl
Disallow /stats/
Disallow /links
Disallow /links/
Disallow /insects/glossary/li
Disallow /insects/glossary/ap
Disallow /publications/invertebrate-conservation-news/subscribers/
Disallow /publications/invertebrate-conservation-news/benhs/

Comments

  • Issues with sitemap (07/13)
  • Exclude ICN