forums.aaca.org
robots.txt

Robots Exclusion Standard data for forums.aaca.org

Resource Scan

Scan Details

Site Domain forums.aaca.org
Base Domain aaca.org
Scan Status Ok
Last Scan2025-06-23T21:08:05+00:00
Next Scan 2025-07-23T21:08:05+00:00

Last Scan

Scanned2025-06-23T21:08:05+00:00
URL https://forums.aaca.org/robots.txt
Domain IPs 162.159.141.105, 172.66.1.101, 2606:4700:7::165, 2a06:98c1:58::165
Response IP 172.66.1.101
Found Yes
Hash 03b0a0f81751feff51f5440d69769f512b2e359ce8ab26f770af4ef108f12465
SimHash 30202301a49a

Groups

*

Rule Path
Disallow /startTopic/
Disallow /discover/unread/
Disallow /markallread/
Disallow /staff/
Disallow /cookies/
Disallow /online/
Disallow /discover/
Disallow /leaderboard/
Disallow /search/
Disallow /*?advancedSearchForm=
Disallow /register/
Disallow /lostpassword/
Disallow /login/
Disallow /*currency%3D
Disallow /*?sortby=
Disallow /*?filter=
Disallow /*?tab=
Disallow /*?do=
Disallow /*ref%3D
Disallow /*?forumId*
Disallow /*?&controller=embed
Disallow /cdn-cgi/

*

Rule Path
Disallow /cdn-cgi/

Other Records

Field Value
sitemap http://forums.aaca.org/sitemap.php

Comments

  • Rules for Invision Community (https://invisioncommunity.com)
  • Block pages with no unique content
  • Block faceted pages and 301 redirect pages
  • Block CDN endpoints
  • Sitemap URL