greatwarforum.org
robots.txt

Robots Exclusion Standard data for greatwarforum.org

Resource Scan

Scan Details

Site Domain greatwarforum.org
Base Domain greatwarforum.org
Scan Status Ok
Last Scan2024-09-19T11:19:29+00:00
Next Scan 2024-10-19T11:19:29+00:00

Last Scan

Scanned2024-09-19T11:19:29+00:00
URL https://greatwarforum.org/robots.txt
Redirect https://www.greatwarforum.org/robots.txt
Redirect Domain www.greatwarforum.org
Redirect Base greatwarforum.org
Domain IPs 2600:9000:2024:4000:b:f46a:7d80:93a1, 2600:9000:2024:600:b:f46a:7d80:93a1, 2600:9000:2024:b000:b:f46a:7d80:93a1, 2600:9000:2024:d200:b:f46a:7d80:93a1, 2600:9000:2024:da00:b:f46a:7d80:93a1, 2600:9000:2024:dc00:b:f46a:7d80:93a1, 2600:9000:2024:e800:b:f46a:7d80:93a1, 2600:9000:2024:f800:b:f46a:7d80:93a1, 65.9.112.113, 65.9.112.129, 65.9.112.38, 65.9.112.64
Redirect IPs 2600:9000:2792:3800:b:f46a:7d80:93a1, 2600:9000:2792:4000:b:f46a:7d80:93a1, 2600:9000:2792:8800:b:f46a:7d80:93a1, 2600:9000:2792:9a00:b:f46a:7d80:93a1, 2600:9000:2792:9c00:b:f46a:7d80:93a1, 2600:9000:2792:a00:b:f46a:7d80:93a1, 2600:9000:2792:c800:b:f46a:7d80:93a1, 2600:9000:2792:dc00:b:f46a:7d80:93a1, 3.164.182.126, 3.164.182.14, 3.164.182.78, 3.164.182.82
Response IP 18.165.122.97
Found Yes
Hash 63f1f40463dab5d262b58365df82e2a19d4ef40ecfe78facdbf2d1afd1a4b284
SimHash 30307a83849a

Groups

*

Rule Path
Disallow /startTopic/
Disallow /discover/unread/
Disallow /markallread/
Disallow /staff/
Disallow /cookie/
Disallow /online/
Disallow /discover/
Disallow /leaderboard/
Disallow /search/
Disallow /tags/
Disallow /*?advancedSearchForm=
Disallow /register/
Disallow /lostpassword/
Disallow /login/
Disallow /*?sortby=
Disallow /*?filter=
Disallow /*?tab=
Disallow /*?do=
Disallow /*ref%3D
Disallow /*?forumId*
Disallow /*?&controller=embed

Other Records

Field Value
sitemap https://greatwarforum.org/sitemap.php

Comments

  • Rules for Invision Community (https://invisioncommunity.com)
  • Block pages with no unique content
  • Block faceted pages and 301 redirect pages
  • Sitemap URL