njwoodsandwater.com
robots.txt

Robots Exclusion Standard data for njwoodsandwater.com

Resource Scan

Scan Details

Site Domain njwoodsandwater.com
Base Domain njwoodsandwater.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-04-11T05:28:31+00:00
Next Scan 2024-07-10T05:28:31+00:00

Last Successful Scan

Scanned2023-09-13T10:26:44+00:00
URL https://njwoodsandwater.com/robots.txt
Redirect https://www.njwoodsandwater.com/robots.txt
Redirect Domain www.njwoodsandwater.com
Redirect Base njwoodsandwater.com
Domain IPs 104.26.14.10, 104.26.15.10, 172.67.70.115, 2606:4700:20::681a:e0a, 2606:4700:20::681a:f0a, 2606:4700:20::ac43:4673
Redirect IPs 104.26.14.10, 104.26.15.10, 172.67.70.115, 2606:4700:20::681a:e0a, 2606:4700:20::681a:f0a, 2606:4700:20::ac43:4673
Response IP 172.67.70.115
Found Yes
Hash 1e4233c40e4281961767109ee08a2783295bdec605648dd32f7a50c166b2b941
SimHash b8304d010c9a

Groups

*

Rule Path
Disallow /startTopic/
Disallow /markallread/
Disallow /staff/
Disallow /online/
Disallow /leaderboard/
Disallow /search/
Disallow /*?advancedSearchForm=
Disallow /register/
Disallow /lostpassword/
Disallow /login/
Disallow /*?sortby=
Disallow /*?filter=
Disallow /*?tab=
Disallow /*?do=
Disallow /*ref%3D
Disallow /*?forumId*
Disallow /profile/

Other Records

Field Value
sitemap https://www.njwoodsandwater.com/sitemap.php

Comments

  • Rules for Invision Community (https://invisioncommunity.com)
  • Block pages with no unique content
  • Disallow: /discover/unread/
  • Disallow: /discover/
  • Block faceted pages and 301 redirect pages
  • Block profile pages as these have little unique value, consume a lot of crawl time and contain hundreds of 301 links
  • Sitemap URL