corsair.wtf
robots.txt

Robots Exclusion Standard data for corsair.wtf

Resource Scan

Scan Details

Site Domain corsair.wtf
Base Domain corsair.wtf
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-04-13T23:46:55+00:00
Next Scan 2025-07-12T23:46:55+00:00

Last Successful Scan

Scanned2024-12-15T18:25:46+00:00
URL https://corsair.wtf/robots.txt
Domain IPs 104.21.40.180, 172.67.156.19, 2606:4700:3030::6815:28b4, 2606:4700:3037::ac43:9c13
Response IP 104.21.40.180
Found Yes
Hash 4958dedcbb27ef133afd30933c817bb4f8bb6446464a6bb1d96276af54bd7c5d
SimHash b92048108e88

Groups

*

Rule Path
Disallow /startTopic/
Disallow /*?do=add
Disallow /*?do=submit
Disallow /discover/unread/
Disallow /markallread/
Disallow /staff/
Disallow /online/
Disallow /discover/
Disallow /leaderboard/
Disallow /search/
Disallow /*?advancedSearchForm=
Disallow /register/
Disallow /lostpassword/
Disallow /login/
Disallow /*?sortby=
Disallow /*?filter=
Disallow /*?tab=comments
Disallow /*?do=email
Disallow /*?do=findComment
Disallow /*?do=getLastComment
Disallow /*?do=getNewComment
Disallow /profile/

Other Records

Field Value
sitemap https://corsair.wtf/sitemap.php

Comments

  • Block pages with no unique content
  • Block faceted pages and 301 redirect pages
  • Block profile pages as these have little unique value, consume a lot of crawl time and contain hundreds of 301 links