curvage.org
robots.txt

Robots Exclusion Standard data for curvage.org

Resource Scan

Scan Details

Site Domain curvage.org
Base Domain curvage.org
Scan Status Ok
Last Scan2024-11-04T14:50:22+00:00
Next Scan 2024-11-18T14:50:22+00:00

Last Scan

Scanned2024-11-04T14:50:22+00:00
URL https://curvage.org/robots.txt
Domain IPs 104.21.14.204, 172.67.160.139, 2606:4700:3032::6815:ecc, 2606:4700:3037::ac43:a08b
Response IP 104.21.14.204
Found Yes
Hash 27da85167024f2e5d3ae6f009971507f32397ee7498278fe9d122faf0ab94af1
SimHash 43406a8382b2

Groups

*

Rule Path
Disallow /mvf/
Allow /favicon.ico
Disallow /forum/startTopic/
Disallow /forum/discover/unread/
Disallow /forum/markallread/
Disallow /forum/staff/
Disallow /forum/cookie/
Disallow /forum/online/
Disallow /forum/discover/
Disallow /forum/leaderboard/
Disallow /forum/search/
Disallow /forum/tags/
Disallow /*?advancedSearchForm=
Disallow /forum/register/
Disallow /forum/lostpassword/
Disallow /forum/login/
Disallow /*?sortby=
Disallow /*?filter=
Disallow /*?tab=
Disallow /*?do=
Disallow /*ref%3D
Disallow /*?forumId*
Disallow /*?&controller=embed

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://www.curvage.org/forum/sitemap.php

Comments

  • Rules for Curvage Community (https://www.Curvage.org)
  • Block pages with no unique content
  • Block faceted pages and 301 redirect pages
  • Sitemap URL