forum-bateau.caradisiac.com
robots.txt

Robots Exclusion Standard data for forum-bateau.caradisiac.com

Resource Scan

Scan Details

Site Domain forum-bateau.caradisiac.com
Base Domain caradisiac.com
Scan Status Ok
Last Scan2024-09-22T22:06:22+00:00
Next Scan 2024-09-29T22:06:22+00:00

Last Scan

Scanned2024-09-22T22:06:22+00:00
URL https://forum-bateau.caradisiac.com/robots.txt
Redirect https://forum-auto.caradisiac.com:443/robots.txt
Redirect Domain forum-auto.caradisiac.com
Redirect Base caradisiac.com
Domain IPs 2600:9000:23d0:0:8:21f6:5240:93a1, 2600:9000:23d0:2e00:8:21f6:5240:93a1, 2600:9000:23d0:7200:8:21f6:5240:93a1, 2600:9000:23d0:7c00:8:21f6:5240:93a1, 2600:9000:23d0:8e00:8:21f6:5240:93a1, 2600:9000:23d0:9800:8:21f6:5240:93a1, 2600:9000:23d0:c600:8:21f6:5240:93a1, 2600:9000:23d0:fe00:8:21f6:5240:93a1, 65.9.189.111, 65.9.189.121, 65.9.189.61, 65.9.189.62
Redirect IPs 2600:9000:23d0:1000:11:9afd:c740:93a1, 2600:9000:23d0:400:11:9afd:c740:93a1, 2600:9000:23d0:5e00:11:9afd:c740:93a1, 2600:9000:23d0:7800:11:9afd:c740:93a1, 2600:9000:23d0:9000:11:9afd:c740:93a1, 2600:9000:23d0:a00:11:9afd:c740:93a1, 2600:9000:23d0:b600:11:9afd:c740:93a1, 2600:9000:23d0:ba00:11:9afd:c740:93a1, 3.160.212.117, 3.160.212.19, 3.160.212.22, 3.160.212.41
Response IP 52.85.49.109
Found Yes
Hash 33616c7cba3a548eb9f2a05837837a71f6a38df2ed1efbb2195c89fd36a23f8c
SimHash b83049138c8a

Groups

*

Rule Path
Disallow /startTopic/
Disallow /discover/unread/
Disallow /markallread/
Disallow /staff/
Disallow /online/
Disallow /discover/
Disallow /leaderboard/
Disallow /search/
Disallow /*?advancedSearchForm=
Disallow /register/
Disallow /lostpassword/
Disallow /login/
Disallow /*?sortby=
Disallow /*?filter=
Disallow /*?tab=
Disallow /*?do=
Disallow /*ref%3D
Disallow /*?forumId*
Disallow /?app*
Disallow /profile/

Other Records

Field Value
sitemap https://forum-auto.caradisiac.com/sitemap.php

Comments

  • Block pages with no unique content
  • Block faceted pages and 301 redirect pages
  • Block profile pages as these have little unique value, consume a lot of crawl time and contain hundreds of 301 links
  • Sitemap URL