parentscafe.gr
robots.txt

Robots Exclusion Standard data for parentscafe.gr

Resource Scan

Scan Details

Site Domain parentscafe.gr
Base Domain parentscafe.gr
Scan Status Ok
Last Scan2024-11-12T16:39:53+00:00
Next Scan 2024-11-19T16:39:53+00:00

Last Scan

Scanned2024-11-12T16:39:53+00:00
URL https://parentscafe.gr/robots.txt
Redirect https://www.parentscafe.gr/forum/robots.txt
Redirect Domain www.parentscafe.gr
Redirect Base parentscafe.gr
Domain IPs 104.21.42.134, 172.67.206.25, 2606:4700:3033::6815:2a86, 2606:4700:3037::ac43:ce19
Redirect IPs 104.21.42.134, 172.67.206.25, 2606:4700:3033::6815:2a86, 2606:4700:3037::ac43:ce19
Response IP 172.67.206.25
Found Yes
Hash 6db563c2d8a0ab30d08e735735d5be619aebed53248919504f230fb97e9fbad5
SimHash cb403da7c98c

Groups

*

Rule Path
Disallow /forum/startTopic/
Disallow /forum/discover/unread/
Disallow /forum/markallread/
Disallow /forum/staff/
Disallow /forum/online/
Disallow /forum/discover/
Disallow /forum/leaderboard/
Disallow /forum/search/
Disallow /forum/tags/
Disallow /forum/*?advancedSearchForm=
Disallow /forum/register/
Disallow /forum/lostpassword/
Disallow /forum/login/
Disallow /forum/*?sortby=
Disallow /forum/*?filter=
Disallow /forum/*?tab=
Disallow /forum/*?do=
Disallow /forum/*ref%3D
Disallow /forum/*?forumId*
Disallow /forum/profile/

Other Records

Field Value
sitemap https://www.parentscafe.gr/forum/sitemap.php

Comments

  • Rules for Invision Community (https://invisioncommunity.com)
  • Block pages with no unique content
  • Block faceted pages and 301 redirect pages
  • Block profile pages as these have little unique value, consume a lot of crawl time and contain hundreds of 301 links
  • Sitemap URL