insomnia.gr
robots.txt

Robots Exclusion Standard data for insomnia.gr

Resource Scan

Scan Details

Site Domain insomnia.gr
Base Domain insomnia.gr
Scan Status Ok
Last Scan2024-11-09T06:39:29+00:00
Next Scan 2024-11-16T06:39:29+00:00

Last Scan

Scanned2024-11-09T06:39:29+00:00
URL https://insomnia.gr/robots.txt
Redirect https://www.insomnia.gr/robots.txt
Redirect Domain www.insomnia.gr
Redirect Base insomnia.gr
Domain IPs 104.26.4.164, 104.26.5.164, 172.67.72.120, 2606:4700:20::681a:4a4, 2606:4700:20::681a:5a4, 2606:4700:20::ac43:4878
Redirect IPs 104.26.4.164, 104.26.5.164, 172.67.72.120, 2606:4700:20::681a:4a4, 2606:4700:20::681a:5a4, 2606:4700:20::ac43:4878
Response IP 172.67.72.120
Found Yes
Hash b465e833a2e09d0efafde12ab1fffa4481491ad15ce783707aa10b9ed73d3bb8
SimHash 1c304d010488

Groups

*

Rule Path
Disallow /startTopic/
Disallow /*?do=add
Disallow /*?do=submit
Disallow /discover/unread/
Disallow /markallread/
Disallow /staff/
Disallow /online/
Disallow /discover/
Disallow /leaderboard/
Disallow /search/
Disallow /*?advancedSearchForm=
Disallow /register/
Disallow /lostpassword/
Disallow /login/
Disallow /*?sortby=
Disallow /*?filter=
Disallow /*?tab=
Disallow /*?do=
Disallow /*?ref=
Disallow /*?forumId=
Disallow /profile/

Other Records

Field Value
sitemap https://www.insomnia.gr/sitemap.php
sitemap https://www.insomnia.gr/insomnia/news_sitemap

Comments

  • Block pages with no unique content
  • Block faceted pages and 301 redirect pages
  • Block profile pages as these have little unique value, consume a lot of crawl time and contain hundreds of 301 links