encontre.tv
robots.txt

Robots Exclusion Standard data for encontre.tv

Resource Scan

Scan Details

Site Domain encontre.tv
Base Domain encontre.tv
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-03-01T23:34:10+00:00
Next Scan 2025-05-30T23:34:10+00:00

Last Successful Scan

Scanned2024-04-13T23:13:50+00:00
URL https://encontre.tv/robots.txt
Domain IPs 104.21.58.151, 172.67.205.44, 2606:4700:3034::6815:3a97, 2606:4700:3036::ac43:cd2c
Response IP 104.21.58.151
Found Yes
Hash f6160c4467756fdef2794d13624665430fcdd6e19179cd78184ceef3c5e75e42
SimHash 18a04830ca88

Groups

*

Rule Path
Disallow /startTopic/
Disallow /*?do=add
Disallow /*?do=submit
Disallow /discover/unread/
Disallow /markallread/
Disallow /staff/
Disallow /online/
Disallow /discover/
Disallow /leaderboard/
Disallow /search/
Disallow /*?advancedSearchForm=
Disallow /tags/
Disallow /forums/
Disallow /browse/
Disallow /new/
Disallow /assistir-*
Disallow /*?sortby=
Disallow /*?alphabet=
Disallow /*?type=
Disallow /*?filter=
Disallow /*?tab=comments
Disallow /*?do=findComment
Disallow /*?do=getLastComment
Disallow /*?do=getNewComment
Disallow /*?do=reportComment
Disallow /*?do=markRead
Disallow /*?area=online
Disallow /*?area=info
Disallow /*?p=
Disallow /profile/

Other Records

Field Value
sitemap https://encontre.tv/sitemap.php

Comments

  • Block pages with no unique content
  • Disallow: /register/
  • Disallow: /lostpassword/
  • Disallow: /login/
  • Block faceted pages and 301 redirect pages
  • Block profile pages as these have little unique value, consume a lot of crawl time and contain hundreds of 301 links
  • Sitemap URL