startv.pt
robots.txt

Robots Exclusion Standard data for startv.pt

Resource Scan

Scan Details

Site Domain startv.pt
Base Domain startv.pt
Scan Status Ok
Last Scan2024-11-09T05:05:42+00:00
Next Scan 2024-11-16T05:05:42+00:00

Last Scan

Scanned2024-11-09T05:05:42+00:00
URL https://startv.pt/robots.txt
Domain IPs 37.59.93.230
Response IP 37.59.93.230
Found Yes
Hash 4505e7f002760d714b77f1a8704ca512100e8d96c83c5cf6be04ce3eb8f51a55
SimHash 6f17d420c333

Groups

*

Rule Path
Allow /$
Allow /series
Allow /filmes
Allow /programacao
Allow /especial
Allow /info
Allow /contacte-nos
Allow /videos
Allow /recipe
Allow /sitemap.xml
Allow /sitemap-video.xml
Disallow /

Other Records

Field Value
crawl-delay 5

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.startv.pt/sitemap.xml
sitemap https://www.startv.pt/sitemap-video.xml

Comments

  • www.robotstxt.org/
  • www.google.com/support/webmasters/bin/answer.py?hl=en&answer=156449
  • PT