pressa.tv
robots.txt

Robots Exclusion Standard data for pressa.tv

Resource Scan

Scan Details

Site Domain pressa.tv
Base Domain pressa.tv
Scan Status Ok
Last Scan2024-11-14T23:11:12+00:00
Next Scan 2024-11-21T23:11:12+00:00

Last Scan

Scanned2024-11-14T23:11:12+00:00
URL https://pressa.tv/robots.txt
Domain IPs 2.58.67.220
Response IP 2.58.67.220
Found Yes
Hash f90cdb74ac1a5e8ad0994d4dc80d20a1bd002940c07d1bb48086d4956233bbc8
SimHash 750d85718033

Groups

mediapartners-google

Rule Path
Allow /page/
Allow */page/*

*

Rule Path
Disallow /engine/go.php
Disallow /engine/download.php
Disallow /newposts/
Disallow /statistics.html
Disallow /*subaction%3Duserinfo
Disallow /*subaction%3Dnewposts
Disallow /*do%3Dlastcomments
Disallow /*do%3Dfeedback
Disallow /*do%3Dregister
Disallow /*do%3Dlostpassword
Disallow /*do%3Daddnews
Disallow /*do%3Dstats
Disallow /*do%3Dpm
Disallow /*do%3Dsearch
Disallow /lastnews/
Disallow /page/
Disallow */page/*
Disallow /*print
Disallow /2011/
Disallow /2012/
Disallow /2013/
Disallow /2014/
Disallow /2015/
Disallow /2016/
Disallow /2017/

Other Records

Field Value
sitemap https://pressa.tv/sitemap.xml

Warnings

  • `host` is not a known field.