ccrtv.cat
robots.txt

Robots Exclusion Standard data for ccrtv.cat

Resource Scan

Scan Details

Site Domain ccrtv.cat
Base Domain ccrtv.cat
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-10-01T20:00:08+00:00
Next Scan 2024-12-30T20:00:08+00:00

Last Successful Scan

Scanned2024-06-04T19:12:29+00:00
URL http://ccrtv.cat/robots.txt
Redirect https://www.ccma.cat/robots.txt
Redirect Domain www.ccma.cat
Redirect Base ccma.cat
Domain IPs 185.104.134.129
Redirect IPs 154.47.23.177, 212.102.42.89, 2a02:6ea0:d342::4, 2a02:6ea0:d638::4
Response IP 212.102.42.89
Found Yes
Hash 01fabb7667ea87948358040fd2bc20b772ec3fed56d5b1e416fd92f0bb4c29cb
SimHash 0ae05a0c8c53

Groups

*

Rule Path
Disallow /*/standalone/
Disallow /app_*/
Disallow /324/homes/
Disallow /catradio/clickat/
Disallow /catradio/homes/
Disallow /corporatiu/rs/contacte/*/
Disallow /cultura/homes/
Disallow /el-temps/homes/
Disallow /esport3/homes/
Disallow /iptv/
Disallow /qa/modul/*
Disallow /qa/test/*
Disallow /qa/redl/*
Disallow /tv3/homes/
Disallow /tv3/marato/recerca/proposta-malalties/*/
Disallow /video/ad-integration/*
Disallow /tv3/sx3/*/joc/pantalla-completa/
Disallow /tv3/sx3/families-escola/activitats/cercador/
Disallow /tv3/sx3/families-escola/activitats/cercador-mapa/

mediapartners-google

Rule Path
Disallow /tv3/sx3/

Comments

  • ccma.cat