ccma.cat
robots.txt

Robots Exclusion Standard data for ccma.cat

Resource Scan

Scan Details

Site Domain ccma.cat
Base Domain ccma.cat
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-08-06T02:45:08+00:00
Next Scan 2024-10-05T02:45:08+00:00

Last Successful Scan

Scanned2024-06-08T02:43:04+00:00
URL https://ccma.cat/robots.txt
Redirect https://www.ccma.cat/robots.txt
Redirect Domain www.ccma.cat
Redirect Base ccma.cat
Domain IPs 185.104.134.129
Redirect IPs 138.199.8.193, 143.244.35.226, 2a02:6ea0:d342::4, 2a02:6ea0:d638::4, 37.19.207.209
Response IP 154.47.23.177
Found Yes
Hash 01fabb7667ea87948358040fd2bc20b772ec3fed56d5b1e416fd92f0bb4c29cb
SimHash 0ae05a0c8c53

Groups

*

Rule Path
Disallow /*/standalone/
Disallow /app_*/
Disallow /324/homes/
Disallow /catradio/clickat/
Disallow /catradio/homes/
Disallow /corporatiu/rs/contacte/*/
Disallow /cultura/homes/
Disallow /el-temps/homes/
Disallow /esport3/homes/
Disallow /iptv/
Disallow /qa/modul/*
Disallow /qa/test/*
Disallow /qa/redl/*
Disallow /tv3/homes/
Disallow /tv3/marato/recerca/proposta-malalties/*/
Disallow /video/ad-integration/*
Disallow /tv3/sx3/*/joc/pantalla-completa/
Disallow /tv3/sx3/families-escola/activitats/cercador/
Disallow /tv3/sx3/families-escola/activitats/cercador-mapa/

mediapartners-google

Rule Path
Disallow /tv3/sx3/

Comments

  • ccma.cat