catorze.cat
robots.txt

Robots Exclusion Standard data for catorze.cat

Resource Scan

Scan Details

Site Domain catorze.cat
Base Domain catorze.cat
Scan Status Ok
Last Scan2024-09-19T18:08:49+00:00
Next Scan 2024-09-26T18:08:49+00:00

Last Scan

Scanned2024-09-19T18:08:49+00:00
URL https://catorze.cat/robots.txt
Redirect https://www.catorze.cat/robots.txt
Redirect Domain www.catorze.cat
Redirect Base catorze.cat
Domain IPs 104.21.59.62, 172.67.216.158, 2606:4700:3031::6815:3b3e, 2606:4700:3035::ac43:d89e
Redirect IPs 104.21.59.62, 172.67.216.158, 2606:4700:3031::6815:3b3e, 2606:4700:3035::ac43:d89e
Response IP 172.67.216.158
Found Yes
Hash 3f59532d519518f94fc9b0b79091da66d060056550fa12808f63ecbee1d0f488
SimHash 914048408690

Groups

*

Rule Path
Disallow /_call*
Disallow /*breaking-news-es.json*
Disallow /*breaking-news-ca.json*
Disallow /buscador.html?*
Disallow /cercador.html?*
Disallow /amp-news-list.html*
Disallow *?idComment=*
Disallow /tag/*

Other Records

Field Value
sitemap https://mercurium.wearebab.com/uploads/feeds/google_sitemap_es.xml
sitemap https://mercurium.wearebab.com/uploads/feeds/google_sitemap_ca.xml