nuevatribuna.es
robots.txt

Robots Exclusion Standard data for nuevatribuna.es

Resource Scan

Scan Details

Site Domain nuevatribuna.es
Base Domain nuevatribuna.es
Scan Status Ok
Last Scan2024-05-25T04:07:40+00:00
Next Scan 2024-06-01T04:07:40+00:00

Last Scan

Scanned2024-05-25T04:07:40+00:00
URL https://nuevatribuna.es/robots.txt
Redirect https://www.nuevatribuna.es/robots.txt
Redirect Domain www.nuevatribuna.es
Redirect Base nuevatribuna.es
Domain IPs 104.26.10.213, 104.26.11.213, 172.67.73.173, 2606:4700:20::681a:ad5, 2606:4700:20::681a:bd5, 2606:4700:20::ac43:49ad
Redirect IPs 104.26.10.213, 104.26.11.213, 172.67.73.173, 2606:4700:20::681a:ad5, 2606:4700:20::681a:bd5, 2606:4700:20::ac43:49ad
Response IP 172.67.73.173
Found Yes
Hash 179d9475df624643da7c0164589f1945bb3dfa2b2bff90377113f09caa0e3c72
SimHash 98204a04e9b2

Groups

*

Rule Path
Disallow /harming/humans
Disallow /ignoring/human/orders
Disallow /harm/to/self
Disallow /api
Disallow /admin

*

Rule Path
Disallow /ads/
Disallow /content/print/
Disallow /comments/
Disallow /articulo/actualidad/montero-iglesias-temieron-seguridad-hijos-acosados-periodista/20220209131649195291.html
Disallow /articulo/actualidad/juicio-eduardo-inda-acosar-hijos-iglesias-montero/20210312170920185518.html
Disallow /articulo/actualidad/fiscalia-pide-archive-causa-inda-acoso-hijos-iglesias/20210408192425186525.html

Other Records

Field Value
sitemap https://www.nuevatribuna.es/sitemap.news.xml.gz
sitemap https://www.nuevatribuna.es/sitemap.xml