comuniate.com
robots.txt

Robots Exclusion Standard data for comuniate.com

Resource Scan

Scan Details

Site Domain comuniate.com
Base Domain comuniate.com
Scan Status Ok
Last Scan2024-11-04T18:49:32+00:00
Next Scan 2024-11-11T18:49:32+00:00

Last Scan

Scanned2024-11-04T18:49:32+00:00
URL https://comuniate.com/robots.txt
Redirect https://www.comuniate.com/robots.txt
Redirect Domain www.comuniate.com
Redirect Base comuniate.com
Domain IPs 185.140.32.61
Redirect IPs 185.140.32.61
Response IP 185.140.32.61
Found Yes
Hash 4a85b302f3c06df1566f8a78c62abf5e6ec2a396bc5f22cf2737945730999828
SimHash c1159d024171

Groups

*

Rule Path
Allow *
Disallow /ajax/*
Disallow /php/*
Disallow /intranet/*
Disallow /articulos/*
Allow /intranet/noticias/*
Allow /intranet/jugadores/*
Allow /intranet/equipos/*

googlebot-news

Rule Path
Disallow /
Allow /noticias/*
Allow /noticias/
Allow /apuestas/*
Allow /apuesta/*
Allow /noticias_amp/*
Allow /noticias_amp/

Other Records

Field Value
sitemap https://www.comuniate.com/sitemap_news.php
sitemap https://www.comuniate.com/sitemap_autores.php