ac24horas.com
robots.txt

Robots Exclusion Standard data for ac24horas.com

Resource Scan

Scan Details

Site Domain ac24horas.com
Base Domain ac24horas.com
Scan Status Ok
Last Scan2024-06-10T04:35:02+00:00
Next Scan 2024-06-17T04:35:02+00:00

Last Scan

Scanned2024-06-10T04:35:02+00:00
URL https://ac24horas.com/robots.txt
Domain IPs 104.26.10.125, 104.26.11.125, 172.67.69.212, 2606:4700:20::681a:a7d, 2606:4700:20::681a:b7d, 2606:4700:20::ac43:45d4
Response IP 104.26.11.125
Found Yes
Hash 0e42bee69be77ab4ca1a7ec5d2acacfd604acb3136adf52d4c5d12e9547f02b1
SimHash e900ccc20013

Groups

mediapartners-google

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

*

Rule Path
Disallow /?s=
Disallow /page/*/?s=
Disallow /search/

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://ac24horas.com/news.xml
sitemap https://ac24horas.com/sitemaps.xml