institucional.cienradios.com
robots.txt

Robots Exclusion Standard data for institucional.cienradios.com

Resource Scan

Scan Details

Site Domain institucional.cienradios.com
Base Domain cienradios.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-07-17T08:00:35+00:00
Next Scan 2024-10-15T08:00:35+00:00

Last Successful Scan

Scanned2023-11-28T07:58:18+00:00
URL https://institucional.cienradios.com/robots.txt
Domain IPs 184.87.193.142, 184.87.193.143, 2600:1413:b000:13::b857:c18e, 2600:1413:b000:13::b857:c18f
Response IP 42.99.140.195
Found Yes
Hash 42d08ed0343a01c1c4df3b8624d8f33a8f8a6f78315edc47fa1856c6e60c07cb
SimHash 7a19b872d3f2

Groups

*

Rule Path
Allow /
Disallow /pf/api/v3/*
Disallow /registro/*
Disallow /admin-concursos/*
Disallow /mantenimiento/*
Disallow /archivo/*
Allow /?p=*
Allow /?outputType=*

Other Records

Field Value
crawl-delay 1

*

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://institucional.cienradios.com/arc/outboundfeeds/sitemap/?outputType=xml
sitemap https://institucional.cienradios.com/arc/outboundfeeds/news-sitemap-index?outputType=xml
sitemap https://institucional.cienradios.com/arc/outboundfeeds/sitemap-index?outputType=xml
sitemap https://institucional.cienradios.com/arc/outboundfeeds/google-discover-feed/?outputType=xml
sitemap https://institucional.cienradios.com/arc/outboundfeeds/google-news-feed/?outputType=xml