rte.ie
robots.txt

Robots Exclusion Standard data for rte.ie

Resource Scan

Scan Details

Site Domain rte.ie
Base Domain rte.ie
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-04-14T18:37:46+00:00
Next Scan 2024-07-13T18:37:46+00:00

Last Successful Scan

Scanned2023-06-20T17:58:06+00:00
URL https://rte.ie/robots.txt
Redirect https://www.rte.ie/robots.txt
Redirect Domain www.rte.ie
Redirect Base rte.ie
Domain IPs 104.18.142.17, 104.18.143.17, 2606:4700::6812:8e11, 2606:4700::6812:8f11
Redirect IPs 104.18.142.17, 104.18.143.17, 2606:4700::6812:8e11, 2606:4700::6812:8f11
Response IP 104.18.142.17
Found Yes
Hash 9cfa7b4bb819d92497cdb0078c63e4c6c833e17667770bf3cd15f0652b088461
SimHash 0b95ac54bd98

Groups

*

Rule Path
Disallow /inc/
Disallow /images/
Disallow /image/
Disallow /style/
Disallow /script/
Disallow /errorpages/
Disallow /*/inc/
Disallow /*/images/
Disallow /*/image/
Disallow /*/style/
Disallow /*/script/
Disallow /*.gif$
Disallow /*.ppt$
Disallow /*.doc$
Disallow /*.docx$
Disallow /*.pptx$
Disallow /*.xls$
Disallow /*.xlsx$
Disallow /*.csv$
Disallow /*.js$
Disallow /*.inc$
Disallow /*.css$
Disallow /*/wp-content/
Disallow /*search/*%26sort%3D*
Disallow /*/search/*
Disallow /news/search/
Disallow /news/search
Disallow /*/admin/
Disallow /tv/prosperity/
Disallow /rnag/search
Disallow /modules/
Disallow /sport/results/img/
Disallow /sport/results/widgets/
Disallow /aertel/assets/
Disallow /aertel/desktopxhtml/
Disallow /aertel//desktopxhtml/
Disallow /aertel///desktopxhtml/
Disallow /aertel/travelfinders/

mediapartners-google

Rule Path
Disallow

googlebot

Rule Path
Disallow /*/search/*
Disallow /search
Disallow /rnag/search

bingbot(at)microsoft.com

Rule Path
Disallow /*/search/*
Disallow /rnag/search
Disallow /search

bingbot

Rule Path
Disallow /*/search/*
Disallow /search
Disallow /rnag/search

baiduspider

Rule Path
Disallow /*/search/*
Disallow /search
Disallow /rnag/search

yandexbot

Rule Path
Disallow /*/search/*
Disallow /search
Disallow /rnag/search

isposure agent

Rule Path
Disallow *

mediapartners-google

Rule Path
Disallow

proximic

Rule Path
Disallow /player/

ahrefsbot

Rule Path
Disallow /

go-http-client

Rule Path
Disallow /

giant oak mn

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /radio
Disallow /radio1
Disallow /lyricfm
Disallow /rnag

Other Records

Field Value
sitemap https://www.rte.ie/sitemap.xml