carlow-nationalist.ie
robots.txt

Robots Exclusion Standard data for carlow-nationalist.ie

Resource Scan

Scan Details

Site Domain carlow-nationalist.ie
Base Domain carlow-nationalist.ie
Scan Status Ok
Last Scan2024-11-12T03:29:11+00:00
Next Scan 2024-11-19T03:29:11+00:00

Last Scan

Scanned2024-11-12T03:29:11+00:00
URL https://www.carlow-nationalist.ie/robots.txt
Domain IPs 213.182.15.181
Response IP 213.182.15.181
Found Yes
Hash a810cac78ac1306848cde38ca8188b03bcc827420dccdfd752e0c2517f434aef
SimHash 0224c155c254

Groups

*

Rule Path
Disallow /cms_addon
Disallow /cms_docs
Disallow /redFACT
Disallow /REST/frontend/itemstatistics

*

Rule Path
Disallow /pu_all
Allow /pu_all/img
Disallow /pu_carlow/
Allow /pu_carlow/img
Disallow /pu_kildare/
Allow /pu_kildare/img
Disallow /pu_laois/
Allow /pu_laois/img
Disallow /pu_roscommon/
Allow /pu_roscommon/img
Disallow /pu_waterford/
Allow /pu_waterford/img
Disallow /pu_western/
Allow /pu_western/img

googlebot

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

googlebot-news

Rule Path
Disallow /sponsored/
Disallow /sponsored-content/
Disallow /sponsoredshowcase/
Disallow /test

ia_archiver

Rule Path
Disallow /

backlink-check.de

Rule Path
Disallow /

backlinkcrawler

Rule Path
Disallow /

bloodhound

Rule Path
Disallow /

cydralspider

Rule Path
Disallow /

downloadexpress

Rule Path
Disallow /

extractorpro

Rule Path
Disallow /

fasterfox

Rule Path
Disallow /

gammaspider

Rule Path
Disallow /

linkextractorpro

Rule Path
Disallow /

linkwalker

Rule Path
Disallow /

meltwater

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

node/simplecrawler

Rule Path
Disallow /

node/simplecrawler 0.7.0 (git+https://github.com/cgiffard/node-simplecrawler.git)

Rule Path
Disallow /

objectssearch

Rule Path
Disallow /

openbot

Rule Path
Disallow /

pimptrain

Rule Path
Disallow /

raven

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

searchpreview

Rule Path
Disallow /

simplecrawler

Rule Path
Disallow /

seodat

Rule Path
Disallow /

seoengbot

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

true_robot

Rule Path
Disallow /

url control

Rule Path
Disallow /

url_spider_pro

Rule Path
Disallow /

wapspider

Rule Path
Disallow /

webzinger

Rule Path
Disallow /

xovi

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.carlow-nationalist.ie/sitemap-index/1271-google_channel_sitemap_cn.xml
sitemap https://www.carlow-nationalist.ie/sitemap-index/1276-google_sitemap_cn.xml
sitemap https://www.carlow-nationalist.ie/sitemap-index/1280-google_news_cn.xml

Comments

  • global live settings :
  • customised settings :