thedailyedge.thejournal.ie
robots.txt

Robots Exclusion Standard data for thedailyedge.thejournal.ie

Resource Scan

Scan Details

Site Domain thedailyedge.thejournal.ie
Base Domain thejournal.ie
Scan Status Ok
Last Scan2024-06-08T20:54:29+00:00
Next Scan 2024-06-15T20:54:29+00:00

Last Scan

Scanned2024-06-08T20:54:29+00:00
URL https://thedailyedge.thejournal.ie/robots.txt
Redirect https://www.dailyedge.ie/robots.txt?utm_source=thedailyedge
Redirect Domain www.dailyedge.ie
Redirect Base dailyedge.ie
Domain IPs 18.202.156.188, 54.155.27.8, 54.195.75.121
Redirect IPs 18.202.156.188, 54.155.27.8, 54.195.75.121
Response IP 54.155.27.8
Found Yes
Hash f47814ec26ea87c0bcc24cb567d5beb983c812150add30f41cd76d4ffd1cc882
SimHash e93f5f14f236

Groups

*

Rule Path
Disallow

*

Rule Path
Disallow /search/*
Disallow /article-search*
Disallow */feed/
Disallow */feed/*
Disallow *oauth*
Disallow *subscription-admin%3D*
Disallow *switcher%3D*
Disallow *logout.php*
Disallow *category*Khadr*
Disallow *category*What%20are%20we%20voting%20on%20and%20why*
Disallow *category*Should%20the%20President%20be%20more%20than%20an%20ambassadorial%20role*
Disallow *category*AP%20Photo*
Disallow *category*who%20had%20been%20in%20power%20for%2023%20years*
Disallow *category*currentvacancies*
Disallow *category*THE%20BIGGEST%20GAME%20of%20the*
Disallow /profile/
Disallow /profile/*
Disallow /topic/*

bingbot

Rule Path
Disallow */*news*
Disallow /author*

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

Warnings

  • 4 invalid lines.