newscabal.co.uk
robots.txt

Robots Exclusion Standard data for newscabal.co.uk

Resource Scan

Scan Details

Site Domain newscabal.co.uk
Base Domain newscabal.co.uk
Scan Status Ok
Last Scan2024-05-22T22:49:54+00:00
Next Scan 2024-05-29T22:49:54+00:00

Last Scan

Scanned2024-05-22T22:49:54+00:00
URL https://newscabal.co.uk/robots.txt
Domain IPs 2a02:2350:5:101:8072:5cb6:3de:6c43, 77.111.240.238
Response IP 77.111.240.238
Found Yes
Hash 9e48350d6ae2be9b2cbbcd922e2d8be1174416c8b1e21794c471773e3c7401e1
SimHash 6614cdf2ce91

Groups

*

Rule Path
Disallow

*

Rule Path
Disallow /

googlebot

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

bingbot

Rule Path
Allow /

slurp

Rule Path
Allow /

*

Rule Path
Allow /ads.txt

yandex

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

dotbot

Rule Path
Allow /

org_bot

Rule Path
Disallow /

accompanybot

Rule Path
Allow /

tweetmemebot

Rule Path
Disallow /

Other Records

Field Value
crawl-delay 20

Other Records

Field Value
sitemap https://www.newscabal.co.uk/sitemap_index.xml