travelden.co.uk
robots.txt

Robots Exclusion Standard data for travelden.co.uk

Resource Scan

Scan Details

Site Domain travelden.co.uk
Base Domain travelden.co.uk
Scan Status Ok
Last Scan2024-04-28T02:05:50+00:00
Next Scan 2024-05-05T02:05:50+00:00

Last Scan

Scanned2024-04-28T02:05:50+00:00
URL https://travelden.co.uk/robots.txt
Redirect https://www.travelcaribou.com/robots.txt
Redirect Domain www.travelcaribou.com
Redirect Base travelcaribou.com
Domain IPs 104.21.34.107, 172.67.159.66, 2606:4700:3031::6815:226b, 2606:4700:3032::ac43:9f42
Redirect IPs 104.26.2.15, 104.26.3.15, 172.67.70.195, 2606:4700:20::681a:20f, 2606:4700:20::681a:30f, 2606:4700:20::ac43:46c3
Response IP 172.67.70.195
Found Yes
Hash e26d03acb7d48ae962a2df5537f6aada747af5446e918bc5168931cb4be3d10b
SimHash ca10c8c0a6a9

Groups

*

Rule Path
Allow /ads.txt
Disallow /trackback/
Disallow /comments/
Disallow /cgi-bin
Disallow /wp-admin/
Disallow /?s=*
Disallow /stats/
Disallow /events/
Disallow /search/
Disallow /21287525/
Disallow /author
Disallow /wget/
Disallow /httpd/
Disallow /wp-content/uploads/

Other Records

Field Value
crawl-delay 30

googlebot-image

Rule Path
Disallow /

twitterbot

Rule Path
Allow /wp-content/uploads/

facebookexternalhit

Rule Path
Allow /wp-content/uploads/

ia_archiver

Rule Path
Disallow /