avrupagazete.co.uk
robots.txt

Robots Exclusion Standard data for avrupagazete.co.uk

Resource Scan

Scan Details

Site Domain avrupagazete.co.uk
Base Domain avrupagazete.co.uk
Scan Status Ok
Last Scan6/12/2025, 2:22:08 AM
Next Scan 6/19/2025, 2:22:08 AM

Last Scan

Scanned6/12/2025, 2:22:08 AM
URL https://avrupagazete.co.uk/robots.txt
Redirect https://www.avrupagazete.co.uk/robots.txt
Redirect Domain www.avrupagazete.co.uk
Redirect Base avrupagazete.co.uk
Domain IPs 104.21.8.67, 172.67.138.162, 2606:4700:3037::6815:843, 2606:4700:3037::ac43:8aa2
Redirect IPs 104.21.8.67, 172.67.138.162, 2606:4700:3037::6815:843, 2606:4700:3037::ac43:8aa2
Response IP 104.21.8.67
Found Yes
Hash 4a6a02f04983dbad78e6b6e5aaa70ed45907dca26fff65f9b6ea94a56f40b71b
SimHash 6c381e36ee12

Groups

*

Rule Path
Disallow /public
Disallow /public/*
Disallow /public/index.php
Disallow /public/index.php/*
Disallow /service*
Disallow /share*
Disallow /tr/*
Disallow /*?ref=
Disallow /*?q=
Disallow /*?preview=
Disallow /*?utm_source=
Disallow /*?ref=
Disallow /*?page=
Allow /

adsbot-google

Rule Path
Disallow /advert/*
Allow /

semrushbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 2

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 2

Other Records

Field Value
sitemap https://www.avrupagazete.co.uk/sitemap.xml