avrupagazete.com
robots.txt

Robots Exclusion Standard data for avrupagazete.com

Resource Scan

Scan Details

Site Domain avrupagazete.com
Base Domain avrupagazete.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a server error.
Last Scan5/19/2025, 11:59:21 AM
Next Scan 8/17/2025, 11:59:21 AM

Last Successful Scan

Scanned1/27/2023, 7:11:43 PM
URL https://avrupagazete.com/robots.txt
Redirect https://www.avrupagazete.co.uk/robots.txt
Redirect Domain www.avrupagazete.co.uk
Redirect Base avrupagazete.co.uk
Domain IPs 78.135.109.97
Redirect IPs 104.21.8.67, 172.67.138.162, 2606:4700:3037::6815:843, 2606:4700:3037::ac43:8aa2
Response IP 104.21.8.67
Found Yes
Hash 4a6a02f04983dbad78e6b6e5aaa70ed45907dca26fff65f9b6ea94a56f40b71b
SimHash 6c381e36ee12

Groups

*

Rule Path
Disallow /public
Disallow /public/*
Disallow /public/index.php
Disallow /public/index.php/*
Disallow /service*
Disallow /share*
Disallow /tr/*
Disallow /*?ref=
Disallow /*?q=
Disallow /*?preview=
Disallow /*?utm_source=
Disallow /*?ref=
Disallow /*?page=
Allow /

adsbot-google

Rule Path
Disallow /advert/*
Allow /

semrushbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 2

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 2

Other Records

Field Value
sitemap https://www.avrupagazete.co.uk/sitemap.xml