gazeteemek.net
robots.txt

Robots Exclusion Standard data for gazeteemek.net

Resource Scan

Scan Details

Site Domain gazeteemek.net
Base Domain gazeteemek.net
Scan Status Ok
Last Scan2025-04-12T11:15:03+00:00
Next Scan 2025-04-19T11:15:03+00:00

Last Scan

Scanned2025-04-12T11:15:03+00:00
URL https://gazeteemek.net/robots.txt
Redirect https://www.gazeteemek.net/robots.txt
Redirect Domain www.gazeteemek.net
Redirect Base gazeteemek.net
Domain IPs 104.21.16.2, 172.67.165.159, 2606:4700:3033::ac43:a59f, 2606:4700:3034::6815:1002
Redirect IPs 104.21.16.2, 172.67.165.159, 2606:4700:3033::ac43:a59f, 2606:4700:3034::6815:1002
Response IP 104.21.16.2
Found Yes
Hash 57b1cb12272e2bc335944935fc6a417707a4f8b94271a25471843f29d4420d5a
SimHash 69383622ae13

Groups

*

Rule Path
Disallow /arama
Disallow /public
Disallow /public/*
Disallow /public/index.php
Disallow /public/index.php/*
Disallow /service*
Disallow /share*
Disallow /tr/*
Disallow /*?ref=
Disallow /*?q=
Disallow /*?preview=
Disallow /*?utm_source=
Disallow /*?ref=
Disallow /*?page=
Disallow /*?cursor=
Allow /

adsbot-google

Rule Path
Disallow /advert/*
Allow /

semrushbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 2

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 2

Other Records

Field Value
sitemap https://www.gazeteemek.net/sitemap.xml