turkuazgazetesi.net
robots.txt

Robots Exclusion Standard data for turkuazgazetesi.net

Resource Scan

Scan Details

Site Domain turkuazgazetesi.net
Base Domain turkuazgazetesi.net
Scan Status Ok
Last Scan2024-09-20T06:37:35+00:00
Next Scan 2024-09-27T06:37:35+00:00

Last Scan

Scanned2024-09-20T06:37:35+00:00
URL https://turkuazgazetesi.net/robots.txt
Redirect https://www.turkuazgazetesi.net/robots.txt
Redirect Domain www.turkuazgazetesi.net
Redirect Base turkuazgazetesi.net
Domain IPs 104.21.31.6, 172.67.174.68, 2606:4700:3032::6815:1f06, 2606:4700:3037::ac43:ae44
Redirect IPs 104.21.31.6, 172.67.174.68, 2606:4700:3032::6815:1f06, 2606:4700:3037::ac43:ae44
Response IP 104.21.31.6
Found Yes
Hash f9f970b8ed690d70bffdefb119a95e76015674eb91335993176f4990d93d1a7b
SimHash 6d149f36e411

Groups

*

Rule Path
Disallow /ara
Disallow /basin_ilan
Disallow /paylas
Disallow /mesaj
Disallow /user/
Disallow /entry/
Disallow /service/
Disallow /yazdir/
Disallow /cdn-cgi/
Disallow /themes/enerjik/assets/img/bos.png
Disallow /themes/enerjik/assets/img/mask-16-9.png
Disallow /haberleri?filter=
Disallow /etiket?filter=
Disallow /*.php$
Allow /

googlebot-image

Rule Path
Disallow /user
Allow /

adsbot-google

Rule Path
Disallow /service/stats/visitors
Disallow /service/advertchannels
Allow /

Other Records

Field Value
sitemap https://www.turkuazgazetesi.net/google-news.xml
sitemap https://www.turkuazgazetesi.net/sitemap.xml
sitemap https://www.turkuazgazetesi.net/sitemap-latest.xml