timeturk.com
robots.txt

Robots Exclusion Standard data for timeturk.com

Resource Scan

Scan Details

Site Domain timeturk.com
Base Domain timeturk.com
Scan Status Ok
Last Scan2024-10-30T21:53:02+00:00
Next Scan 2024-11-06T21:53:02+00:00

Last Scan

Scanned2024-10-30T21:53:02+00:00
URL https://timeturk.com/robots.txt
Redirect https://www.timeturk.com/robots.txt
Redirect Domain www.timeturk.com
Redirect Base timeturk.com
Domain IPs 104.21.87.236, 172.67.148.109, 2606:4700:3032::6815:57ec, 2606:4700:3037::ac43:946d
Redirect IPs 104.21.87.236, 172.67.148.109, 2606:4700:3032::6815:57ec, 2606:4700:3037::ac43:946d
Response IP 172.67.148.109
Found Yes
Hash 4765f9cac0ca133738860fc7d91bee0efe24d8360d731829b1e7231c293c97d4
SimHash 41051940c711

Groups

*

Rule Path
Allow /
Disallow /cdn-cgi/
Disallow /sayac/
Disallow /embed/
Disallow /arama

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.timeturk.com/timenews.xml
sitemap https://www.timeturk.com/harita-index2.xml
sitemap https://www.timeturk.com/time_haber_sitemap.xml