timeturk.com
robots.txt
Robots Exclusion Standard data for timeturk.com
Resource Scan
Scan Details
Site Domain | timeturk.com |
Base Domain | timeturk.com |
Scan Status | Ok |
Last Scan | 2024-10-30T21:53:02+00:00 |
Next Scan | 2024-11-06T21:53:02+00:00 |
Last Scan
Scanned | 2024-10-30T21:53:02+00:00 |
URL | https://timeturk.com/robots.txt |
Redirect | https://www.timeturk.com/robots.txt |
Redirect Domain | www.timeturk.com |
Redirect Base | timeturk.com |
Domain IPs | 104.21.87.236, 172.67.148.109, 2606:4700:3032::6815:57ec, 2606:4700:3037::ac43:946d |
Redirect IPs | 104.21.87.236, 172.67.148.109, 2606:4700:3032::6815:57ec, 2606:4700:3037::ac43:946d |
Response IP | 172.67.148.109 |
Found | Yes |
Hash | 4765f9cac0ca133738860fc7d91bee0efe24d8360d731829b1e7231c293c97d4 |
SimHash | 41051940c711 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /cdn-cgi/ |
Disallow | /sayac/ |
Disallow | /embed/ |
Disallow | /arama |
Other Records
Field | Value |
---|---|
sitemap | https://www.timeturk.com/timenews.xml |
sitemap | https://www.timeturk.com/harita-index2.xml |
sitemap | https://www.timeturk.com/time_haber_sitemap.xml |