gucluanadolugazetesi.com
robots.txt
Robots Exclusion Standard data for gucluanadolugazetesi.com
Resource Scan
Scan Details
Site Domain | gucluanadolugazetesi.com |
Base Domain | gucluanadolugazetesi.com |
Scan Status | Ok |
Last Scan | 2024-09-26T06:34:13+00:00 |
Next Scan | 2024-10-03T06:34:13+00:00 |
Last Scan
Scanned | 2024-09-26T06:34:13+00:00 |
URL | https://gucluanadolugazetesi.com/robots.txt |
Domain IPs | 104.21.67.126, 172.67.174.200, 2606:4700:3032::ac43:aec8, 2606:4700:3037::6815:437e |
Response IP | 104.21.67.126 |
Found | Yes |
Hash | e2f52288b0b0a06fda6ddf7217abef60608b4ff5598c28becebd4de7467d25a7 |
SimHash | c9303432ae13 |
Groups
*
Rule | Path |
---|---|
Disallow | /arama |
Disallow | /public |
Disallow | /public/* |
Disallow | /service* |
Disallow | /share* |
Disallow | /tr/* |
Disallow | /*?ref= |
Disallow | /*?q= |
Disallow | /*?preview= |
Disallow | /*?utm_source= |
Disallow | /*?page= |
Disallow | /*?cursor= |
Allow | / |
Other Records
Field | Value |
---|---|
sitemap | https://www.gucluanadolugazetesi.com/sitemap.xml |