gazetenews.com
robots.txt
Robots Exclusion Standard data for gazetenews.com
Resource Scan
Scan Details
Site Domain | gazetenews.com |
Base Domain | gazetenews.com |
Scan Status | Ok |
Last Scan | 2024-05-15T01:42:02+00:00 |
Next Scan | 2024-05-22T01:42:02+00:00 |
Last Scan
Scanned | 2024-05-15T01:42:02+00:00 |
URL | https://gazetenews.com/robots.txt |
Domain IPs | 104.21.94.139, 172.67.136.139, 2606:4700:3030::ac43:888b, 2606:4700:3036::6815:5e8b |
Response IP | 172.67.136.139 |
Found | Yes |
Hash | d10af6b34150b79896c94750199d64d9fe18b70d587024b9294f31eb262fe86d |
SimHash | 42a0d54f4f7b |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /?ref= |
Disallow | /?q= |
Disallow | /?arama= |
Disallow | /*?ref= |
Disallow | /advertorial/* |
Disallow | /bakim.php |
Disallow | /?utm_source= |
Disallow | /*page/* |
Disallow | /cron/* |
Disallow | /*sayfa/* |
Disallow | /?p=* |
Disallow | /?p* |
Disallow | /ara?s=* |
Disallow | /index.php?page=* |
Disallow | /?id=* |
Disallow | /xpanel |
Disallow | /xpanel/* |
Disallow | /g.php |
Disallow | /info.php |
Disallow | /amp//m/* |
Other Records
Field | Value |
---|---|
sitemap | https://gazetenews.com/sitemap_index.xml |
sitemap | https://gazetenews.com/sitemap_news.xml |
sitemap | https://gazetenews.com/yandex_news.xml |