gomantaktimes.com
robots.txt
Robots Exclusion Standard data for gomantaktimes.com
Resource Scan
Scan Details
Site Domain | gomantaktimes.com |
Base Domain | gomantaktimes.com |
Scan Status | Ok |
Last Scan | 2024-06-06T09:07:21+00:00 |
Next Scan | 2024-06-13T09:07:21+00:00 |
Last Scan
Scanned | 2024-06-06T09:07:21+00:00 |
URL | https://gomantaktimes.com/robots.txt |
Redirect | https://www.gomantaktimes.com/robots.txt |
Redirect Domain | www.gomantaktimes.com |
Redirect Base | gomantaktimes.com |
Domain IPs | 23.20.179.164, 54.158.195.16 |
Redirect IPs | 104.18.90.198, 104.18.91.198, 104.18.92.198, 104.18.93.198, 104.18.94.198, 2606:4700::6812:5ac6, 2606:4700::6812:5bc6, 2606:4700::6812:5cc6, 2606:4700::6812:5dc6, 2606:4700::6812:5ec6 |
Response IP | 104.18.91.198 |
Found | Yes |
Hash | 5ace8963f2d33f0665dd9ed2a6a0d218eefc5aa2ff47fa3089a4d4a5aafbd55b |
SimHash | 09148860c710 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /topicPage/ |
Disallow | *?utm_* |
Disallow | /115394472* |
Disallow | *%2C* |
Disallow | *%* |
Disallow | /search* |
Other Records
Field | Value |
---|---|
sitemap | https://www.gomantaktimes.com/sitemap.xml |
sitemap | https://www.gomantaktimes.com/news_sitemap.xml |