umannews.city
robots.txt
Robots Exclusion Standard data for umannews.city
Resource Scan
Scan Details
Site Domain | umannews.city |
Base Domain | umannews.city |
Scan Status | Ok |
Last Scan | 2024-11-10T23:59:31+00:00 |
Next Scan | 2024-11-17T23:59:31+00:00 |
Last Scan
Scanned | 2024-11-10T23:59:31+00:00 |
URL | https://umannews.city/robots.txt |
Domain IPs | 104.21.25.164, 172.67.134.98, 2606:4700:3034::ac43:8662, 2606:4700:3036::6815:19a4 |
Response IP | 172.67.134.98 |
Found | Yes |
Hash | 3a07cbb6466bf23e956c23f08b0f1554774f87b89b85ed1c4a02f50f2a633bf3 |
SimHash | f2103b422e73 |
Groups
*
Rule | Path |
---|---|
Disallow | /account* |
Disallow | /ajax/* |
Disallow | /read/articles/search?* |
Disallow | /articles/search |
Disallow | *?fbclid=* |
Disallow | *?_ga=* |
Other Records
Field | Value |
---|---|
sitemap | https://umannews.city/sitemap.xml |
Warnings
- 12 invalid lines.