news.nicekoora.com
robots.txt
Robots Exclusion Standard data for news.nicekoora.com
Resource Scan
Scan Details
Site Domain | news.nicekoora.com |
Base Domain | nicekoora.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2025-04-22T01:12:42+00:00 |
Next Scan | 2025-06-21T01:12:42+00:00 |
Last Successful Scan
Scanned | 2025-01-30T01:08:25+00:00 |
URL | https://news.nicekoora.com/robots.txt |
Domain IPs | 104.21.77.31, 172.67.203.253, 2606:4700:3033::6815:4d1f, 2606:4700:3034::ac43:cbfd |
Response IP | 104.21.77.31 |
Found | Yes |
Hash | 4f4174582011f7ae5ab643c172093b999706f48d6dc995e9ebae4249073e6495 |
SimHash | ed3998056653 |
Groups
*
Rule | Path |
---|---|
Disallow | /*? |
Disallow | /*?page= |
Disallow | /forum/*?page= |
Disallow | /search/* |
Disallow | /search.html/* |
Disallow | /search.html*?query= |
Disallow | /archive.html/* |
Disallow | /archive.html*?publishDateDay= |
Disallow | /panel |
Disallow | /cron |
Disallow | /ajax |
Disallow | /widgets_factory |
Disallow | /auth |
Disallow | /login |
Disallow | /register |
Disallow | /style |
Disallow | /printit |
Disallow | /emailthis |
Disallow | /outside |
Other Records
Field | Value |
---|---|
sitemap | https://news.nicekoora.com/sitemap.xml |
sitemap | https://news.nicekoora.com/sitemap.xml?format=google_news |