niarn.org
robots.txt
Robots Exclusion Standard data for niarn.org
Resource Scan
Scan Details
Site Domain | niarn.org |
Base Domain | niarn.org |
Scan Status | Failed |
Failure Reason | Scan timed out. |
Last Scan | 2025-06-10T08:33:07+00:00 |
Next Scan | 2025-09-08T08:33:07+00:00 |
Last Successful Scan
Scanned | 2025-02-11T07:42:32+00:00 |
URL | https://niarn.org/robots.txt |
Domain IPs | 104.21.10.192, 172.67.190.203, 2606:4700:3030::6815:ac0, 2606:4700:3036::ac43:becb |
Response IP | 104.21.10.192 |
Found | Yes |
Hash | c3f3e8d99675cde120ab25cbf61c459176edd0c51c03a79278ae9486a96b8124 |
SimHash | 701ad032f881 |
Groups
*
Rule | Path |
---|---|
Disallow | /404 |
Disallow | /data-deletion |
Disallow | /login |
Disallow | /logout |
Disallow | /goto |
Disallow | /goto/ |
Disallow | /search-article |
Disallow | /search |
Disallow | /tim-kiem/ |
Disallow | /tim-kiem-truyen |
Disallow | /w/ |
Disallow | /wp-content/ |
Disallow | /sw.js |
Disallow | /*/games/search |
Disallow | /games/search |