top-10.in
robots.txt
Robots Exclusion Standard data for top-10.in
Resource Scan
Scan Details
Site Domain | top-10.in |
Base Domain | top-10.in |
Scan Status | Ok |
Last Scan | 2024-10-02T19:02:53+00:00 |
Next Scan | 2024-10-09T19:02:53+00:00 |
Last Scan
Scanned | 2024-10-02T19:02:53+00:00 |
URL | https://top-10.in/robots.txt |
Domain IPs | 139.59.28.140 |
Response IP | 139.59.28.140 |
Found | Yes |
Hash | 39a5ff4a04d2f0da0892500796a66cc3cc8615c95d6b6bda2bc1b1bbaa8ebf9a |
SimHash | 420055427b1b |
Groups
*
Rule | Path |
---|---|
Allow | /wp-admin/admin-ajax.php |
Disallow | /wp-admin/ |
Disallow | /cgi-bin/ |
Disallow | /linkout/ |
Disallow | /recommended/ |
Disallow | /comments/feed/ |
Disallow | /trackback/ |
Disallow | /index.php |
Disallow | /xmlrpc.php |
Disallow | /tag/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.top-10.in/sitemap_index.xml |
sitemap | https://www.top-10.in/category-sitemap.xml |
sitemap | https://www.top-10.in/post-sitemap.xml |
sitemap | https://www.top-10.in/page-sitemap.xml |
Warnings
- 2 invalid lines.
Comments