in.wiki
robots.txt
Robots Exclusion Standard data for in.wiki
Resource Scan
Scan Details
Site Domain | in.wiki |
Base Domain | in.wiki |
Scan Status | Ok |
Last Scan | 2025-05-25T06:02:26+00:00 |
Next Scan | 2025-06-01T06:02:26+00:00 |
Last Scan
Scanned | 2025-05-25T06:02:26+00:00 |
URL | https://in.wiki/robots.txt |
Domain IPs | 104.21.10.117, 172.67.163.39, 2606:4700:3030::ac43:a327, 2606:4700:3034::6815:a75 |
Response IP | 104.21.10.117 |
Found | Yes |
Hash | bfff79f549b880df060adc721dce5f80b0603bb1b858e965dc93631cf96f6c28 |
SimHash | ea59d8d18525 |
Groups
*
Rule | Path |
---|---|
Disallow | /modified_extensions/ |
Disallow | /proprietary_extensions/ |
Disallow | /w/ |
Disallow | /w/index.php? |
Disallow | /files/thumb/ |
Disallow | /files/archive/ |
Disallow | /files/deleted/ |
Disallow | /files/math/ |
Disallow | /files/temp/ |
Disallow | /files/tmp/ |
Disallow | /Special |
Disallow | /%D0%A1%D0%BB%D1%83%D0%B6%D0%B5%D0%B1%D0%BD%D0%B0%D1%8F |
Other Records
Field | Value |
---|---|
crawl-delay | 20 |
Other Records
Field | Value |
---|---|
sitemap | http://in.wiki/.sitemap/sitemap-index-in.wiki.xml |
Warnings
- `host` is not a known field.
Comments