archive.vn
robots.txt
Robots Exclusion Standard data for archive.vn
Resource Scan
Scan Details
Site Domain | archive.vn |
Base Domain | archive.vn |
Scan Status | Ok |
Last Scan | 2024-11-13T09:12:13+00:00 |
Next Scan | 2024-11-20T09:12:13+00:00 |
Last Scan
Scanned | 2024-11-13T09:12:13+00:00 |
URL | https://archive.vn/robots.txt |
Domain IPs | 23.137.248.133 |
Response IP | 23.137.249.77 |
Found | Yes |
Hash | 6fb550cedde2fef9d0bdaf00f3fd1887390c5d8cc0a236cd3e2463fccbbc1761 |
SimHash | c2188ea907b3 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /faq.html$ |
Disallow | /*.xml$ |
Allow | /*sitemap*.xml$ |
Disallow | /*/abuse$ |
Disallow | /*/share$ |
Disallow | /*/again$ |
Disallow | /download/ |
Disallow | /link%3A |
Disallow | /o/ |
Disallow | /search/ |
Disallow | /timegate/ |
Disallow | /timemap/ |
Disallow | /offset%3D |
Disallow | /http%3A |
Disallow | /https%3A |
Other Records
Field | Value |
---|---|
sitemap | https://archive.vn/sitemap.xml |
Warnings
- `host` is not a known field.
Comments